What is a long-term identifier?
An identifier is an association between a character string and an object. Objects can be files, parts of files, names of persons or organizations, abstractions, etc. Objects can be online or offline. Character strings include URLs, serial numbers, names addresses, etc. A "persistent identifier" is an identifier that is available and managed over time; it will not change if the item is moved or renamed. This means that an item can be reliably referenced for future access by humans and software. EZID currently supports persistent for two kinds of identifiers: DataCite Digital Object Identifiers (DOIs) and lower-cost Archival Resource Keys (ARKs).
What is a DOI?
DOI stands for "Digital Object Identifier." It is an identifier originating from the publishing world and in widespread use for journal articles. DOIs become persistent when the objects and identifier forwarding information is maintained.
What is an ARK?
ARK stands for "Archival Resource Key." It is an identifier originating from the library, archive and museum community. ARKs become persistent when the objects and identifier forwarding information is maintained.
Can DOIs and ARKs be used together?
ARK identifiers have certain features that can be very useful for:
- Keeping track of many granules of a dataset (ARKs will be able to "pass through" a suffix, so many thousands of items can be referenced on the basis of a single registration);
- Keeping track of data before a decision has been made about whether or not it is going to be retained (ARKs can be deleted).
This can make it attractive to use ARKs during the early part of a dataset's "life" or the early stages of the research process. Then, when the time comes to begin writing up results and it becomes clear which object(s) will be cited, it may appropriate to get DOIs for those objects.
DOI identifiers have the citation-level "reputation" and:
- It is possible to use the ARK identifier of the cited object as the suffix for the DOI so that there is a traceable connection between the two.
In this way, the two identifier schemes can work together to support the research data life cycle. With EZID, clients get access to both of these identifiers and can take best advantage of both approaches.
What is identifier resolution?
Instead of leading directly to an object, one identifier frequently points to another, or "target URL", that leads directly to the object. The process of getting to the final target name, possibly via a chain of intermediate names, is called "resolution." Resolution on the web is usually fast and invisible. It is done behind the scenes on your behalf by web browsers. Unsuccessful resolution, however, usually means visible failure to access the object that you were expecting, resulting in a "broken identifier." Objects tend to move, so identifier persistence depends on resolution using up-to-date target URLs. To make this happen, EZID provides a way for people to update target URLs as they change when objects move around. This is very similar to leaving a forwarding address when you change your residence. As the starting point for resolution, the resolver effectively lets you publicize an unchanging identifier that you maintain so that it will consistently hit a target that may be moving. EZID currently updates two resolvers: N2T (Name-to-Thing) based at n2t.net and the DOI (Digital Object Identifier) resolver based at doi.dx.org.
What is metadata?
Metadata is information (data)about the object, such as the name of the object's creator, the date of creation, the target URL, the version of the object, its title, and so on. EZID allows the user to enter metadata at the same time as an identifier is requested. Associating metadata with identifiers enables more sophisticated mechanisms for digital content discovery and higher-level assurances of long-term persistence.