Presentation is loading. Please wait.

Presentation is loading. Please wait.

1 CS 502: Computing Methods for Digital Libraries Lecture 4 Identifiers and Reference Links.

Similar presentations


Presentation on theme: "1 CS 502: Computing Methods for Digital Libraries Lecture 4 Identifiers and Reference Links."— Presentation transcript:

1 1 CS 502: Computing Methods for Digital Libraries Lecture 4 Identifiers and Reference Links

2 2 Desirable Properties of Identifiers Location independent name Globally unique Persistent across time Choice of human generated or automatic generation Fast resolution Decentralized administration Supported from standard user interfaces

3 3 Syntax / or hdl: / Examples 10.1234/1995.02.12.16.42.21;9 (date-time stamp) cornell.cs/cstr-94.45 (mnemonic name) loc/a43v-8940cgr(random string) Syntax of Handles

4 4 Examples of DOIs 10.156 / catalog-96 Publisher ID assigned by DOI Agency Item ID assigned by Publisher 10.1048 / 872 10.1532 / PII 10.18698 / SICI

5 5 Elements of the Handle System Handle services: global handle service local handle services caching services Clients: client libraries browser extension WWW proxy servers Handle administration System utilities

6 6 Hierarchy of Naming Authorities loc 10cornell loc.cords 10.1234 cornell.temp cornell.cs cornell.cs.d

7 7 Handle Servers and Handle Service The Global Handle Service provides central coordination for all handle services. Each naming authority has a home handle service (which may be Global) where its handles are maintained. Each handle service may be implemented as several handle servers. A hashing algorithm determines the server used to store a given handle.

8 8 Handle Record for a Digital Object Adm Admin Data URL cnri.dlib/arms-09 Adm Admin Data http://www.cnri/xyz RAP merlin.dlib.org NEW orb:#cornell[]norb

9 9 Address Rules The Global Handle Service stores: a record for each naming authority a record for each local handle service The record for each naming authority includes: the home handle service for that naming authority For each handle, the home handle service stores: the handle record

10 10 Resolving a Handle Without Caches Handle cnri.dlib/wya in Global G Global Client ? cnri.dlib/wya ? handle data G cnri.dlib/wya

11 11 Resolving a Handle Without Caches Handle cnri.dlib/wya in Home Service abc Global Home HS for cnri.dlib Client ? cnri.dlib/wya ? G abc pointer to abc ? cnri.dlib/wya ? handle data cnri.dlib/wya

12 12 Caching Handle Service Client Caching Server Handle Servers Hash Cache Hash table

13 13 Replication All data is replicated at several sites for performance and reliability Washington, DC Los Angeles, CA

14 14 Applications of Identifiers The challenges: Persistent, unique identifiers Eliminate broken links Control duplicates Applications: On-line publication Registration Citation (reference links) Collection management Archives

15 15 User Handle System DOIs and URNs in Action Publisher DOI

16 16 Flexibility for Publisher Warehouse Database Repository Every publisher can have a different system. DOI

17 17 Reorganization by Publisher Database Repositories The publisher can create a new system. DOI

18 18 Change of Publisher Halfmoon Millenium User DOI Handle System

19 19 Handle System Citation Publisher User 1 DOI User 2 DOI

20 20 User Handle System Publisher Search System DOI Catalogs and Indexes

21 21 Copyright Registration Copyright Registry User Handle System Halfmoon DOI

22 22 Multiple Copies Halfmoon Europe User DOI Handle System Halfmoon USA

23 23 User Handle System Archives Archive DOI

24 24 Reference Linking: The Problem Generic Given the information in a standard citation, how does one get to the thing to which the citation refers? Specific Given the information in a citation to a journal article, how does a user get from the citation to an appropriate copy of the article?

25 25 The General Model Reference database Location database Content Publisher Client Publisher places information in databases

26 26 The General Model Reference database Location database Content Publisher Client Citation Identifier s

27 27 The General Model Reference database Location database Content Publisher Client URL s Identifier

28 28 The General Model Reference database Location database Content Publisher Client URL Content

29 29 The General Model Reference database Location database Content Publisher Client Citation Identifiers URLs Identifier URL Content

30 30 Target of Citations Work Expression Manifestation Item IFLA model Citations can refer to any specific creation but for journals usually refer to the work.

31 31 Identifiers Are identifiers necessary? –Persistence –Flexible targets Examples: –PubMed ID, BibCode, DOI, etc.

32 32 How are Identifiers Obtained? Often the client knows the citation, but not the identifier. In the general model identifiers are obtained by searching the reference database. In limited domains, identifiers can be calculated from metadata. The identifier may be embedded in the citation.

33 33 Reference Database Lookup Static: Reference links are established once for all time. –Current model in journal publishing –Not suitable for general user queries Dynamic: Reference links are established on demand. –Provides link based on most recent information –Success can not be guaranteed Quality of metadata in reference database(s) is crucial.

34 34 Metadata in Reference Database Existing schemes –Considerable agreement on minimal elements –Considerable differences in details and syntax

35 35 Minimal Metadata Elements for Journal Article Title of journal article Creator(s) Journal title Date of publication Enumeration (e.g., volume and issue) Location (e.g., page or article number) Type (e.g., "journal article")

36 36 Resolution of Identifier Choice of resolver (distributed resolution) –Simple model: identifier determines resolver Selection from multiple copies (selective resolution) –Performance criteria –Economic and related criteria –User requirements

37 37 Interoperability Several reference linking services under development: PubMed Astrophysics Data Center DOI reference service Los Alamos National Laboratory internal reference service What levels of agreement and tools are needed for cross- linking?


Download ppt "1 CS 502: Computing Methods for Digital Libraries Lecture 4 Identifiers and Reference Links."

Similar presentations


Ads by Google