Presentation is loading. Please wait.

Presentation is loading. Please wait.

Doi> Norman Paskin, International DOI Foundation Digital Object Identifiers for Science Data.

Similar presentations


Presentation on theme: "Doi> Norman Paskin, International DOI Foundation Digital Object Identifiers for Science Data."— Presentation transcript:

1 doi> Norman Paskin, International DOI Foundation Digital Object Identifiers for Science Data

2 doi> Digital Object Identifier = DOI A name (not a location) for an entity on digital networks A system for persistent and actionable identification and interoperable exchange of managed information on digital networks –Standards-based components (detail in a moment) –Now to become an International Standard (in ISO TC46) Developed as cross-industry, cross-sector, not-for-profit effort managed by an open membership collaborative development body –International DOI Foundation (IDF) In widespread use now: –Over 15 million assigned, over 1000 naming authorities (users) –Key feature of scientific primary publishing as part of CrossRef system –Adopted for government documents (EC, OECD, UK, etc) In use, is a mechanism behind the scenes, –e.g. looks like a URL in a web context Offers interoperable common system for identification of science data: two projects considered as examples: –TIB project (citation of primary data sets) –Names for Life (biological taxonomy)

3 doi> The word identifier can mean several different things, e.g.: –Labels : Output of numbering schemes e.g. ISBN 3-540-40465-1 –Specifications for using labels: e.g. on internet URL, URN, URI (URI = Uniform Resource Identifier) –Implemented systems: Labels, following a specification, in a system e.g. DOI system. Packaged system offering label + tools + implementation mechanisms Requirements: reliability, automated global access, and interoperability –Interoperability = the possibility of use in services outside the direct control of the issuing assigner. Persistence implies interoperability (with the future) Interoperability implies extensibility (do not know future uses) Hence DOI is a generic framework applicable to any digital object –Digital object can be a representation of any entity Identifiers

4 Data Model Internet Resolution Numbering scheme Policies DOI is the combination of these four components doi>

5 DOI syntax can include any existing identifier label formal or informal, of any entity An identifier container e.g. –10.1234/NP5678 –10.5678/ISBN-0-7645-4889-4 –10.2224/2004-10-ISO-DOI NISO Z39.84, DOI Syntax

6 Internet resolution allows a DOI to link to any & multiple pieces of current data Resolve from DOI to data –initially to location (URL) – persistence May be to multiple data: –Multiple locations –Metadata –Services –Extensible user-defined Uses the Handle system -Implementing URI/URN concept -Running on TCP/IP (common co-inventor) -IETF RFCs 3650, 3651, 3652 -See Release 1.0, September 2003 "Online Registries: The DNS and Beyond... [doi:10.1340/309registries ]

7 Data Dictionary + DOI AP framework DOI Data Model = Metadata tools: –a data dictionary to define + –a grouping mechanism to relate Necessary for interoperability –Enabling information that originates in one context to be used in another in ways that are as highly automated as possible. Able to use existing metadata –Mapped using a standard dictionary –Can describe any entity at any level of granularity –indecsDD which incorporates ISO MPEG 21 RDD IDF is the MPEG21 RDD registration authority

8 DOI policies allow any model for practical implementations Implementation through IDF –Governance and agreed scope, policy, rules of the road –Technical infrastructure: resolution mechanism, proxy servers, mirrors, back-up, central dictionary, –Social infrastructure: persistence commitments, fall-back procedures, cost-recovery (self-sustaining), shared use of system –Not a standard but a Registration Authority/maintenance agency IDF delegates through Registration Agencies –Each can develop own applications –Use in own brand ways appropriate for their community

9 DOI to become ISO TC46/SC9 standard Home of identification numbering: identifiers for semantically meaningful entities: ISO 2108 International Standard Book Numbering (ISBN) ISO 3297 International Standard Serial Number (ISSN) ISO 3901 International Standard Recording Code (ISRC) ISO 10444 International Standard Technical Report Number (ISRN) ISO 10957 International Standard Music Number (ISMN) ISO 15706 International Standard Audiovisual Number (ISAN) ISO 15707 International Standard Musical Work Code (ISWC) ISO Project 20925 Version identifier for Audiovisual Works (V-ISAN) ISO Project 21047 International Standard Text Code (ISTC) http://www.collectionscanada.ca/iso/tc46sc9/index.htm Information and Documentation - Identification and Description

10 doi> Resolve The Handle resolution technology allows you to access any kind of Service associated with your DOI. eg Services can include metadata services Identify DOI syntax can include any existing identifier, formal or informal, of any entity eg 10.2341/0-7645-4889-1 10.5678/978-0-7645-4889-4 10.1000/ISBN 0764548891 10.1234/Norman_presentation 10.2224/2004-10-28-ISO-DOI Describe DOI metadata can be of any type, standard or proprietary eg OnixForBooks OnixForSerials IEEE/LOM MARC Dublin Core Proprietary scheme (to interoperate with anyone else in the DOI network, map to the Data Dictionary (iDD). DOI combination of components A package of services is an Application Profile

11 doi> DOI and scientific data DOI is already the core technology for maintaining cross-reference –persistent links between a citation and internet access to article CrossRef system used by 350+ publishers representing bulk of STM articles (as pre-publication link builder) www.crossref.org 9,000 DOIs per day added to CrossRef. –Over 12 million DOIs now registered with CrossRef, –Over 850,000 assigned to books and conference proceedings. Several projects suggested to IDF using DOIs for data (not connected with CrossRef) –physico-chemical property data; biological microscopy images. –See Paskin, ICSTI 2002 paper Some projects have developed their own identifiers, very useful for their own area –E.g. Life Science Identifier (I3C/IBM): simple URN mechanism, non- generic, non-global –These can be incorporated into a DOI if needed to make globally interoperable and extensible Two projects in particular have developed DOI applications:

12 doi> (1) TIB: Citation of Primary Data Problem: re-use of existing data sets –Attribution of data source: make data publications citable in a standard way (cf. articles Citation Index) –Archiving of data in context so as to be discoverable and interoperable (usable by others) Background –CODATA National Committee WG, grant-aided by DFG (Sept 2001 to May 2002): Report "Concept of Citing Scientific Primary Data –Continuation as project for pilot implementation funded by DFG Oct 2003 to Oct 2005 at TIB (German National Library of Science & Technology) –Development of DOI registration agency for Data Solution: -DOIs for data sets, with associated metadata -Core management metadata applicable to all datasets -Structured metadata extensible to specific science disciplines

13 doi> (1) Citation of Primary Data: illustration of solution During her research for the World Data Center Climate (WDCC) Dr. Weather gains primary data about the weather in Hannover in the year 2003. –Primary data is tested, evaluated, stored and administrated at the WDCC. –Primary data is registered and allocated DOI at the TIB –With quality control of metadata, no change once allocated, etc Dr Weather can now cite this with a resolvable DOI e.g DOI:10.1594 /WDCC/W_Han_2003_MMB_2 10.1594 (Prefix) = TIB as the registration agency. WDCC = research institute. W_Han_2003_MMB_2 = internal name of the Data DOI is resolvable directly, or via http as http://dx.doi.org/10.1594/WDCC/W_Han_2003_MMB_2

14 doi> (1) Citation of Primary Data: illustration of solution Usage scenario 1: Dr. Storm is reading publications from Dr. Weather in a journal and would like to analyse her data under different aspects. Can resolve the DOI to obtain the data set for use In his publication Comparison of the weather from Hannover and Miami Dr. Storm cites Dr. Weathers data using its DOI, referring to the uniqueness and own identity of the original data. Citation example: Weather, 2003: Weather in Hannover for 2003 doi: 10.1594/WDCC/W_Han_2003_MMB_2 Usage scenario 2: Mr. Nice is writing a paper about the sales figures of ice cream in Hannover in 2003, but he has no information about the weather. Searches via TIB central registration agency metadata search Result is doi:10.1594/WDCC/W_Han_2003_MMB_2 He resolves the DOI to find the data. The metadata refers him to the WDCC as publisher and data archive. In his paper he cites the data using the DOI.

15 doi> (2) Names for life: Biological taxonomy Problem: Future-proofing biological nomenclature –See Garrity and Lyons, OMICS, 2003 For a given nomenclature in a biological taxonomy, change occurs –e.g. new species recognised, species reassigned as the founding species of new genera; synonyms; species split into subspecies which later became separate species; –resulting in changes of names, genera, families, classes, relationships over time –How does researcher keep track? Solution: DOI proposed as tool –a data model of nomenclature and taxonomy –enabling disambiguation of synonyms and competing taxonomies –a metadata resolution service –enabling dissemination of archived and updated information objects through persistent links

16 macleodii (T) communis Alteromonas vaga nomenclature (2) Names for Life: illustration of problem doi>

17 macleodii (T) communis Alteromonas 1972 vaga nomenclature

18 communis vaga haloplanktis Alteromonas macleodii (T) 1972 1973 nomenclature

19 communis vaga haloplanktis rubra Alteromonas 1972 1973 1976 macleodii (T) nomenclature

20 communis vaga haloplanktis rubra citrea Alteromonas 1972 1973 1976 1977 macleodii (T) nomenclature

21 communis vaga haloplanktis rubra citrea esperjiana undina Alteromonas 1972 1973 1976 1977 1978 macleodii (T) nomenclature

22 communis vaga haloplanktis rubra citrea esperjiana undina aurantia Alteromonas 1972 1973 1976 1977 1978 1979 macleodii (T) nomenclature

23 communis vaga haloplanktis rubra citrea esperjiana undina aurantia putrifaciens hanedai Alteromonas 1972 1973 1976 1977 1978 1979 1981 macleodii (T) nomenclature

24 communis vaga haloplanktis rubra citrea esperjiana undina aurantia putrifaciens hanedai luteoviolaceae Alteromonas 1972 1973 1976 1977 1978 1979 1981 1982 macleodii (T) nomenclature

25 communis vaga haloplanktis rubra citrea esperjiana undina aurantia putrifaciens hanedai luteoviolaceae vaga communis (T) MarinomonasAlteromonas commune vagum 1972 1973 1976 1977 1978 1979 1981 1982 1984 multiglobiferum japonicum minutium biejerinckii maris hiroshimense pelagicum pusillum jannaschii kreigii Oceanosprillum maris williamsae linum (T) macleodii (T) nomenclature

26 communis vaga haloplanktis rubra citrea esperjiana undina aurantia putrifaciens hanedai vagabenthica hanedai MarinomonasAlteromonas putrifaciens (T) Shewanella japonicum minutium biejerinckii maris hiroshimense multiglobiferum pelagicum pusillum commune jannaschii kreigii vagum Oceanosprillum maris williamsae 1972 1973 1976 1977 1978 1979 1981 1982 1984 1986 luteoviolaceae communis (T) linum (T) macleodii (T) nomenclature

27 1972 1973 1976 1977 1978 1979 1981 1982 1984 1986 1987 communis vaga haloplanktis rubra citrea esperjiana undina aurantia hanedai luteoviolaceae denitrificans vagabenthica hanedai MarinomonasAlteromonasShewanella japonicum minutium biejerinckii maris hiroshimense multiglobiferum pelagicum pusillum commune jannaschii kreigii vagum Oceanosprillum maris williamsae putrifaciens putrifaciens (T) communis (T) linum (T) macleodii (T) nomenclature

28 communis vaga haloplanktis rubra citrea esperjiana undina aurantia putrifaciens hanedai luteoviolaceae denitrificans vagabenthica hanedai MarinomonasAlteromonasShewanella japonicum minutium biejerinckii maris hiroshimense multiglobiferum pelagicum pusillum commune jannaschii kreigii vagum Oceanosprillum maris williamsae 1972 1973 1976 1977 1978 1979 1981 1982 1984 1986 1987 1988 colwelliana putrifaciens (T) communis (T) linum (T) macleodii (T) nomenclature

29 vagabenthica hanedai MarinomonasShewanella japonicum minutium biejerinckii maris hiroshimense multiglobiferum pelagicum pusillum commune jannaschii kreigii vagum biejerinckii pelagicum maris hiroshimense Oceanosprillum maris williamsae communis vaga haloplanktis rubra citrea esperjiana undina aurantia putrifaciens hanedai luteoviolaceae denitrificans tetradonis Alteromonas colwelliana 1972 1973 1976 1977 1978 1979 1981 1982 1984 1986 1987 1988 1990 colwelliana putrifaciens (T) communis (T) linum (T) macleodii (T) nomenclature

30 vagabenthica hanedai colwelliana algae MarinomonasShewanella communis vaga haloplanktis rubra citrea esperjiana undina aurantia putrifaciens hanedai luteoviolaceae denitrificans tetradonis atlantica carageenovora Alteromonas colwelliana 1972 1973 1976 1977 1978 1979 1981 1982 1984 1986 1987 1988 1990 1992 japonicum minutium biejerinckii maris hiroshimense multiglobiferum pelagicum pusillum commune jannaschii kreigii vagum biejerinckii pelagicum maris hiroshimense Oceanosprillum maris williamsae putrifaciens (T) communis (T) linum (T) macleodii (T) nomenclature

31 vagabenthica hanedai colwelliana algae MarinomonasShewanella communis vaga haloplanktis putrifaciens hanedai denitrificans rubra citrea esperjiana undina aurantia luteoviolaceae tetradonis atlantica carageenovora Alteromonas colwelliana 1972 1973 1976 1977 1978 1979 1981 1982 1984 1986 1987 1988 1990 1992 1995 japonicum minutium biejerinckii maris hiroshimense multiglobiferum pelagicum pusillum commune jannaschii kreigii vagum biejerinckii pelagicum maris hiroshimense Oceanosprillum maris williamsae distincta fulginea atlantica aurantia carrageenovora citrea esperjiana luteoviolacea nigrifaciens pisicida rubra haloplanktis haloplanktis (T) Pseudoalteromonas undina haloplanktis tetradonis putrifaciens (T) communis (T) linum (T) macleodii (T) nomenclature

32 vagabenthica hanedai colwelliana algae MarinomonasShewanella communis vaga haloplanktis rubra citrea esperjiana undina aurantia putrifaciens hanedai luteoviolaceae denitrificans tetradonis atlantica carageenovora Alteromonas colwelliana 1972 1973 1976 1977 1978 1979 1981 1982 1984 1986 1987 1988 1990 1992 1995 1997 japonicum minutium biejerinckii maris hiroshimense multiglobiferum pelagicum pusillum commune jannaschii kreigii vagum biejerinckii pelagicum maris hiroshimense Oceanosprillum maris williamsae distincta fulginea atlantica aurantia carrageenovora citrea esperjiana luteoviolacea nigrifaciens pisicida rubra Pseudoalteromonas undina antartica elyakoviii haloplanktis tetradonis haloplanktis haloplanktis (T) putrifaciens (T) communis (T) linum (T) macleodii (T) nomenclature

33 vagabenthica hanedai colwelliana algae MarinomonasShewanella communis vaga haloplanktis rubra citrea esperjiana undina aurantia putrifaciens hanedai luteoviolaceae denitrificans tetradonis atlantica carageenovora Alteromonas colwelliana 1972 1973 1976 1977 1978 1979 1981 1982 1984 1986 1987 1988 1990 1992 1995 1997 2000 japonicum minutium biejerinckii maris hiroshimense multiglobiferum pelagicum pusillum commune jannaschii kreigii vagum biejerinckii pelagicum maris hiroshimense Oceanosprillum maris williamsae distincta fulginea atlantica aurantia carrageenovora citrea esperjiana luteoviolacea nigrifaciens pisicida rubra Pseudoalteromonas undina antartica elyakoviii fridgidimarina geldimarina woodyii amazonensis baltica oneidensis pealeana violacea bacteriolytica prydzensis tunicata distincta elyakovii peptidolytica haloplanktis tetradonis mediterannea haloplanktis haloplanktis (T) putrifaciens (T) communis (T) linum (T) macleodii (T) nomenclature

34 vagabenthica hanedai colwelliana algae MarinomonasShewanella communis vaga haloplanktis rubra citrea esperjiana undina aurantia putrifaciens hanedai luteoviolaceae denitrificans tetradonis atlantica carageenovora Alteromonas colwelliana 1972 1973 1976 1977 1978 1979 1981 1982 1984 1986 1987 1988 1990 1992 1995 1997 2000 2001 japonicum minutium biejerinckii maris hiroshimense multiglobiferum pelagicum pusillum commune jannaschii kreigii vagum biejerinckii pelagicum maris hiroshimense Oceanosprillum maris williamsae distincta fulginea atlantica aurantia carrageenovora citrea esperjiana luteoviolacea nigrifaciens pisicida rubra Pseudoalteromonas undina antartica elyakoviii fridgidimarina geldimarina woodyii amazonensis baltica oneidensis pealeana violacea bacteriolytica prydzensis tunicata distincta elyakovii peptidolytica tetrodonis japonica haloplanktis tetradonis mediterannea haloplanktis haloplanktis (T) putrifaciens (T) communis (T) linum (T) macleodii (T) nomenclature

35 vagabenthica hanedai colwelliana algae MarinomonasShewanella communis vaga haloplanktis rubra citrea esperjiana undina aurantia putrifaciens hanedai luteoviolaceae denitrificans tetradonis atlantica carageenovora Alteromonas colwelliana 1972 1973 1976 1977 1978 1979 1981 1982 1984 1986 1987 1988 1990 1992 1995 1997 2000 2001 2002 japonicum minutium biejerinckii maris hiroshimense multiglobiferum pelagicum pusillum commune jannaschii kreigii vagum biejerinckii pelagicum maris hiroshimense Oceanosprillum maris williamsae distincta fulginea Pseudoalteromonas elyakoviii fridgidimarina geldimarina woodyii amazonensis baltica oneidensis pealeana violacea japonica denitrificans livingstonensis alleyanna atlantica aurantia carrageenovora citrea esperjiana luteoviolacea nigrifaciens pisicida rubra undina antartica bacteriolytica prydzensis tunicata distincta elyakovii peptidolytica tetrodonis haloplanktis tetradonis mediterannea haloplanktis haloplanktis (T) putrifaciens (T) communis (T) linum (T) macleodii (T) nomenclature

36 vagabenthica hanedai colwelliana algae MarinomonasShewanella communis vaga haloplanktis rubra citrea esperjiana undina aurantia putrifaciens hanedai luteoviolaceae denitrificans tetradonis atlantica carageenovora Alteromonas colwelliana 1972 1973 1976 1977 1978 1979 1981 1982 1984 1986 1987 1988 1990 1992 1995 1997 2000 2001 2002 2004 japonicum minutium biejerinckii maris hiroshimense multiglobiferum pelagicum pusillum commune jannaschii kreigii vagum biejerinckii pelagicum maris hiroshimense Oceanosprillum maris williamsae distincta fulginea Pseudoalteromonas elyakoviii fridgidimarina geldimarina woodyii amazonensis baltica oneidensis pealeana violacea japonica denitrificans livingstonensis alleyanna atlantica aurantia carrageenovora citrea esperjiana luteoviolacea nigrifaciens pisicida rubra undina antartica bacteriolytica prydzensis tunicata distincta elyakovii peptidolytica tetrodonis haloplanktis tetradonis 11 others mariniintestina saire schlegeliana gaetbuli mediterannea primoryensis haloplanktis haloplanktis (T) putrifaciens (T) communis (T) linum (T) macleodii (T) nomenclature

37 name taxon combined name exemplar nomos journal article gene annotation any online information strain record links from the web journal article strain record gene annotation journal article links to the web DOI (2) Names for Life: illustration of solution

38 dissemination nametaxon combined name exemplar nomos By reasoning over information objects, construct services that can be offered through multiple resolution. Look up this name and all its synonyms in PubMed Determine whether this exemplar is part of a taxon in another nomos Compare this name to the current state (contents) of the taxon (2) Names for Life: illustration of solution doi>

39 Summary: DOI A system for persistent and actionable identification and interoperable exchange of managed information on digital networks –Standards-based components (detail in a moment) –Now to become an International Standard (in ISO TC46) Developed as cross-industry, cross-sector, not-for-profit effort managed by an open membership collaborative development body –International DOI Foundation (IDF) In widespread use now: –Over 15 million assigned, over 1000 naming authorities (users) –Key feature of scientific primary publishing as part of CrossRef system –Adopted for government documents (EC, OECD, UK, etc) In use, is a mechanism behind the scenes, –e.g. looks like a URL in a web context Offers interoperable common system for identification of science data: two projects considered as examples: –TIB project (citation of primary data sets) –Names for Life (biological taxonomy)

40 doi> n.paskin@doi.org www.doi.org


Download ppt "Doi> Norman Paskin, International DOI Foundation Digital Object Identifiers for Science Data."

Similar presentations


Ads by Google