Persistent Identifiers Herbarieworkshop, Kongsvoll Fjellstue September 2014. Dag Endresen, NHM.

Slides:



Advertisements
Similar presentations
John Wieczorek Museum of Vertebrate Zoology University of California, Berkeley Georeferencing Introduction: Collaboration to Automation.
Advertisements

GUIDs in EMu Ian Turnbull KE Software. GUID? UUID? A Globally Unique Identifier (GUID) is a persistent unique reference number used as an identifier.
Don’t make me think Biodiversity data publishing made easy Vince Smith, Alice Heaton, Laurence Livermore, Simon Rycroft, Ben Scott & Lyubomir Penev* The.
Pensoft Writing Tool (PWT) Lyubomir Penev ViBRANT Tools for DNA taxonomists, 11 June 2013, Brussles ViBRANT.
Advanced Metadata Usage Daan Broeder TLA - MPI for Psycholinguistics / CLARIN Metadata in Context, APA/CLARIN Workshop, September 2010 Nijmegen.
Development of a computer information system for wildlife conservation in Louisiana, with a prototype system for fishes Henry L. Bart Jr. and Nelson E.
Rapid digitization of P Herbarium Switching to the fast track: Rapid digitization of the world's largest herbarium TDWG New Orleans Simon Chagnoux,
Publish or perish? Linking Scratchpads and the new Biodiversity Data Journal for streamlining publication of botanical data D.N Koureas 1, L. Penev 2 &
Making small data big! The Biodiversity Data Journal (BDJ) Lyubomir Penev, Teodor Georgiev, Pavel Stoev, David Roberts, Vincent Smith ViBRANT.
BIS TDWG Conference 29 October 2014, Jönköping, Sweden Publishing sample-based data using Darwin Core Archives Éamonn Ó Tuama, Markus Döring, Kyle Braak,
IDENTIFIERS & THE DATA CITATION INDEX DISCOVERY, ACCESS, AND CITATION OF PUBLISHED RESEARCH DATA NIGEL ROBINSON 17 OCTOBER 2013.
Streamlining the registration- to-publication pipeline Lyubomir Penev, Teodor Georgiev, Pavel Stoev Sherborn Meeting, NHM London, 28 Oct 2011 ViBRANT.
Data Publishing Workflows: Strategies and Standards
BIS TDWG Conference 28 October 2013, Florence Documenting data quality in a global network: the challenge for GBIF Éamonn Ó Tuama, Andrea Hahn, Markus.
DataCite: Making Data Citable Jan Brase (DataCite/TIB Hannover) Brigitte Hausstein (GESIS) Wolfgang Zenk-Möltgen (GESIS)
THE DATA CITATION INDEX AN INNOVATIVE SOLUTION TO EASE THE DISCOVERY, USE AND ATTRIBUTION OF RESEARCH DATA MEGAN FORCE 22 FEBRUARY 2014.
Chinese-European Workshop on Digital Preservation, Beijing July 14 – Network of Expertise in Digital Preservation 1 Persistent Identifiers Reinhard.
Fourth Annual Summit | Feb | Tucson, AZ Scratchpads for community involvement for natural history collections Dr Dimitris Koureas Biodiversity.
Tobias Weigel (DKRZ) Tobias Weigel Deutsches Klimarechenzentrum (DKRZ) Persistent Identifiers Solving a number of problems through a simplistic mechanism.
Developing Data Attribution and Citation Practices and Standards: An International Symposium and Workshop August , 2011 Hotel Shattuck Plaza Data.
SERNEC Image/Metadata Database Goals and Components Steve Baskauf
CalBugDigitizing California’s Terrestrial Arthropods CalBug: Digitizing California’s Terrestrial Arthropods Peter T Oboyski, Joan Ball, Rosemary Gillespie,
Making small data big: The Biodiversity Data Journal (BDJ) Lyubomir D. Penev 1,3, Teodor A. Georgiev 3, Pavel E. Stoev 2,3, David M. Roberts 4 & Vincent.
IDs in and out of the database Entomological Collections Network (ECN) 2012 November 10 – 11, Knoxville, TN Debbie Paul, Greg Riccardi.
IDigBio is funded by a grant from the National Science Foundation’s Advancing Digitization of Biodiversity Collections Program (Cooperative Agreement EF ).
To be Published for free or to be Read for free: OA publishing from an Easterneuropean perspective Lyubomir Penev Pensoft Publishers, Sofia APE 2011 Berlin.
GLOBAL BIODIVERSITY INFORMATION FACILITY Dr Vishwas Chavan Senior Programme Officer for DIGIT Data Citation Mechanism and.
Managing the Record of Research At the Smithsonian Using SIdora SAA Research Forum August 12, 2014.
Michelle Koo, Carol Spencer, David Bloom, Nelson Rios Museum of Vertebrate Zoology (UC Berkeley), VertNet, & Tulane University Georeferencing Introduction:
Dave Bloom Museum of Vertebrate Zoology University of California, Berkeley Georeferencing Introduction: Collaboration to Automation.
11 th GBIF Global NODES Meeting Incentivising and Strategising Publishing of Biodiversity Data Vishwas Chavan Senior Programme Officer for Digitisation.
At the frontline of publishing in systematic zoology: A presentation of ZooKeys Lyubomir Penev 1, Terry Erwin 2, Jeremy Miller 3 1 Pensoft Publishers,
Dimitris Koureas, PhD Natural History Museum London Linking layers of biodiversity data: Informatics challenges for the long tail research RDA - Long Tail.
IDigBio is funded by a grant from the National Science Foundation’s Advancing Digitization of Biodiversity Collections Program (Cooperative Agreement EF ).
Integrating Live Plant Images with Other Types of Biodiversity Records Steve Baskauf Vanderbilt Dept. of Biological Sciences
Progress since the February 2005 London DNA Barcode of Life Conference Scott Miller, Chair Consortium for the Barcode of Life Smithsonian Institution.
Standards and tools for publishing biodiversity data Yu-Huang Wang June 25, 2012.
Comisión Nacional para el Conocimiento y Uso de la Biodiversidad Session 3: "Sharing e-knowledge on agricultural diversity worldwide“ TDWG 2009 – Montpellier,
A paradigm shift in biodiversity publishing: mobilization, mark up, reuse and integration of small data Lyubomir D. Penev 1,3, Teodor A. Georgiev 3, Pavel.
1 GBIF and Ocean Biodiversity, OBI'07 Conference, Oct 2-4, 2007, Dartmouth, Nova Scotia GBIF and Ocean Biodiversity Building the data web with OBIS Éamonn.
Biodiversity Data Journal: mobilization, reuse and integration of small data Lyubomir D. Penev 1,3, Teodor A. Georgiev 3, Pavel E. Stoev 2,3, Jordan Bisserkov.
Resolving the publishing bottleneck and increasing data interoperability in biodiversity science Lyubomir Penev, Teodor Georgiev, Pavel Stoev, David Roberts,
Scratchpads The virtual research environment for biodiversity data Simon Rycroft, Dave Roberts, Vince Smith, Alice Heaton, Katherine Bouton, Laurence Livermore,
Data Exchange Standards The Power of Being Stupidly Simple Chuck Miller Missouri Botanical Garden TDWG 2008, Fremantle October 24, 2008.
BIS TDWG Conference, New Orleans 2011 Knowledge Organization Systems Session - Introduction Éamonn Ó Tuama Senior Programme Officer, Inventory, Discovery,
Global Biodiversity Information Facility GLOBAL BIODIVERSITY INFORMATION FACILITY Meredith A. Lane CODATA/ERPANET Workshop: Scientific Data Selection &
TDWG Life Sciences Identifiers Applicability Statement Ben Richardson Review Manager, LSID Applicability Statement Western Australian Herbarium Department.
IDigBio is funded by a grant from the National Science Foundation’s Advancing Digitization of Biodiversity Collections Program (Cooperative Agreement EF ).
Don’t make me think Biodiversity Data Publishing Made Easy Laurence Livermore, Vince Smith, Alice Heaton, Simon Rycroft, Ed Baker, Ben Scott & Lyubomir.
Scratchpads and the new Biodiversity Data Journal Biodiversity Data Publishing made… easier Dimitris Koureas Natural History Museum London.
4 way comparison of Data Citation Principles: Amsterdam Manifesto, CoData, Data Cite, Digital Curation Center FORCE11 Data Citation Synthesis Group Should.
The Korean Bird Information System (KBIS) National Science Meseum of Korea InCoB 2009, Singapore.
Acronym Soup GBIF, TDWG & GUIDs Jerry Cooper. Global Biodiversity Information Facility (GBIF) Established in 2000 through non-binding MOU (25 countries.
Geo-Topics GeoVoCampDC20111 Overview of Geo Topics, Vocabularies, Semantics, Examples Gary Berg-Cross GeoVoCampDC2011.
Laura Russell Programmer VertNet Buenos Aires (Argentina) 28 September 2011 Training course on biodiversity data publishing and.
Margret Plank 17th International Conference on Grey Literature 1st and 2nd December 2015, Amsterdam (Netherlands) Move beyond text – How TIB manages the.
GLOBAL BIODIVERSITY INFORMATION FACILITY Vishwas Chavan Senior Programme Officer for DIGIT 10 th Meeting of the GBIF Participant Node Managers Committee.
Laura Russell VertNet Meherzad Romer NatureServe Canada John Wieczorek
OBIS IODE PO OBIS INCOIS OBIS- SEAMAP Separate files OBIS Nodes Data providers Separate files GBIFLifeWatchGEOSSEOL,…CBDFAOISA Fail-over mirrorGeo-load.
Course on persistent identifiers, Madrid (Spain) Information architecture and the benefits of persistent identifiers Greg Riccardi Director Institute for.
Delivers local and global resources in a single search The first, easy step toward the first cooperative library service on the Web WorldCat Local “quick.
Biodiversity Information Serving Our Nation: 140+ million mapped species occurrences, 50+ map layers, includes ITIS-enabled search BISON: A national.
Natural History Collections: Connecting With Faculty and Content
Introduction to Persistent Identifiers
International Congress of Entomology, Orlando
ACS 2016 Moving research forward with persistent identifiers
Data publishing from the viewpoint of a biodiversity publisher
Mission DataCite was founded in 2009 as an international organization which aims to: establish easier access to research data increase acceptance of research.
Georeferencing Introduction: Collaboration to Automation
Bird of Feather Session
Presentation transcript:

Persistent Identifiers Herbarieworkshop, Kongsvoll Fjellstue September Dag Endresen, NHM

Name ambiguity: George

Identify the thing that you care about The specimen itself (the physical entity) Image of the specimen Description of the specimen Location where the specimen was captured The occurrence event when the specimen was captured…

Record-level Terms dcterms:type | dcterms:modified | dcterms:language | dcterms:rights | dcterms:rightsHolder | dcterms:accessRights | dcterms:bibliographicCitation | dcterms:references | institutionID | collectionID | datasetID | institutionCode | collectionCode | datasetName | ownerInstitutionCode | basisOfRecord | informationWithheld | dataGeneralizations | dynamicProperties Occurrence occurrenceID | catalogNumber | occurrenceRemarks | recordNumber | recordedBy | individualID | individualCount | sex | lifeStage | reproductiveCondition | behavior | establishmentMeans | occurrenceStatus | preparations | disposition | otherCatalogNumbers | previousIdentifications | associatedMedia | associatedReferences | associatedOccurrences | associatedSequences | associatedTaxa MaterialSample materialSampleID Event eventID | samplingProtocol | samplingEffort | eventDate | eventTime | startDayOfYear | endDayOfYear | year | month | day | verbatimEventDate | habitat | fieldNumber | fieldNotes | eventRemarks dcterms:Location locationID | higherGeographyID | higherGeography | continent | waterBody | islandGroup | island | country | countryCode | stateProvince | county | municipality | locality | verbatimLocality | verbatimElevation | minimumElevationInMeters | maximumElevationInMeters | verbatimDepth | minimumDepthInMeters | maximumDepthInMeters | minimumDistanceAboveSurfaceInMeters | maximumDistanceAboveSurfaceInMeters | locationAccordingTo | locationRemarks | verbatimCoordinates | verbatimLatitude | verbatimLongitude | verbatimCoordinateSystem | verbatimSRS | decimalLatitude | decimalLongitude | geodeticDatum | coordinateUncertaintyInMeters | coordinatePrecision | pointRadiusSpatialFit | footprintWKT | footprintSRS | footprintSpatialFit | georeferencedBy | georeferencedDate | georeferenceProtocol | georeferenceSources | georeferenceVerificationStatus | georeferenceRemarks GeologicalContext geologicalContextID | earliestEonOrLowestEonothem | latestEonOrHighestEonothem | earliestEraOrLowestErathem | latestEraOrHighestErathem | earliestPeriodOrLowestSystem | latestPeriodOrHighestSystem | earliestEpochOrLowestSeries | latestEpochOrHighestSeries | earliestAgeOrLowestStage | latestAgeOrHighestStage | lowestBiostratigraphicZone | highestBiostratigraphicZone | lithostratigraphicTerms | group | formation | member | bed Identification identificationID | identifiedBy | dateIdentified | identificationReferences | identificationVerificationStatus | identificationRemarks | identificationQualifier | typeStatus Taxon taxonID | scientificNameID | acceptedNameUsageID | parentNameUsageID | originalNameUsageID | nameAccordingToID | namePublishedInID | taxonConceptID | scientificName | acceptedNameUsage | parentNameUsage | originalNameUsage | nameAccordingTo | namePublishedIn | namePublishedInYear | higherClassification | kingdom | phylum | class | order | family | genus | subgenus | specificEpithet | infraspecificEpithet | taxonRank | verbatimTaxonRank | scientificNameAuthorship | vernacularName | nomenclaturalCode | taxonomicStatus | nomenclaturalStatus | taxonRemarks ResourceRelationship (Auxiliary Terms) resourceRelationshipID | resourceID | relatedResourceID | relationshipOfResource | relationshipAccordingTo | relationshipEstablishedDate | relationshipRemarks MeasurementOrFact (Auxiliary Terms) measurementID | measurementType | measurementValue | measurementAccuracy | measurementUnit | measurementDeterminedDate | measurementDeterminedBy | measurementMethod | measurementRemarks

Term name:occurrenceID Identifier: Class: Definition:An identifier for the Occurrence (as opposed to a particular digital record of the occurrence). In the absence of a persistent global unique identifier, construct one from a combination of identifiers in the record that will most closely make the occurrenceID globally unique. Comment:For a specimen in the absence of a bona fide global unique identifier, for example, use the form: "urn:catalog:[institutionCode]:[collectionCode]:[catalogNumber]". Examples: "urn:lsid:nhm.ku.edu:Herps:32", "urn:catalog:FMNH:Mammal:145732". For discussion see

Persistent Identifier (PID) Globally Unique Identifier (GUID) Universal Resource Identifier (URI) Persistent Uniform Resource Locator (PURL) Life Science Identifier (LSID) Digital Object Identifier (DOI) Handle system (Handle) Archival Resource Key (ARK, EZID) Universally Unique Identifier (UUID) Reuse existing identifiers!

Photo: Smithsonian National Museum of Natural History, USNM Eutoxeres-aquila urn:lsid:Orthoptera.speciesfile.org:TaxonName:xx urn:lsid:catalogueoflife.org:taxon:d755ba3e-29c1-102b-9a4a f820:ac2009 PURL Reuse existing identifiers

Illustration by Miroslav Šašek (1963)

Globally unique Scalability, number of IDs Community acceptance Long-term life-cycle Resolvable, resolution service(s) Cost per identifier People-friendly or machine-friendly Solution for the generation of new IDs – Central generation, PID issuer – Distributed generation at source

A UUID is a 16-octet (128-bit) 36-chars number. Example: C37E3F9B-BCAF EB7-3346A2DB2373 C37E3F9B-BCAF EB7-3346A2DB2373 The probability of one duplicate would be about 50% if every person on earth create 600 million UUIDs. Allows for easy generation at source in a distributed network.

IdentifierResolver LocationSpecimen

http – PURL – UUID

Including machine readable formats

Quick Response Code (QR code). A type of matrix barcode (or two- dimensional code). Popular due to its fast readability and large storage capacity. The use of QR Codes is free of any license. The QR Code is clearly defined and published as an ISO standard. Invented in Japan by the Toyota subsidiary Denso Wave in 1994.

UUID QR codes for museum objects at NHM-UiO provides: Machine-readable identifiers (using a simple smart phone - or a barcode reader) Allows for new and efficient workflows for collection management. Deployment for stable identifiers appropriate for data-basing.

Peer review option for biodiversity data sets. Authors get scientific credit for data publication. Meeting concerns over data quality. Meeting concerns over data citation mechanism. Towards  Each data set published through GBIF accompanied by a data paper…? Data set identifier DOI  dwc:datasetID …?

Why publish your data Citable publication Establish scientific priority Increase collaboration Link data to bigger network Re-use and multiply effect Respond to funding requirements Smith V, Georgiev T, Stoev P, Biserkov J, Miller J, Livermore L, Baker E, Mietchen D, Couvreur T, Mueller G, Dikow T, Helgen K, Frank J, Agosti D, Roberts D, Penev L (2013) Beyond dead trees: integrating the scientific process in the Biodiversity Data Journal. Biodiversity Data Journal 1: e995. DOI: /BDJ.1.e995

Globally unique identifiers are one of the three core components in the TDWG technical architecture.

Status 27. August 2014 GBIF enables free and open access to biodiversity data online. We are an international government-initiated and funded initiative focused on making biodiversity data available to all and anyone, for scientific research, conservation and sustainable development.

GBIF provides a data discovery system global registry data portal

Thanks for listening! Dag Endresen