Introduction to Persistent Identifiers

Slides:



Advertisements
Similar presentations
The Corporation for National Research Initiatives The Handle System Persistent, Secure, Reliable Identifier Resolution.
Advertisements

Basic Internet Terms Digital Design. Arpanet The first Internet prototype created in 1965 by the Department of Defense.
GUIDs in EMu Ian Turnbull KE Software. GUID? UUID? A Globally Unique Identifier (GUID) is a persistent unique reference number used as an identifier.
What are GUIDs and Why Do We Need Them ??? Steve Baskauf Vanderbilt Dept. of Biological Sciences
DDI3 Uniform Resource Names: Locating and Providing the Related DDI3 Objects Part of Session: DDI 3 Tools: Possibilities for Implementers IASSIST Conference,
Web development  World Wide Web (web) is the Internet system for hypertext linking.  A hypertext document (web page) is an online document. It contains.
Hypertext Transfer Protocol Kyle Roth Mark Hoover.
SESSION 9 THE INTERNET AND THE NEW INFORMATION NEW INFORMATIONTECHNOLOGYINFRASTRUCTURE.
Creating your website Using Plain HTML. What is HTML? ► Web pages are authored in HyperText Markup Language (HTML) ► Plain text is marked up with tags,
GLOBAL BIODIVERSITY INFORMATION FACILITY Greg Riccardi Co-chair 9 November Outcomes of the GBIF LSID-GUID Task Group.
National libraries and identity in the Semantic Web Gordon Dunsire BNE, Madrid, 14 Dec 2011.
UNDERSTANDING WEB AND WEB PROJECT PLANNING AND DESIGNING AND EFFECTIVE WEBSITE Garni Dadaian.
LC and the W3C: History b Attended two W3C Workshops Indexing/Distributed Search Indexing/Distributed Search Query Language Query Language.
EPICUR Kathrin Schroeder ERPANET-Workshop „Persistent Identifiers“ (17th June 2004) Uniform Resource Names (URN) – Overview Die Deutsche Bibliothek.
Why identifiers? To access resources To cite resources To unambiguously identify a resource –To register it as intellectual property –To record changes.
Chinese-European Workshop on Digital Preservation, Beijing July 14 – Network of Expertise in Digital Preservation 1 Persistent Identifiers Reinhard.
Locating objects identified by DDI3 Uniform Resource Names Part of Session: Concurrent B2: Reports and Updates on DDI activities 2nd Annual European DDI.
Tobias Weigel (DKRZ) Tobias Weigel Deutsches Klimarechenzentrum (DKRZ) Persistent Identifiers Solving a number of problems through a simplistic mechanism.
SERNEC Image/Metadata Database Goals and Components Steve Baskauf
IDs in and out of the database Entomological Collections Network (ECN) 2012 November 10 – 11, Knoxville, TN Debbie Paul, Greg Riccardi.
DATA COMMUNICATION DONE BY: ALVIN SAMPATH CARLVIN SAMPATH.
AON Data Questionnaire Results 21 Respondents Last Updated 27 March 2007 First AON PI Meeting Scot Loehrer, Jim Moore.
Linking resources Praha, June 2001 Ole Husby, BIBSYS
OCLC Online Computer Library Center Erpanet Symposium on Persistent Identifiers PURLs Stuart Weibel Senior Research Scientist June 17, 2004.
WEB PROGRAMMING  Introduction to World Wide Web  Lecture #: 1.
Integrating Live Plant Images with Other Types of Biodiversity Records Steve Baskauf Vanderbilt Dept. of Biological Sciences
European Endeavor Users Group Meeting Helsinki, Sept Esa-Pekka Keskitalo, System Analyst Helsinki University Library OpenURL 1.0.
LSIDs in a Nutshell Jun Zhao University of Manchester 1 st December, 2005.
TDWG Life Sciences Identifiers Applicability Statement Ben Richardson Review Manager, LSID Applicability Statement Western Australian Herbarium Department.
Globally Unique Identifiers in Biodiversity Informatics Kevin Richards Landcare Research NZ TDWG 2008.
LSIDs and RDF in TDWG Roger Hyam, TDWG, RBGE Donald Hobern, GBIF June 7-9, Edinburgh, UK.
CS 6401 The World Wide Web Outline Background Structure Protocols.
LTER, PASTA, and persistent identifiers LTER IMC Water Cooler Series January 2011.
Interoperability How to Build a Digital Library Ian H. Witten and David Bainbridge.
Low-Risk Persistent Identification: the “Entity” (N2T) Resolver 10 October 2006 John Kunze, California Digital Library, University of California.
Globally Unique Identifiers: What, why, when, which and what now? Dave Thau University of Kansas
1 CS 502: Computing Methods for Digital Libraries Guest Lecture William Y. Arms Identifiers: URNs, Handles, PURLs, DOIs and more.
Web Design – Week 2 Introduction to website basics Website basics: How the Web Works Client / server architecture Packet switching URL components.
Course on persistent identifiers, Madrid (Spain) Information architecture and the benefits of persistent identifiers Greg Riccardi Director Institute for.
Linked Data Publishing on the Semantic Web Dr Nicholas Gibbins
HTML Paragraphs Paragraphs are defined with the tag. Example This is a paragraph This is another paragraph.
COMP3121 E-Commerce Technologies Richard Henson University of Worcester December 2009.
Instructor Materials Chapter 5 Providing Network Services
How HTTP Works Made by Manish Kushwaha.
Introduction to Persistent Identifiers
Improving searches through community clustering of information
Vocabulary Prototype: A preliminary sketch of an idea or model for something new. It’s the original drawing from which something real might be built or.
Chapter Eight Interoperability How to Build a Digital Library
E-commerce | WWW World Wide Web - Concepts
E-commerce | WWW World Wide Web - Concepts
Naming in Distributed Web-based Systems
Processes The most important processes used in Web-based systems and their internal organization.
Introduction to Persistent Identifiers
Vocabulary Prototype: A preliminary sketch of an idea or model for something new. It’s the original drawing from which something real might be built or.
CS222 Web Programming Course Outline
Persistent identifiers in VI-SEEM
What is the Internet? Global system of interconnected computer networks – a network of networks! Hartland Consolidated Schools network Network at your.
1 Introduction to the Internet.
Application layer Lecture 7.
CNIT 131 HTML5 – Anchor/Link.
Cataloging the Internet
COMP 150-IDS: Internet Scale Distributed Systems (Spring 2016)
Web Server Technology Unit 10 Website Design and Development.
HTTP and Abstraction on the Internet / The Need for DNS
IPFS: Interplanetary File Systems
Introduction to World Wide Web
MVC Controllers.
Lesson 19 Internet Basics.
INTRODUCTION TO THE INTERNET AND WEB
Presentation transcript:

Introduction to Persistent Identifiers Summary Definition of persistent identifiers Which objects – physical / digital Proliferation of identifiers What do identifiers allow us to do The challenge of real persistence Social contracts Present, real examples Course on persistent identifiers, Madrid (Spain) Type of Persistent Identifiers Kevin Richards Software Developer Landcare Research February 8th, 2012

Introduction This presentation explains some characteristics of a typical identifier scheme, some examples of types of identifiers and briefly looks at some issues that may arise when the information associated with an identifier changes This session is presented by Kevin Richards, Software Developer, at Landcare Research in New Zealand.

Summary Types of Persistent Identifiers Characteristics of an ideal identifier scheme Commonly used identifier types Points for consideration Events that impact the maintenance of identifiers Discussion

Characteristics of an Identifier scheme Universally unique Unchanging Independent generation Opaque Associated services

Characteristics of an Identifier Universally Unique A mechanism is required to ensure global uniqueness Integers obviously not good here Internet example – domain names, URI UUID ensures uniqueness with no tie to resolution protocol

Characteristics of an Identifier Unchanging The resource the identifier refers to must remain the same The link between the identifier itself and what it identifies is the key point Draw diagram to illustrate – eg ID – refers to this object, etc

Characteristics of an Identifier Scheme Independent generation, ideally It should be possible to create identifiers without relying on a central service If a generation mechanism has been defined to ensure uniqueness, then the creation of identifiers can be done by anyone, anywhere c.f. DOI We are mobilising millions of objects

Characteristics of an Identifier Scheme Opaque, ideally In theory, should not be possible to determine any information about a resource by looking at the identifier Often not easy to achieve May be more of a sociological issue

Characteristics of an Identifier Associated Services Not required for a persistent identifier but makes the identifier more useful Resolution of identifier Other functions may include metadata requests and generation Explain uselessness of a UUID by itself

Resolution of Identifiers URLs, URIs, URN E.g. http, lsid, mailto, ftp HTTP Protocol for resolving a web resource which may have many formats, e.g. image, html page, text, xml May have redirects, HTTP 303 Resolution proxies

Types of Identifier Schemes UUID (sometimes GUID) Assured unique E.g. d6610130-5248-11e1-b86c-0800200c9a66 Hard to type in Not resolvable Not always DB friendly Opaque

Types of Identifier Schemes HTTP URI Uniform Resource Identifier Web based – uses HTTP and DNS In common use Promoted by Linked Data advocates May not be opaque due to semantics of domain names E.g. http://example.org/speciman/123 A generalisation of URLs.

Types of Identifiers PURL Persistent Uniform Resource Locator Eg http://purl.oclc.org/example/specimen/12921 Web based using HTTP and HTTP redirect Resolved through PURL resolver May not be opaque due to domain names and paths

Types of Identifiers DOI Digital Object Identifier Eg doi:10.1000/182 http://dx.doi.org/10.1006/jado.1998.0204 Managed by DOI Foundation (commercial) Generated by DOI Foundation Resolved through DOI resolution service Very opaque

Types of Identifiers LSID Life Science Identifier (URN) Developed by OMG for “name” based identification E.g. urn:lsid:example.org:specimen:12921 Resolution protocol independent (i.e. does not rely upon HTTP) 3 step resolution mechanism May not be opaque due to domain names

When things change Change of institution name (domain name for URLs) E.g. old ID http://www.example.org.nz/specimen/12921 new ID http://www.egHerbarium.org.nz/specimen/12921 Broken Identifiers ? Avoid by avoiding institution names for authority, use Project names. E.g use NZspecimens.org.nz instead

When things change Transfer of dataset (or split) New owner for dataset Eg specimens of NHM given permanently to regional herbarium -> new institution name Similar solution to last problem, but if project name changes too, then still an issue, e.g. NZspecimens.org.nz project is migrated to wellingtonSpecimens.org.nz 2 options new owner maintains old Identifiers Old Identifiers redirected to new identifiers

When things change Resource destroyed When the resource the identifier refers to has been deleted or removed from the system Preferred results is to maintain the resolution of the identifier and state it is deprecated in the metadata for that resource

Summary/conclusions Characteristics of an Identifier Universally unique, independent generation, unchanging, opaque and actionable Types of Identifiers (some of them) URI, PURL, DOI, LSID When things change Split or transfer of dataset, change of institution name, destroyed resource

Introduction to Persistent Identifiers Summary Definition of persistent identifiers Which objects – physical / digital Proliferation of identifiers What do identifiers allow us to do The challenge of real persistence Social contracts Present, real examples Course on persistent identifiers, Madrid (Spain) Type of Persistent Identifiers Kevin Richards Software Developer Landcare Research February 8th, 2012