Persistent identifiers – an Overview Juha Hakala The National Library of Finland 2011-02-01.

Slides:

Advertisements

Similar presentations

Serials identification and the electronic environment F. Pellé, ISSN IC Cairo, October 2001.

Advertisements

Resource description and access for the digital world Gordon Dunsire Centre for Digital Library Research University of Strathclyde Scotland.

IDF open meeting 2007 doi>. Eight possible innovations doi> Innovative uses of the DOI System.

Doi> DOI – new applications panel IDF Annual Members meeting Bologna 2005.

THE DONOR PROJECT Titia van der Werf-Davelaar. Project Financed by: Innovation of Scientific Information Provision (IWI) Duration: –phase 1: 1 may 1998.

An Introduction to Repositories Thornton Staples Director of Community Strategy and Alliances Director of the Fedora Project.

Developing a Metadata Exchange Format for Mathematical Literature David Ruddy Project Euclid Cornell University Library DML 2010 Paris 7 July 2010.

Documenting the Resource Malcolm Polfreman

Project Proposal.

Publication History Diane I. Hillmann. Background  Formally the CONSER Task Force to Explore the Use of a Universal Holdings Record  In the process.

DDI 3.0 Conceptual Model Chris Nelson. Why Have a Model Non syntactic representation of the business domain Useful for identifying common constructs –Identification,

Challenges for the DL and the Standards to solve them Alan Hopkinson Technical Manager (Library Systems) Learning Resources Middlesex University.

Page 1 Building Reliable Component-based Systems Chapter 18 - A Framework for Integrating Business Applications Chapter 18 A Framework for Integrating.

IMT530- Organization of Information Resources1 Feedback Like exercises –But want more instructions and feedback on them –Wondering about grading on these.

1 CS 502: Computing Methods for Digital Libraries Lecture 4 Identifiers and Reference Links.

National libraries and identity in the Semantic Web Gordon Dunsire BNE, Madrid, 14 Dec 2011.

Chinese-European Workshop on Digital Preservation, Beijing July 14 – Network of Expertise in Digital Preservation 1 Persistent Identifiers Reinhard.

1 APARSEN - WP2200 Identifiers and Citability Interoperability Framework for PI systems Webinar on PI - 15 February 2013 Maurizio Lunghi.

Metadata and identifiers for e- journals Copenhagen Juha Hakala Helsinki University Library

XP New Perspectives on XML Tutorial 4 1 XML Schema Tutorial – Carey ISBN Working with Namespaces and Schemas.

Locating objects identified by DDI3 Uniform Resource Names Part of Session: Concurrent B2: Reports and Updates on DDI activities 2nd Annual European DDI.

Metadata Standards and Applications 5. Applying Metadata Standards: Application Profiles.

Rfc2141bis, rfc3406bis and the ISBN + NBN namespaces IETF 83, Paris, France Juha Hakala The National Library of Finland.

8/28/97Organization of Information in Collections Introduction to Description: Dublin Core and History University of California, Berkeley School of Information.

Copy cataloguing in Finland Juha Hakala The National Library of Finland

BEYOND THE OPAC: FUTURE DIRECTIONS FOR WEB-BASED CATALOGUES Martha M. Yee September 11, 2006 draft.

METADATA QUALITY IN EUROPEANA , Den Haag.

DOI Workshop, Luxembourg - 20 May Identifiers in Context Andy Powell UKOLN University of Bath UKOLN.

Linking electronic documents and standardisation of URL’s What can libraries do to enhance dynamic linking and bring related information within a distance.

Lifecycle Metadata for Digital Objects (INF 389K) September 18, 2006 The Big Metadata Picture, Web Access, and the W3C Context.

European Endeavor Users Group Meeting Helsinki, Sept Esa-Pekka Keskitalo, System Analyst Helsinki University Library OpenURL 1.0.

Topic Rathachai Chawuthai Information Management CSIM / AIT Review Draft/Issued document 0.1.

RDA: Resource Description and Access A New Cataloging Standard for a Digital Future Jennifer Bowen RDA Forum ALA Annual Meeting, New Orleans, June 24,

European Commission on Preservation and Access Preservation of digital heritage Yola de Lusenet Lisbon, November

Evolving MARC 21 for the future Rebecca Guenther CCS Forum, ALA Annual July 10, 2009.

1 Strategic Plan for Digital Archives Programme DAP PROJECT SCOPE OVERVIEW STATUS.

Primary funding is provided by the JISC and ESRC. Based at Manchester Computing, The University of Manchester. 1 1 Getting Technical - Linking UKSG Serial.

Catherine Tabone 04 June ELI Compliant URI Scheme Implementation.

VIVO and Scholarly Repositories: Synergistic Opportunities.

Evidence from Metadata INST 734 Doug Oard Module 8.

RDA DAY 1 – part 2 web version 1. 2 When you catalog a “book” in hand: You are working with a FRBR Group 1 Item The bibliographic record you create will.

TDWG Life Sciences Identifiers Applicability Statement Ben Richardson Review Manager, LSID Applicability Statement Western Australian Herbarium Department.

Planning your Project Managing your 333T project is like managing any professional project.

Intellectual Works and their Manifestations Representation of Information Objects IR Systems & Information objects Spring January, 2006 Bharat.

Building the digital world from local to universal Adolf Knoll National Library of the Czech Republic

Functional Requirements for Bibliographic Records The Changing Face of Cataloging William E. Moen Texas Center for Digital Knowledge School of Library.

Globally Unique Identifiers in Biodiversity Informatics Kevin Richards Landcare Research NZ TDWG 2008.

A Overview of Standards and Technologies in Identification of Archival Information Lou Reich CSC/NASA AWIICS 13-Oct-99.

Digital Object Identifier doi> Norman Paskin The International DOI Foundation W3C DRM workshop January 22/

Interoperability How to Build a Digital Library Ian H. Witten and David Bainbridge.

A centre of expertise in digital information management UKOLN is supported by: Functional Requirements Eprints Application Profile Working.

Achieving Semantic Interoperability at the World Bank Designing the Information Architecture and Programmatically Processing Information Denise Bedford.

CNR – National Research Council, Rome (IT) Central Library ‘G. Marconi’ National Centre for Grey Literature and National ISSN Centre CNR – National Centre.

The Akoma Ntoso Naming Convention Fabio Vitali University of Bologna.

Low-Risk Persistent Identification: the “Entity” (N2T) Resolver 10 October 2006 John Kunze, California Digital Library, University of California.

DC Architecture WG meeting Wednesday Seminar Room: 5205 (2nd Floor)

1 CS 502: Computing Methods for Digital Libraries Guest Lecture William Y. Arms Identifiers: URNs, Handles, PURLs, DOIs and more.

The Linking ISSN (ISSN-L): Crossing the Bridge to the Future Crossing the Bridge to the Future The Linking ISSN (ISSN-L): Crossing the Bridge to the Future.

URN resolution via Z39.50 August 1999 Z39.50 Tutorial, Stockholm Juha Hakala Helsinki University Library

Building Preservation Environments with Data Grid Technology Reagan W. Moore Presenter: Praveen Namburi.

Some basic concepts Week 1 Lecture notes INF 384C: Organizing Information Spring 2016 Karen Wickett UT School of Information.

Information organization Week 2 Lecture notes INF 380E: Perspectives on Information Spring 2015 Karen Wickett UT School of Information.

Information organization Week 2 Lecture notes INF 380E: Perspectives on Information Spring 2015 Karen Wickett UT School of Information.

PIDs and National PID Services

Chapter Eight Interoperability How to Build a Digital Library

Metadata for research outputs management

Prepared by Elena Escolano

An Open Archival Repository System for UT Austin

FRBR and FRAD as Implemented in RDA

Presentation transcript:

Persistent identifiers – an Overview Juha Hakala The National Library of Finland

Traditional identifiers Traditional (bibliographic) identifiers are systems like ISBN (International Standard Book Number) which provide unique and persistent identification for certain types of resources (books, serials, etc.) They were designed for printed resources before the Internet was invented; thus the match with the digital resources and the Web may be a forced one These identifiers are well established international standards with relatively clear roles Not always clear how to apply them to the e-resources, except that identified resources themselves should be persistent

Persistent identifiers (PIDs) A new category of identifiers which are actionable in the Internet, that is, they enable persistent linking (resolution) to the resource or a surrogate such as a bibliographic description of the resource Most PIDs are also “traditional” identifiers When using a DOI, one can identify a book with DOI & an embedded ISBN or DOI with a local ID string URN is the only exception from this; URNs must include a traditional identifier URN namespaces inherit the rules of the traditional identifier used; there is no need to discuss the scope of the URN itself

Traditional versus persistent identifiers Assigning a traditional identifier such as ISBN is (should be?) a controlled process with precise rules What is identified, by whom Assigning a PID such as ARK may or may not be a controlled process and the rules of application may be vague Sometimes the rules are different: A book must have just one ISBN, but it may have two PIDs (for instance, ARK and DOI) The National Library of Finland uses Handles in its Dspace system, but URN is the ”official” identifier of these resources

Recommendations Conflicts between the two identifier groups should be avoided at all cost If a traditional identifier can be assigned to the resource, use that identifier as a part of the PID It follows that PIDs that cannot (easily) incorporate traditional identifiers may cause problems Any identifier (traditional / PID) should have explicit implementation guidelines If no general guidelines exist rules must be developed locally; such rules should eventually be aligned in the level of the PID community

Persistent identifiers and the Web: Cool URIs From the library point of view, cool URIs (URLs) are not proper identifiers at all The same resource may be available from many URLs Over time, different resources or variant versions of the same resource may be available in the same URI There is absolutely no control over cool URI assignment A user cannot know if a URI is cool or not (most of them aren’t) Instead, cool URIs are just shelf marks What is a realistic time frame for cool URI persistence? Cool URIs can support only resolution; persistent identifiers can be more versatile in this respect Match with the current / future long term preservation systems

Services provided by PIDs Basic question: what services do we need? Some examples: Find all locations (URLs) related to the PID Find bibliographic metadata related to the PID Retrieve the preservation commitment of the owning organization (concerning the resource at hand) There is no overall framework / context within which to design the resolution services Each PID provides a slightly different set

PID –based services in the future Theoretical basis could be twofold: Functional requirements for bibliographic records (FRBR) – model: work, expression, manifestation Current theory and practice of long-term preservation based on the migration strategy (and a long tail of manifestations for each work) This means it must be possible for instance to: Find all works related to the work at hand Find all expressions related to the work at hand Find all manifestations of the work at hand Find out differences between these manifestations

PID–based services in the future (2) It should also be possible to Find out who is preserving the resource Retrieve the rights metadata related to the resource Retrieve the preservation metadata related to the resource Retrieve the most original version (the eldest preserved manifestation) of the resource Retrieve the latest (and supposedly the easiest to use) manifestation of the resource …

Example: qualitative social scientific data set The work itself should be described; one metadata element should be the PID Expressions (translations to other languages) should have their own PIDs, linked to the work level record There may be multiple manifestations (relational database, Excel table, etc.) of each expression; each one should have its own PID, and there should be links to the work / expressions In this environment, it would make sense to provide links to the work, and let the users to choose the most appropriate manifestation Choice of the language, file format, etc.

Recommendations (2) Services supported by PID systems need a face lift Many systems were designed 10+ years ago, when digital object management systems were still in their infancy Upgrades must be done in a non-destructive manner (existing implementations must be compliant with the new version) All aspects of PID systems should be standardized Some PIDs (e.g. ARK and PURL) have never reached a standard status, and at best only one part of the system (identifier syntax) has been published as a standard More (and better) open source implementations are needed

Conclusion There will be multiple PIDs in existence in the future (just like there are now) Once a system has been chosen, you cannot give it up PID supporters and cool URI proponents will most likely continue talking past one another for quite some time, but: Given the time frame the national libraries & archives must preserve resources (centuries) and the technical complexity of this task, cool URIs fall short of the requirements in several ways; instead, PIDs must be used PID systems are to some extent ”work in progress”