GLOBAL BIODIVERSITY INFORMATION FACILITY Greg Riccardi Co-chair 9 November 2009 WWW.GBIF.ORG Outcomes of the GBIF LSID-GUID Task Group.

Slides:



Advertisements
Similar presentations
Supporting further and higher education Learning design for a flexible learning environment Sarah Knight and Ros Smith Pedagogy Strand of the JISC e-Learning.
Advertisements

European Clearing-House Mechanism Portal Toolkit Expert Group Meeting
GUID-1 Workshop Welcome and Introduction Donald Hobern GBIF Program Officer for Data Access and Database Interoperability February 2006.
© GEO Secretariat The Group on Earth Observations – Status and Post 2015 Osamu Ochiai GEO Secretariat 41 st CGMS Tsukuba, Japan 8-12 July 2013.
Integrating Biodiversity Data
Orientation to Performance and Quality Improvement Plan
BIS TDWG Conference 28 October 2013, Florence Documenting data quality in a global network: the challenge for GBIF Éamonn Ó Tuama, Andrea Hahn, Markus.
Customer Focus Module Preview
2016 Transitional GEO Work Programme Discussion & CONSOLIDATION Session 10, 2015 GEO Work Plan Symposium Geneva, 7 May 2015 Alan Edwards (IPWG) Giovanni.
Roles and Goals Greg Riccardi. iDigBio People University of Florida o Larry Page, Jose Fortes, Pamela Soltis, Bruce McFadden, Renato Figueiredo, Reed.
The NSDL Registry Jon Phipps Stuart Sutton Diane Hillmann Ryan Laundry Cornell U. U. of Washington.
SERNEC Image/Metadata Database Goals and Components Steve Baskauf
US NITRD LSN-MAGIC Coordinating Team – Organization and Goals Richard Carlson NGNS Program Manager, Research Division, Office of Advanced Scientific Computing.
II Course on GBIF Node Management Arusha, Tanzania 31 st October and 1 st November 2008 Tim ROBERTSON Systems Architect GBIF Secretariat Data Publishing.
IDs in and out of the database Entomological Collections Network (ECN) 2012 November 10 – 11, Knoxville, TN Debbie Paul, Greg Riccardi.
1 On the Record Report of the Library of Congress Working Group on the Future of Bibliographic Control Diane Boehr Head of Cataloging, NLM
WebWatch Ian Peacock UKOLN University of Bath Bath BA2 7AY UK
METADATA QUALITY IN EUROPEANA , Den Haag.
The Elements of Collaboration in TDWG Stanley Blum California Academy of 2013 Florence, Italy.
© GEO Secretariat 5.2 Monitoring and Evaluation John Adamec Co-Chair, M&E Working Group GEO-XI Plenary November 2014 Geneva, Switzerland.
Mid-Term GBIF Committees Meetings eLearning Alberto González Talaván Global Biodiversity Information Facility (GBIF) May 2011.
1 DanBIF Danish Biodiversity Information Facility Arbejdsseminar om GBIF i Norge Norges Forskningsråd, Oslo 25. September 2003 Isabel Calabuig.
THINK LEARN LEAD LINK Flinders University Web Redevelopment An overview May 2006 Antonia Malavazos, Web Project Officer.
Integrating Live Plant Images with Other Types of Biodiversity Records Steve Baskauf Vanderbilt Dept. of Biological Sciences
NASA’s Process of Community Endorsement Standards or: How the NASA Standards Process seeks to “Cross the Chasm” CEOS WGISS, Annapolis MD Richard Ullman,
Richard White Biodiversity Informatics. What is biodiversity informatics? The preceding project, among others, shows that the challenges facing biodiversity.
GLOBAL BIODIVERSITY INFORMATION FACILITY Éamonn Ó Tuama Senior Programme Officer, IDA 21 June Metadata publishing with the IPT.
TDWG 2006, Missouri, U.S.A. Exchange of germplasm datasets with PyWrapper/BioCASE October 16, 2006 TDWG annual Meeting 2006 Missouri Botanical Garden St.
GEO Work Plan Symposium 2012 ID-03: Science and Technology in GEOSS ID-03-C1: Engaging the Science and Technology (S&T) Community in GEOSS Implementation.
Linking Tasks, Data, and Architecture Doug Nebert AR-09-01A May 2010.
Ricardo Pereira Software Engineer TDWG Infrastructure Project (TIP)
IABIN Visioning Meeting Washington, D.C. October 2008 Mike Frame.
TDWG Life Sciences Identifiers Applicability Statement Ben Richardson Review Manager, LSID Applicability Statement Western Australian Herbarium Department.
Policies and Procedures for Civil Society Participation in GEF Programme and Projects presented by GEF NGO Network ECW.
Introduction to the Semantic Web and Linked Data
Task ID_01 “Advancing GEOSS Data Sharing Principles” The Group on Earth Observations Data Sharing Activities for 2013 Reported by Lerato Senoko 29 th EXCOM:
Australian Teacher Performance and Development Framework Consultation proposal.
Globally Unique Identifiers in Biodiversity Informatics Kevin Richards Landcare Research NZ TDWG 2008.
P088; Presented in Canberra, 27 th March, 2008 GR000: Presented in Fremantle on 20 th October, 2008 GAIA RESOURCES Experiences in mobilizing biodiversity.
Fire Emissions Network Sept. 4, 2002 A white paper for the development of a NSF Digital Government Program proposal Stefan Falke Washington University.
Task XX-0X Task ID-01 GEO Work Plan Symposium April 2014 Task ID-01 “ Advancing GEOSS Data Sharing Principles” Experiences related to data sharing.
The New GBIF Data Portal Web Services and Tools Donald Hobern GBIF Deputy Director for Informatics October 2006.
1 Stockholm Convention on Persistent Organic Pollutants Clearing-house Mechanism for Information on POPs. Role of Regional Centres.
The Data Sharing Working Group 24 th meeting of the GEO Executive Committee Geneva, Switzerland March 2012 Report of the Data Sharing Working Group.
Riccardi: DIALOGUE Workshop August 1, 2005 Supported by NSF BDI 1 Representing and Using Phylogenetic Characters in Morphbank Greg Riccardi, David Gaitros,
TDWG – Looking Backward and Forward Donald Hobern, Director, Atlas of Living Australia 20 October 2008.
Converting an Existing Taxonomic Data Resource to Employ an Ontology and LSIDS Jessie Kennedy Rob Gales, Robert Kukla.
Where now for the taxon transfer schema and related work: collaboration possibilities? Jessie Kennedy.
GLOBAL BIODIVERSITY INFORMATION FACILITY Vishwas Chavan Senior Programme Officer for DIGIT 10 th Meeting of the GBIF Participant Node Managers Committee.
GLOBAL BIODIVERSITY INFORMATION FACILITY David Remsen Senior Programme Officer, ECAT 3 Oct th Nodes Meeting.
II Course on GBIF Node Management Arusha, Tanzania 31 st October and 1 st November 2008 GBIF Training Materials and Future Plans Alberto GONZÁLEZ-TALAVÁN.
Summary of PRAGMA SC Meeting Planning for PRAGMA’s Future PRAGMA March2011 University of Hong Kong.
Course on persistent identifiers, Madrid (Spain) Information architecture and the benefits of persistent identifiers Greg Riccardi Director Institute for.
GBIF NODES Committee Meeting Copenhagen, Denmark 4 th October 2009 The GBIF Integrated Publishing Toolkit Alberto GONZÁLEZ-TALAVÁN Programme Officer for.
GEOSS Data Sharing: Plans for 2016 and beyond GEO Work Programme Symposium WMO, Geneva, 2 May 2016 Robert Chen on behalf of the DSWG co-chairs (GEO Foundational.
Coordination and Policy Development in Preparation for a European Open Biodiversity Knowledge Management System Supported by the European Commission through.
12 th Meeting of the GBIF Participant Nodes Committee 6-7 October 2013, Berlin, Germany Towards a generic work programme for a Node Olaf Bánki Senior Programme.
Introduction to SHERPA RoMEO and its Significance for Publishers
Onboarding Learning Objectives Checklist
EOSC MODEL Pasquale Pagano CNR - ISTI
Introduction to Persistent Identifiers
GLOBAL BIODIVERSITY INFORMATION FACILITY GBIF Community Site
Jessie Kennedy Rob Gales, Robert Kukla
The IPT user interface and data quality tools
GLOBAL BIODIVERSITY INFORMATION FACILITY
Connecting for Health Preliminary Terminology Consensus Statements
A Case Study for Synergistically Implementing the Management of Open Data Robert R. Downs NASA Socioeconomic Data and Applications.
JISC and SOA A view Robert Sherratt.
Draft revision of ISPM 6: National surveillance systems ( )
{Project Name} Organizational Chart, Roles and Responsibilities
Presentation transcript:

GLOBAL BIODIVERSITY INFORMATION FACILITY Greg Riccardi Co-chair 9 November Outcomes of the GBIF LSID-GUID Task Group

Overview l Task Group Overview l The Characteristics of Effective Identifiers l Benefits and Opportunities l Recommendations l Discussion Session l Thursday, 12 Nov,

GUID Goals from GBIF Strategic Plans l The GBIF strategic plans document includes goals l To consolidate the underlying enabling infrastructure and standardisation for global connectivity of biodiversity data and information l To develop a system of globally unique identifiers and encourage their use throughout biodiversity informatics l To use TDWG standards to allow all data objects to be identified using standard actionable globally unique identifiers l To provision GBIF web services and user interfaces to allow users to locate and view any data object with a standard globally unique identifier.

Call to the Task Group l GBIF convened a task group, the “LSID GUID Task Group” (LGTG) l to explore the issues and offer recommendations on the way forward, with particular reference to the GBIF network, l that will enable GBIF to provide architecture leadership and best practices for implementation. l The principal objective of the group is l to provide recommendations and guidelines on deployment of identifiers on the GBIF network with particular reference to the potential role of GBIF as a stable, long term provider of identifier resolution services.

Members l Phil Cryer (Missouri Botanical Garden) l Roger Hyam (Natural History Museum and PESI) l Chuck Miller (Missouri Botanical Garden) l Nicola Nicolson (Royal Botanic Gardens, Kew) l Éamonn Ó Tuama (GBIF) l Rod Page (University of Glasgow) l Jonathan Rees (Science Commons) l Greg Riccardi (co-chair, Florida State University) l Kevin Richards (Landcare Research, New Zealand) l Richard White (co-chair, Cardiff University)

Results l Report document l Draft written at the August 2009 workshop at GBIF l Revised for distribution in October 2009 l Contents of report l Overview of definitions and technology l Recommendations for the GBIF secretariat and for the biodiversity community l Report delivered to GBIF Science Committee l Response of committee (at end of talk)

Overview l Task Group Overview l The Characteristics of Effective Identifiers l Benefits and Opportunities l Recommendations l Discussion Session l Thursday, 12 Nov,

Preliminary Definition l An identifier is a character string associated with an object. l Identifiers are used in informatics to refer to objects in data sets, documents and repositories. l Some identifiers are useful l Some are more useful

Characteristics of Effective Identifiers l Two use cases that make identifiers effective for users l Uniqueness of reference to a single object l An identifier can be used to aggregate information about the identified object l For example, information received from multiple sources associated with a single identifier is information about a single object. l Actions may be carried out using the identifier l An identifier can be used to find further information about the object, concept or data to which it refers. l This information might be interpreted directly or used to support services.

Problems with terminology l The task group struggled with terms l GUID is problematic l Used in IT to refer to the way that Microsoft uses 128 bit UUIDs l Used in biodiversity to refer to … l Persistent, actionable identifier l The Task Group recommendation for terminology l Two required characteristics: persistent and actionable

Persistent Identifier l Persistence: The property that an identifier always refers to a specific object. l All information associated with a persistent identifier is about the same object. l The properties of the object are subject to change, but once a persistent identifier is assigned to one object, it cannot be reused to refer to a different object. l Example l ITIS TSNs are integers that are persistent identifiers for taxa

Actionable Identifiers l An identifier is actionable if there is a service that, given the identifier, provides information about the object identified l E.g., a resolution service maps an identifier into a Web service that provides information about the identifier and its associated object l Example l An HTTP URI is actionable. l The HTTP system provides mechanisms for clients to access informationabout a data object from its associated identifier. l ITIS TSNs are actionable because ITIS supports services that provide information for TSNs.

Good Identifier Technologies l HTTP URI: A fundamental technology of WWW l Persistence assured using DNS l Actionable through HTTP protocol l LSID: Life Science Identifiers l Persistence assured by convention l Actionable according to the LSID services model l May be mapped into HTTP URI by resolution services l Recommendation: Both are important to biodiversity and should be supported by GBIF l UUID l Persistence assured by random assignment l Not independently actionable l Can be an effective part of HTTP URI and LSID technologies

Overview l Task Group Overview l The Characteristics of Effective Identifiers l Benefits and Opportunities l Recommendations l Discussion Session l Thursday, 12 Nov,

Example Benefits of IDs l Tracking citation and impact l The association among objects might be contained in a blog post: l Joe writes “I searched the GBIF repository for all frogs from Cuba. The collection of objects that I found useful are in the collection [ID1]. I plotted the locations of the records [ID2] and reported the results in my paper [ID3]. l Such an association provides feedback and is used by search engines in rankings and ratings l Management and disambiguation of taxon names l Disambiguation of taxon names requires services that support tests of difference as well as of equality. l Different identifiers do not necessarily refer to different objects. l Tests of inequality for objects must rely on evaluation of metadata or of the objects themselves.

Opportunity l Integrating identifiers with the Semantic Web and the Linked Data model l Linked Data ( is a vision of a web of interconnected data, to be consumed by machineshttp://linkeddata.org l HTTP URIs are used as identifiers, and the data is described using RDF l If we use HTTP URIs for identifiers, we will be part of Linked Data

Potential Linked Data Model

Overview l Task Group Overview l The Characteristics of Effective Identifiers l Benefits and Opportunities l Recommendations l Discussion Session l Thursday, 12 Nov,

Recommendations: GBIF Should l Take the leadership role in driving the application and use of identifiers in biodiversity informatics, l Provide materials such as an executive summary targeted to administrative leadership explaining the costs and benefits of implementing persistent identifiers, l Educate the community in general persistent identifier principles and practices, l Encourage, support and advise on the use of appropriate identifier technologies, in particular lsids and HTTP uris, but not impose a requirement for one at the expense of the other, and provide specific advice for the issuing and use of lsids and for HTTP uris, l Support a promotional programme, l Demonstrate good practice in its data portal, l Assist providers that are not currently maintaining their own persistent identifiers to do so: this includes both education and technology, l Make data more inter-connected, l Start a programme to become an RDF consumer and encourage data providers to deploy RDF services, l Provide services to support identifier resolution, redirection, metadata hosting, and caching, l Provide additional services, including persistent identifier monitoring services, l Extend the role of its data portal by hosting resources related to the use of identifiers, such as the TDWG vocabularies, l Assist with the availability of software for data and service providers, and l Continue to be funded to provide support to data providers for the foreseeable future.

Response of the GBIF Science Committee l The SC reviewed and endorsed the report of the LSID GUID TG (LGTG). l The SC recommends that l An additional full case study is developed in the document to highlight the new quality control mechanisms that can be established to have users report and receive feedback on the quality of data being served. l Additionally, l the LGTG makes an excellent “obligatory reading material” for the Biodiversity Informatics community in general and for GBIF Participants, in particular. l The SC strongly recommends all participants to read it and be aware of the impact that the implementation of tools such as IPT and GBRDS will have in their local contexts as well as globally

How to contact GBIF: Web site: Data portal: data.gbif.org GBIF Secretariat Universitetsparken Copenhagen Denmark Phone: Fax: