DOIs for Tracking and Citing Scientific Data J. Klump, J. Wächter and M. Lautenschlager CODATA Conference 2006 Beijing, PR China.

Slides:



Advertisements
Similar presentations
The Benefits of Cross- Linking The International Continental Scientific Drilling Program (ICDP) Jens Klump et al. Knowledge by Networking - Digitising.
Advertisements

doi> Digital Object Identifier: overview
Pilot Implementation: Publication and Citation of Scientific Primary Data Result of CODATA WG, supported by DFG Jan Brase Learning Lab Lower Saxony, Uni.
The German National Library of Science and Technology as a DOI RA 2007.
Access to non-textual information 2008 Jan Brase IDF Open Meeting: Resource Access for a Digital World June 17th, 2008, Brussels.
IATUL Porto, May 21, 2006 DOI and e-Science Dr Anne E Trefethen Oxford e-Research Centre
Ubiquity of Grey Literature in a Connected Content Context Julia Gelfand University of California, Irvine Paper presented at GL5 Conference.
Std-doi Publication of Climate Data at WDCC DataCite Summer Meeting 7./8. June 2010 Publication of climate data Heinke Höck World Data Center for Climate.
Introduction to DataCite Adam Farquhar PhD Head of Digital Library Technology, The British Library President, DataCite June 2010.
LOCALIZED REFERENCE LINKING PROJECT Dale Flecker NFAIS/NISO Linking Workshop February 24, 2002 Philadelphia.
CEOS Working Group on Information Systems and Services, WGISS-24, Oberpfaffenhofen, Oct , 2007 GFZ Representative Report Bernd Ritschel, GFZ ISDC.
Introduction to DataCite Adam Farquhar, PhD Head of Digital Library Technology, The British Library President, DataCite June, 2010.
Data Acquisition and Data Publishing with eSciDoc Matthias Razum DataCite Summer Meeting Hannover June 7-8, 2010.
CrossRef Linking and Library Users “The vast majority of scholarly journals are now online, and there have been a number of studies of what features scholars.
DSpace: the MIT Libraries Institutional Repository MacKenzie Smith, MIT EDUCAUSE 2003, November 5 th Copyright MacKenzie Smith, This work is the.
Preservation and Long Term Access of Data at the World Data Centre for Climate Frank Toussaint N.P. Drakenberg, H. Höck, M. Lautenschlager, H. Luthardt,
Pilot Implementation: Publication and Citation of Scientific Primary Data Result of CODATA WG, supported by DFG Jan Brase Learning Lab Lower Saxony, Uni.
M.Lautenschlager (WDCC, Hamburg) / / 1 Conception of Citing Scientific Primary Data (Result of CODATA WG, supported by DFG) Michael Lautenschlager.
Michael R. Lightner Candidate for 2005 IEEE President-Elect IEEE Computer Society Board of Governors Long Beach, CA June 10, 2004.
Institutional Perspective on Credit Systems for Research Data MacKenzie Smith Research Director, MIT Libraries.
German Cluster of WDCs for Earth System Research - Entwurf - Michael Lautenschlager 1, Michael Diepenbroek 2, Hannes Grobe 2, Michael Bittner 3, Jens Klump.
M. Diepenbroek (MARUM), M. Lautenschlager (MPI-M), E. Paliouras (DLR), H. Grobe (AWI) CODATA General Assembly, Berlin World Data Center Cluster.
Review on 5 Years DataCite and 10 Years DOI Registration for Data DataCite Annual Conference 2014 Nancy, August 25th – 26th Michael Lautenschlager (DKRZ.
DataCite: Making Data Citable Jan Brase (DataCite/TIB Hannover) Brigitte Hausstein (GESIS) Wolfgang Zenk-Möltgen (GESIS)
The Bremen core repositories and data curation with PANGAEA Hannes Grobe Alfred Wegener Institute for Polar and Marine Research.
EZID (easy-eye-dee) is a service that makes it simple for digital object producers (researchers and others) to obtain and manage long-term identifiers.
M.Lautenschlager (WDCC / MPI-M) / / 1 AGU Fall Meeting, San Francisco, December 2005 Michael Lautenschlager - WDC Climate (Max-Planck-Institut.
China’s Scientific Data Sharing Initiatives and Future Perspective Pro. Peng, Jie Dr. Liu, Runda 5 March 2012,
CrossRef, DOIs and Data: A Perfect Combination Ed Pentz, Executive Director, CrossRef CODATA ’06 Session K4 October 25, 2006.
GLOBAL BIODIVERSITY INFORMATION FACILITY Dr Vishwas Chavan Senior Programme Officer for DIGIT Data Citation Mechanism and.
Libraries as Partners in Research: the UC Curation Center’s Tools and Services UC3 Team University of California Curation Center California Digital Library.
UC3 Standards and Best Practices for Datasets and Other Supplemental Journal Article Materials UC3 Stephen Abrams Patricia Cruse John Kunze.
1 CrossRef - a DOI Implementation for Journal Publishers January 29, 2003 CENDI Workshop.
Dataset Citation: From Pilot to Production Mark Martin Assistant Director, Office of Scientific and Technical Information U.S. Department of Energy.
World Data Center for Marine Environmental Sciences.
E - Physical Sciences & Engineering Jeff Pache IEE
Publication and Citation of Scientific Primary Data at WDC Climate (WDCC ) Michael Lautenschlager (WDCC) Heinke Höck (WDCC) Jan Brase (TIB) Susanne Waszkewitz.
DOI uses cases for data Jan Brase DOI outreach meeting November 21 st Milano.
Joint Declaration of Data Citation Principles Notes [1] CODATA 2013: sec 3.2.1; Uhlir (ed.) 2012, ch 14; Altman &
EGY General Meeting, Boulder, March 2007 GFZ Potsdam contribution to eGY Bernd Ritschel electronic Geophysical Year.
M.Lautenschlager (WDCC, Hamburg) / / 1 Training-Workshop Facilities and Sevices for Earth System Modelling Integrated Model and Data Infrastructure.
Semantic linking of data and journal publications in the STD-DOI project Jens Klump and STD-DOI Team European GeoInformatics Workshop Edinburgh, 7 March.
Data Attribution and Citation Practices and Standards Fifth China - U.S. Roundtable on Scientific Data Cooperation Beijing, China, October, 2011.
Deepcarbon.net Xiaogang (Marshall) Ma, Yu Chen, Han Wang, John Erickson, Patrick West, Peter Fox Tetherless World Constellation Rensselaer Polytechnic.
Making Data Accessible Yolanda Gil USC/ISI February 20, 2015 "To deposit or not to deposit, that is the question - journal.pbio g001"
Every bit counts Data management and data publication in the earth sciences Jens Klump et al. International Data Exchange Workshop Kiel, 10 May 2007.
Publishing & Citing Research Data Arun Prakash. Agenda  Introduction  Why is Data publishing important ?  Ongoing Work  Role of Semantics.
Responsible Data Use: Copyright and Data Matthew Mayernik National Center for Atmospheric Research Version 1.0 Review Date.
Bridging the gap between data centres and publishers J. Brase ICSTI Workshop “Interactive Publications and the Record of Science February 8th, 2010.
Dataset citation Clickable link to Dataset in the archive Sarah Callaghan (NCAS-BADC) and the NERC Data Citation and Publication team
Integration of the Activity Research Database and the Institutional Repository at Carlos III University of Madrid Teresa Malo de Molina Head Librarian.
Copyright and Data Matthew Mayernik National Center for Atmospheric Research Section: Responsible Data Use Version 1.0 October 2012 Copyright 2012 Matthew.
Margret Plank 17th International Conference on Grey Literature 1st and 2nd December 2015, Amsterdam (Netherlands) Move beyond text – How TIB manages the.
Hannes Thiemann Michael Lautenschlager Deutsches Klimarechenzentrum GmbH, Germany EGU 2010.
Joint Declaration of Data Citation Principles (Overview) The Data Citation Synthesis Group Joint Declaration.
Marat Rakhmatullaev, professor of Tashkent University of Information Technologies DIGITAL INFORMATION RESOURCES FOR INNOVATIVE ACTIVITIES IN SCIENTIFIC.
1 Digital Object Identifiers Update ESIP Data Stewardship Committee Meeting May 16, 2016 Presenters: Nate James, ESDIS Lalit Wanchoo, ADNET Systems Inc.
IST SciX project: Lowering the technical, economic and social barriers to open scientific publishing Žiga Turk University of Ljubljana, Slovenia.
ODIN – ORCID and DATACITE Interoperability Network ODIN: Connecting research and researchers Sergio Ruiz - DataCite Funded by The European Union Seventh.
Russian Academy of Sciences
Access to Global Research in Agriculture to Support National and Regional Research and Academic Programmes Gracian Chimwaza Executive Director, ITOCA.
Digital Repository Certification Schema A Pathway for Implementing the GEO Data Sharing and Data Management Principles Robert R. Downs, PhD Sr. Digital.
SowiDataNet - A User-Driven Repository for Data Sharing and Centralizing Research Data from the Social and Economic Sciences in Germany Monika Linne, 30.
Non-profit DOI registration agency for Scientific primary data
What Are Institutional Repositories?
Theses and TDX: Legal aspects
Mission DataCite was founded in 2009 as an international organization which aims to: establish easier access to research data increase acceptance of research.
Research data in library catalogues and the joint initiative of European technical libraries for data registration Jan Brase Workshop Primary data for.
Bird of Feather Session
Data + Research Elements What Publishers Can Do (and Are Doing) to Facilitate Data Integration and Attribution David Parsons – Lawrence, KS, 13th February.
Presentation transcript:

DOIs for Tracking and Citing Scientific Data J. Klump, J. Wächter and M. Lautenschlager CODATA Conference 2006 Beijing, PR China

Data publication today

Data in the publication process today Manuscript Publication Library DataMetadata Private Files After Helly et al. (2003)

The consequences Most data remain underutilised because they are not accessible. → Unnecessary duplication Research results cannot be verified. → Falsification of results. Calls to make data accessible and share data were welcomed but did not give any results.

Specific situation at GFZ Potsdam GFZ produces not only closed data sets but also time series from monitoring systems and observatories. Satellites (CHAMP, GRACE, future missions) Earth magnetic field variations Seismology Geodetic services (e.g. rotation, GPS baseline) Operation of these systems is labour intensive but is not fully appreciated in the scientific literature.

Example CHAMP No citation, only acknowledgement. The data sources need to be deduced from the paper. No Metadata. Often the source of data is not acknowledged.

Why data are not made accessible Data publication is hampered by structural barriers in the publication process: Journals do not devote space to data tables due to economic constraints and have no interest in archiving data. Authors do not receive professional recognition for publishing data because the datasets cannot be cited in a reliable way. Data are not cited because their location (URL), in many cases, is transient.

Necessary steps Data need to be citeable to be „valuable“. „Reputation“ is the currency of science. Authors will only prepare data for publication if the effort is worthwhile. Data publication is labour intensive. Data must be accessible. Access through persistent indentifiers and long-term archives. Intellectual property rights need to be secured. Authors need full control other their publications.

Project “Publication and Citation of Scientific Primary Data” Funded by the German Science Foundation. Implementation of services for the publication of data. DOI registration agency at German National Library for Science and Technology (TIB Hannover). To date 6 DOI registration agents. Project partners: WDC-MARE (Bremen/Bremerhaven) WDC Climate (Hamburg) GFZ Potsdam (proposed WDC-TERRA) WDC-RSAT (Oberpfaffenhofen) Inclusion of data publications into library catalogues.

Was is a DOI? DOI = Digital Object Identifier, a persistent, digital identifier for an object. DOI = Name of object, URL = Location of object. The location may change, the name persists, irrespective of the location of the object. Global resolving mechanism (handle.net) “translates” DOIs to URLs.

STD-DOI system architecture

System architecture at GFZ Data Source DOIDB TIB Hannover Library RSS TIBORDER / GBV Catalogue

Example GFZ Library TIBORDER doi: /GFZ.SDDB.1043 doi: /GFZ.SDDB.1043

TIBORDER / GBV Catalogue

How to cite data

Fair Use “Fair Use” of electronic sources is one of the most contested issues surrounding the use of the internet. Scientific publication are acknowledged by a “citation”. The citation is part of good scientific conduct. In that sense, data publications are analogous to “classical” publications.

The Creative Commons Licence Toolbox for the configuration of a custom licence. Our recommendation for scientific data: By attribution (citation) Non-commercial Share alike (derivative works have to be published under the same licence)

Questions remain Data publication attempts to change existing scientific practice. How does review of data publications work? What do trusted data repositories look like? What are the requirements of different scientific disciplines?

Thank you! Please visit our project website at Thank you for your attention!