DataCite: Making Data Citable Jan Brase (DataCite/TIB Hannover) Brigitte Hausstein (GESIS) Wolfgang Zenk-Möltgen (GESIS)

Slides:



Advertisements
Similar presentations
Putting the Pieces Together Grace Agnew Slide User Description Rights Holder Authentication Rights Video Object Permission Administration.
Advertisements

doi> Digital Object Identifier: overview
Introduction to DataCite Adam Farquhar PhD Head of Digital Library Technology, The British Library President, DataCite June 2010.
THE DONOR PROJECT Titia van der Werf-Davelaar. Project Financed by: Innovation of Scientific Information Provision (IWI) Duration: –phase 1: 1 may 1998.
Introduction to DataCite Adam Farquhar, PhD Head of Digital Library Technology, The British Library President, DataCite June, 2010.
Giri Palanisamy Oak Ridge National Laboratory & Lorrie Apple Johnson U.S. Department of Energy October 16, 2013.
Lorrie Apple Johnson Lead Librarian, Information Analysis & Services Office of Scientific and Technical Information (OSTI) National Academy of Sciences.
Frauke Ziedorn IATUL Workshop 2013 Research Data Management: Finding our Role 6. December 2013 PIDs and DOI Registration with DataCite.
Digital Object Identifiers for EOSDIS data HDF Workshop April 17, 2012 John Moses, ESDIS
THE ODIN PROJECT Sergio Ruiz – DataCite Laura Paglione – ORCID ORCID and DataCite Interoperability Network: Connecting Identifiers This project has received.
Dublin Core as a tool for interoperability Common presentation of data from archives, libraries and museums DC October 2006 Leif Andresen Danish.
Data Publishing Workflows: Strategies and Standards
Chinese-European Workshop on Digital Preservation, Beijing July 14 – Network of Expertise in Digital Preservation 1 Persistent Identifiers Reinhard.
Implementing Digital Object Identifiers at the GESIS Data Archive for the Social Sciences Workshop “Persistent Identifiers for the Social Sciences” Bonn,
1 APARSEN - WP2200 Identifiers and Citability Interoperability Framework for PI systems Webinar on PI - 15 February 2013 Maurizio Lunghi.
Presented by DOI Create: TERN as a use-case Siddeswara Guru
DOI Registration for Social and Economic Data da|ra Brigitte Hausstein GESIS Leibniz-Institute for the Social Sciences, Berlin.
Digital Object Identifiers for EOSDIS data ESIP Winter Meeting Jan 6, 2011 John Moses, ESDIS
CrossRef, DOIs and Data: A Perfect Combination Ed Pentz, Executive Director, CrossRef CODATA ’06 Session K4 October 25, 2006.
ICPSR’s Approach to Data Citation and Persistent Identifiers Mary Vardigan Assistant Director, ICPSR Workshop on Persistent Identifiers in the Social Sciences.
DataCite Canada Cyndie Found, CISTI Background : Who is CISTI, Definition of Data Research Data Management(RDM) – Benefits, Challenges Addressing.
ORCID and me: DataCite ORCID Outreach Meeting Jan Brase, Managing agent DataCite September 17th, 2011 CERN.
UC3 Standards and Best Practices for Datasets and Other Supplemental Journal Article Materials UC3 Stephen Abrams Patricia Cruse John Kunze.
1 CrossRef - a DOI Implementation for Journal Publishers January 29, 2003 CENDI Workshop.
Dataset Citation: From Pilot to Production Mark Martin Assistant Director, Office of Scientific and Technical Information U.S. Department of Energy.
United Nations Economic Commission for Europe Statistical Division Part B of CMF: Metadata, Standards Concepts and Models Jana Meliskova UNECE Work Session.
Topic Rathachai Chawuthai Information Management CSIM / AIT Review Draft/Issued document 0.1.
1 Schema Registries Steven Hughes, Lou Reich, Dan Crichton NASA 21 October 2015.
Joint Declaration of Data Citation Principles Notes [1] CODATA 2013: sec 3.2.1; Uhlir (ed.) 2012, ch 14; Altman &
Data Management in Scholarly Journals and possible Roles for Libraries – Some Insights from EDaWaX Sven Vlaeminck | Leibniz-Information Centre for Economics.
Dataset Metadata Joan Starr California Digital Library January, Tools and Approaches for Access and Preservation.
DNER Architecture Andy Powell 6 March 2001 UKOLN, University of Bath UKOLN is funded by Resource: The Council for.
Libraries and data – the DataCite consortium Jan Brase, DataCite February 2nd, 2011 Workshop: Persistent Identifiers for the Social Sciences Bonn, Germany.
Alternative Architecture for Information in Digital Libraries Onno W. Purbo
Research Grants and Projects Discovery Service ANDS Webinar 12th August 2015 Monica Omodei, ANDS.
NOAA Data Citation Procedural Directive 8 November 2012 DAARWG.
DDI AND EXPERIENCES AT ICPSR Prepared for Expert Seminar Finnish Social Science Data Archive Tampere, Finland September 1-2, 2000.
Data Citation & Digital Object Identifiers DOIs. 2 Digital Object Identifiers 101 Persistent identifier Identifies intellectual property in the digital.
4 way comparison of Data Citation Principles: Amsterdam Manifesto, CoData, Data Cite, Digital Curation Center FORCE11 Data Citation Synthesis Group Should.
1 Not So Strange Bedfellows: Information Standards For Librarians AND Publishers November 6, 2015.
Bridging the gap between data centres and publishers J. Brase ICSTI Workshop “Interactive Publications and the Record of Science February 8th, 2010.
4 way comparison of Data Citation Principles: Amsterdam Manifesto, CoData, Data Cite, Digital Curation Center FORCE11 Data Citation Synthesis Group Should.
Breakout Session 2.2: A sustainable GEO Information System of Systems Chair: Lorenzo Bigagli Rapporteur: Greg Yetman.
|| Barbara Hirschmann1 Establishing a DOI service for Switzerland’s university and research sector.
4 way comparison of Data Citation Principles: Amsterdam Manifesto, CoData, Data Cite, Digital Curation Center FORCE11 Data Citation Synthesis Group.
TOWARDS A DATA CITATION STANDARD FOR GEOSS I. McCallum, H.-P. Plag & S. Fritz.
NIH BioCADDIE / Force11 Data Citation Pilot Kickoff Meeting Nine Zero Hotel, Boston MA, 3 February 2016 Introduction: Tim Clark, Maryann Martone and Joan.
Data Citation Implementation Pilot Workshop
Joint Declaration of Data Citation Principles (Overview) The Data Citation Synthesis Group Joint Declaration.
PERSISTENT IDENTIFIERS FOR THE UK: SOCIAL AND ECONOMIC DATA …………………………………………………………………………………………………… LOUISE CORTI …………………….…………………………….… UK DATA ARCHIVE.
1 Digital Object Identifiers Update ESIP Data Stewardship Committee Meeting May 16, 2016 Presenters: Nate James, ESDIS Lalit Wanchoo, ADNET Systems Inc.
Metadata Schema Registries: background and context MEG Registry Workshop, Bath, 21 January 2003 Rachel Heery UKOLN, University of Bath Bath, BA2 7AY UKOLN.
ODIN – ORCID and DATACITE Interoperability Network ODIN: Connecting research and researchers Sergio Ruiz - DataCite Funded by The European Union Seventh.
Acknowledgments Funding provided by the Jewett Foundation Introduction Data collected in ocean sciences, whether generated from research or operational.
NRF Open Access Statement
The Registration Agency, DDI and Linked Open Data
ACS 2016 Moving research forward with persistent identifiers
SowiDataNet - A User-Driven Repository for Data Sharing and Centralizing Research Data from the Social and Economic Sciences in Germany Monika Linne, 30.
CNI Spring 2010 Membership Meeting
A step-by-step guide to DOI registration
Introducing da|raSearchNet
Metadata for research outputs management
EUDAT B2FIND A Cross-Discipline Metadata Service and Discovery Portal
DataCite - A global registration agency for research data
Tech introduction.
Mission DataCite was founded in 2009 as an international organization which aims to: establish easier access to research data increase acceptance of research.
A Case Study for Synergistically Implementing the Management of Open Data Robert R. Downs NASA Socioeconomic Data and Applications.
Emily Witt (INEXDA, ECB) 14 November 2018
Exchanging Data Management Plans with DDI
Jez Cope, Data Services Lead, The British Library
Presentation transcript:

DataCite: Making Data Citable Jan Brase (DataCite/TIB Hannover) Brigitte Hausstein (GESIS) Wolfgang Zenk-Möltgen (GESIS)

Data is difficult to manage after project funding ends No direct access to data No widely used method to identify datasets No widely used method to cite datasets No effective way to link between datasets and articles Datasets are not included in impact analysis Introduction: Where do we stand?

DataCite Establishes easier access to scientific research data Increases acceptance of research data Supports persistent identification of data using the DOI system Supports archiving of data for verification and re-use DataCite is global consortium founded in London 1 Dec 2009

Membership Fifteen members across ten countries Over 800,000 records registered with DOI names so far

Supporting the community Researchers by enabling them to locate, identify, and cite research datasets with confidence Data centres by providing workflows and infrastructure to identify and cite datasets Publishers by enabling research articles to be linked to the underlying data

Structure and responsibilities DataCite (registration agency): Maintains the resolution infrastructure Maintains a searchable database of metadata Manage DOI over the long term Establishes best practice Allocation agencies (DC member institutes) Creating the identifier Quality assurance Maintains a searchable database of metadata Establishes best practice Publishing agents (data centers, data publishers): Data storage and access Creating and updating metadata

Registration agency for social science data: da|ra since February 2010 GESIS member of Datacite Pilot project March - December 2010 Technical and organisational concept Meta data schema Technical implementation and registration of data sets (GESIS data archive: EVS, Eurobarometer etc.) Implementation of a registration portal for social and economic data; including upgrade of services

Technical system (SOA) RESOLVING SERVICE DataCite REGISTRY SERVICE DDI SERVICE METADATA STORRAGE PUBLICATION AGENT USER da|ra INFORMATION SYSTEM INDEXING SERVICE searchedit/import INDEXING SERVICE REGISTRY SERVICE DDI SERVICE DOI FOUNDATION

da|ra policy framework Service Level Agreement (SLA) Basis for the cooperation with publication agents Guidelines & Best practices da|ra policy General policy for the assignment of Digital Object Identifiers (DOI)

Who? Data Archives Research Data Centers Service Data Centers Future: individual Researchers (via self archiving) What? survey data aggregate data micro data qualitative data Future: pictures, further data formats, scales Register: Who & what?

DataCite metadata kernel Goals Recommend a citation format for datasets Provide the basis for interoperability Promote dataset discovery Lay the groundwork for future services Status August 2010: Draft kernel available for community review September 2010: Comment period ended Comments from 37 individuals, 24 outside of DataCite institutions Until 1st quarter 2011: Publish final metadata kernel

DataCite metadata properties Mandatory properties Identifier (currently DOI) Creator (repeatable) Title (Subtitle, Alternative Title, Translated Title - repeatable) Publisher Publication Year Optional properties (all repeatable) Discipline Contributors (of several types, like Contact Person, Data Collector etc.) Dates (of several types, e.g. Available, Created, Accepted etc.) Resource Types, Descriptions, AlternateIdentifiers Format, Version, Size, Language Relationship to other resources

DataCite mandatory metadata properties I IDProperty NameDefinitionOcc 1Identifier A globally unique persistent identifier associated with a resource. This is the primary identifier of the resource, and the one that will be used in any citation of the resource identifierSchemeThe name of the persistent identifier scheme.1 Controlled List Allowed values: DOI 2Creator The main researchers involved in producing the data, or the authors of the publication in priority order. 1-n The personal name format may be distinguished by using the namePart attribute. 2.1nameIdentifierUniquely identifies an individual or legal entity, according to various schemes.0-1 The format is dependent upon scheme. 2.2nameIdentifierSchemeThe name of the name identifier scheme.1Examples are ORCID, ISNI 2.3namePartThe parts of a personal name.0-1Allowed values: family, given (work in progress)

DataCite mandatory metadata properties II IDProperty NameDefinitionOcc 3TitleA name or title by which a resource is known.1-nThe format is open. 3.1titleTypeThe type of the title.0-1 Controlled List Allowed values: AlternativeTitle Subtitle TranslatedTitle 4Publisher A holder of the data (including archives as appropriate) or institution which submitted the work. Any others may be listed as contributors. This property will be used to formulate the citation, so consider the prominence of the role. In the case of datasets, "publish" is understood to mean making the data available to the community of researchers. 1 5PublicationYear The year when the data was or will be made publicly available. If an embargo period has been in effect, use the date when the embargo period ends. 1Format: YYYY (work in progress)

da|ra metadata schema Goals Support the DataCite metadata kernel In addition: Domain specific possibilities for retrieval and discovery Social sciences Economics Support German and English metadata To be further developed with publication agents

da|ra metadata properties Mandatory properties All DataCite mandatory properties Dates of Data Collection Topic Classification Language, Last Edition, Availability Status Other internally required properties Optional properties All DataCite optional properties Universe, Selection Method Area of Collection (repeatable) Collection Mode Publications (repeatable) Links (repeatable)

da|ra mandatory metadata properties IDProperty NameMapping to DataCiteDefinitionOcc 1Title Title of the dataset.1 3DOIIdentifier (type = DOI)Persistent Identifier (DOI) assigned to the resource.1 4URL Uniform Resource Locator that will be registered with the DOI. 1-n 6Internal IDAlternateIdentifierInternal ID for the da|ra-System1 Assigned by the da|ra-System 7Publisher Name of the publication agency for the resource.1 8 Registration Agency (Homepage, Contact, ) Contributor (type = Registration Agency) Name of the registration agency (“GESIS da|ra”).1 9Dates of Data CollectionDate (type = Start/End)Description of the time the data was gathered.1-n 10 Principal Investigator (Name and/or Institution) Creator (type = Data Collector) Name and/or Institution of the Principal Investigators.1-n 17Topic Classification Description (type = Keywords) Classification of the datasets topics covered.1-n 19Language Language of the dataset.1 20Last EditionVersionVersion description of the dataset.1 21Publication DatePublication YearDate the dataset was made publicly available.1 29Availability StatusRightsDescription under which conditions the data is available.1 (work in progress)

da|ra mandatory metadata properties in DDI 3 internal ID English Title German Title Principle Investigator Name Publisher Registration Agency Publication Date Language DOI Study Description UNIVERSE_REF Study Documentation of GESIS1234 Topic Classification

da|ra mandatory metadata properties in DDI 3 (cont.) Start Date End Date Last Edition (Version Description not in Format n.n.n) RecLayRef DOI URL ArchiveOrg Availablity Status GESIS

Metadata interoperability Conclusions DDI 3 can hold DataCite mandatory metadata properties DDI 3 can also hold da|ra mandatory metadata properties Mapping for optional properties has to be done Increased visibility for research data from social science and economics

da|ra: 4465 registered studies