NIH BioCADDIE / Force11 Data Citation Pilot Kickoff Meeting Nine Zero Hotel, Boston MA, 3 February 2016 Introduction: Tim Clark, Maryann Martone and Joan.

Slides:



Advertisements
Similar presentations
Dr. Markus Quandt GESIS – Leibniz-Institute for the Social Sciences Workshop: Persistent Identifiers for the Social Sciences University Club, Bonn, February.
Advertisements

VO Sandpit, November 2009 Data Citation, Principles and Practice Sarah DataCite Annual Conference, 2014.
Data, Data Everywhere, But Not a Byte to Eat Michael F. Huerta, Ph.D. Associate Director, National Library of Medicine Director, Office of Health Information.
Global Alignment and Collaboration Jo
1 CS 502: Computing Methods for Digital Libraries Lecture 4 Identifiers and Reference Links.
Data Publishing Workflows: Strategies and Standards
Vivien Bonazzi Ph.D. Program Director: Computational Biology (NHGRI) Co Chair Software Methods & Systems (BD2K) Biomedical Big Data Initiative (BD2K)
DataCite: Making Data Citable Jan Brase (DataCite/TIB Hannover) Brigitte Hausstein (GESIS) Wolfgang Zenk-Möltgen (GESIS)
1 APARSEN - WP2200 Identifiers and Citability Interoperability Framework for PI systems Webinar on PI - 15 February 2013 Maurizio Lunghi.
CrossRef, DOIs and Data: A Perfect Combination Ed Pentz, Executive Director, CrossRef CODATA ’06 Session K4 October 25, 2006.
Data Citation: the next big thing… ?!?! 1 Victoria University 20 Nov
5-7 November 2014 DR Workflow Practical Digital Content Management from Digital Libraries & Archives Perspective.
1 CrossRef - a DOI Implementation for Journal Publishers January 29, 2003 CENDI Workshop.
1 INFRA : INFRA : Scientific Information Repository supporting FP7 “The views expressed in this presentation are those of the author.
SCIENCE, RESEARCH DATA, AND PUBLISHING Stewart Wills Editorial Director, Web & New Media, Science 26 February 2013.
Making Connections: SHARE and the Open Science Framework Jeffrey Open Repositories 2015.
Big Data to Knowledge (BD2K) Jennie Larkin, Ph.D. NIH RDA P5 March 10,2015.
Joint Declaration of Data Citation Principles Notes [1] CODATA 2013: sec 3.2.1; Uhlir (ed.) 2012, ch 14; Altman &
Integrating Automated Data Deposit into the Journal Publishing Workflow Eleni Castro, Research Coordinator > IQSS Harvard Digital Publishing Collaborative,
(a)Live and kicking! OAI3 CERN, Geneva, February 2004 Lilian van der Vaart, programme manager DARE.
Now launched! Visit nature.com/scientificdata Honorary Academic Editor Susanna-Assunta Sansone Advisory.
VIVO and Scholarly Repositories: Synergistic Opportunities.
Making Data Accessible Yolanda Gil USC/ISI February 20, 2015 "To deposit or not to deposit, that is the question - journal.pbio g001"
Summary of RDA Outputs so far dr. Ir. Herman Stehouwer 22 September 2015.
How to Implement an Institutional Repository: Part II A NASIG 2006 Pre-Conference May 4, 2006 Technical Issues.
Symposium on Global Scientific Data Infrastructures Panel Two: Stakeholder Communities in the DWF Ann Wolpert, Massachusetts Institute of Technology Board.
NOAA Data Citation Procedural Directive 8 November 2012 DAARWG.
A Fedora 3 to 4 Migration Case Study for UNSW Australia Library Fedora 4 Training Workshop, eResearch Australasia 2015, Brisbane UNSW Library Arif Shaon,
Data Citation: framing the discussion and global context Dr Simon Hodson Executive Director, CODATA Referencing data in publications: principles,
A Fedora 3 to 4 Migration Case Study for UNSW Australia Library Fedora 4 Training Workshop, eResearch Australasia 2015, Brisbane UNSW Library Arif Shaon,
4 way comparison of Data Citation Principles: Amsterdam Manifesto, CoData, Data Cite, Digital Curation Center FORCE11 Data Citation Synthesis Group Should.
Persistent Identifiers (PIDs) & Digital Objects (DOs) Christine Staiger & Robert Verkerk SURFsara.
MPS Workshop 1: Gauging the Impact of Requirements for Public Access to Data November 19, 2015 Jennie Larkin, Ph.D. Office of the Associate Director for.
Entering the Data Era; Digital Curation of Data-intensive Science…… and the role Publishers can play The STM view on publishing datasets Bloomsbury Conference.
4 way comparison of Data Citation Principles: Amsterdam Manifesto, CoData, Data Cite, Digital Curation Center FORCE11 Data Citation Synthesis Group Should.
The library is open Digital Assets Management & Institutional Repository Russian-IUG November 2015 Tomsk, Russia Nabil Saadallah Manager Business.
The BioCADDIE / FORCE11 Data Citation Pilot © 2015 FORCE11.orgFORCE11.org Tim Clark, Ph.D. Harvard Medical School & Massachusetts General Hospital Maryann.
4 way comparison of Data Citation Principles: Amsterdam Manifesto, CoData, Data Cite, Digital Curation Center FORCE11 Data Citation Synthesis Group.
Future Functionality and CrossRef Policy Special Member Meeting December 4th, 2001.
NIH: DATA SCIENCE & BD2K Jennie Larkin, PhD Senior Advisor, Extramural Programs and Strategic Planning Office of the Associate Director for Data Science,
Data Citation Implementation Pilot Workshop
Data Citation Dataverse Mercè Crosas Chief Data Science and Technology Officer, IQSS, Harvard Workshop: Data Citation.
Joint Declaration of Data Citation Principles (Overview) The Data Citation Synthesis Group Joint Declaration.
Open Science (publishing) as-a-Service Paolo Manghi (OpenAIRE infrastructure) Institute of Information Science and Technologies Italian Research Council.
1 Digital Object Identifiers Update ESIP Data Stewardship Committee Meeting May 16, 2016 Presenters: Nate James, ESDIS Lalit Wanchoo, ADNET Systems Inc.
Updating image To update the background image: Go to ‘View’ Select ‘Slide Master’ Select the page with the image Right click on the image and select ‘Change.
ODIN – ORCID and DATACITE Interoperability Network ODIN: Connecting research and researchers Sergio Ruiz - DataCite Funded by The European Union Seventh.
National Institutes of Health U.S. Department of Health and Human Services Planning for a Team Science Evaluation ∞ NIEHS: Children’s Health Exposure Analysis.
Acknowledgments Funding provided by the Jewett Foundation Introduction Data collected in ocean sciences, whether generated from research or operational.
IPDA Registry Definitions Project Dan Crichton Pedro Osuna Alain Sarkissian.
Enhancements to Galaxy for delivering on NIH Commons
RDA WG on Dynamic Data Citation
Jennie Larkin, PhD Senior Advisor
NIH BioCADDIE / FORCE11 Data Citation Pilot (DCIP) Status Update
First Light for DOIs at ESO
INFRA : Scientific Information Repository supporting FP7
Designing a better future: Active, actionable DMPs
Paolo Budroni, University of Vienna
ACS 2016 Moving research forward with persistent identifiers
Publishing software and data
Policy and publishing developments for sharing data and code
Preprints and Other Interim Research Products NIH perspectives
OpenML Workshop Eindhoven TU/e,
A Case Study for Synergistically Implementing the Management of Open Data Robert R. Downs NASA Socioeconomic Data and Applications.
Bird of Feather Session
Dataverse for citing and sharing research data
How to Implement an Institutional Repository: Part II
Persistent identifiers for instruments (PIDINST) working group
Data + Research Elements What Publishers Can Do (and Are Doing) to Facilitate Data Integration and Attribution David Parsons – Lawrence, KS, 13th February.
Presentation transcript:

NIH BioCADDIE / Force11 Data Citation Pilot Kickoff Meeting Nine Zero Hotel, Boston MA, 3 February 2016 Introduction: Tim Clark, Maryann Martone and Joan Starr

Big Data to Knowledge: Goals of BD2K Facilitate broad use of biomedical digital assets by making them discoverable, accessible, and citable. Conduct research and develop the methods, software, and tools needed to analyze biomedical Big Data. Enhance training in the development and use of methods and tools necessary for biomedical Big Data science. Support a data ecosystem that accelerates discovery as part of a digital enterprise. biocaddie.orgdatascience.nih.gov/bd2k

Goals of this Pilot 1. Biomedical articles published citing robustly archived data within 1 year. 2. Common use of Joint Declaration of Data Citation Principles (JDDCP) implementation guidelines (Starr et al. 2015); focus on primary data. 3. Publicize results in coordination with other related projects and organizations.

1. Publish biomedical articles citing robustly archived data a. Landscape of issues. b. “Matrix of goodness” - Good/better/best. c. Expert Groups - Virtual help desk. d. Goal: advance capabilities.

2. Use JDDCP implementation guidelines a. Publishers: JATS, workflow, identifiers. b. Repositories: resolvable identifiers, landing pages. c. Identifier services: harmonize. d. Citation managers: new metadata elements.

3. Publicize results. a. Perspective articles. b. Social media. c. Reports from Expert Groups. d. CODATA global seminars.

Data Citation Generic Example example of a data citation as it would appear in a reference list* Author(s), Year, Dataset Title, Data Repository or Archive, [Accession], Global Persistent Identifier, version or subset Principle 2: Credit and Attribution (e.g. authors, repositories or other distributors and contributors) Principle 4: Unique Identifier (e.g. DOI, Handle.). Principle 5, 6 Access, Persistence: A persistent link to a landing page with metadata and access information Principle 7: Version and granularity (e.g. a version number or a query to a subset) In addition, access to versions or subsets should be available from the landing page, *Note that the format is not intended to be defined with this example, as formats will vary across publishers and communities [Principle 8: Interoperability and flexibility].

Project Timeline Evaluation Planning OctJanApr Jul Pilot Kick off Meeting Outreach Phase II (funding dependent)

Good: endorse the JDDCP principles Better—level above and… Plan for adoption of JATS and identify / resolve obstacles Require data deposition by authors in general and/or domain repositories. Obtain accession number from authors, check that data is properly archived. Best:—levels above and… Publish articles w/ cited data using JATS 1.1d3. Incorporate citation info in JATS XML. Properly render JATS data citation in HTML and PDF. All levels: Participate in workshops and discussions Matrix of Goodness: Publishers

Matrix of Goodness: Repositories Good: endorse the JDDCP principles Better—level above and… Reliably archive data Support minimal metadata required for citation Get or provide actionable PIDs (more info later!) Best—levels above and Assign actionable globally unique PIDs Use PIDs that resolve to landing page then to data (via content negotiation, for machines) All levels: Participate in workshops, discussions, expert groups.

Participants And you!

Plan for Today Morning Early adopter panel JATS presentation Equal time for repositories Afternoon Getting to a resolvable ID Break out sessions and planning for action steps

Le mieux est l'ennemi du bien. - Voltaire, Dictionnaire Philosophique, 1770 DCIP Strategy: get started and improve over time.