Acknowledgments Funding provided by the Jewett Foundation Introduction Data collected in ocean sciences, whether generated from research or operational.

Slides:



Advertisements
Similar presentations
Preservation, access and re-use of Research Data The STM view on publishing datasets Presented at the DataCite Summer Meeting 2010 Hannover, 8 June 2010.
Advertisements

Introduction to DataCite Adam Farquhar PhD Head of Digital Library Technology, The British Library President, DataCite June 2010.
Building Support for a Discipline-Based Data Repository Ryan Scherle 1, Sarah Carrier 2, Jane Greenberg 2, Hilmar Lapp 1, Abbey Thompson 2, Todd Vision.
Data Provenance and Attribution for Published Datasets The Challenge and the reality check April 9-10, 2009 National Academy of Sciences, Woods Hole, MA.
© S.J. Coles 2006 Usability WS, NeSC Jan 06 Enabling the reusability of scientific data: Experiences with designing an open access infrastructure for sharing.
Peter Griffith and Megan McGroddy 4 th NACP All Investigators Meeting February 3, 2013 Expectations and Opportunities for NACP Investigators to Share and.
Selecting a Data Sharing Repository. 2 Why Share Data? Enabling others to replicate and verify results as part of the scientific process Allows researchers.
Data citation from the perspective of a scholarly publisher Lyubomir Penev TDWG Data Citation Workshop, New Orleans, Oct 2011 ViBRANT.
IDENTIFIERS & THE DATA CITATION INDEX DISCOVERY, ACCESS, AND CITATION OF PUBLISHED RESEARCH DATA NIGEL ROBINSON 17 OCTOBER 2013.
FAO and UNESCO-IOC/IODE Combine Efforts in their Support of Open Access Written by Marc Goovaerts, U. Hasselt, BE.
Co-funded by the European Union under FP7-ICT Alliance Permanent Access to the Records of Science in Europe Network Co-ordinated by aparsen.eu #APARSEN.
DataCite: Making Data Citable Jan Brase (DataCite/TIB Hannover) Brigitte Hausstein (GESIS) Wolfgang Zenk-Möltgen (GESIS)
EZID (easy-eye-dee) is a service that makes it simple for digital object producers (researchers and others) to obtain and manage long-term identifiers.
From Berlin back to Business OPEN Stellenbosch University Library and Information Service Mimi Seyffert Manager: Digitisation and Digital Services.
THE DATA CITATION INDEX AN INNOVATIVE SOLUTION TO EASE THE DISCOVERY, USE AND ATTRIBUTION OF RESEARCH DATA MEGAN FORCE 22 FEBRUARY 2014.
Presented by Ansie van der Westhuizen Unisa Institutional Repository: Sharing knowledge to advance research
Data standardization and Data access Peter Pissierssens, Head, IOC project office for IODE, Oostende Intergovernmental Oceanographic Commission Brussels,
Data Management Practices: BCO-DMO’s Successes and Challenges Bob Groman BCO-DMO Woods Hole Oceanographic Institution NERACOOS/NeCODP Data Management Workshop.
The Case for Data Stewardship: Preserving the Scientific Record Matthew Mayernik National Center for Atmospheric Research Version 2.0 [Review Date]
Agenda: DMWG SM policy status ESIP meeting recap Reminder - DM Webinar Series New and updated web pages on DM website Metadata Training Sessions CDI meeting.
GLOBAL BIODIVERSITY INFORMATION FACILITY Dr Vishwas Chavan Senior Programme Officer for DIGIT Data Citation Mechanism and.
Evolving Roles in Scholarly Communications Susan Reilly, APA, Frascati, 7th Nov, 2012.
DataCite Canada Cyndie Found, CISTI Background : Who is CISTI, Definition of Data Research Data Management(RDM) – Benefits, Challenges Addressing.
The Department of Energy’s Public Access Solution Giving Voice to Energy and Science R&D Results Jeffrey Salmon Deputy Director for Resource Management.
Further Development of Contents and Features of the OceanTeacher IODE Program of the IOC of UNESCO, towards Marine Biodiversity Data Management. Antonios.
ACCESS for VALIDITY ACCESS for INNOVATION. Starting January 2011 for NEW proposals Not voluntary – “integral part” of proposal and FastLane Required for.
THROUGH OR AROUND? SCIENTIFIC RESEARCH DATA AND THE INSTITUTIONAL REPOSITORY Panel Presentation for the International Conference on University Libraries.
©2010 Thomson Reuters THE NEW GOLD: UNLOCKING HIDDEN DATA CHRISTOPHER BURGHARDT VP, MARKET & PRODUCT STRATEGY SCIENTIFIC & SCHOLARLY RESEARCH FEBRUARY.
Supporting scientific communities by publishing data Dryad Digital Repository Peggy Schaeffer OpenAIRE/LIBER Workshop May 28, 2013 Ghent, Belgium.
MEDIN Partners Meeting 2010 Submitting data to and using Data Archive Centres.
Open Access and the Wellcome Trust: providing funds for open-access publishing Kathryn Lallu Grants Policy, Liaison and Support Manager Grants Administration.
Scientific Data and Electronic Publishing Renze Brandsma, Head, Digital Production Centre University of Amsterdam Maarten Hoogerwerf, Project Manager,
Data archiving and curation Ryan Scherle Data Repository Architect Dryad Digital Repository CurateGear January 8, 2014 You may reuse any of the original.
AQUATIC COMMONS INITIATIVE: a model for resource sharing in marine and aquatic sciences - presentation to IODE XIX, AQUATIC COMMONS INITIATIVE: a model.
Biological and Chemical Oceanography Data Management Office slide 1 of 19 CAMEO Data Management Bob Groman Biological and Chemical Oceanography Data Management.
Choosing Between Data Sharing Repositories for Engineering Linking Open Data cloud diagram, by Richard Cyganiak and Anja Jentzsch.
ScholarSpace & Open UH Mānoa March 2013 Beth Tillinghast Web Support Librarian ScholarSpace & eVols Project Manager UHM Library.
Symposium on Global Scientific Data Infrastructures Panel Two: Stakeholder Communities in the DWF Ann Wolpert, Massachusetts Institute of Technology Board.
NOAA Data Citation Procedural Directive 8 November 2012 DAARWG.
OceanDocs Repository (Network) in Oceanography and Marine Science Marc Goovaerts Hasselt University Library, Belgium Presented by Suzie Davies 34 th Annual.
Aquatic Commons Initiative: the year in review Presented for the Aquatic Commons Implementation Task Force by Stephanie Haas, University of Florida IAMSLIC:
Publishing & Citing Research Data Arun Prakash. Agenda  Introduction  Why is Data publishing important ?  Ongoing Work  Role of Semantics.
Responsible Data Use: Copyright and Data Matthew Mayernik National Center for Atmospheric Research Version 1.0 Review Date.
George E. Brown, Jr. Network for Earthquake Engineering Simulation 4 th regular meeting of the NEES preservation advisory committee Stanislav Pejša
Dataset citation Clickable link to Dataset in the archive Sarah Callaghan (NCAS-BADC) and the NERC Data Citation and Publication team
Entering the Data Era; Digital Curation of Data-intensive Science…… and the role Publishers can play The STM view on publishing datasets Bloomsbury Conference.
Copyright and Data Matthew Mayernik National Center for Atmospheric Research Section: Responsible Data Use Version 1.0 October 2012 Copyright 2012 Matthew.
The IODE Anniversary Bibliography: 50 years of activities Maria Kalenchits, Estonian Marine Institute, Estonia Pauline Simpson, Central Caribbean Marine.
NESC Worshop – 07 September 2005 Development of a Marine Metadata Standard Greg Reed Executive Officer Australian Ocean Data Centre Joint Facility.
Working Group 4 Data and metadata lifecycle management  1. Policies and infrastructure for data and metadata changes  2. Supporting file and data formats.
NIH BioCADDIE / Force11 Data Citation Pilot Kickoff Meeting Nine Zero Hotel, Boston MA, 3 February 2016 Introduction: Tim Clark, Maryann Martone and Joan.
University of St Andrews Towards e-Research June 16 th 2005 Research-related computing developments in St Andrews Birgit Plietzsch, Anna Clements, Jeremy.
Data Citation Implementation Pilot Workshop
PERSISTENT IDENTIFIERS FOR THE UK: SOCIAL AND ECONOMIC DATA …………………………………………………………………………………………………… LOUISE CORTI …………………….…………………………….… UK DATA ARCHIVE.
Biological and Chemical Oceanography Data Management Office slide 1 of 22 Introduction to Data Management for Ocean Science Research Cyndy Chandler Biological.
| 1 Anita de Waard, VP Research Data Collaborations Elsevier RDM Services May 20, 2016 Publishing The Full Research Cycle To Support.
Training Course on Data Management for Information Professionals and In-Depth Digitization Practicum September 2011, Oostende, Belgium Concepts.
Development and Management of e-Repositories April 2013 IODE Project Office Oostende, Belgium Future Repository Trends: Repositories and Published.
NRF Open Access Statement
Justin Buck OceanSITES data Incentives for participation: Data citation & data services Justin Buck
OceanDocs Digital Repository of Marine Science Research Outputs
ACS 2016 Moving research forward with persistent identifiers
GFBio – Education module
SowiDataNet - A User-Driven Repository for Data Sharing and Centralizing Research Data from the Social and Economic Sciences in Germany Monika Linne, 30.
Institutional role in supporting open access, open science, open data
VI-SEEM Data Repository
Training Course on Data Management for Information Professionals and In-Depth Digitization Practicum September 2011, Oostende, Belgium Concepts.
Mission DataCite was founded in 2009 as an international organization which aims to: establish easier access to research data increase acceptance of research.
Entering the Data Era; Digital Curation of Data-intensive Science…… and the role Publishers can play The STM view on publishing datasets Bloomsbury Conference.
Data + Research Elements What Publishers Can Do (and Are Doing) to Facilitate Data Integration and Attribution David Parsons – Lawrence, KS, 13th February.
Presentation transcript:

Acknowledgments Funding provided by the Jewett Foundation Introduction Data collected in ocean sciences, whether generated from research or operational observations, are not always deposited in national or international databases in a format that makes them retrievable and reusable, or even to test the reproducibility of reported research. Often, there are insufficient incentives for data submission, only punishments for not submitting data, resulting in low submission rates and even when submitted, a bare minimum of metadata. This issue is not unique to the ocean sciences, but several ocean science organizations have begun an effort to stimulate the submission and availability of ocean data. Method The Scientific Committee on Oceanic Research (SCOR), International Oceanographic Data and Information Exchange (IODE) of the Intergovernmental Oceanographic Commission, and Marine Biological Laboratory/Woods Hole Oceanographic Institution (MBLWHOI) Library are working together to develop and execute pilot projects related to two use cases (1) data held by data centers are packaged and served in formats that can be cited, and (2) data related to traditional journal articles are assigned persistent identifiers referred to in the articles and stored in data repositories, such as DSpace repositories provided by libraries and IODE’s PublishedOceanData repository (to be launch in summer 2011). A key feature of this activity is that data centers, libraries, and ocean scientists are working together to test the processes related to these two use cases, and appropriate persistent identifiers will provide a linkage between data, publications, and scientists’ CVs. Use Cases Conclusions While authors agree that it would be ideal to begin collaboration early in the research process, we have found that adapting workflows is a challenge. Use case authors have found it difficult to submit data early in the publishing process and QA/QC of datasets is also an issue. Depositing quality data in a timely manner so that DOIs for datasets can be included in the final version of the article continues to be an obstacle. By joining up their effort and expertise, a number of oceanographic data centers and libraries are putting in place some infrastructures and guidelines which will enable researchers to obtain citable references for their primary datasets prior to publication and also deposit processed data used in journal articles in permanent well managed repositories. Versioning and the management of the relationships between the components of datasets is a key requirement of the data publication process. Recent developments by groups such as DataCite will provide the framework needed to make substantial progress towards the publication of the more dynamic datasets. Collaboration between libraries and data centers such as with the Woods Hole based data center, BCO- DMO, has allowed for deposit of high quality datasets and procedures are now being developed to automate the system of metadata and data transfer to the repository and the assignment of DOIs. This collaboration is also providing the Library with an opportunity to talk to researchers before papers are published as data issues are being discussed with the data center. In the next year we also expect to renew conversations with publishers as we now have a workable model for oceanographic datasets that can be enhanced by coordination with journal producers. Conclusions While authors agree that it would be ideal to begin collaboration early in the research process, we have found that adapting workflows is a challenge. Use case authors have found it difficult to submit data early in the publishing process and QA/QC of datasets is also an issue. Depositing quality data in a timely manner so that DOIs for datasets can be included in the final version of the article continues to be an obstacle. By joining up their effort and expertise, a number of oceanographic data centers and libraries are putting in place some infrastructures and guidelines which will enable researchers to obtain citable references for their primary datasets prior to publication and also deposit processed data used in journal articles in permanent well managed repositories. Versioning and the management of the relationships between the components of datasets is a key requirement of the data publication process. Recent developments by groups such as DataCite will provide the framework needed to make substantial progress towards the publication of the more dynamic datasets. Collaboration between libraries and data centers such as with the Woods Hole based data center, BCO- DMO, has allowed for deposit of high quality datasets and procedures are now being developed to automate the system of metadata and data transfer to the repository and the assignment of DOIs. This collaboration is also providing the Library with an opportunity to talk to researchers before papers are published as data issues are being discussed with the data center. In the next year we also expect to renew conversations with publishers as we now have a workable model for oceanographic datasets that can be enhanced by coordination with journal producers. JCDL136p SCOR/IODE/MBLWHOI Library Collaboration on Data Publication Lisa Raymond 1, Linda Pikula 2, Roy Lowry 3, Ed Urban 4, Gwenaëlle Moncoiffé 3, Peter Pissierssens 5, and Cathy Norton 6 1. Woods Hole Oceanographic Institution (WHOI); 2. NOAA Central Library; 3. British Oceanographic Data Centre (BODC); 4. Scientific Committee on Oceanic Research (SCOR); 5. IOC Project Office of IODE; 6. Marine Biological Laboratory (MBL) _______________________________________________________________________________________________________________________________ References Borgman, Christine L Research Data: Who will share what, with whom, when, and why? Presented at the China-North American Library Conference, 17 Aug Costello, M. J Motivating Online Publication of Data. BioScience, 59,5 (May 2009), DOI= Digital Research Data Curation: Overview of Issues, Current Activities, and Opportunities for the Cornell University Library A report of the CUL Data Working Group. Cornell University, Ithaca, NY, SCOR/IODE/MBLWHOI Library Workshop on Data Publication, UNESCO Headquarters, Paris, France, 2 April IOC Workshop Report No UNESCO, Paris, SCOR/IODE Workshop on Data Publishing, Oostende, Belgium, June IOC Workshop Report No UNESCO, Paris, A Woods Hole Data Repository: Addressing the Issues of Provenance, Attribution, Citation, and Accessibility. Project Report. MBLWHOI Library, Woods Hole, MA, From Fieldwork to Citation Diagram showing the evolution of a scientific dataset through its different stages from raw data collection (fieldwork) to the generation of master files which must be preserved from accidental loss, self described, exchangeable, re-usable, and citable (by deposition in an accredited data centre), through to the preparation of analyzed data subsets for the purpose of scientific investigation and communication, data subsets which must in turn be openly accessible and citable for the purpose of traceability. 1 The primary issue under investigation in this use case is how to express the continuous nature of data dynamically managed in a relational database management system in the quantized manner required for citation. The group that is managing this use case (the British Oceanographic Data Centre) is also working to document best practice for the physical composition (e.g. file formats) and semantic description of the content of such snapshots to ensure confident re-use of the data in decades to come. 2 The goal of this use case is to identify best practices for tracking data provenance and clearly attributing credit to data collectors/providers for data published in journal articles. To improve efficacy of data directly associated with a scientific article those data must be discoverable, citable and freely available on the Internet. Resources, standards, and workflows must be defined to support publisher and funding agency mandates. For the data to be discoverable, appropriate metadata, defined using community accepted metadata standards, must be associated with the data source. Data will be made citable by the assignment of a persistent identifier as well as provenance and attribution metadata. The availability of the data will be assured by submission to a data repository that has stability and permanence. 1 The primary issue under investigation in this use case is how to express the continuous nature of data dynamically managed in a relational database management system in the quantized manner required for citation. The group that is managing this use case (the British Oceanographic Data Centre) is also working to document best practice for the physical composition (e.g. file formats) and semantic description of the content of such snapshots to ensure confident re-use of the data in decades to come. 2 The goal of this use case is to identify best practices for tracking data provenance and clearly attributing credit to data collectors/providers for data published in journal articles. To improve efficacy of data directly associated with a scientific article those data must be discoverable, citable and freely available on the Internet. Resources, standards, and workflows must be defined to support publisher and funding agency mandates. For the data to be discoverable, appropriate metadata, defined using community accepted metadata standards, must be associated with the data source. Data will be made citable by the assignment of a persistent identifier as well as provenance and attribution metadata. The availability of the data will be assured by submission to a data repository that has stability and permanence. Related Websites PublishedOceanData Woods Hole Open Access Server Biological and Chemical Oceanography Data Management Office British Oceanographic Data Centre Information about the project, including reports from project meetings, can be found at By Tyler Goepfert