Preservation, access and re-use of Research Data The STM view on publishing datasets Presented at the DataCite Summer Meeting 2010 Hannover, 8 June 2010.

Slides:



Advertisements
Similar presentations
1 of 16 Information Access The External Information Providers © FAO 2005 IMARK Investing in Information for Development Information Access The External.
Advertisements

IDF Open Meeting 2008: Resource Access for a Digital World International DOI Foundation Brussels, June
1 OECDs StatLinks Using DOIs to link to Statistical Data.
Linking Data from ScienceDirect Articles Presented by: IJsbrand Jan Aalbersberg Hannover, DataCite Meeting Date: June 8, 2010.
Introduction to DataCite Adam Farquhar PhD Head of Digital Library Technology, The British Library President, DataCite June 2010.
WHEN DATA BECOME FIRST CLASS OBJECTS JOEL HAMMOND, SR DIRECTOR, PRODUCT DEVELOPMENT IP & SCIENCE DATACITE 2011 SUMMER MEETING AUGUST 24, 2011.
NIH Public Access Policy What It Means for Authors and for Universities.
Preservation, access and re-use of research data A Publishers perspective……and how we can help Joep Verheggen, Elsevier PARSE.insight workshop, Darmstadt,
DRIVER Building a worldwide scientific data repository infrastructure in support of scholarly communication 1 JISC/CNI Conference, Belfast, July.
DRIVER Long Term Preservation for Enhanced Publications in the DRIVER Infrastructure 1 WePreserve Workshop, October 2008 Dale Peters, Scientific Technical.
May 2010, Brussels, Central Library Eurolib plenary meeting1 Presentation results Working Paper # 003 EU Grey Literature: Long-term preservation, access,
Repositories, Learned Societies and Research Funders Stephen Pinfield University of Nottingham.
UK PubMed Central – a service for biomedical researchers Increasing Nottinghams Research Impact Through Open Access Event 11th October 2007 Mark Samson.
Scholarly Communications in Flux Michael Jubb Director, Research Information Network Bloomsbury Conference on E-Publishing and E-Publications 29 June 2007.
Identifiers and trust: lessons for data publishers Valued Resources: Roles and Responsibilities of Digital Curators and Publishers FOURTH BLOOMSBURY.
Introduction to DataCite Adam Farquhar, PhD Head of Digital Library Technology, The British Library President, DataCite June, 2010.
The Research Information Landscape: Challenges for Researchers and Service Providers Michael Jubb Director Research Information Network UK Data Archive.
S.J. Coles a*, M.B. Hursthouse a, R.A. Stephenson a, P. Cliff b, E. Lyon b, M. Patel b J. Downing c & P. Murray-Rust.
The Future of Scholarship in the Digital Age: The Role of Institutional Repositories Ann J. Wolpert Director of Libraries Massachusetts Institute of Technology.
A centre of expertise in data curation and preservation DigCCur2007 Symposium, Chapel Hill, N.C., April 18-20, 2007 Co-operation for digital preservation.
Data Citation & Digital Object Identifiers DOIs. 2 Digital Object Identifiers 101 Persistent identifier - a form of Handle Identifies intellectual property.
Intellectual Property Rights (IPR)
DSpace: the MIT Libraries Institutional Repository MacKenzie Smith, MIT EDUCAUSE 2003, November 5 th Copyright MacKenzie Smith, This work is the.
PaN-data WP7 - Integration Brian Matthews STFC-e-Science.
13th Fiesole Collection Development Retreat, St Petersburg, May 2011 (Primary) Data: The New Special Collections for Research Libraries? Wouter Schallier.
CHORUS Implementation Webinar May 16, 2014 Mark Martin Assistant Director, Office of Scientific and Technical Information Office of Science U.S. Department.
US DOE’s Public Access Plan: A vision reaching fruition Ms. Deborah Cutler Alt. US INIS Liaison Officer Office of Scientific and Technical Information.
Lorrie Apple Johnson Lead Librarian, Information Analysis & Services Office of Scientific and Technical Information (OSTI) National Academy of Sciences.
INFORMATION SOLUTIONS Citation Analysis Reports. Copyright 2005 Thomson Scientific 2 INFORMATION SOLUTIONS Provide highly customized datasets based on.
1 What is the Internet Archive We are a Digital Library Mission Statement: Universal access to human knowledge Founded in 1996 by Brewster Kahle in San.
Integrating Data and Publication Researchers Perspective Max Wilkinson APA 9th Nov 2011.
DARE: building a networked academic repository in the Netherlands ICOLC October 25 Ronald Dekker Delft University of Technology Library.
Data citation from the perspective of a scholarly publisher Lyubomir Penev TDWG Data Citation Workshop, New Orleans, Oct 2011 ViBRANT.
Institutional Perspective on Credit Systems for Research Data MacKenzie Smith Research Director, MIT Libraries.
Presented by Ansie van der Westhuizen Unisa Institutional Repository: Sharing knowledge to advance research
Data and Publications how to make things better Integration of Research Data and Publications Project ODE – workpackage 4 Eefke Smit International Association.
Moving forward our shared data agenda: a view from the publishing industry ICSTI, March 2012.
GLOBAL BIODIVERSITY INFORMATION FACILITY Dr Vishwas Chavan Senior Programme Officer for DIGIT Data Citation Mechanism and.
The Role of Abstract and Citation Databases in Supporting Data Repositories DataCite Workshop: Möglichkeiten und neue Lösungen im Forschungsdatenmanagement.
Dataset Citation: From Pilot to Production Mark Martin Assistant Director, Office of Scientific and Technical Information U.S. Department of Energy.
The Department of Energy’s Public Access Solution Giving Voice to Energy and Science R&D Results Jeffrey Salmon Deputy Director for Resource Management.
Session Chair: Peter Doorn Director, Data Archiving and Networked Services (DANS), The Netherlands.
Innovation & Supplementary Material Eleonora Presani – Elsevier
Paloma Marín Arraiza 36 th IATUL Conference 9 th July 2015, Hanover (Germany) VIDEO ABSTRACTS A NEW WAY OF SCIENTIFIC COMMUNICATION.
Avoiding a Digital Dark Age for Data: why data and publications belong together Integration of Research Data and Publications Eefke Smit International.
Recommended Practices for Journal Article Supplemental Material Highlights of the Sub-Session Background Basic Principles Definitions Status of Recommendations.
Joint Declaration of Data Citation Principles Notes [1] CODATA 2013: sec 3.2.1; Uhlir (ed.) 2012, ch 14; Altman &
Data Management and Accessibility S.M. Kaye PPPL Research Seminar 12/16/2013.
| 14 | Role for Libraries in Data Curation & Preservation | ODE Workshop, Tartu, 27 June Role for Libraries in Data Curation & Preservation Sabine.
1 ARRO: Anglia Ruskin Research Online Making submissions: Benefits and Process.
VIVO and Scholarly Repositories: Synergistic Opportunities.
Hussein Suleman University of Cape Town Department of Computer Science Digital Libraries Laboratory February 2008 Data Curation Repositories:
Institutional Repositories: the DSpace Experience Ann J. Wolpert Director of Libraries Massachusetts Institute of Technology.
Dataset citation Clickable link to Dataset in the archive Sarah Callaghan (NCAS-BADC) and the NERC Data Citation and Publication team
DOE Data Management Plan Requirements
Entering the Data Era; Digital Curation of Data-intensive Science…… and the role Publishers can play The STM view on publishing datasets Bloomsbury Conference.
Managing Access at the University of Oregon : a Case Study of Scholars’ Bank by Carol Hixson Head, Metadata and Digital Library Services
Is there a role for online repositories in e-Learning? Sarah Hayes Andrew Rothery University of Worcester.
Joint Declaration of Data Citation Principles (Overview) The Data Citation Synthesis Group Joint Declaration.
Acknowledgments Funding provided by the Jewett Foundation Introduction Data collected in ocean sciences, whether generated from research or operational.
NRF Open Access Statement
Publishing software and data
SowiDataNet - A User-Driven Repository for Data Sharing and Centralizing Research Data from the Social and Economic Sciences in Germany Monika Linne, 30.
VI-SEEM Data Repository
ESciDoc Introduction M. Dreyer.
Entering the Data Era; Digital Curation of Data-intensive Science…… and the role Publishers can play The STM view on publishing datasets Bloomsbury Conference.
Jonathan Griffin, Managing Director, IFIS Publishing &
Data + Research Elements What Publishers Can Do (and Are Doing) to Facilitate Data Integration and Attribution David Parsons – Lawrence, KS, 13th February.
Joyce Backus Associate Director, Library Operations
Presentation transcript:

Preservation, access and re-use of Research Data The STM view on publishing datasets Presented at the DataCite Summer Meeting 2010 Hannover, 8 June 2010 Eefke Smit, International Association of STM publishers Director, Standards and Technology

2 Context …… increased availability of primary sources of data in digital form has the potential to shift the balance away from research based on secondary sources such as publications, thus positioning data as the central element in the scientific process. (a statement from the Director of the Directorate General for Information Society and Media of the European Commission, 2008) If the raw data doesnt form a central part of the scientific record then we perhaps need to start asking whether the usefulness of that record in its current form is starting to run out. (from a blog called Science in the Open: the-pain-and-embarassment-make-all-the-raw-data-available/ the-pain-and-embarassment-make-all-the-raw-data-available/..let us get back to the days where observational scientists could justify peer reviewed publication primarily on the basis of collection, description and reporting of high quality data sets (usually with some basic level of interpretation.. Quote taken from a discussion paper called The Risk-Reward Basis for Data Publication (marine sciences, 2007) Problem = scientific community does not see online data as publication (from a presentation called: How to motivate scientists to publish data online, Mark J. Costello. June 2008)

3 What do scientist want…….

4 How to locate data ?

5 Where to submit data ?

6 Some numbers….. Preview of Parse Insight Results: Researchers: Only 20 % of researchers share data 40 % have problems sharing it (distrust, legal and privacy issues) But 80 % of researchers like to use data from others…… What publishers do: 70 % of publishers = 90 % of journals accept data and other suppl material 95 % of publishers facilitate linking to datasets Less than 5 % publishers have special facilities for datasets 60 % see the researcher and research institute as the responsible party to maintain and curate datasets

7 What do Publishers currently do…… Instructions to authors in Tetrahedron

8 Supplementary files are linked directly from an articles abstract page.

9 Supplementary files are referenced within the article text and linked via the articles abstract page using the doi.

10

11 How do Publishers view research data in the context of IP The Publishing Industry (STM/ALPSP) position is: It is also stated that: …..believe that, as a general principle, data sets, raw data outputs of research, and sets or subsets of that data should wherever possible be made freely accessible to other scholars (Statement from STM & ALPSP, June 2006) ….articles published in scholarly journals often include tables and charts in which certain data points are included or expressed. Journal publishers often do seek the transfer of or ownership of the publishing rights in such illustrations.., but this does not amount to a claim to the underlying data itself..

12 Research data and the Publishers Mission Can we contribute to the data dissemination/retrieval process? Storing, Linking Search, Discovery Can we contribute to research workflows ? Meta-data, collections, ontologies Visualization, mining, etc Can we meaningful contribute to an editorial process for data? Submission processes editorial organization, review Publishers are committed to making genuine contributions to the research communities….. support to the scholarly communication process increased availability of research output increased citations to research output increased overall quality of research develop new means of knowledge discovery increase in the research efficiency

13 Support through the journal networks and publishing platforms General instructions to make available available as supplementary information with the online article Textual references to data repositories & datasets Verbal instructions, limited support by editorial team More granular definition of research data and supplementary information Specific instructions on how, when and where to submit, and how to cite. Specific sustainable destinations for research data Agreed formats & metadata requirements for data submission Expand editorial teams with a data-editor Hyper-linking between articles and (final) dataset destinations and v.v. Federated searching Intelligent (contextual) referencing of datasets in articles Move from…..To………. Note: a successful implementation requires a combination of domain specific and generic solutions

14 working examples……..

15 Vice versa

What Publishers are busy solving Peer review practices Readability, navigation, accessibility, presentation Discoverability: search, metadata, linking, citability Copyright issues Preservation and long term archiving Version control/ dynamic data Access, permissions for re-use Editorial practice and support See joint NISO/ NFAIS initiative:

To make solutions scalable and sustainable, we need: convergence Good collaboration with all stakeholders in the chain: researchers, research instuitutes, safe data repositories, libraries, policymakers Standards and common practice building on what is in place already: from persistent identifiers, citation conventions, to submission guidelines across scholarly journals Scalable solutions that work across disciplines Infrastructure: TiB and DataCite are excellent initiatives to get the right infrastructure in place Willingness in abundance among publishers What we now need is:

18 In conclusion Do Publishers recognise the importance of data publishing YES Can Publishers help to get research data in the open? YES Will Publishers help to improve the discoverability of data? YES …..and YES: Solutions must be scalable & sustainable Existing capabilities should be used as much as possible We need close collaboration across the whole chain of researchers and research communities, libraries and data centres as well as the policy makers...and support DataCite.