Evolving a Community Digital Repository: Lessons from Dryad Making data underlying scientific publications discoverable, freely reusable, and citable Bill.

Slides:



Advertisements
Similar presentations
Building Support for a Discipline-Based Data Repository Ryan Scherle 1, Sarah Carrier 2, Jane Greenberg 2, Hilmar Lapp 1, Abbey Thompson 2, Todd Vision.
Advertisements

Ryan Scherle and Jane Greenberg. A Repository of Data Underlying Journal Articles.
The Dryad Data Repository Ryan Scherle 1, Hilmar Lapp 1, Amol Bapat 2, Sarah Carrier 2, Jane Greenberg 2, Peggy Schaeffer 1, Todd Vision 1,3, Hollie White.
Mark Toole 25 March “the principle that the results of research that has been publicly funded should be freely accessible in the open domain is.
DSpace: the MIT Libraries Institutional Repository MacKenzie Smith, MIT EDUCAUSE 2003, November 5 th Copyright MacKenzie Smith, This work is the.
PaN-data WP7 - Integration Brian Matthews STFC-e-Science.
US DOE’s Public Access Plan: A vision reaching fruition Ms. Deborah Cutler Alt. US INIS Liaison Officer Office of Scientific and Technical Information.
Current status Todd Vision (overview) Elena Feinstein (curation) Ryan Scherle (demo) 7/23/12Dryad Board of Directors1.
Data archiving in evolutionary biology Michael Whitlock.
Open Access Publishing with Wiley. Gold v Green Open Access Gold or pay to publish Open Access: Article is made freely accessible online to anyone anywhere.
PubMed Central ANCHASL Spring Meeting April 1, 2005 Robert James Associate Director of Public Services Duke University.
OPEN ACCESS PUBLICATION ISSUES FOR NSF OPP Advisory Committee May 30, /24/111 |
Introduction to the Dryad Digital Repository A nonprofit repository for data underlying the international scientific and medical literature. April 2013.
New business models for open research Todd Vision Jared Lyle Mark Hahnel 12-June-2014Open Repositories1.
Greater Reach for your Research: Author’s Rights & the Shifting Landscape of Scholarly Communication Lisa Goddard & Shannon Gordon Memorial University.
Data Publishing & Management Learning Objectives: 1.Introduce the advantages of publishing your data, the steps involved and how to publish to increase.
Institutional Perspective on Credit Systems for Research Data MacKenzie Smith Research Director, MIT Libraries.
Canadian Research Libraries: A History of Cooperation Canadian Research Libraries: A History of Cooperation Gwendolyn Ebbett Dean of the Library University.
EPSRC expectations on research data: What researchers need to know 12/03/2015 Masud Khokhar and Hardy Schwamm.
Providing Access to Your Data: Tracking Data Usage Robert R. Downs, PhD NASA Socioeconomic Data and Applications Center (SEDAC) Center for International.
Open Access: An Introduction Edward Shreeves Director, Collections and Content Development University of Iowa Libraries
Literature/data integration and Ryan Scherle Data Repository Architect Dryad Digital Repository HighWire Fall Publishers’ Meeting November 20, 2013 You.
GLOBAL BIODIVERSITY INFORMATION FACILITY Dr Vishwas Chavan Senior Programme Officer for DIGIT Data Citation Mechanism and.
Evolving Roles in Scholarly Communications Susan Reilly, APA, Frascati, 7th Nov, 2012.
Libraries as Partners in Research: the UC Curation Center’s Tools and Services UC3 Team University of California Curation Center California Digital Library.
Providing Access to Your Data: Tracking Data Usage Robert R. Downs, PhD NASA Socioeconomic Data and Applications Center (SEDAC) Center for International.
UC3 Standards and Best Practices for Datasets and Other Supplemental Journal Article Materials UC3 Stephen Abrams Patricia Cruse John Kunze.
Managing Data: The Long View FORCE15 – 12 January 2015 Amy Friedlander, Ph.D.
Login / Upload / Share Deposit your scholarly research - it’s as easy as 1, 2, 3 MAIN MESSAGE key reasons enumerated ->please read speaker notes id / who.
The Department of Energy’s Public Access Solution Giving Voice to Energy and Science R&D Results Jeffrey Salmon Deputy Director for Resource Management.
Elements of a Data Management Plan Bill Michener University Libraries University of New Mexico Data Management Practices for.
Supporting scientific communities by publishing data Dryad Digital Repository Peggy Schaeffer OpenAIRE/LIBER Workshop May 28, 2013 Ghent, Belgium.
The Information Challenge Exponential growth of resources New researchers with new needs Multiple communication options New expectations and opportunities.
A 40 Year Perspective Dr. Frank Scioli NSF-Retired.
What can publishers do to support data? Dryad’s perspective STM Annual US Conference - April 22, 2015 Meredith Morovati Executive Director Illustration.
Data archiving and curation Ryan Scherle Data Repository Architect Dryad Digital Repository CurateGear January 8, 2014 You may reuse any of the original.
Data Sharing and Archiving: A Professional Society View Clifford S. Duke Ecological Society of America September 9, 2010.
Data Management in Scholarly Journals and possible Roles for Libraries – Some Insights from EDaWaX Sven Vlaeminck | Leibniz-Information Centre for Economics.
Preserving and Sharing Data: Best Practices & Requirements for Selecting a Data Sharing Repository
South Africa in the global knowledge arena: implications for academic libraries Andrew M. KANIKI Executive Director: Knowledge Management and Strategy.
BMJ and Data Sharing Claire Bower, Digital Communications
Can sharing research data raise your research profile and impact? Gerry Ryder Charles Darwin University, September 2015.
UKOLN is supported by: Digital Preservation Benefits Tools Project Dissemination Workshop Dr Liz Lyon, Associate Director, UK Digital Curation Centre Director,
Deepcarbon.net Xiaogang (Marshall) Ma, Yu Chen, Han Wang, John Erickson, Patrick West, Peter Fox Tetherless World Constellation Rensselaer Polytechnic.
DataONE: Preserving Data and Enabling Data-Intensive Biological and Environmental Research Bob Cook Environmental Sciences Division Oak Ridge National.
Elements of a Data Management Plan Bill Michener University of New Mexico
Data Citation: framing the discussion and global context Dr Simon Hodson Executive Director, CODATA Referencing data in publications: principles,
Research Information Management: Continuity, Change and Impact Michael Jubb Research Information Network UUK Workshop 5 December 2007.
Planning for School Implementation. Choice Programs Requires both district and school level coordination roles The district office establishes guidelines,
ARL Workshop on New Collaborative Relationships: The Role of Academic Libraries in the Digital Data Universe September 26-27, 2006 ARL Prue.
OA Challenges and expectations: th Sell Meeting, May 22-23rd Florence.
Managing Access at the University of Oregon : a Case Study of Scholars’ Bank by Carol Hixson Head, Metadata and Digital Library Services
1 Introducing the Australian National Data Service (ANDS) Research data as a scholarly output Options for data publishing and data discovery Make your.
Preliminary Findings Baseline Assessment of Scientists’ Data Sharing Practices Carol Tenopir, University of Tennessee
The R EPOSITORY AS P UBLISHER OPPORTUNITIES AND CHALLENGES IN A DUAL ROLE BEN HOCKENBERRY SYSTEMS LIBRARIAN | ST. JOHN FISHER COLLEGE.
Using the DMPTool for data management plans Kathleen Fear February 27, 2014.
Writing a Data Management Plan with the DMPTool Kathleen Fear January 15, 2015.
An Introduction to the USENIX Association The Advanced Computing Systems Association.
ODIN – ORCID and DATACITE Interoperability Network ODIN: Connecting research and researchers Sergio Ruiz - DataCite Funded by The European Union Seventh.
Acknowledgments Funding provided by the Jewett Foundation Introduction Data collected in ocean sciences, whether generated from research or operational.
NRF Open Access Statement
Robert R. Downs1and Robert S. Chen2
Witness Statement – TAIR
CNI Spring 2010 Membership Meeting
Getting Started with Data Management
Open Access to your Research Papers and Data
Research Data Management
Bird of Feather Session
Data + Research Elements What Publishers Can Do (and Are Doing) to Facilitate Data Integration and Attribution David Parsons – Lawrence, KS, 13th February.
Getting Started with Data Management & DMPTool
Presentation transcript:

Evolving a Community Digital Repository: Lessons from Dryad Making data underlying scientific publications discoverable, freely reusable, and citable Bill Michener, Board of Directors, Meredith Morovati, Executive Director, Todd Vision, Chair, Board of

e-science us/collaboration/fourthparadigm/4th_paradigm_book_jim_gray_transcript.pdf

Data publication opportunities

Tenopir C et al. (2011) Data Sharing by [n=1329] Scientists: Practices and Perceptions. PLoS ONE doi: /journal.pone Researcher attitudes toward data reuse Agree strongly or somewhat I would be willing to share data across a broad group of researchers who use data in different ways 78.2% It is important that my data are cited when used by other researchers. 88.9% It is appropriate to create new datasets from shared data 73.7%

Volume Rank frequency of data type Specialized repositories (e.g. GenBank) After Heidorn (2008) Long tail data Dryad’s vision is a world where research data is openly available, integrated with the scholarly literature, and routinely re-used to create knowledge. Dryad’s mission is to provide the infrastructure for, and promote the re-use of, data underlying the scholarly literature.

Why archive data at the time of publication ? Vines TH et al. (2013) Current Biology DOI: /j.cub

Roadmap Dryad Lessons Challenges

Dryad

Joint Data Archiving Policy (JDAP) Data are important products of the scientific enterprise, and they should be preserved and usable for decades in the future As a condition for publication, data supporting the results in the article should be deposited in an appropriate public archive Authors may elect to embargo access to the data for a period up to a year after publication Exceptions may be granted at the discretion of the editor, especially for sensitive information

Credit 10

Integration of manuscript and data submission

Curation

Dryad links to journals Provides citation instructions

Datasets are being cited

Portals for integrated journals (beta)

All data are from a vetted scientific publication (such as a peer- reviewed article, thesis/dissertation, or book) and receive professional curation Submission integration with journals makes deposit easy for authors and curators provide user support Flexible to journal data policy (e.g. on embargoes, review, standards) Reciprocal linkage between article and data via a persistent, resolvable data DOI Data are citable, and preserved for the long term Data are free to download & reuse due to modest data publication charges Backed by a nonprofit organization sustained and governed by its diverse stakeholders Dryad Digital Repository

How to participate in Dryad Become a member –Elect the Board of Directors and approve changes to ByLaws –Stay informed through the Annual Community Meeting –Get discounts on submission fees –Financially sustain the repository –Help steer the future direction of the organization Integrate your journal with Dryad –Ensure the article and data are bidirectionally linked –Lower the burden on authors to make their data available –Improve compliance with the journal’s data policy –The process is tailored to each journal (e.g. embargo option) Sponsor data publication charges –As a service to your authors/researchers

Members NORDIC SOCIETY OIKOS

Metrics

Growth (2015 = ~3,800 +)

Building a valuable science resource

lessons

1. “You can't build a great building on a weak foundation.” Gordon B. Hinckley Gordon B. Hinckley

Understand costs & diversify funding streams Data publication charges (DPC) –primary source of revenue – enable free access in perpetuity Membership fees –fund annual membership meetings –provide a cost savings on DPC Project grants –support R&D activities

Sponsoring Data Publication Charges Individuals can deposit data associated with an article on their own, regardless of payment plan. If an author finds the journal it is submitting to does not have a payment plan, they can elect to pay $90 for deposit. Supporting payment plans on behalf of your authors makes it easy for authors and saves money.

Membership tiers

Partner & leverage

2. “Communication leads to community.” Rollo May Rollo May

Remote work

Skype, webex, google hangout, etc.

Slack

3. “Failing to plan is planning to fail.” Alan Lakein Alan Lakein

Turn the plan into action

Trello

Challenges

Table 1. Journal and publication year of 100 reviewed studies with associated data publicly archived in the digital repository Dryad ( Roche DG, Kruuk LEB, Lanfear R, Binning SA (2015) Public Data Archiving in Ecology and Evolution: How Well Are We Doing?. PLoS Biol 13(11): e doi: /journal.pbio

Fig 2. Completeness and reusability scores. Roche DG, Kruuk LEB, Lanfear R, Binning SA (2015) Public Data Archiving in Ecology and Evolution: How Well Are We Doing?. PLoS Biol 13(11): e doi: /journal.pbio

Student training in the classroom

Training resources & workshops

PLoS Comp. Biology, Oct. 22, 2015, “Ten Simple Rules for Creating a Good Data Management Plan,” WK Michener. DOI: /journal.pcbi (1)Determine the research sponsor requirements; (2)Identify the data to be collected; (3)Define how the data will be organized; (4)Explain how the data will be documented; (5)Describe how data quality will be assured; (6)Present a sound data storage and preservation strategy; (7)Define the project’s data policies; (8)Describe how the data will be disseminated; (9)Assign roles and responsibilities; and (10)Prepare a realistic budget.

NSF Public Access Plan Applies to proposals submitted in 2016 Restates longstanding policy that –“Investigators are expected to share with other researchers, at no more than incremental cost and within a reasonable time, the primary data, samples, physical collections and other supporting materials created or gathered in the course of work under NSF grants” –Allows costs of archiving within grants Restates 2011 Data Management Plan requirement –Further requires archiving plan in DMP to be followed Restates 2013 Biosketch policy allowing data to count as a product

Efficacy of funder vs journal data policy Figure 5. Availability of archived phylogenetic data as a function of age. After: Magee et al. (2014) The Dawn of Open Access to Phylogenetic Data. PLoS ONE 9(10): e doi: /journal.pone year Proportion of data available

Roche DG, Kruuk LEB, Lanfear R, Binning SA (2015) Public Data Archiving in Ecology and Evolution: How Well Are We Doing?. PLoS Biol 13(11): e doi: /journal.pbio Other ideas ?

To learn more Dryad Digital Repository: Dryad News & Views blog: Feedback (Ideas Forum):