Presentation is loading. Please wait.

Presentation is loading. Please wait.

Evolving a Community Digital Repository: Lessons from Dryad Making data underlying scientific publications discoverable, freely reusable, and citable Bill.

Similar presentations


Presentation on theme: "Evolving a Community Digital Repository: Lessons from Dryad Making data underlying scientific publications discoverable, freely reusable, and citable Bill."— Presentation transcript:

1 Evolving a Community Digital Repository: Lessons from Dryad Making data underlying scientific publications discoverable, freely reusable, and citable Bill Michener, Board of Directors, wmichene@unm.edu Meredith Morovati, Executive Director, mmorovati@datadryad.org Todd Vision, Chair, Board of Directors, tjv@datadryad.org @datadryad http://datadryad.org

2 e-science http://research.microsoft.com/en- us/collaboration/fourthparadigm/4th_paradigm_book_jim_gray_transcript.pdf

3 http://www.stm-assoc.org/2011_12_5_ODE_Report_On_Integration_of_Data_and_Publications.pdf Data publication opportunities

4 Tenopir C et al. (2011) Data Sharing by [n=1329] Scientists: Practices and Perceptions. PLoS ONE doi:10.1371/journal.pone.0021101 Researcher attitudes toward data reuse Agree strongly or somewhat I would be willing to share data across a broad group of researchers who use data in different ways 78.2% It is important that my data are cited when used by other researchers. 88.9% It is appropriate to create new datasets from shared data 73.7%

5 Volume Rank frequency of data type Specialized repositories (e.g. GenBank) After Heidorn (2008) http://hdl.handle.net/2142/9127 http://hdl.handle.net/2142/9127 Long tail data Dryad’s vision is a world where research data is openly available, integrated with the scholarly literature, and routinely re-used to create knowledge. Dryad’s mission is to provide the infrastructure for, and promote the re-use of, data underlying the scholarly literature.

6 Why archive data at the time of publication ? Vines TH et al. (2013) Current Biology DOI:10.1016/j.cub.2013.11.014

7 Roadmap Dryad Lessons Challenges

8 Dryad

9 Joint Data Archiving Policy (JDAP) - 2011 Data are important products of the scientific enterprise, and they should be preserved and usable for decades in the future As a condition for publication, data supporting the results in the article should be deposited in an appropriate public archive Authors may elect to embargo access to the data for a period up to a year after publication Exceptions may be granted at the discretion of the editor, especially for sensitive information http://datadryad.org/jdap

10 Credit 10

11 Integration of manuscript and data submission

12 Curation

13 Dryad links to journals Provides citation instructions

14 Datasets are being cited

15 Portals for integrated journals (beta) http://datadryad.org/journal/amNat

16 All data are from a vetted scientific publication (such as a peer- reviewed article, thesis/dissertation, or book) and receive professional curation Submission integration with journals makes deposit easy for authors and curators provide user support Flexible to journal data policy (e.g. on embargoes, review, standards) Reciprocal linkage between article and data via a persistent, resolvable data DOI Data are citable, and preserved for the long term Data are free to download & reuse due to modest data publication charges Backed by a nonprofit organization sustained and governed by its diverse stakeholders Dryad Digital Repository

17 How to participate in Dryad Become a member –Elect the Board of Directors and approve changes to ByLaws –Stay informed through the Annual Community Meeting –Get discounts on submission fees –Financially sustain the repository –Help steer the future direction of the organization Integrate your journal with Dryad –Ensure the article and data are bidirectionally linked –Lower the burden on authors to make their data available –Improve compliance with the journal’s data policy –The process is tailored to each journal (e.g. embargo option) Sponsor data publication charges –As a service to your authors/researchers

18 Members NORDIC SOCIETY OIKOS

19 Metrics

20 Growth (2015 = ~3,800 +)

21 Building a valuable science resource

22 lessons

23 1. “You can't build a great building on a weak foundation.” Gordon B. Hinckley Gordon B. Hinckley

24 Understand costs & diversify funding streams Data publication charges (DPC) –primary source of revenue – enable free access in perpetuity Membership fees –fund annual membership meetings –provide a cost savings on DPC Project grants –support R&D activities

25 Sponsoring Data Publication Charges Individuals can deposit data associated with an article on their own, regardless of payment plan. If an author finds the journal it is submitting to does not have a payment plan, they can elect to pay $90 for deposit. Supporting payment plans on behalf of your authors makes it easy for authors and saves money.

26 Membership tiers

27 Partner & leverage

28 2. “Communication leads to community.” Rollo May Rollo May

29 Remote work

30 Skype, webex, google hangout, etc.

31 Slack

32 3. “Failing to plan is planning to fail.” Alan Lakein Alan Lakein

33

34 Turn the plan into action

35 Trello

36 Challenges

37

38 Table 1. Journal and publication year of 100 reviewed studies with associated data publicly archived in the digital repository Dryad (http://datadryad.org/). Roche DG, Kruuk LEB, Lanfear R, Binning SA (2015) Public Data Archiving in Ecology and Evolution: How Well Are We Doing?. PLoS Biol 13(11): e1002295. doi:10.1371/journal.pbio.1002295 http://journals.plos.org/plosbiology/article?id=info:doi/10.1371/journal.pbio.1002295

39 Fig 2. Completeness and reusability scores. Roche DG, Kruuk LEB, Lanfear R, Binning SA (2015) Public Data Archiving in Ecology and Evolution: How Well Are We Doing?. PLoS Biol 13(11): e1002295. doi:10.1371/journal.pbio.1002295 http://journals.plos.org/plosbiology/article?id=info:doi/10.1371/journal.pbio.1002295

40 Student training in the classroom

41 Training resources & workshops

42 PLoS Comp. Biology, Oct. 22, 2015, “Ten Simple Rules for Creating a Good Data Management Plan,” WK Michener. DOI:10.1371/journal.pcbi.1004525 (1)Determine the research sponsor requirements; (2)Identify the data to be collected; (3)Define how the data will be organized; (4)Explain how the data will be documented; (5)Describe how data quality will be assured; (6)Present a sound data storage and preservation strategy; (7)Define the project’s data policies; (8)Describe how the data will be disseminated; (9)Assign roles and responsibilities; and (10)Prepare a realistic budget.

43 NSF Public Access Plan Applies to proposals submitted in 2016 Restates longstanding policy that –“Investigators are expected to share with other researchers, at no more than incremental cost and within a reasonable time, the primary data, samples, physical collections and other supporting materials created or gathered in the course of work under NSF grants” –Allows costs of archiving within grants Restates 2011 Data Management Plan requirement –Further requires archiving plan in DMP to be followed Restates 2013 Biosketch policy allowing data to count as a product http://www.nsf.gov/pubs/2015/nsf15052/nsf15052.pdf

44 Efficacy of funder vs journal data policy Figure 5. Availability of archived phylogenetic data as a function of age. After: Magee et al. (2014) The Dawn of Open Access to Phylogenetic Data. PLoS ONE 9(10): e110268. doi:10.1371/journal.pone.0110268 200220112014 year Proportion of data available 200220112014

45 Roche DG, Kruuk LEB, Lanfear R, Binning SA (2015) Public Data Archiving in Ecology and Evolution: How Well Are We Doing?. PLoS Biol 13(11): e1002295. doi:10.1371/journal.pbio.1002295 http://journals.plos.org/plosbiology/article?id=info:doi/10.1371/journal.pbio.1002295 Other ideas ?

46 To learn more Dryad Digital Repository: http://datadryad.orghttp://datadryad.org Dryad News & Views blog: http://blog.datadryad.orghttp://blog.datadryad.org Twitter: @datadryad Feedback (Ideas Forum): http://datadryad.org/feedbackhttp://datadryad.org/feedback


Download ppt "Evolving a Community Digital Repository: Lessons from Dryad Making data underlying scientific publications discoverable, freely reusable, and citable Bill."

Similar presentations


Ads by Google