Presentation is loading. Please wait.

Presentation is loading. Please wait.

The Digital Curation Lifecycle Model Joy Davidson and Sarah Jones

Similar presentations


Presentation on theme: "The Digital Curation Lifecycle Model Joy Davidson and Sarah Jones"— Presentation transcript:

1 The Digital Curation Lifecycle Model Joy Davidson and Sarah Jones
Digital Curation Centre, Glasgow

2 DCC curation lifecycle model
Activities / phases to cover are: storing and preserving data accessing data licensing data citing data

3 Ingest: where should you data go?
Key questions: which repository is right for your data? what are the policies of the repository? how long will your data be kept? who will have access to your data? how can you make the most impact with your data? 3

4 Guidance: Think about where you will deposit your data from the outset (what policies does the repository have that may affect your data?) Determine how you will license your data early on. This is especially important to clarify early on in collaborative research endeavours. Think about your impact from the outset. Make potential citations as easy as possible.

5 Where will you store your data?
Exeter University has three repositories Exeter Research and Institutional Content archive (ERIC) – theses and publications Digital Collections Online (DCO) – images and multimedia Exeter Data Archive EDA - research data DataCite list of data repositories

6 Exeter Data Archive (EDA)
EDA enables searches by subject authors Collections It can be a place to store your data but also a good way to find potential collaborators and gaps.

7 Policies Sustainability
Prior to deposit, check the data repository’s sustainability claims – both for your data and for the repository itself. EDA Example EDA regularly backs up its files according to current best practice. In the event of Exeter Data Repository being closed down, the database will be transferred to another appropriate archive. 7

8 Policies Formats Prior to deposit, check to make sure that the data repository accepts the format(s) you will be working with. Check to see if there are normalisation procedures (ingest, preservation). EDA Example The Exeter Data Archive (EDA) collects, preserves and makes available the University's research data. The content policy states that EDA will accept all types of data. 8

9 Policies What information do you need to provide?
Most repositories have a clearly defined set of minimum information requirements. EDA Example Title Data creator Department Date of publication Dataset description 9

10 General guidance details of how the data have been encoded (database
structures, file formats); a list of software known to work with the data and their supporting information; indications of how the data relate to other data assets; administrative information (identifiers, checksums); 10

11 General guidance explanations of what the data represent (e.g. for sensor data, what the sensor was measuring and in what units); the processing history of the data (how they were generated and subsequently transformed, when and by whom); a narrative describing the context (why the data were generated/collected, what methodology was used and why). This information is particularly important for users as they interpret the data, and determine whether and how they can be integrated with other data. 11

12 Policies Access Prior to deposit, check to make sure that the data repository’s policy on access meets your needs. EDA Example Anyone may access full items free of charge. Copies of full items generally can be: reproduced, displayed or performed, given to third parties, and stored in a database in any format or medium (b) for personal research or study, educational, or not-for-profit purposes without prior permission or charge 12

13 Policies Restrictions on Access
Are there any restrictions on access to your data that the repository should be made aware of? EDA Example Items can be deposited at any time, but will not be made publicly visible until any publishers' or funders' embargo period has expired. This repository is not the publisher; it is merely the online archive. 13

14 Policies Licensing Prior to deposit, think about how you will license your data. Make sure that your data license respects limits associated with any external data you are using in your work. EDA Example Full items must not be sold commercially in any format or medium without formal permission of the copyright holders. Any copyright violations are entirely the responsibility of the authors/depositors Some full items are individually tagged with different rights permissions and conditions . 14

15 General guidance for data licensing
Taken from DCC How-to guide on licensing data Two key issues to consider: Licensing – legal instrument stating what people can and can’t do with your data Waivers – legal instrument allowing author to give up rights 15

16 Attribution Non-Commercial Share Alike (CC-BY-NC-SA)
Creative Commons Attribution condition - allows others to copy, distribute, display, and perform the work as long as the creator is given due credit. Non-commercial – users cannot use the work for commercial purposes Share-alike – all derivative works must be released under the same licence as the original work Attribution Non-Commercial Share Alike (CC-BY-NC-SA) 16

17 http : / /creativecommons .org /publicdomain/zero/1.0/.
CC0 The most permissive way of releasing data is under a dedication to the public domain. This is where all copyright interests and database rights are waived, allowing the data to be used as freely as possible. Creative Commons Zero (CC0) is the Creative Commons tool for dedicating works to the public domain. It works on two levels: as a waiver of a person’s rights to the work, and in case that is not effective, as an irrevocable, royalty-free and unconditional licence for anyone to use the work for any purpose. http : / /creativecommons .org /publicdomain/zero/1.0/. 17

18 Policies Citation Make sure that your data is citable to increase your potential impact. EDA Example Once your work has been approved for entry into the EDA, you will receive a notification via . This will contain a permanent link to your work - you should cite this link in preference to the URL of the item as it provides continuing persistent access in case the URL should ever change. 18

19 General guidance for data citation
Taken from DCC How-to guide on data citation If you have generated/collected data to be used as evidence in an academic publication, you should deposit them with a suitable data archive or repository as soon as you are able. If they do not provide you with a persistent identifier or URL for your data, encourage them to do so. 19

20 General guidance for data citation
When citing a dataset in a paper, use the citation style required by the editor/publisher. If no form is suggested for datasets, take a standard data citation style and adapt it to match the style for textual publications. Give dataset identifiers in the form of a URL wherever possible, unless otherwise directed. Include data citations alongside those for textual publications. Some reference management packages now include support for datasets, which should make this easier. 20

21 General guidance for data citation
Cite datasets at the finest-grained level available that meets your need. If that is not fine enough, provide details of the subset of data you are using at the point in the text where you make the citation. If a dataset exists in several versions, be sure to cite the exact version you used. When you publish a paper that cites a dataset, notify the repository that holds the dataset, so it can add a link from that dataset to your paper. 21

22 Using Exeter Data Archive (EDA)
EDA is currently being piloted. If you would like to place any of your data in the repository please and a member of the team will be in contact. 22

23 The validity and authenticity of the content of
But remember! The validity and authenticity of the content of submissions is the sole responsibility of the depositor. This is true for any data repository.

24 Useful resources - DCC Tools catalogue
Managing data Sharing and tracking reuse 24

25 Other ideas Consider sharing negative results.
Journal of Negative Results in BioMedicine Journal of Negative Results The All Results Journals Consider publishing on your RDM activity in curation journals 25

26 Any questions? For DCC guidance, tools and case studies see:
Follow us on and #ukdcc


Download ppt "The Digital Curation Lifecycle Model Joy Davidson and Sarah Jones"

Similar presentations


Ads by Google