Presentation is loading. Please wait.

Presentation is loading. Please wait.

Because good research needs good data The DCC lifecycle model, Exeter Uni, 18-19 May 2011 Funded by: The Digital Curation Lifecycle Model Joy Davidson.

Similar presentations


Presentation on theme: "Because good research needs good data The DCC lifecycle model, Exeter Uni, 18-19 May 2011 Funded by: The Digital Curation Lifecycle Model Joy Davidson."— Presentation transcript:

1 Because good research needs good data The DCC lifecycle model, Exeter Uni, 18-19 May 2011 Funded by: The Digital Curation Lifecycle Model Joy Davidson and Sarah Jones Digital Curation Centre, Glasgow joy.davidson@glasgow.ac.uk sarah.jones@glasgow.ac.uk

2 Because good research needs good data The DCC lifecycle model, Exeter Uni, 18-19 May 2011 DCC curation lifecycle model Activities / phases to cover are: storing and preserving data accessing data licensing data citing data

3 Because good research needs good data The DCC lifecycle model, Exeter Uni, 18-19 May 2011 Key questions researchers may have: what are the policies of the Exeter repositories? how long will researchers’ data be kept? who will have access to the data? how can researchers make the most impact with their data? Ingest: where should data go?

4 Because good research needs good data The DCC lifecycle model, Exeter Uni, 18-19 May 2011 Guidance you may need to provide: Suggesting where researchers deposit their data at the bid stage (what policies does the repository have that may affect their data?) Advising on the type of license researchers may need for their data. This is especially important to clarify early on in collaborative research endeavours. Helping researchers to plan their impact from the outset. Helping to make potential citations as easy as possible.

5 Because good research needs good data The DCC lifecycle model, Exeter Uni, 18-19 May 2011 Where will researchers store their data? Exeter University has three repositories http://as.exeter.ac.uk/library/resources/openaccess/repositories Exeter Research and Institutional Content archive (ERIC) – theses and publications Digital Collections Online (DCO) – images and multimedia Exeter Data Archive EDA - research data DataCite list of data repositories http://datacite.org/repolisthttp://datacite.org/repolist

6 Because good research needs good data The DCC lifecycle model, Exeter Uni, 18-19 May 2011 Exeter Data Archive (EDA) EDA enables searches by subject authors Collections It can be a place to store research data but also a good way for researchers to find potential collaborators and gaps.

7 Because good research needs good data The DCC lifecycle model, Exeter Uni, 18-19 May 2011 Sustainability Prior to advising on places of data deposit, check the data repository’s sustainability claims – both for researchers’ data and for the repository itself. EDA Example EDA regularly backs up its files according to current best practice. In the event of Exeter Data Repository being closed down, the database will be transferred to another appropriate archive. Policies

8 Because good research needs good data The DCC lifecycle model, Exeter Uni, 18-19 May 2011 Formats Prior to advising on a place of deposit, check to make sure that the data repository accepts the format(s) researcher will be working with. Check to see if there are normalisation procedures (ingest, preservation). EDA Example EDA collects, preserves and makes available the University's research data. The content policy states that EDA will accept all types of data. Policies

9 Because good research needs good data The DCC lifecycle model, Exeter Uni, 18-19 May 2011 What information do researchers need to provide? Most repositories have a clearly defined set of minimum information requirements. EDA Example Title Data creator Department Date of publication Dataset description Policies

10 Because good research needs good data The DCC lifecycle model, Exeter Uni, 18-19 May 2011 details of how the data have been encoded (database structures, file formats); a list of software known to work with the data and their supporting information; indications of how the data relate to other data assets; administrative information (grant info, identifiers, checksums); General guidance

11 Because good research needs good data The DCC lifecycle model, Exeter Uni, 18-19 May 2011 explanations of what the data represent (e.g. for sensor data, what the sensor was measuring and in what units); the processing history of the data (how they were generated and subsequently transformed, when and by whom); a narrative describing the context (why the data were generated/collected, what methodology was used and why). This information is particularly important for users as they interpret the data, and determine whether and how they can be integrated with other data. General guidance

12 Because good research needs good data The DCC lifecycle model, Exeter Uni, 18-19 May 2011 Access Prior to advising on place of deposit, check to make sure that the data repository’s policy on access meets researchers’ needs. EDA Example Anyone may access full items free of charge. Copies of full items generally can be: (a)reproduced, displayed or performed, given to third parties, and stored in a database in any format or medium (b) for personal research or study, educational, or not-for-profit purposes without prior permission or charge Policies

13 Because good research needs good data The DCC lifecycle model, Exeter Uni, 18-19 May 2011 Restrictions on Access Are there any restrictions on access to researchers’ data that the repository should be made aware of? EDA Example Items can be deposited at any time, but will not be made publicly visible until any publishers' or funders' embargo period has expired. This repository is not the publisher; it is merely the online archive. Policies

14 Because good research needs good data The DCC lifecycle model, Exeter Uni, 18-19 May 2011 Licensing Prior to deposit, work with researchers to determine the best license for their data. Make sure that researchers’ data license respects limits associated with any external data they are using. EDA Example Full items must not be sold commercially in any format or medium without formal permission of the copyright holders. Any copyright violations are entirely the responsibility of the authors/depositors Some full items are individually tagged with different rights permissions and conditions. Policies

15 Because good research needs good data The DCC lifecycle model, Exeter Uni, 18-19 May 2011 Two key issues to consider: Licensing – legal instrument stating what people can and can’t do with data Waivers – legal instrument allowing author to give up rights General guidance for data licensing Taken from DCC How-to guide on licensing data www.dcc.ac.uk/resources/how-guides www.dcc.ac.uk/resources/how-guides

16 Because good research needs good data The DCC lifecycle model, Exeter Uni, 18-19 May 2011 Attribution condition - allows others to copy, distribute, display, and perform the work as long as the creator is given due credit. Non-commercial – users cannot use the work for commercial purposes Share-alike – all derivative works must be released under the same licence as the original work Attribution Non-Commercial Share Alike (CC-BY-NC-SA) http://creativecommons.org/ Creative Commons

17 Because good research needs good data The DCC lifecycle model, Exeter Uni, 18-19 May 2011 Citation Work with researchers to help make their data citable to increase your potential impact (for researchers and REF). EDA Example Once your work has been approved for entry into the EDA, you will receive a notification via email. This email will contain a permanent link to your work - you should cite this link in preference to the URL of the item as it provides continuing persistent access in case the URL should ever change. Policies

18 Because good research needs good data The DCC lifecycle model, Exeter Uni, 18-19 May 2011 If you have generated/collected data to be used as evidence in an academic publication, you should deposit them with a suitable data archive or repository as soon as you are able. If they do not provide you with a persistent identifier or URL for your data, encourage them to do so. General guidance for data citation Taken from DCC How-to guide on data citation www.dcc.ac.uk/resources/how-guides www.dcc.ac.uk/resources/how-guides

19 Because good research needs good data The DCC lifecycle model, Exeter Uni, 18-19 May 2011 When citing a dataset in a paper, use the citation style required by the editor/publisher. If no form is suggested for datasets, take a standard data citation style and adapt it to match the style for textual publications. Give dataset identifiers in the form of a URL wherever possible, unless otherwise directed. Include data citations alongside those for textual publications. Some reference management packages now include support for datasets, which should make this easier. General guidance for data citation

20 Because good research needs good data The DCC lifecycle model, Exeter Uni, 18-19 May 2011 Cite datasets at the finest-grained level available that meets your need. If that is not fine enough, provide details of the subset of data you are using at the point in the text where you make the citation. If a dataset exists in several versions, be sure to cite the exact version you used. When you publish a paper that cites a dataset, notify the repository that holds the dataset, so it can add a link from that dataset to your paper. General guidance for data citation

21 Because good research needs good data The DCC lifecycle model, Exeter Uni, 18-19 May 2011 Using Exeter Data Archive (EDA) EDA is currently being piloted. If researchers you work with would like to place any of their data in the repository please email eda-pilot@exeter.ac.uk and a member of the team will be in contact.eda-pilot@exeter.ac.uk

22 Because good research needs good data The DCC lifecycle model, Exeter Uni, 18-19 May 2011 Feedback from PhD student training session Discipline specific support is better than generic Advice on where to store data during the active phase of research is needed Key messages should be introduced early on in the PhD process and reiterated at strategic points over the next few years Examples of good practice are helpful

23 Because good research needs good data The DCC lifecycle model, Exeter Uni, 18-19 May 2011 But remember! The validity and authenticity of the content of submissions is the sole responsibility of the depositor. This is true for any data repository. Make sure researchers are aware of their responsibility.

24 Because good research needs good data The DCC lifecycle model, Exeter Uni, 18-19 May 2011 But remember! You can’t do the work for researchers – this is a collaborative process. Start conversations at the bid stage/start of PhD and keep communications going. Feed into the Open Exeter project as it progresses to help shape effective support services.

25 Because good research needs good data The DCC lifecycle model, Exeter Uni, 18-19 May 2011 Managing data http://www.dcc.ac.uk/resources/external/tools- services/managing-active-research-data Sharing and tracking reuse http://www.dcc.ac.uk/resources/external/tools-services/sharing- output-and-tracking-impact Useful resources - DCC Tools catalogue

26 Because good research needs good data The DCC lifecycle model, Exeter Uni, 18-19 May 2011 Other ideas Researchers might consider sharing negative results. Journal of Negative Results in BioMedicine www.jnrbm.com/ Journal of Negative Results www.jnr-eeb.org/ The All Results Journals www.arjournals.com/ Consider co-authoring papers with researchers on their RDM activity in curation journals http://www.dcc.ac.uk/resources/curation-journals. http://www.dcc.ac.uk/resources/curation-journals

27 Because good research needs good data The DCC lifecycle model, Exeter Uni, 18-19 May 2011 Any questions? For DCC guidance, tools and case studies see: www.dcc.ac.uk/resources Follow us on twitter @digitalcuration and #ukdcc


Download ppt "Because good research needs good data The DCC lifecycle model, Exeter Uni, 18-19 May 2011 Funded by: The Digital Curation Lifecycle Model Joy Davidson."

Similar presentations


Ads by Google