Presentation on theme: "Managing your research data"— Presentation transcript:
1Managing your research data Ben TaylorsonAcademic Liaison Librarian
2Outline Why manage data? What is data management? Data life cycle Putting together a planActively managing dataMetadataBackupsVersionsStoring and sharing
3What are data?Research data, unlike other types of information, is collected, observed, or created, for purposes of analysis to produce original research results.Qualitative or quantitativeAnalogue or digital – both have challenges
4Reasons to manage your data Responsible conduct of researchFunding body grant requirementsResearch integrity and replicationIncrease research efficiencySave time and resourcesEnhance data securityPrevent duplication of effort by enabling others to use your data
5Climategate1,000 private s and many other documents were stolen or leaked from the University of East Anglia's (UEA) Climatic Research Unit (CRU) in November 2009While HoC Select Committee cleared them of scientific failings, it did find room for improvement in research practices.
6Activity What is data management? Think about A definition Elements of data managementQuestions/issues
7A definition“actively managing data for as long as it continues to be of scholarly, scientific, research and/or administrative interest […] managing it from its point of creation until it is determined not to be useful, and ensuring its long-term accessibility and preservation, authenticity and integrity.Adapted from Digital Curation Centre definition for digital curationIt is not just archiving or preservation“”
8What is data management? PlanningCreationProcessingDescribing, archiving and organisationAnalysingPreservation and securityAccess and reuseEthics and privacyDisposal
9Data life cycle www.data-archive.ac.uk/create-manage/life-cycle creatingprocessinganalysingpreservinggiving accessreusingData life cycle
10Specific PlansICPSR FrameworkDigital Curation Centre Data Management PlanIndividual institutions e.g. Oxford and MIT
11Creating dataTypes of data e.g. text, numerical, models, multimedia, softwareFormat e.g. Word or PDF, XML or Excel? Consider longevity and choose open formatsHow much data will you produce?How will you document it?Will the data change or be updated? Tracking?Will it be reproducible?What if it was lost?
12Metadata Accurately describing your data so you can find and understand it again efficientlyothers can reuse your data easilyDescriptive, administrative, structuralBasic – files and folders in WindowsComplex – XML, Dublin CoreWhere will this be stored? With the data? Will you need additional storage/software?
13Basic metadata Flickr For own use Project level descriptor then breakdown into useful groupingsUnique element including datePhD\Primary Research\Interviews\phase 1\ Government officials\Highlevel\MrSmith mp3PhD\Primary Research\Interviews\phase 1\ Government officials\Highlevel\ MrSmith docxFlickr
14More complex metadata Dates Funders Language Location Rights List of file names and relationshipsFormatsMethodology
15More complex metadataWorkflowsSourcesVersionsChecksumsExplanation of codes used in file namesList of codes used in filesStore metadata in a text file (such as a readme file or codebook) in the same directory as the data
16Version control Will you retain originals or overwrite as you go? Will anyone else be editing the information and do you need to track these changes?Need to consider this before deciding on naming conventions
17Storage Short term Think about volume of data Which media you will use do you need something more than DVD/portable hard driveSecurityCost
18BackupsMake 3 copies which are geographical distributed (original + external/local + external/remote)ITS will do much of this for you but what if remote from Durham?How frequently?Analogue dataConsider digitising if unique
19Preservation Long-term, more strategic Selection criteria Time-scale – how long will it be saved for?DisposalAdditional information necessary for deposit?Does it need to be migrated?Where will it be deposited? Will they manage it for you?
20Where to preserve your data UK Data ArchiveArchaeology Data Service, History DS, Economic and Social DS, Oxford Text ArchiveNo one repository at Durham University for data, only outputs; speak to Sebastian Palucha at Main Library
21Sharing Will you share it? Are you obliged to share it? Who will be interested in it? How might they use it?Are there reasons not to fully disclose data?How will it be accessed?When will you make it available? Embargo?Will you publish findings that rely on the data?Consider FOI
22DisseminationDeposit in a specialist data centre, dedicated to archiving digital dataSubmitting to a journal (may be required)Deposit in a self-archiving system or an institutional repositoryVia a project or institutional websiteInformally on a peer-to-peer basis e.g.
23ActivityThinking about the data life cycle, look at the ICPSR guidanceTry and fill in some of the sections of the DCC Data Management PlanHave you identified any areas on which you will need to seek further advice?
24Sources of guidance Durham University UK Data Archive (Social Sciences and Humanities)Create and Manage DataDigital Curation CentreResearch Information NetworkFunders’ web sites
25Conclusions Good data management = good research practice Needs management throughout its life cyclePlanning helpful and possibly a requirement of fundersDepositing data for preservation and accessSlides available at