Presentation on theme: "GRAD 521, Research Data Management Winter 2014 – Lecture 6 Amanda L. Whitmire, Asst. Professor."— Presentation transcript:
GRAD 521, Research Data Management Winter 2014 – Lecture 6 Amanda L. Whitmire, Asst. Professor
Roadmap 1.Characteristics & content of a Data Curation Profile (DCP) 2.The components of the DCP Toolkit 3.How these components fit together 4.Discussion: your ideas for the assignment
Characteristics of the DCP Tells “the story” of the data Focused on a specific data set: provides depth, not breadth Interview-based Meant to be “discipline neutral” & widely applicable to different types of data Modular – allows for flexibility & tailoring to specific situations and uses
Characteristics of the DCP Represents the researcher’s needs & perspectives A concise, structured document suitable for sharing & annotation A resource for librarians, archivists, IT professionals, data managers & others
DCP Sections Information about the data & its context 1. Summary of data curation needs 2. Overview of the research Focus Intended audience Funding 3. Data types, formats & stages Data narrative (data lifecycle) Target data for sharing Use/re-use value Contextual narrative
Data StageOutputTypical File SizeFormatOther / Notes Primary Data RawSensor data100k in 1 file per day proprietary to the sensor FTP downloads are mostly automated. Processing Stage 1 Sensor data – open/accessible format Roughly 6kb.csv /.xls Data are formatted into.csv before bring reformatted into a mySQL database. ProcessedData vectors 800 records per intersection per day. SQL /.xls Data are extracted from the mySQL database for analysis purposes. Analyzed charts/ Graphs.xls /.emf charts and graphs used for interpretation. Published charts/ graphs.ppt Data are presented via power point. Ancillary Data Image Stills taken from video.gif /.jpg /.pptImages generated from video. Example: Data types & formats
DCP Sections 4.Intellectual property 5.Organization & description of data 6.Ingest 7.Sharing & access 8.Discovery 9.Tools 10.Interoperability 11.Measuring impact 12.Data management 13.Preservation Information about practices & needs
The DCP Toolkit The Data Curation Profile Toolkit consists of 4 components: 1.User Guide 2.Interviewer’s Manual 3.Interview Worksheet 4.DCP Template
The User Guide Describes the rationale for the DCP Describes the process through which a DCP is generated Stage 1 – Preparation Stage 2 – Worksheet & interviews Stage 3 – Constructing the Profile Provides guidance & advice
Interview Worksheet & Manual Meant to be used in tandem The Interview Worksheet is given to the researcher to fill out. The Interviewer’s Manual contains follow up questions for the interviewer to ask once the researcher has filled out a module.
DCP Template Provides the structure of the Data Curation Profile Each section or sub-section contains a brief definition of the information that is needed to populate a Data Curation Profile
Module 2 Example Interview Worksheet Interview Manual
DCP Template Completed Profile Module 2 Example
Stages of the DCP Preparation Interviewing Connections between the components, pulling it all together
How to Develop a DCP A Data Curation Profile is developed through 3 stages: Stage 1 – Preparation Stage 2 – Interviews Stage 3 – Constructing the Profile
Stage 1 – Preparing Investigate researchers’ work and use of data (e.g., a recent article/grant) Faculty’s website Faculty publications Seminars Press releases Review of grants that have been awarded
Stage 2 – Interviewing Introduction to the Interview Need for two interviews? Time required Coverage
Stage 2 – Interviewing Using the Interview Manual & Worksheet 1.Read any introductory statement listed in the “Interviewer’s Manual” (if any) 2.Then have the researcher complete the list of questions for the module in the “Interview Worksheet” 3.Review the responses and ask the questions listed in the “Interviewer’s Manual” as appropriate 4.Ask any follow up questions you feel are needed 5.Move on to the next module
Stage 2 – Interviewing Types of Worksheet Questions: Free text Short answer (text) Selecting from a range of possible responses Yes/No Likert Scale
Stage 2 – Interviewing Types of Questions: Manual Explanatory – “Tell me why you selected “x” as your response” Clarifying – “Could you explain what you mean by “x”? Probing – “Could you tell me more about “x”? Relational – “Could you tell me how “x” relates to your earlier response of “y”?
Discussion 1. Modifications to the DCP 2. Audio recording 3. Biggest challenges? 4. Realistic timeframe
Modifications to the DCP How many of the profile modules do you want to include? Required interview modules: Background demographic Q’s Mod.01: The dataset Mod.02: Lifecycle of dataset Mod.03: Sharing Mod.04: Access Mod.06: Organization & description of data Mod.12: Data Management Optional interview modules: Mod.05: Data Xfer/ingest Mod.07: Discovery Mod.08: Intellectual property Mod.09: Tools Mod.10: Linking/Interop. Mod.11: Measuring impact Mod.13: Data Preservation
Audio recording Is it necessary & possible? Would you & your interviewee be OK w/it?