DATA MANAGEMENT PLANNING: THE BASICS NCSU Libraries.

Slides:



Advertisements
Similar presentations
OVERVIEW & LIBRARY SUPPORT FOR DATA MANAGEMENT/SHARING Jim Van Loon, MSME/MLIS Science Librarian.
Advertisements

OPEN ACCESS PUBLICATION ISSUES FOR NSF OPP Advisory Committee May 30, /24/111 |
Data Management Plans PAUL H. BERN, PH.D. APRIL 3, 2014.
The Data Curation Profile IASSIST 2010 Jake Carlson Data Research Scientist Purdue University Libraries.
Data Management What? Why? How?. 2 What do we mean by … Managing your Research (aka Data) … Ensuring physical integrity of files and helping to preserve.
NSF Data Management Plan Requirements Alex Kanous
Archiving Data. Essential stuff to know Why deposit? Digital repositories ADS Guidelines Deposit evaluation & requirements Deposit checklist & template.
Institutional Repositories Tools for scholarship Mary Westell University of Calgary AMTEC Conference May 26, 2005.
Open Exeter Project Team
INTRODUCTION TO RESEARCH DATA MANAGEMENT Robin Desmeules Janice Kung J W Scott Health Sciences Library University of Alberta Libraries.
Elements of a Data Management Plan Alison Boyer Environmental Sciences Division Oak Ridge National Laboratory.
Elements of a Data Management Plan
DMPTool Expert Resources and Support for Data Management Planning Tao Zhang Michael Witt Purdue University Libraries 1.
EPSRC expectations on research data: What researchers need to know 12/03/2015 Masud Khokhar and Hardy Schwamm.
What are research data? July 2015 This work is licensed under a Creative Commons Attribution 4.0 International LicenseCreative Commons Attribution 4.0.
Data Management Planning and DMPonline
U.S. Department of the Interior U.S. Geological Survey Planning for Data Management Creating data management plans for your project.
The importance of DART for funding agencies Dr. Ingrid Kissling-Näf.
Managing Research Data – The Organisational Challenge at Oxford James A J Wilson Friday 6 th December,
ACCESS for VALIDITY ACCESS for INNOVATION. Starting January 2011 for NEW proposals Not voluntary – “integral part” of proposal and FastLane Required for.
Elements of a Data Management Plan Bill Michener University Libraries University of New Mexico Data Management Practices for.
Research Data Management System project: Best Practices in Research Data Management* *Adaptation of the NECDMC.
UVa Library Research Data Services
Data Management Planning
A 40 Year Perspective Dr. Frank Scioli NSF-Retired.
The Data Management Plan Amanda Rinehart, Data Librarian Illinois State University The Data Management Plan by Amanda K Rinehart is licensed under a Creative.
NSF Data Management Plan Requirement Presentation May 25, 2011 William Mischo & Mary Schlembach.
Meet and Confer Rule 26(f) of the Federal Rules of Civil Procedure states that “parties must confer as soon as practicable - and in any event at least.
Data Management 101 for Earth Scientists Data Management Plans Robert Cook Environmental Sciences Division Oak Ridge National Laboratory.
How to Publish Your Code on COIN-OR Bob Fourer Industrial Engineering & Management Sciences Northwestern University COIN Strategic Leadership Board.
Institute Repositories and Digital Preservation : Assessing Current Practices at Research Library Rathachai Chawuthai Information.
Choosing Between Data Sharing Repositories for Engineering Linking Open Data cloud diagram, by Richard Cyganiak and Anja Jentzsch.
DMPTool and Data Management Basics Hannah Norton July 29, 2014 Image modified from :
Elements of a Data Management Plan Bill Michener University of New Mexico
Breakout # 1 – Data Collecting and Making It Available Data definition “ Any information that [environmental] researchers need to accomplish their tasks”
How Not to Lose Track of Your Research Organization and Planning Resources at Brandeis Melanie Radik and Raphael Fennimore Library & Technology Services.
Primer on Data Management Data Management Plans Robert Cook Environmental Sciences Division Oak Ridge National Laboratory American Meteorological Society.
DOE Data Management Plan Requirements
Data Management Lesley A. Brown Director of Proposal Development.
11 Researcher practice in data management Margaret Henty.
Copyright Issues in Data Management CHRISTINE FRUIN / SCHOLARLY COMMUNICATIONS LIBRARIAN.
Managing Access at the University of Oregon : a Case Study of Scholars’ Bank by Carol Hixson Head, Metadata and Digital Library Services
ARIADNE is funded by the European Commission's Seventh Framework Programme Archiving and Repositories Holly Wright.
Data Management Plans PAUL H. BERN, PH.D. APRIL 3, 2014.
SEDAC Long-Term Archive Development Robert R. Downs Socioeconomic Data and Applications Center Center for International Earth Science Information Network.
Digital Stewardship Lee Dotson Digital Initiatives Librarian University of Central Florida John C. Hitt Library Presentation available at
Preserving your research data for future use This work is licensed under a Creative Commons Attribution 3.0 Unported License.Creative Commons Attribution.
Building Preservation Environments with Data Grid Technology Reagan W. Moore Presenter: Praveen Namburi.
Using the DMPTool for data management plans Kathleen Fear February 27, 2014.
Writing a Data Management Plan with the DMPTool Kathleen Fear January 15, 2015.
Training Course on Data Management for Information Professionals and In-Depth Digitization Practicum September 2011, Oostende, Belgium Concepts.
Writing a successful data management plan Kathleen Fear October 17, 2013.
Why ANDS? 16 May, 2011 Mathew Wyatt. Trends towards open data  Data science  Gov 2.0  Research 2.0  Open Science  Freedom of Information.
Preservation Planning Bojana Tasić FORS SEEDS Workshop I Belgrade, October.
Introduction to Managing Research and Personal Data.
Jeff Moon Data Librarian &
Open Exeter Project Team
Open Access and Research Data Management: An Overview for LLOs
Data Management What? Why? How?.
Institutional role in supporting open access, open science, open data
CFI John R Evans Leaders Fund Digital Data Management
General Finnish DMP Guidance
Getting Started with Data Management
Digital Project Lifecycle Curating Across the Curriculum
Getting Started with Data Management
Research Data Management
Research Data Management
Getting Started with Data Management & DMPTool
Research Data Dr Aoife Coffey, Research Data Coordinator
Fundamental Science Practices (FSP) of the U.S. Geological Survey
Presentation transcript:

DATA MANAGEMENT PLANNING: THE BASICS NCSU Libraries

Why talk about Data Management?

Why manage Data? NCSU Data Mgt. Planning

AGENDA, OBJECTIVES & OUTCOMES Agenda & objectives –What is a DMP? –Elements of a DMP –Services and support at NCSU Outcomes –Understand components of a data management plan –Learn about DMPTool –Learn about available resources –Handouts – Checklist, Resources, NSF FAQ, Example DMP, NSF Template

Why “manage” data? go.ncsu.edu/data-snafu Why manage Data? NCSU Data Mgt. Planning

WHAT IS A DATA MANGEMENT PLAN (DMP)? WHAT: A DMP is a document that describes how you will organize, store, and share your research data – during and after you complete your research. WHO REQUIRES A DMP: ● Funders (e.g., NIFA?, NSF, NEH, NOAA, NIH, GBMF) ● Journals require data sharing (e.g., PLoS, PNAS, Nature) ● Institutional Review Boards WHY: ● Preserve your data for transparency and replication ● Facilitate new discoveries via re-use ● Increase visibility of your work

NCSU Data Mgt. Planning Why manage Data?

PRODUCTS OF RESEARCH or DATA TYPES What kinds of data (observations) are you generating? How much? What types of data? How are you creating or capturing the data? Are you using existing data?

WHAT IS DATA? Raw “stuff,” lacking context “Essence of science” One person’s data is another’s information Adapted from Greenberg & Baker (2012) -- * Observational * Experimental * Simulation * Derived/compiled

PRODUCTS OF RESEARCH or DATA TYPES POSITIVE: “This research will generate data resulting from sensor recordings (i.e. earth pressures, accelerations, and wall deformation) during the centrifuge experiments. In addition to the raw, uncorrected sensor data, converted and corrected data (in engineering units) as well as other forms of derived data will be produced. Metadata that describes the experiments with their materials, loads, experimental environment, and parameters will be produced. The experiments will be recorded with still cameras and video cameras. Photos and videos will be part of the data collection.” NEGATIVE: “The nature of the data or other materials produced under this NSF-sponsored project will include data characteristics such as observational, experimental, reference, derived, simulated and/or other. The data types referenced could include data generated by computer, data collected from sensors or instruments, images, video files, reports and/or other.”

DATA STORAGE & PRESERVATION How long should data be kept? Where will you deposit your data? What procedures are in place for preservation and backup? What transformations will be necessary to prepare data for preservation or sharing? What documentation is needed to make the data reusable? Courtesy of Flickr user Ian-S

DATA STORAGE & PRESERVATION POSITIVE: “For archiving, the data along with any related publications will be deposited in [___] data repository with an appropriate licensing statement. DOIs will be attached to all data stored for this project. Since the current preservation plan for the repository is indefinite data storage, preservation of access is assured.” NEGATIVE: “All data will be available at request immediately after the scientific results are published and will be stored at least another five years on magnetic and optical storage devices (hard disks, CDs, DVDs). Optical storage devices will serve the purpose of the Disaster Recover Plan.” MIXED: “The infrastructure provided by Agency B is not yet either designed or suitable for distribution of software that evolves and changes over time. Instead, we will release and distribute GPL versions of data analysis and numerical modeling software as tarballs directly from our respective webpages.”

DATA FORMATS & METADATA Which file formats will you use and why? How will you create or capture the contextual details that describe your data (“metadata”)? What metadata standards will you use? What other details will be necessary for others to work with your data? Courtesy of John Vickery

FILE FORMATS Will I be able to open this file in ten years? Is this format proprietary or open? Does this format preserve as much of the original information as possible? Is this format widely used and supported?

SOME OPEN FILE FORMATS For text: TXT, RTF, XML (DOCX, ODT, etc.) For tabular data: CSV, TSV For databases: CSV, SQL For images: TIFF, JPG 2000 (lossless), SVG For audio: MP3, FLAC (lossless), WAV For video: AVI, MP4 with H.264 Uncompressed, where possible

DATA FORMATS POSITIVE: “The data will be in standard, non-proprietary file formats to facilitate both data sharing and long-term preservation. The simulation code will be developed in C and provided to the public in source code format for non-commercial use under GNU GPL. The numerical data will be in TXT format so that they are readable by any text editing software. Data and documents in these formats (C, TXT) will likely be standard for a long time due to their wide use. In the situation that these formats become obsolete, we will reformat the documents.” NEGATIVE: “The data format includes digital data recorded by computers and instruments and metadata recorded in lab notebooks and reports.” POSITIVE: “Whenever possible, standard formats of data will be used, e.g.: images: TIFF, BMP, JPG.” NEGATIVE: “Data will be stored in the following formats: Acrobat PDF and Microsoft Excel, PowerPoint, and Word.”

METADATA

FILE NAMING AND VERSIONING Keep names short, if possible Use underscores ( _ ) or hyphens (-) instead of spaces to separate words Include a date in YYYY-MM-DD format ISO Standard 8601 Don’t include the folder name in the file name Include a version number (v1, v01) at the end For “final” version, include the word FINAL

FILE NAMING AND VERSIONING C:\Folder1\Folder2\Nov22007CopperData1.xls C:\Experiment\RawData\ _Cu_Run1.xls C:\Folder1\Folder2\2NOV2007CopperDataRepeat.xls C:\Experiment\RawData\ _Cu_Run2.xls

METADATA POSITIVE: “Metadata will include time, date, and location of measurement, object measured, THz equipment used, and personnel present.” NEGATIVE: “In addition to data, metadata concerning how the data was generated will be maintained and disseminated on request.” POSITIVE: “For each code made available, a user's manual will be provided with instructions for compiling the source codes, installing and running the codes, formulating input data streams, and visualizing the output. Documentation will be in TXT format.” NEGATIVE: “Embedded comments in English will be used to provide sufficient metadata to interpret the meaning of the Microsoft Word files. We have chosen this metadata format because it is flexible and enables later interpretation.”

DATA DISSEMINATION, SHARING & ACCESS How/when/how long will data be available? Are there any reasons not to limit or withhold the data? Are there ethical or privacy issues? If so, how will these be resolved? What have you done to comply with NCSU's IRB protocols? Who will hold the intellectual property rights to the data? Courtesy of commercialintegrator.com

DATA DISSEMINATION, SHARING & ACCESS POSITIVE: “These data will provide a detailed experimental look at the mechanical regulation of mesenchymal stem cell osteogensis. The data will further delineate the functional role of the cytoskeletal-focal adhesion- extracellular matrix signaling axis in the mechanoresponsive mesenchymal stem cell osteogenesis, as described in the main body of the proposal. As such, they will be of interest to the tissue engineering and regenerative medicine communities.” NEGATIVE: “No, or very little, derivative use of the data is expected since the research is aimed at developing machining process models and the data gathered is directed towards this end.” POSITIVE: “Data will be uploaded into the Project A repository and will conform to Project Data Sharing and Archiving Policies. These policies also define data confidentiality during the research, but require the data be available to the broader community 12 months after experiment completion.” NEGATIVE: “Data will be shared by mailing a CD to the requestor.”

ROLES & RESPONSIBILITIES Who is responsible for implementing the DMP? How will responsibility for the data be transferred? Who is responsible for long- term decisions about the data? What training is necessary to manage the data? Courtesy of conferencenike.com

ROLES & RESPONSIBILITIES POSITIVE: “For the proposed collaborative research, Dr. A from University B will take the lead and responsibility for coordinating and assuring data storage and access. However, Dr. C from University D will also be involved in managing, storing, and disseminating the results of the project, particularly in regards to data acquired through testing associated with the E device.” NEGATIVE: “All the investigators involved in the proposed project have equal rights to access the data generated through this sponsored project. They also have the same obligations to share data with each other and to publish the results in a timely manner.” POSITIVE: “Should the PI leave University F, the grant would likely be transferred. If not, the co-PI will assume leadership of the project and responsibility for the data.” NEGATIVE: “There will be no change to the scope of this data management plan should the PI or co-PI leave University G.”

Why manage Data? NCSU Data Mgt. Planning

NCSU DATA MANAGEMENT PLANNING GUIDE lib.ncsu.edu/guides/datamanagement

DMPTOOL Free scratch space for writing a DMP Step-by-step guidance Log in with NCSU Unity ID Template for each funding agency Links embedded

NCSU SERVICES & SUPPORT: IP & COPYRIGHT What do you own? –No © for raw data –Supplementary materials How will you share? –Preference for open access NCSU's Copyright & Digital Scholarship Center –lib.ncsu.edu/cdsc

NCSU SERVICES & SUPPORT NCSU Libraries –DMP Guide & DMPTool: lib.ncsu.edu/guides/datamanagement –DMP Review service, Consultation and guidance for storage & file naming, metadata, citing data, sharing data, data repositories –Subject librarians ready to help! Office of Information Technology –Shared services: oit.ncsu.edu/ac –Storage & backup, HPC, software licensing, IT security, web hosting, networking Research Administration –SPARCS: research.ncsu.edu/sparcs –Research integrity, compliance, proposal development, IRB, REACH NC, funding opportunities, scholarly data ownership

SUMMARY & QUESTIONS NCSU DMP Guide: lib.ncsu.edu/guides/datamanagement DMP Review service DMPTool Contact us at Image Courtesy of Flickr member Gonzalo AR Positive and negative examples via “NSF ENG Data Management Plan Template.” ©2011, Regents of the University of Michigan and subject to a Creative Commons Attribution 3.0 license.