The Case for Data Stewardship: Preserving the Scientific Record Matthew Mayernik National Center for Atmospheric Research Section: The Case for Data Stewardship.

Slides:



Advertisements
Similar presentations
Digital Repositories: interoperability & common services Closing Remarks Dr Liz Lyon, UKOLN, University of Bath, UK
Advertisements

DSpace: the MIT Libraries Institutional Repository MacKenzie Smith, MIT EDUCAUSE 2003, November 5 th Copyright MacKenzie Smith, This work is the.
Elements of a Data Management Plan: Identifying the materials to be created Ruth Duerr National Snow and Ice Data Center Data Management Plans Copyright.
Providing access to your data: Determining your audience Robert R. Downs, PhD NASA Socioeconomic Data and Applications Center (SEDAC) Center for International.
Preserving the Scientific Record: Case Study 1 – National Snow & Ice Data Center (NSIDC) Glacier Photos Matthew Mayernik National Center for Atmospheric.
Agency Requirements: NASA Data Management Plans Ronald Weaver National Snow and Ice Data Center W. Christopher Lenhardt Renaissance Computing Institute.
EMu and Archives NA EMu Users Conference – Oct Slide 1 EMu and Archives Experiences from the Canada Science and Technology Museum Corporation.
INTRODUCTION TO RESEARCH DATA MANAGEMENT Robin Desmeules Janice Kung J W Scott Health Sciences Library University of Alberta Libraries.
Creating documentation and metadata: Introduction to metadata and metadata standards Lynn Yarmey National Snow and Ice Data Center Version 2.0 Review Date.
Teaching Metadata and Networked Information Organization & Retrieval The UNT SLIS Experience William E. Moen School of Library and Information Sciences.
Providing Access to Your Data: Tracking Data Usage Robert R. Downs, PhD NASA Socioeconomic Data and Applications Center (SEDAC) Center for International.
Data Formats: Using Self-describing Data Formats Curt Tilmes NASA Version 1.0 February 2013 Section: Local Data Management Copyright 2013 Curt Tilmes.
Providing Access to Your Data: Access Mechanisms Robert R. Downs, PhD NASA Socioeconomic Data and Applications Center (SEDAC) Center for International.
The Case for Data Stewardship: Preserving the Scientific Record Matthew Mayernik National Center for Atmospheric Research Version 2.0 [Review Date]
Providing Access to Your Data Matthew Mayernik National Center for Atmospheric Research Version 1.0 Review Date.
Preserving the Scientific Record: Establishing Relationships with Archives Matthew Mayernik National Center for Atmospheric Research Version 1.0 Review.
Providing Access to Your Data: Rights Robert R. Downs, PhD NASA Socioeconomic Data and Applications Center (SEDAC) Center for International Earth Science.
Providing Access to Your Data: Tracking Data Usage Robert R. Downs, PhD NASA Socioeconomic Data and Applications Center (SEDAC) Center for International.
Elements of a Data Management Plan Ruth Duerr National Snow and Ice Data Center Version 1.0 Review Date Section: Data Management Plans.
Advertising your data: Using data portals and metadata registries Nancy Hoebelheinrich Version 1.0 September 2012 Section: Local Data Management Copyright.
Elements of a Data Management Plan: Identifying the materials to be created Ruth Duerr National Snow and Ice Data Center Version Review Date Section:
Citing Data Sets in the Literature: ORNL DAAC Practices Robert Cook, Suresh SanthanaVannan, and Daine Wright Environmental Sciences Division Oak Ridge.
Creating Documentation and Metadata: Metadata for Discovery Lola Olsen 1, Tyler Stevens 2, 1 National Aeronautics and Space Administration (NASA) 2 Wyle.
Preserving the Scientific Record: Case Study 1 – National Snow & Ice Data Center (NSIDC) Glacier Photos Matthew Mayernik National Center for Atmospheric.
Preserving the Scientific Record: Preserving a Record of Environmental Change Matthew Mayernik National Center for Atmospheric Research Version 1.0 [Review.
Providing Access to Your Data: Access Mechanisms Robert R. Downs, PhD NASA Socioeconomic Data and Applications Center (SEDAC) Center for International.
ATMOSPHERIC SCIENCE DATA CENTER ‘Best’ Practices for Aggregating Subset Results from Archived Datasets Walter E. Baskin 1, Jennifer Perez 2 (1) Science.
Preserving the Scientific Record: Case Study 2 – Arctic Temperature Variability Matthew Mayernik National Center for Atmospheric Research Version 1.0 Review.
Large Scientific Databases. Large scientific datasets are those which are systematically collected and organized and which stretch the technical capabilites.
Data Management Practices for Early Career Scientists: Closing Robert Cook Environmental Sciences Division Oak Ridge National Laboratory Oak Ridge, TN.
Archival Information Packages for NASA HDF-EOS Data R. Duerr, Kent Yang, Azhar Sikander.
Managing Your Data: Backing Up Your Data Robert Cook Oak Ridge National Laboratory Section: Local Data Management Version 1.0 October 2012.
The Case for Data Management Ruth Duerr National Snow and Ice Data Center.
Elements of a Data Management Plan: Roles and Responsibilities Ruth Duerr National Snow and Ice Data Center Version 1.0 Review Date.
NOAA Administrative Order : Management of Environmental and Geospatial Data and Information Jeff Arnfield NOAA’s National Climatic Data Center Version.
Providing Access to Your Data Matthew Mayernik National Center for Atmospheric Research Copyright 2012 Matthew Mayernik. Version 1.0 October 2012 Section:
Preservation Strategies: Framing The Approach Nancy Hoebelheinrich Knowledge Motifs LLC Data Management Workshop American Geophysical.
Advertising your data Nancy Hoebelheinrich Version 1.0 September 2012 Section: Local Data Management Copyright 2012 Nancy J. Hoebelheinrich.
The Case for Data Stewardship: Enhancing Your Reputation Matthew Mayernik National Center for Atmospheric Research Version 1.0 [Review Date]
Responsible Data Use: Data Restrictions Robert R. Downs, PhD NASA Socioeconomic Data and Applications Center (SEDAC) Center for International Earth Science.
Creating Documentation and Metadata: Introduction to Metadata and Metadata Standards Lynn Yarmey National Snow and Ice Data Center Version 1.0 February.
Managing Your Data: Assign Descriptive File Names Robert Cook Oak Ridge National Laboratory Section: Local Data Management Version 1.0 October 2012.
Preserving the Scientific Record: Case Study 2 – Arctic Temperature Variability Data Matthew Mayernik National Center for Atmospheric Research Version.
Examples for Open Access Scholar Electronic Repository by New Bulgarian University IP LibCMASS Sofia 2011 Contract № 2011-ERA-IP-7 Sofia, September,
Breakout # 1 – Data Collecting and Making It Available Data definition “ Any information that [environmental] researchers need to accomplish their tasks”
Advertising your data: Agency requirements for submitting metadata Nancy J. Hoebelheinrich Version 1.0 September 2012 Section: Local Data Management Copyright.
Why Create a Data Management Plan? Ruth Duerr National Snow and Ice Data Center Version 1.0 February 2013 Data Management Plans Copyright 2013 Ruth Duerr.
Federated Space-Time Query for Earth Science Data Using OpenSearch Conventions ESIP Federated Search Cluster Chris Lynnes Bruce Beaumont Ruth Duerr Hook.
A Proposed Short Course on Data Stewardship Scott Hausman Deputy Director NOAA’s National Climatic Data Center Preparing Scientists to Steward Their Data.
Responsible Data Use: Copyright and Data Matthew Mayernik National Center for Atmospheric Research Version 1.0 Review Date.
April 14, 2005MIT Libraries Visiting Committee Libraries Strategic Plan Theme III Work to shape the future MacKenzie Smith Associate Director for Technology.
NATIONAL TREASURES DATA PRESERVATION WITH METADATA Sharon Shin Metadata Coordinator Federal Geographic Data Committee Secretariat ASPRS-Reno 2006.
Elements of a Data Management Plan Ruth Duerr National Snow and Ice Data Center Version 1.0 February 2013 Data Management Plans Copyright 2013 Ruth Duerr.
Data Management Plans: Elements of a Data Management Plan Ruth Duerr National Snow and Ice Data Center Version 1.0 Review Date.
The Case for Data Stewardship: Enhancing Your Reputation Matthew Mayernik National Center for Atmospheric Research Version 1.0 September 2012 Section:
Creating Documentation and Metadata: Creating a Citation for Your Data Robert Cook Oak Ridge National Laboratory Section: Local Data Management Copyright.
Copyright and Data Matthew Mayernik National Center for Atmospheric Research Section: Responsible Data Use Version 1.0 October 2012 Copyright 2012 Matthew.
Providing access to your data: Determining your audience Robert R. Downs, PhD NASA Socioeconomic Data and Applications Center (SEDAC) Center for International.
Preservation Strategies: What goes into a long term archive? Ronald Weaver National Snow and Ice Data Center Version 1.0 Review Date.
Advertising your data Alecia Aleman 1, Ruth Duerr 2 1 National Aeronautics and Space Administration (NASA) 2 National Snow and Ice Data Center, University.
Working with your archive organization: Broadening your user community Robert R. Downs, PhD Socioeconomic Data and Applications Center (SEDAC) Center for.
Providing access to your data: Handling sensitive data Robert R. Downs, PhD NASA Socioeconomic Data and Applications Center (SEDAC) Center for International.
Working with Your Archive : Broadening Your User Community Robert R. Downs, PhD NASA Socioeconomic Data and Applications Center (SEDAC) Center for International.
R2R ↔ NODC Steve Rutz NODC Observing Systems Team Leader May 12, 2011 Presented by L. Pikula, IODE OceanTeacher Course Data Management for Information.
Data Formats: Choosing and Adopting Community Accepted Standards
Copyright 2013 Matthew Mayernik.
Preserving the Scientific Record: Case Study 1 – NSIDC Glacier Photos
Copyright 2012 Matthew Mayernik.
Copyright 2012 Lola Olsen & Tyler Stevens.
Bird of Feather Session
Presentation transcript:

The Case for Data Stewardship: Preserving the Scientific Record Matthew Mayernik National Center for Atmospheric Research Section: The Case for Data Stewardship Copyright 2012 Matthew Mayernik. Version 1.0 September 2012

The Case for Data Stewardship: Preserving the Scientific Record; Version 1.0, October 2012 The Scientific Record The scientific record is an aggregation of scientific journals conference presentations and proceedings technical reports and pre-prints the underlying data, software, and other evidence to support published findings This aggregation is highly distributed across Libraries Archives Museums Data Centers Academic publishers Investigator web sites

The Case for Data Stewardship: Preserving the Scientific Record; Version 1.0, October 2012 Purpose of the Scientific Record Communicating findings, hypotheses, and insights from one person to another, across space and time Organizing scientific communities establishing common nomenclature and terminology connecting related work developing disciplines Documenting, managing, and resolving controversies and disagreements Establishing precedence for ideas and results Offering evidence for the quality and significance of scientific work through bibliometrics

The Case for Data Stewardship: Preserving the Scientific Record; Version 1.0, October 2012 Challenges to the Scientific Record - 1 Increasing complexity of experiments and data cause the linkages between evidence and writings to become more complex and elusive Data sets are often extractions or compilations of other data sets Tracking the provenance of digital resources is very difficult Example* – computation-based scientific research Many data sets are now created through computational methods Different software packages (or custom-built code) might also be used to access, analyze, deposit, format, compile, and/or filter data. Metadata required to execute scientific software, including the libraries, compilers, operating system, and hardware description, might be orders of magnitude larger than the software itself. * Example based on Stodden, Mitchell, & LeVeque, 2012.

The Case for Data Stewardship: Preserving the Scientific Record; Version 1.0, October 2012 Challenges to the Scientific Record - 2 Increasing rate of the growth of the literature and data Disciplines and sub-specialties branch and evolve continuously Example* – Most schools of Meteorology have been renamed in the past few decades. New names include: Atmospheric and Ocean Sciences Earth and Atmospheric Sciences Geological and Atmospheric Sciences Earth, Ocean, and Atmospheric Sciences Environmental Sciences Earth, Atmospheric, and Planetary Sciences Tools and practices that help manage literature are either non-existent or just beginning for data Specialized journals Citations Indices Controlled vocabularies and taxonomies * Example from Ramamurthy, 2012.

The Case for Data Stewardship: Preserving the Scientific Record; Version 1.0, October 2012 Preservation of the Scientific Record Tenets of the scientific record That scientific products are trustworthy That scientific products enable results to be reproducible and/or transparent Preserving the scientific record: Data considerations Are data stored in a trustworthy institutional setting? Are data documented in a way that ensures understandability, reproducibility, and transparency over time?

The Case for Data Stewardship: Preserving the Scientific Record; Version 1.0, October 2012 Other Preservation Modules Establishing relationships with archives Preserving a record of environmental change Case studies National Snow & Ice Data Center Glacier Photos Arctic Temperature Variability Data Image: Field, William Osgood Columbia Glacier: From the Glacier Photograph Collection. Boulder, Colorado USA: National Snow and Ice Data Center/World Data Center for Glaciology. Digital media. Image from:

The Case for Data Stewardship: Preserving the Scientific Record; Version 1.0, October 2012 References Hanson, B., A. Sugden, and B. Alberts “Making data maximally available.” Science 331(6018): Lynch, C “Jim Gray’s Fourth Paradigm and the Construction of the Scientific Record.” In The Fourth Paradigm: Data-Intensive Scientific Discovery, edited by T. Hey, S. Tansley, & K. Tolle, Redmond, WA: Microsoft. us/collaboration/fourthparadigm/4th_paradigm_book_part4_lynch.pdf us/collaboration/fourthparadigm/4th_paradigm_book_part4_lynch.pdf Ramamurthy, M "Data Management: Progress, Opportunities and Challenges." Presentation at 2012 Unidata Users Workshop, 12 June 2012, Boulder, CO. Stodden, V., I. Mitchell, R. LeVeque "Reproducible Research for Scientific Computing: Tools and Strategies for Changing the Culture," Computing in Science and Engineering 14(4): Uhlir, P.F. and P. Schröder “Open data for global science.” Data Science Journal 6.

The Case for Data Stewardship: Preserving the Scientific Record; Version 1.0, October 2012 Resources Data Journals Earth System Science Data - Geoscience Data Journal - Data Citations Earth Science Information Partners (ESIP) - Interagency Data Stewardship/Citations/provider guidelines ines ines DataCite – Thomson Reuters Data Citation Index (announced) Controlled vocabularies and taxonomies Global Change Master Directory - NetCDF Climate and Forecast (CF) Metadata Convention - Open Geospatial Consorium - Observations and Measurements Semantic Web for Earth and Environmental Terminology (SWEET)

The Case for Data Stewardship: Preserving the Scientific Record; Version 1.0, October 2012 Other Relevant Modules Modules about data management plans Data Management Plans: Why Do a Data Management Plan Data Management Plans: Elements of a Plan Modules about documentation and metadata Local Data Management: Introduction to Metadata and Metadata standards Local Data Management: Recording Provenance and Context Modules about preservation strategies Preservation Strategies: Options for Archiving Your Data Preservation Strategies: Emerging Standards For Preservation Case for Data Stewardship – Preserving the Scientific Record Establishing Relationships with Archives Preserving a Record of Environmental Change Case Study 1 – NSIDC Glacier Photos Case Study 2 – Arctic Temperature Variability Data

The Case for Data Stewardship: Preserving the Scientific Record; Version 1.0, October 2012 Recommended Citations Copyright 2012 Matthew Mayernik. Mayernik, M “The Case for Data Stewardship: Preserving the Scientific Record.” In Data Management for Scientists Short Course, edited by Ruth Duerr and Nancy J. Hoebelheinrich, Federation of Earth Science Information Partners: ESIP Commons. doi: /P3PK0D3F