GRAD 521, Research Data Management Winter 2014 – Lecture 9 Amanda L. Whitmire, Asst. Professor.

Slides:



Advertisements
Similar presentations
Value of Metadata Lesson 8: Value of Metadata CC image by John Norris on Flickr.
Advertisements

Organising and Documenting Data Stuart Macdonald EDINA & Data Library DIY Research Data Management Training Kit for Librarians.
What is Metadata Lesson 7: What is Metadata CC image by bonus on Flickr.
Geospatial One-Stop A Federal Gateway to Federal, State & Local Geographic Data
An Leabharlann UCD Órna Roche UCD James Joyce Library Metadata Documenting your data
Creating Geospatial Metadata for the Long-term Lynda Wayne Federal Geographic Data Committee Geospatial One-Stop GeoMaxim.
Oregon Spatial Data Library Partnership Metadata Training OU Knight Library Eugene, Oregon December 3, 2009 Kuuipo Walsh Institute for Natural Resources.
StatCat Building a Statistical Data Finder ssrs.yale.edu/statcat Steven Citron-Pousty Ann Green Julie Linden Yale University.
NOAA Metadata Update Ted Habermann. NOAA EDMC Documentation Directive This Procedural Directive establishes 1) a metadata content standard (International.
Geospatial Metadata Overview WV AGP GIS Conference, June 2008 Presented by: Eric Hopkins, GIS Analyst West Virginia GIS Technical.
Metadata (for the data users downstream) RFC GIS Workshop July 2007 NOAA/NESDIS/NGDC Documentation.
ORGANIZING AND STRUCTURING DATA FOR DIGITAL PROJECTS Suzanne Huffman Digital Resources Librarian Simpson Library.
U.S. Department of the Interior U.S. Geological Survey Tutorials on Data Management Lesson 3: Describe (Metadata, Documentation) CC image by bonus on Flickr.
Agenda: DMWG SM policy status ESIP meeting recap Reminder - DM Webinar Series New and updated web pages on DM website Metadata Training Sessions CDI meeting.
LTER Information Management Training Materials LTER Information Managers Committee Metadata.
Metadata: An Overview Katie Dunn Technology & Metadata Librarian
Science Metadata Viv Hutchison US Geological Survey
Amanda Whitmire Maura Valentino OSU Libraries OPP Workshop Series 5 December 2012.
Preparing Metadata Suresh Vannan ORNL Distributed Active Archive Center Oak Ridge National Laboratory, Oak Ridge, TN Viv Hutchison.
Data Management: Documentation and Metadata for Engineering and Physical Sciences Ivey Glendon, Metadata Librarian Jeremy Bartczak, Intellectual Access.
Metadata Considerations Implementing Administrative and Descriptive Metadata for your digital images 1.
Introduction to Geospatial Metadata – ISO 191** Metadata National Centers for Environmental Information (NCEI)
An Introduction to Metadata Tammy Walker Beaty Environmental Sciences Division Oak Ridge National Laboratory Oak Ridge, TN Data Management.
Sept 19,  Provides a common set of terminology and definitions  A framework for describing resources and processes  Enables computer based interoperability.
Metadata and Geographical Information Systems Adrian Moss KINDS project, Manchester Metropolitan University, UK
The Value of Geospatial Metadata Metadata has tremendous value to Individuals within your organization, as well as to individuals outside of your organization.
Introduction to Geospatial Metadata – ISO 191** Metadata National Coastal Data Development Center A division of the National Oceanographic Data Center.
Data Management: Documentation & Metadata Sherry Lake, Senior Data Consultant Bill Corey, Data Consultant Jeremy Bartczak, Intellectual Access & Metadata.
JENN RILEY METADATA LIBRARIAN IU DIGITAL LIBRARY PROGRAM Introduction to Metadata.
Vers national spatial data infrastructure training program Value of Metadata Introduction to Metadata An overview of the value of metadata to.
Extensible Markup Language (XML) Extensible Markup Language (XML) is a simple, very flexible text format derived from SGML (ISO 8879).ISO 8879 XML is a.
GLOBAL BIODIVERSITY INFORMATION FACILITY Éamonn Ó Tuama Senior Programme Officer, IDA 21 June Metadata publishing with the IPT.
Preparing Metadata Records Suresh K.S. Vannan ORNL, Oak Ridge, TN Viv Hutchison US Geological Survey, Denver, CO
USGS Metadata in the Broader Picture 1994 Executive Order – Metadata must be created for all Federally-funded research – Federal Geographic Data.
Introduction to Geospatial Metadata – FGDC CSDGM National Coastal Data Development Center A division of the National Oceanographic Data Center Please .
Creating Archive Information Packages for Data Sets: Early Experiments with Digital Library Standards Ruth Duerr, NSIDC MiQun Yang, THG Azhar Sikander,
The Digital Library for Earth System Science: Contributing resources and collections Meeting with GLOBE 5/29/03 Holly Devaul.
Introduction to Metadata. Introduction to Metadata  What is metadata?  When is metadata created?  What is included in a metadata record?  What is.
PACSCL Consortial Survey Initiative Group Training Session February 12, 2008 at The Historical Society of Pennsylvania.
Introduction to metadata
U.S. Department of the Interior U.S. Geological Survey CDI Webinar Series 2013 Data Management at the National Climate Change and Wildlife Science Center.
Using the Global Change Master Directory (GCMD) to Promote and Discover ESIP Data, Services, and Climate Visualizations Presented by GCMD Staff January.
The Digital Library for Earth System Science: Contributing resources and collections GCCS Internship Orientation Holly Devaul 19 June 2003.
Introduction to Metadata Jenn Riley Metadata Librarian IU Digital Library Program.
1 Understanding Cataloging with DLESE Metadata Karon Kelly Katy Ginger Holly Devaul
Digitization – Basics and Beyond workshop Interoperability of cultural and academic resources New services for digitized collections Muriel Foulonneau.
Data Management: Documentation & Metadata Metadata (Structured Documentation)
Metadata Training for Gulf Restoration Partners Module 1 – Introduction to Metadata and Metadata Standards.
Metadata, vocabularies and licensing Managing research data in repositories workshop, 11 Nov 2015 Kathryn Unsworth.
Metadata “Data about data” Describes various aspects of a digital file or group of files Identifies the parts of a digital object and documents their content,
Technical Communication A Practical Approach Chapter 9: Technical Research William Sanborn Pfeiffer Kaye Adkins.
ESRI Education User Conference – July 6-8, 2001 ESRI Education User Conference – July 6-8, 2001 Introducing ArcCatalog: Tools for Metadata and Data Management.
Getting Familiar with Metadata Laurie Porth Rocky Mountain Research Station Audience: Scientists/researchers who have heard of metadata and now need to.
Metadata ESA Workshop. In this session we will discuss…  Metadata: what are they? and why should they be created?  Metadata standards  Creating metadata.
The Research Data Archive at NCAR: A System Designed to Handle Diverse Datasets Bob Dattore and Steven Worley National Center for Atmospheric Research.
The Proliferation of Metadata Standards and the Evolution of NASA’s Global Change Master Directory (GCMD) Standard for Uses in Earth Science Data Discovery.
Global Change Master Directory (GCMD) Mission “To assist the scientific community in the discovery of Earth science data, related services, and ancillary.
Advertising your data Alecia Aleman 1, Ruth Duerr 2 1 National Aeronautics and Space Administration (NASA) 2 National Snow and Ice Data Center, University.
The Earth Information Exchange. Portal Structure Portal Functions/Capabilities Portal Content ESIP Portal and Geospatial One-Stop ESIP Portal and NOAA.
Understanding the Value and Importance of Proper Data Documentation 5-1 At the conclusion of this module the participant will be able to List the seven.
Geospatial metadata Prof. Wenwen Li School of Geographical Sciences and Urban Planning 5644 Coor Hall
TRIG: Truckee River Info Gateway Dave Waetjen Graduate Student in Geography Information Center for the Environement (ICE) University of California, Davis.
Session 3A: Catalog Services and Metadata Models
Lecture 4 Data Management & Metadata
Introduction to Metadata
Presented by Sharon Shin, FGDC Developed by Lynda Wayne, GeoMaxim-FGDC
Data Management: Documentation & Metadata
A Case Study for Synergistically Implementing the Management of Open Data Robert R. Downs NASA Socioeconomic Data and Applications.
Creating Geospatial Metadata for the Long-term
GEO 802, Data Information Literacy
Presentation transcript:

GRAD 521, Research Data Management Winter 2014 – Lecture 9 Amanda L. Whitmire, Asst. Professor

Lesson topics 1.Definition of metadata 2.Examine information included in a metadata record 3.Examples of metadata standards and how to choose 4.Illustrate the value of metadata to data users, data providers, and organizations 5.Describe the utility of metadata for a variety of scenarios beyond discovery

The data lifecycle

Data collection CC image by Justin See on Flickr CC image by CIMMYT on Flickr CC image by acordova on Flickr CC image by kukkurovaca on Flickr CC image by SEDAC on Flickr CC image by ISAS on Flickr

From field notes to datasets Average temperature of observation for each species Species Average Temperature Temperature Standard Deviation Number of Observations Minimum Temperature Maximum Temperature Northern Red-legged Frog Tailed Frog Arizona Toad Strecker's Chorus Frog Oregon Spotted Frog New Jersey Chorus Frog Wood Frog Spring Peeper Red-legged Frog

From datasets to published papers CC image by Heather Kennedy on Flickr

Working with data provide When you provide data to someone else, what types of information would you want to include with the data? receive When you receive a dataset from an external source, what types of details do you want to know about the data?

Working with data Providing data: Why were the data created? What limitations, if any, do the data have? What does the data mean? How should the data be cited if it is re-used in a new study? Receiving data: What are the data gaps? What processes were used for creating the data? Are there any fees associated with the data? In what scale were the data created? What do the values in the tables mean? What software do I need in order to read the data? What projection are the data in? Can I give these data to someone else?

What is metadata? “Data about data” “Structured information that describes, explains, locates, or otherwise makes it easier to retrieve, use, or manage an information resource.” NISO, Understanding Metadata

Metadata “The metadata accompanying your data should be written for a user 20 years into the future -- what does that person need to know to use your data properly? Prepare the metadata for a user who is unfamiliar with your project, methods, or observations.” Oak Ridge National Laboratory Distributed Active Archive Center for Biogeochemical Dynamics (ORNL DAAC)

What is metadata? WHO created the data? WHAT is the content of the data? WHEN were the data created? WHERE is it geographically? HOW were the data developed? WHY were the data developed? Photo by Michelle Chang. All Rights Reserved Metadata is: Data ‘reporting’

Levels of metadata PROJECT LEVEL Descriptive information DATA LEVEL Granular information

Metadata in real life You use it all the time…

Metadata standards Dublin Core (DC), Darwin Core (DwC), EML, DDI, NBII, FGDC/CSDGM, ISO 19139, ISO 19115, DIF, LDIF, e- GMS, AGLS, METS, MODS, PREMIS, OAI-PMH, MARC, CDWA, CIDOC/CRM, DACS, DIG35, GILS, GML, ISBD, LCSH, KML, MARCXML, MEI, MODS, MIX, OAIS, ANSI/NISO Z39.88, PB Core, PRISM, QDC, RDF, SGML, VSO, XML, XMP

What is a metadata standard? A Standard provides a structure to describe data with: o Common terms to allow consistency between records o Common definitions for easier interpretation o Common language for ease of communication o Common structure to quickly locate information In search and retrieval, standards provide: o Documentation structure in a reliable and predictable format for computer interpretation o A uniform summary description of the dataset CC image by ccarlstead on Flickr

What does a metadata record look like? Ocean Currents and Biogeochemistry: Nearshore Water Profiles (Monthly CTD and Chemistry; SBC- LTER) web link New York City Community Health Survey, 2009 (ICPSR) web link Mountain hemlock tree-ring width chronologies from the western Oregon Cascade Mountains (USFS Research Data Archive) web link

Muddiest point… What did you find unclear about the concept of metadata?

Even if the value of data documentation is recognized, concerns remain as to the effort required to create metadata that effectively describe the data. Concerns about creating metadata

ConcernSolution workload required to capture accurate robust metadata incorporate metadata creation into data development process – distribute the effort time and resources to create, manage, and maintain metadata include in grant budget and schedule readability / usability of metadata use a standardized metadata format discipline specific information and ontologies ‘profile’ standard to require specific information and use specific values

The value of metadata Data creators Data users Organizations Metadata helps…

What is the value to data creators? Metadata allows data creators to: o Avoid data duplication o Share reliable information o Publicize efforts – promote the work of a scientist and his/her contributions to a field of study CC image by US Embassy Guyana on Flickr

What is the value to data users? Metadata gives a user the ability to: o Search, retrieve, and evaluate data set information from both inside and outside an organization o Find data: Determine what data exists for a geographic location and/or topic o Determine applicability: Decide if a data set meets a particular need o Discover how to acquire the dataset you identified; process and use the dataset CC image by ASEE on Flickr

What is the value to organizations? Metadata helps ensure an organization’s investment in data o Documentation of data processing steps, quality control, definitions, data uses, and restrictions o Ability to use data after initial intended purpose Transcends people & time o Offers data permanence o Creates institutional memory Advertises an organization’s research o Creates possible new partnerships and collaborations through data sharing

Information Entropy DATA DETAILS Time of data development Specific details about problems with individual items or specific dates are lost relatively rapidly General details about datasets are lost through time Accident or technology change may make data unusable Retirement or career change makes access to “mental storage” difficult or unlikely Loss of data developer leads to loss of remaining information TIME (From Michener et al 1997)

Information Entropy TIME DATA DETAILS Sound information management, including metadata development, can arrest the loss of dataset detail.

A closer look: the utility of metadata Metadata can support: o data distribution o data management o [project management] If it is: o considered a component of the data o created during data development o populated with rich content derive classify collect planimetricimagery analysis alternative committee review PLAN charette meta

Data distribution via metadata metadata publication data portals data discovery

Distribution: data discovery The descriptive content of the metadata file can be used to identify, assess, and access available data resources. online access order process contacts use constraints access constraints data quality availability/pricing keywords geographic location time period attributes

Distribution: metadata publication A metadata collection can be published to the internet via: website catalog web accessible folder (WAF) Z39.50 metadata clearinghouse metadata service geospatial data portal Internet Metadata CollectionUser Query Internet / Intranet Dataset

Distribution: data portals Examples of metadata search portals: Data.gov Federal e-gov geospatial data portal Metacat Repository for data and metadata US Geological Survey USGS Core Science Metadata Clearinghouse: ICPSR Political and Social Science data portal

Data management via metadata Data Accountability Discovery & Re-use Maintenance & Update Data Liability

Management: maintenance & update Metadata records can used to track data provenance accuracy Data Maintenance: Are the data current? o Do we have data older than ten years? o was before some political or geophysical event that resulted in significant change? Are the data valid? o prior to most current source data o prior to most current methodologies Data Update: Contact information Distribution policies, availability, pricing, URLs New derivations of the dataset

Discovery: data reuse If you create metadata, other people can discover your data If you create metadata, you can find your own data CC image by Oceanit Daily Photo on Flickr

Management: data discovery & reuse Find your data by: o themes / attributes o geographic location o time ranges o analytical methods used o sources & contributors o data quality Discoverable data is usable data! CC image by NASA Goddard Spece Flight Center on Flickr

Management: data accountability Metadata allows you to repeat scientific process if: o methodologies are defined o variables are defined o analytical parameters are defined Metadata allows you to defend your scientific process: o demonstrate process o increasingly GIS-savvy public requires metadata for consumer information INPUT RESULTS

Management: data accountability Metadata is an exercise in data accountability. It requires you to assess: What do you know about the dataset? What don’t you know about the dataset? What should you know about the dataset? Are you willing to associate yourself with the metadata record ?

Management: data liability Metadata is a declaration of: Purpose o the originator’s intended application of the data Use Constraints o inappropriate applications of the data Completeness o features or geographies excluded from the data Distribution Liability o explicit liability of the data producer and assumed liability of the consumer What to do… What not to do…

Review: the utility of metadata Metadata can support: Data distribution o discovery o metadata publication o data portals Data management o maintenance & update o discovery & reuse o data accountability o data liability [Project management]

Choosing Metadata Standards Image courtesy of Viv Hutchinson

Darwin Core | biological diversity, taxonomy Dublin Core | general DDI (Data Documentation Initiative) | social & behavioral sci. DIF (Directory Interchange Format) | environmental sci. EML (Ecological Metadata Language) | ecology, biology ISO | geographic data Multiple standards exist Browse by discipline:

Comparing metadata standards EMLFGDC Title Abstract Entity DescriptionEntity Type Definition Intellectual RightsUse Constraints

Choosing a metadata standard Many standards collect similar information Factors to consider: 1.Your data type raster/vector GIS data, images, surveys/text, etc. 2.Organization [funder] policies 3.Future preservation/sharing location 4.Tools to support creation & distribution 5.Other factors: Availability of human support; instructional materials; use of controlled vocabularies; output formats

Summary o Metadata is documentation of data o A metadata record captures critical information about the content of a dataset o Metadata allows data to be discovered, accessed, and re-used o A metadata standard provides structure and consistency to data documentation o Standards and tools vary – select according to defined criteria such as data type, organizational guidance, and available resources o Metadata is of critical importance to data developers, data users, and organizations o Metadata can be effectively used for: data distribution data management project management o Metadata completes a dataset. Creating robust metadata is in your OWN best interest!

On Thursday Barnard Classroom 5 th Floor