MGDS Project Overview and Sample Metadata (Arko)1 of 18SESAR–IGSN Workshop (February 26-27, 2007) PROJECT TEAM: Joyce Alsop * Robert Arko Suzanne Carbotte.

Slides:



Advertisements
Similar presentations
Activity Update Frank Nitsche Robert Arko Suzanne Carbotte
Advertisements

Rolling Deck to Repository: Transforming the United States Academic Fleet Into an Integrated Global Observing System Suzanne M. Carbotte, Robert Arko,
Providing access to your data: Determining your audience Robert R. Downs, PhD NASA Socioeconomic Data and Applications Center (SEDAC) Center for International.
Visualizing Fitness for Purpose Bob Groman and Dicky Allison Biological and Chemical Oceanography Data Management Office Woods Hole Oceanographic Institution.
Ocean Data Interoperability Platform EU-US-Australia collaborative project Grant Number: Call: FP7-INFRASTRUCTURES INFSO Activity: INFRA :
Symposium on Digital Curation in the Era of Big Data: Career Opportunities and Educational Requirements: A Data Scientist Perspective Dr. Vicki Lynn Ferrini.
Marine Geoscience Seismic Data System Access for education and research Field Data Center (LDEO) Marine Seismic Data Center (UTIG)
StatCat Building a Statistical Data Finder ssrs.yale.edu/statcat Steven Citron-Pousty Ann Green Julie Linden Yale University.
Metadata Standards for Sample- Based Observations Kerstin Lehnert EGU General Assembly 2011.
DATA SYSTEMS FOR SAMPLE- BASED OBSERVATIONS 1 Kerstin Lehnert.
Building the LTER Network Information System. NIS History, Then and Now YearMilestone 1993 – 1996NIS vision formed by Information Managers (IMs) and LTER.
National Aeronautics and Space Administration Implementing DSpace at NASA Langley Research Center 1 Greta Lowe Librarian NASA Langley Research Center
State Geological Survey Contributions to the National Geothermal Data System.
2 nd Training Workshop 4 – 5 June 2007 Common Data Index - CDI By Dick M.A Schaap Technical Coordinator SeaDataNet.
MEDIN Data Guidelines. Data Guidelines Documents with tables and Excel versions of tables which are organised on a thematic basis which consider the actual.
Data Resources US Perspective Kerstin Lehnert Suzanne Carbotte Lamont-Doherty Earth Observatory of Columbia University.
© 2013 National Ecological Observatory Network, Inc. ALL RIGHTS RESERVED. THE NEON APPROACH TO DATA INGEST, CURATION, AND SHARING Christine Laney (Data.
Data Management Practices: BCO-DMO’s Successes and Challenges Bob Groman BCO-DMO Woods Hole Oceanographic Institution NERACOOS/NeCODP Data Management Workshop.
Information Requirements for Integrating Spatially Discrete, Feature- Based Earth Observations Jeffery S. Horsburgh Anthony Aufdenkampe, Kerstin Lehnert,
Beyond a Data Portal: A Collaborative Environment for the Deep Carbon Science Communities Han Wang, Yu Chen, Patrick West, John Erickson, Xiaogang Ma,
The Digital Library for Earth System Education: A Community Resource
The Marine Metadata Interoperability Project A Model for Community Collaboration September 23, 2010 Nan Galbraith WHOI.
Helen Glaves (NERC- BGS), Dick Schaap (MARIS), Robert Arko (LDEO) and Roger Proctor (IMOS)
Australian Partnership for Sustainable Repositories University of Sydney practices and test-bed projects, sustainability in a distributed.
California’s Surface Water Ambient Monitoring Program Data Management Systems Cassandra Lamerdin SWAMP Data Management Team Marine Pollution Studies Laboratory.
Updates from EOSDIS -- as they relate to LANCE Kevin Murphy LANCE UWG, 23rd September
Application of International GeoSample Number (IGSN) to Sample Collections Sri Vinay Geoinformatics for Geochemistry (GfG) Program Lamont Campus of Columbia.
World Data Center for Marine Environmental Sciences.
Enabling Access to High-Resolution LiDAR Topography through Cyberinfrastructure-Based Data Distribution and Processing Christopher J. Crosby, J Ramón Arrowsmith.
Mind the Gap: Finding Data Across Decades and Disciplines with the SSDB Stephen P. Miller 1, P. Dru Clark 1, Jacob M. Perez 1, Aaron D. Sweeney 1, John.
Local global disambiguation of terms and concepts The BCO-DMO metadata database uses controlled vocabularies to record many of the important pieces of.
INTEGRATED OCEAN DRILLING PROGRAM MANAGEMENT INTERNATIONAL International Data Exchange Workshop – Kiel, Germany – May 9-11, 2007 SEDIS Scientific Earth.
Mind the Gap: Finding Data Across Decades and Disciplines with the SSDB Stephen P. Miller 1, P. Dru Clark 1, Jacob M. Perez 1, Aaron D. Sweeney 1, John.
T43C-1647 The EarthChem Deep Lithosphere Dataset: Digital Access to Mantle Xenolith Petrological Data The EarthChem Deep Lithosphere Dataset: Digital Access.
The Digital Library for Earth System Science: Contributing resources and collections Meeting with GLOBE 5/29/03 Holly Devaul.
Kerstin Lehnert Lamont-Doherty Earth Observatory, Columbia University.
Two types of data requirements: 1. "time-series monitoring data" which requires real- time continuous collection (sea surface temperature, stream gauges,
EarthChem: Geochemistry Information Network Registry for Earth samples that administers unique identifiers Kerstin Lehnert Steve Goldstein.
NOAA National Geophysical Data Center & collocated World Data Centers, Boulder CO USA World Data Center for Marine Geology and Geophysics, Boulder, CO.
Building the LTER Network Information System. NIS History, Then and Now YearMilestone 1993 – 1996NIS vision formed by Information Managers (IMs) and LTER.
The Digital Library for Earth System Science: Contributing resources and collections GCCS Internship Orientation Holly Devaul 19 June 2003.
The Long Tail of Sample-based Data in the Next Decade FROM DARKNESS TO LIGHT Kerstin Lehnert
OAI Overview DLESE OAI Workshop April 29-30, 2002 John Weatherley
Presented by Jens Schwidder Tara D. Gibson James D. Myers Computing & Computational Sciences Directorate Oak Ridge National Laboratory Scientific Annotation.
“A Library outranks any other one thing a community can do to benefit its people.” --Andrew Carnegie.
The US Long Term Ecological Research (LTER) Network: Site and Network Level Information Management Kristin Vanderbilt Department of Biology University.
29 March 2004 Steven Worley, NSF/NCAR/SCD 1 Research Data Stewardship and Access Steven Worley, CISL/SCD Cyberinfrastructure meeting with Priscilla Nelson.
Serving Multidisciplinary Data For Ridge2000 and MARGINS Programs William B. F. Ryan, Suzanne Carbotte and MGDS Team.
| nectar.org.au NECTAR TRAINING Module 2 Virtual Laboratories and eResearch Tools.
U.S. Department of the Interior U.S. Geological Survey Decision Support Tools and USGS Data Management Best Practices Cassandra Ladino USGS Chesapeake.
Earth System Curator and Model Metadata Discovery and Display for CMIP5 Sylvia Murphy and Cecelia Deluca (NOAA/CIRES) Hannah Wilcox (NCAR/CISL) Metafor.
Fire Emissions Network Sept. 4, 2002 A white paper for the development of a NSF Digital Government Program proposal Stefan Falke Washington University.
1 1 NOAA Office of Ocean Exploration End-to-End Data Management: A Success Story NOAA Tech Conference November 2005 Susan Gottfried National Coastal Data.
Providing access to your data: Determining your audience Robert R. Downs, PhD NASA Socioeconomic Data and Applications Center (SEDAC) Center for International.
Working with your archive organization: Broadening your user community Robert R. Downs, PhD Socioeconomic Data and Applications Center (SEDAC) Center for.
National Geophysical Data Center (NGDC) U.S. Department of Commerce National Oceanic & Atmospheric Administration National Geophysical Data Center (NGDC)
Semantic Concepts in Expedition Metadata Semantic Concepts in Expedition Metadata Bob Arko Lamont-Doherty Earth Observatory OOSSI Workshop Nov. 18, 2008.
The Earth Information Exchange. Portal Structure Portal Functions/Capabilities Portal Content ESIP Portal and Geospatial One-Stop ESIP Portal and NOAA.
The launching of an expedition has its own brand of excitement, with the sound of the main engines firing up, and the lifting of the gangway in a foreign.
Working with Your Archive : Broadening Your User Community Robert R. Downs, PhD NASA Socioeconomic Data and Applications Center (SEDAC) Center for International.
Biological and Chemical Oceanography Data Management Office slide 1 of 10 U.S. GEOTRACES Data Management Cyndy Chandler BCO-DMO ~ WHOI 23 September 2008.
SIOExplorer: Digital Library Projects R/V Alexander Agassiz November, 1907 UCSD Libraries Scripps Institution of Oceanography San Diego Supercomputer Center.
LEGACY MEETING – SUMMARY NOTES LEGACY MEETING SUMMARY NOTES September 3-5, 2008 Palisades, NY.
EarthCube Sustaining the Geosciences for 21 st Century Challenges Credits: from top to bottom: NOAA Okeanos Explorer Program (CC BY-SA 2.0), NASA/Kathryn.
A Fleet-wide Approach to Optimizing Data Quality Vicki Ferrini, Suzanne O’Hara (LDEO) Paul Johnson, Kevin Jerram (UNH)
Jens Klump | OCE Science Leader Earth Science Informatics
Linked Data for Field Deployments
An Overview of Data-PASS Shared Catalog
WGISS Connected Data Assets Oct 24, 2018 Yonsook Enloe
Bird of Feather Session
Presentation transcript:

MGDS Project Overview and Sample Metadata (Arko)1 of 18SESAR–IGSN Workshop (February 26-27, 2007) PROJECT TEAM: Joyce Alsop * Robert Arko Suzanne Carbotte (lead) Dale Chayes John Diebold Vicki Ferrini Andrew Goodwillie * Kerstin Lehnert Andrew Melkonian Suzanne O’Hara William Ryan R.A. Weissel MGDS PROJECT OVERVIEW AND SAMPLE METADATA

MGDS Project Overview and Sample Metadata (Arko)2 of 18SESAR–IGSN Workshop (February 26-27, 2007) OUTLINE 1.PROJECT OVERVIEW 2.CURRENT HOLDINGS 3.DATA MODEL 4.METADATA SUBMISSION 5.CHALLENGES

MGDS Project Overview and Sample Metadata (Arko)3 of 18SESAR–IGSN Workshop (February 26-27, 2007) OVERVIEW: MISSION STATEMENT Design and maintain an integrated data repository for MG&G communities: Ridge 2000 Program MARGINS Program U.S. Antarctic Program Legacy - Multibeam Synthesis Seismic Reflection Joint funding from NSF OCE + EAR + OPP

MGDS Project Overview and Sample Metadata (Arko)4 of 18SESAR–IGSN Workshop (February 26-27, 2007) OVERVIEW: SCOPE AND PARTNERS Data from marine and terrestrial realms Data from all disciplines - biological, physical, chemical, geological Project partners: WHOI (Ridge 2000 Program) TAMU (MARGINS Program) RPSC (U.S. Antarctic Program) NGDC, CCOM (Legacy - Multibeam Synthesis) UTIG (Seismic Reflection) Collaborative partners: DLESE (education modules) MMI (community/ontology development) SESAR (sample registration)

MGDS Project Overview and Sample Metadata (Arko)5 of 18SESAR–IGSN Workshop (February 26-27, 2007) OVERVIEW: SCIENTIFIC RATIONALE Ensure ability to verify research results Preserve expensive/unique/unrepeatable data Supplement traditional publication methods Facilitate cross-disciplinary research Increase data availability to non-specialists Enable automated analysis + synthesis

MGDS Project Overview and Sample Metadata (Arko)6 of 18SESAR–IGSN Workshop (February 26-27, 2007) OVERVIEW: SYSTEM COMPONENTS PRODUCTS Metadata catalog (1500+ collections) Data repository (210,000+ files total 5+ TB - partnership with SDSC) Global syntheses (e.g. multi-resolution DEM) SERVICES Web portals (search + download) GeoMapApp® (integrate + visualize data from multiple sources) Web services (OAI, OGC, etc.)

MGDS Project Overview and Sample Metadata (Arko)7 of 18SESAR–IGSN Workshop (February 26-27, 2007) CURRENT HOLDINGS: SOLID EARTH SAMPLES 50 NEW DATA SETS OVER 3500 SAMPLES (growing rapidly…)

MGDS Project Overview and Sample Metadata (Arko)8 of 18SESAR–IGSN Workshop (February 26-27, 2007) MULTIPLE WEB PORTALS TO SERVE DIFFERENT COMMUNITIES : SINGLE INTEGRATED DATABASE BACKEND DATA MODEL:

MGDS Project Overview and Sample Metadata (Arko)9 of 18SESAR–IGSN Workshop (February 26-27, 2007) DATA MODEL: COLLECTION (registration = ?) Field Observatory Expedition Derived SET (registration = STD-DOI) group of data objects having common provenance OBJECT (registration = IGSN) Data File Real-time Processed Sample COLLECTION SET OBJECT

MGDS Project Overview and Sample Metadata (Arko)10 of 18SESAR–IGSN Workshop (February 26-27, 2007) DATA MODEL: COLLECTION METADATA related collections collection aliases (at other repositories) platform/operator funding agency/awards project titles/urls science party (field + lab personnel) lat/lon bins location (physio features, place names) supporting documents (cruise reports etc.) references (citations)

MGDS Project Overview and Sample Metadata (Arko)11 of 18SESAR–IGSN Workshop (February 26-27, 2007) DATA MODEL: ACQUISITION EVENTS 1.LAUNCH (independent, navigated) daughter platforms e.g. Submersible, Drone, Small Boat 2.LINE (navigated) towed platforms e.g. Camera, MCS, TowYo 3.STATION (only start/stop) lowered platforms e.g. Core, Grab, CTD, BLISP towed platforms e.g. Dredge, Net deployed platforms e.g. OBS, Marker, Float, Probe Events can be nested (e.g. Dive > Station)

MGDS Project Overview and Sample Metadata (Arko)12 of 18SESAR–IGSN Workshop (February 26-27, 2007) DATA MODEL: SAMPLE METADATA collection_id sample_id sample_name (investigator’s pet name) parent_id data_type (e.g. “Rock Sample”) sample_type (e.g. “Igneous: Volcanic: Mafic”) launch_id line_id station_id ---> station_type (e.g. “Bottom: Towed”) + station_platform (e.g. “Dredge”) start_date start_longitude/latitude/elevation stop_date stop_longitude/latitude/elevation navfix_type local_origin/units (e.g. for dive programs) start_local_x/y stop_local_x/y location_id (physiographic feature) tectonic_setting (e.g. “Back-Arc Basin”) investigator_id contact_id contributor_id repository_id (holds authoritative metadata) facility_id (holds physical sample) other/details

MGDS Project Overview and Sample Metadata (Arko)13 of 18SESAR–IGSN Workshop (February 26-27, 2007) DATA MODEL: CONTROLLED VOCABULARIES (both types and identifiers) collection_id collection_type data_type device_type dive_type feature_id feature_type format_id initiative_id language_id launch_platform_type launch_type line_platform_type line_type location_id nav_type organization_id person_id platform_id platform_type role_id role_type station_platform_type station_type status_id (and still growing…)

MGDS Project Overview and Sample Metadata (Arko)14 of 18SESAR–IGSN Workshop (February 26-27, 2007) METADATA SUBMISSION: POLICY Records made public immediately: people/projects/awards primary navigation catalog of acquisition events catalog of data sets catalog of samples

MGDS Project Overview and Sample Metadata (Arko)15 of 18SESAR–IGSN Workshop (February 26-27, 2007) METADATA SUBMISSION: FORMS 1.Contact chief scientist in advance - designate science party liaison 2.Follow up with liaison (60 days) 3.Register/submit data sets to appropriate partner repositories

MGDS Project Overview and Sample Metadata (Arko)16 of 18SESAR–IGSN Workshop (February 26-27, 2007) METADATA SUBMISSION: FORMS Example: Sediment Cores (based on LDEO Repository log sheet)

MGDS Project Overview and Sample Metadata (Arko)17 of 18SESAR–IGSN Workshop (February 26-27, 2007) CHALLENGES: 1.Metadata form submission completeness consistent identifiers and formats 2.Globally unique identifiers 3.Evolving/shared vocabularies Physiographic Feature (gazetteer + local features e.g. Vents) Tectonic Setting Sample Type (domain specific) Station Platform/Type

MGDS Project Overview and Sample Metadata (Arko)18 of 18SESAR–IGSN Workshop (February 26-27, 2007) Questions? _____________ marine – geo.org