Enhancing Linkages Between Projects and Datasets: Examples from LBA-ECO for NACP Lisa Wilcox, Amy L. Morrell,

Slides:



Advertisements
Similar presentations
CIDOC 2000 Using GEM Metadata to Access Education Resources Nancy Virgil Morgan Coordinator
Advertisements

The Biosafety Clearing-House of the Cartagena Protocol on Biosafety Tutorial – BCH Resources.
Near East Plant Protection Network for Regional Cooperation & Knowledge Sharing Food and Agriculture Organization of the United Nations An Overview on.
V Alyssa Rosemartin 1, Lee Marsh 1, Ellen Denny 1, Bruce Wilson USA National Phenology Network, Tucson, AZ; 2 - Oak Ridge National Laboratory, Oak.
Peter Griffith and Megan McGroddy 4 th NACP All Investigators Meeting February 3, 2013 Expectations and Opportunities for NACP Investigators to Share and.
The North American Carbon Program Google Earth Collection Peter C. Griffith, NACP Coordinator; Lisa E. Wilcox; Amy L. Morrell, NACP Web Group Organization:
Geospatial One-Stop A Federal Gateway to Federal, State & Local Geographic Data
EPA GIS Workgroup - Metadata National Geospatial Overview: Metadata EPA GIS Workgroup Meeting Fall 2004 Las Vegas, NV.
Matthew Cechini Raytheon - EED ID: IN31C-07.  ECHO Metadata Overview  Introduction  Problem Space  Solutions ISO Lessons Learned – Perceived.
16 months…. The Visibility Information Exchange Web System is a database system and set of online tools originally designed to support the Regional Haze.
1 ORNL DAAC: Data and Services Robert Cook and Suresh SanthanaVannan Environmental Sciences Division Oak Ridge National Laboratory Oak Ridge, TN Presentation.
Data Sources & Using VIVO Data Visualizing Scholarship VIVO provides network analysis and visualization tools to maximize the benefits afforded by the.
Chapter 4 Database Management Systems. Chapter 4Slide 2 What is a Database Management System (DBMS)?  Database An organized collection of related data.
Agricultural Biotechnology Network for Regional Collaboration and Knowledge Sharing Food and Agriculture Organization of the United Nations An Overview.
System Design/Implementation and Support for Build 2 PDS Management Council Face-to-Face Mountain View, CA Nov 30 - Dec 1, 2011 Sean Hardman.
Elements of a Data Management Plan Alison Boyer Environmental Sciences Division Oak Ridge National Laboratory.
Elements of a Data Management Plan
EZID (easy-eye-dee) is a service that makes it simple for digital object producers (researchers and others) to obtain and manage long-term identifiers.
1 LOMGen: A Learning Object Metadata Generator Applied to Computer Science Terminology A. Singh, H. Boley, V.C. Bhavsar National Research Council and University.
Metadata (for the data users downstream) RFC GIS Workshop July 2007 NOAA/NESDIS/NGDC Documentation.
SAFARI 2000 Data Activities at the ORNL DAAC Bob Cook, Les Hook, Stan Attenberger, Dick Olson, and Tim Rhyne Oak Ridge National Laboratory.
Data Management Practices: BCO-DMO’s Successes and Challenges Bob Groman BCO-DMO Woods Hole Oceanographic Institution NERACOOS/NeCODP Data Management Workshop.
US GPO AIP Independence Test CS 496A – Senior Design Team members: Antonio Castillo, Johnny Ng, Aram Weintraub, Tin-Shuk Wong Faculty advisor: Dr. Russ.
Inter-American Workshop on Environmental Data Access Panel discussion on scientific and technical issues Merilyn Gentry, LBA-ECO Data Coordinator NASA.
Case Studies: Statistics Canada (WP 11) Alice Born Statistics UNECE Workshop on Statistical Metadata.
Geospatial One Stop Modules Two and Three. Module 2 Inventory/Document existing Federal agency framework datasets and publish metadata to clearinghouse.
Web Indexing and Searching By Florin Zidaru. Outline Web Indexing and Searching Overview Swish-e: overview and features Swish-e: set-up Swish-e: demo.
Introduction to Apache OODT Yang Li Mar 9, What is OODT Object Oriented Data Technology Science data management Archiving Systems that span scientific.
LBA DIS Activities Report 16 th LBA SSC – Scientific Steering Committee Manaus – Amazonas - Brazil November 4-6, 2004 Laurindo Campos INPA/LBA Central.
LBA-DIS Working Group Report 17th LBA Science Steering Committee Meeting Belém, PA June 2 - 4, 2005 Laurindo Campos, Luiz Horta, Merilyn Gentry & Peter.
An Alternative Approach to Interoperability Testing The Use of Special Diagnostic Records in the Context of Z39.50 and Online Library Catalogs William.
Extensible Markup Language (XML) Extensible Markup Language (XML) is a simple, very flexible text format derived from SGML (ISO 8879).ISO 8879 XML is a.
Creating an Open Archives Metadata Harvesting Protocol Compliant Repository for the American Memory Online Collections OAI Open Meeting, Washington, DC.
FGDC and GOS Metadata: Foundations to Build the NSDI Sharon Shin FGDC Secretariat / Geospatial One-Stop.
Data Management Practices for Early Career Scientists: Closing Robert Cook Environmental Sciences Division Oak Ridge National Laboratory Oak Ridge, TN.
Global map layers Additional global data sets such as Hydrology data (Hydrosheds), new and updated Landcover data (Globcover), demographic data and others.
Design of a Search Engine for Metadata Search Based on Metalogy Ing-Xiang Chen, Che-Min Chen,and Cheng-Zen Yang Dept. of Computer Engineering and Science.
Archival Information Packages for NASA HDF-EOS Data R. Duerr, Kent Yang, Azhar Sikander.
The Digital Library for Earth System Science: Contributing resources and collections Meeting with GLOBE 5/29/03 Holly Devaul.
CBSOR,Indian Statistical Institute 30th March 07, ISI,Kokata 1 Digital Repository support for Consortium Dr. Devika P. Madalli Documentation Research &
5 - 1 Copyright © 2006, The McGraw-Hill Companies, Inc. All rights reserved.
GeoMAPP: Using Metadata to Help Preserve Geospatial Content Matt Peters, Utah’s Automated Geographic Reference Center Glen McAninch, Kentucky Department.
ORNL DAAC Semi-Automated Data Ingest Process Daine Wright Suresh Vannan, Tammy Beaty, Bob Cook, Yaxing Wei, Ranjeet Deverakonda, Harold.
LBA-DIS Working Group Report LBA Science Steering Committee Meeting Palmas - Tocantins November 28-30, 2001.
This presentation describes the development and implementation of WSU Research Exchange, a permanent digital repository system that is being, adding WSU.
DataONE: Preserving Data and Enabling Data-Intensive Biological and Environmental Research Bob Cook Environmental Sciences Division Oak Ridge National.
An OAI-Compliant Federated Physics Digital Library for the NSDL Department of Computer Science Old Dominion University, Norfolk, VA In Collaboration.
SSC-15 Recommendations 1.The SSC recommends that the SSC Chair write a letter of support by the LBA SSC for the GAIA program proposal. The letter should.
NACP A High-Resolution Daily Surface Weather Database for NACP Investigations Peter E. Thornton 1, Robert B. Cook 2, W. Mac Post 2, Bruce E. Wilson 2,
LBA-DIS Working Group Report LBA Science Steering Committee Meeting Cuiabá - MT May 15-17, 2003 Luiz M. Horta.
Cyberinfrastructure to promote Model - Data Integration Robert Cook, Yaxing Wei, and Suresh S. Vannan Oak Ridge National Laboratory Presented at the Model-Data.
Mercury – A Service Oriented Web-based system for finding and retrieving Biogeochemical, Ecological and other land- based data National Aeronautics and.
Registering Earth Science Data and Data Related Services Using NASA’s Global Change Master Directory (GCMD) Tyler Stevens (GIS/Services Coordinator) ESIP.
1 Overview Finding and importing data sets –Searching for data –Importing data_.
NOVA A Networked Object-Based EnVironment for Analysis “Framework Components for Distributed Computing” Pavel Nevski, Sasha Vanyashin, Torre Wenaus US.
JISC/NSF PI Meeting, June Archon - A Digital Library that Federates Physics Collections with Varying Degrees of Metadata Richness Department of Computer.
U.S. Environmental Protection Agency Central Data Exchange Pilot Project Promoting Geospatial Data Exchange Between EPA and State Partners. April 25, 2007.
Preserving Electronic Mailing Lists as Scholarly Resources: The H-Net Archives Lisa M. Schmidt
A look to the past for the future- The North American Profile Sharon Shin Metadata Coordinator Federal Geographic Data Committee.
Global Change Master Directory (GCMD) Mission “To assist the scientific community in the discovery of Earth science data, related services, and ancillary.
The Earth Information Exchange. Portal Structure Portal Functions/Capabilities Portal Content ESIP Portal and Geospatial One-Stop ESIP Portal and NOAA.
Building Preservation Environments with Data Grid Technology Reagan W. Moore Presenter: Praveen Namburi.
Developing Metadata Frameworks for Earth System Education NSDL 2003 Annual Meeting October 14, 2003 Katy Ginger and Karon Kelly DLESE Program Center.
Store and exchange data with colleagues and team Synchronize multiple versions of data Ensure automatic desktop synchronization of large files B2DROP is.
DIS Working Group Report LBA Science Steering Committee Meeting Florianópolis - SC - Brazil Abril 26-28, 2007 Laurindo Campos (INPA) Luiz Horta (CPTEC)
LBA-DIS Working Group Report
Standardization Promotes Biogeochemical Data Management and Use in Multidisciplinary Environmental Research Yaxing Wei, Suresh Vannan, Robert B. Cook,
Flanders Marine Institute (VLIZ)
Metadata in the modernization of statistical production at Statistics Canada Carmen Greenough June 2, 2014.
ORNL is Operated by UT-Battelle for DOE
Presentation transcript:

Enhancing Linkages Between Projects and Datasets: Examples from LBA-ECO for NACP Lisa Wilcox, Amy L. Morrell, Peter C. Griffith, Organization: Science Systems & Applications, Inc. and the Carbon Cycle and Ecosystems Office, NASA Goddard Space Flight Center Summary The Carbon Cycle and Ecosystems Office is developing ideas to build a warehouse of metadata resulting from the North American Carbon Program. The resulting warehouse and applications would be very similar to those developed for the LBA- ECO Project ( 1. The harvested metadata would be used to create dynamically generated reports, available at 2, which would facilitate access to NACP datasets. Our primary goal is to, as much as possible, associate harvested metadata with its corresponding project group profile. This also addresses high-priority goal #4 of the NACP Data System Task Force to "link the dataset metadata index with the project metadata index generated and maintained by the NACP Office“ 3. The benefit of achieving this goal will be the maximization of data discovery by association of each dataset with its corresponding NACP project group profile. This provides a greater understanding of the scientific and social context of each dataset. This will be challenging, because the datasets exist in many different formats, residing in many thematic data centers and also distributed among hundreds of investigators. Among other things, this situation creates a lack of consistency in how associated metadata is composed, thereby limiting our ability to fully automate metadata harvesting as well as dynamic generation of a wide variety of associated reports This presentation will demonstrate what we can do for NACP by looking at what we have already done for LBA-ECO. Figure 1: LBA-ECO Investigation Profile: The LBA-ECO website provides a profile (Figure 1) for each LBA-ECO investigation including: Participants Abstract(s) Study sites Publications These profiles are very similar to NACP project profiles. Linked from each LBA-ECO profile is a list of associated registered dataset titles, each of which link to a dataset profile (Figure 2) that describes the metadata in a user-friendly way. Figure 2: Dataset Profile: The LBA-ECO dataset profiles are dynamically generated from the harvested metadata stored in our MySQL database. Because of the consistency among LBA- ECO metadata records, we have been able to cross-link controlled vocabulary terms (such as geospatial region) with associated region and site profile reports. Additionally, each dataset profile contains hyperlinks to each associated data file at its home data repository. There are also links to other associated information, such as abstracts of related publications. References 1.LBA-ECO, a component of the Large-Scale Biosphere Atmosphere Experiment in Amazonia, 2.North American Carbon Program (NACP), 3.Prioritized list of recommendations to CCIWG regarding NACP Data Central, NACP Data System Task Force, July 24, Simple Web Indexing System for Humans – Enhanced (Swish-e), 5.MySQL, 6.Content Standard for Digital Geospatial Metadata (CSDGM), 7.Comprehensive Perl Archive Network, Acknowledgements Our CPTEC and ORNL colleagues: Luiz Horta (CPTEC), Merilyn Gentry (ORNL), and Bob Cook (ORNL). Scott Dennis Miller, Humberto Ribeiro da Rocha and all of the investigators of the LBA-ECO investigation CD-01. The generous support given to us by NASA’s Terrestrial Ecology Program. Technical Overview of What We are Currently Doing for LBA-ECO Metadata for LBA-ECO datasets reside at two repositories: the Centro de Previsão de Tempo e Estudos Climáticos (CPTEC) in Brazil and the Oak Ridge National Laboratory (ORNL) Distributed Active Archive Center (DAAC) in the United States. Each of these metadata records is available online as an XML file. A list containing a file location for each record is provided to the system by a separate process. The harvesting process inputs this list to the Swish-e 4 spider and retrieves the files into a single ASCII output file. After the harvesting process is complete, our parsing script ingests the contents of the output file into the MySQL 5 database developed for the metadata warehouse. The database was designed to be compliant with the Content Standard for Digital Geospatial Metadata (CSDGM), a widely accepted metadata standard for geospatial data 6. The parsing script is based on a crosswalk that maps the harvested metadata to this metadata standard. It is written in PERL, and makes extensive use of the XML::Simple CPAN module 7 to parse the metadata. All of the scripts that create our dynamically generated reports are also written in PERL, using PERL’s DBI interface to access our MySQL database. We currently only harvest metadata that is in an XML format. However, not all NACP datasets have corresponding metadata in XML. Therefore, we will need to expand upon our current capabilities by creating harvest and ingest scripts that can extract metadata in other formats. MySQL Metadata Warehouse ORNL CPTEC Swish-e Spider XML Files Parsing Script (using XML::Simple) ASCII Output File Perl Script using DBI Perl Script using DBI Perl Script using DBI Perl Script using DBI Perl Script using DBI Investigation Profile (Figure 1) Region Profile (Figure 3) Dataset Profile (Figure 2) Region/ Datasets Profile (Figure 4) Registered Dataset Titles URL to Registered Dataset Titles URL from Title to Dataset Profile URL to Region Profile URL to Region/ Datasets Profile Example Profiles from the LBA-ECO Project We also use the harvested metadata from the LBA Project in administrative applications to assist quality assurance efforts. These include processes to check for broken hyperlinks to data files, automated s that inform our administrators when critical metadata fields are updated, dynamically generated reports of metadata records that link to datasets with questionable file formats, and dynamically generated region/site coordinate quality assurance reports. These applications are as important as those that facilitate access to information because they help ensure a high standard of quality for the information. Where possible, we hope to create similar reports for NACP.