A Multi-Discipline Metadata Registry for Science Interoperability J. Steven Hughes/JPL - Daniel J. Crichton/JPL -

Slides:



Advertisements
Similar presentations
Remote Visualisation System (RVS) By: Anil Chandra.
Advertisements

Manage Scientific Metadata Using XML Yang, R., M. Kafatos and X. Wang, Managing Scientific Metadata Using XML, IEEE Internet Computing, Volume: 6, Issue:
Web Service Architecture
Overview of Web Services
COM vs. CORBA.
Interoperability of Distributed Component Systems Bryan Bentz, Jason Hayden, Upsorn Praphamontripong, Paul Vandal.
1 Introduction to XML. XML eXtensible implies that users define tag content Markup implies it is a coded document Language implies it is a metalanguage.
National Aeronautics and Space Administration Jet Propulsion Laboratory California Institute of Technology OODT and PDS: An Infrastructure for Management.
DCS Architecture Bob Krzaczek. Key Design Requirement Distilled from the DCS Mission statement and the results of the Conceptual Design Review (June 1999):
© Tefko Saracevic, Rutgers University1 metadata considerations for digital libraries.
XML A brief introduction ---by Yongzhu Li. XML --- a brief introduction 2 CSI668 Topics in System Architecture SUNY Albany Computer Science Department.
PDS M/C Dec PDS4 Data Model Working Group1 The PDS4 Data Model Working Group Activity and Status Report Presented to the PDS Management Council,
1 CS 502: Computing Methods for Digital Libraries Lecture 4 Identifiers and Reference Links.
Ontology-based Access Ontology-based Access to Digital Libraries Sonia Bergamaschi University of Modena and Reggio Emilia Modena Italy Fausto Rabitti.
1 TECO-WIS, 6-8 November 2006 TECHNICAL CONFERENCE ON THE WMO INFORMATION SYSTEM Seoul, Republic of Korea, 6-8 November 2006 ISO 191xx series of geographic.
Web-based Portal for Discovery, Retrieval and Visualization of Earth Science Datasets in Grid Environment Zhenping (Jane) Liu.
March 2010 PDS Imaging Node 1 NASA PDS Imaging Node: NASA PDS Imaging Node: Digital Data Archives and Distribution Archiving and distributing data and.
System Design/Implementation and Support for Build 2 PDS Management Council Face-to-Face Mountain View, CA Nov 30 - Dec 1, 2011 Sean Hardman.
A Software Architecture for Highly Data-Intensive Systems Chris A. Mattmann USC Center for Software Engineering Annual Research Review.
Adapting Legacy Computational Software for XMSF 1 © 2003 White & Pullen, GMU03F-SIW-112 Adapting Legacy Computational Software for XMSF Elizabeth L. White.
EARTH SCIENCE MARKUP LANGUAGE “Define Once Use Anywhere” INFORMATION TECHNOLOGY AND SYSTEMS CENTER UNIVERSITY OF ALABAMA IN HUNTSVILLE.
A Multi-Discipline Metadata Registry for Science Interoperability J. Steven Hughes/JPL - Daniel J. Crichton/JPL -
DCS Overview MCS/DCS Technical Interchange Meeting August, 2000.
XML 1 Enterprise Applications CE00465-M XML. 2 Enterprise Applications CE00465-M XML Overview Extensible Mark-up Language (XML) is a meta-language that.
Introduction to MDA (Model Driven Architecture) CYT.
1 Technologies for distributed systems Andrew Jones School of Computer Science Cardiff University.
Introduction to Apache OODT Yang Li Mar 9, What is OODT Object Oriented Data Technology Science data management Archiving Systems that span scientific.
Page 1 Informatics Pilot Project EDRN Knowledge System Working Group San Antonio, Texas January 21, 2001 Steve Hughes Thuy Tran Dan Crichton Jet Propulsion.
Spectroscopy in VO, ESAC Mar Access to Spectroscopic Data In the VO Doug Tody (NRAO/US-NVO ) for the IVOA DAL working group I NTERNATIONAL.
Text linking in the humanities: citing canonical works using OpenURL CNI Spring 2009 Task Force Meeting Eric Rebillard Departments of Classics and History.
Interfacing Registry Systems December 2000.
ET-ADRS-1, April ISO 191xx series of geographic information standards.
Extensible Markup Language (XML) Extensible Markup Language (XML) is a simple, very flexible text format derived from SGML (ISO 8879).ISO 8879 XML is a.
February 17, 1999Open Forum on Metadata Registries 1 Census Corporate Statistical Metadata Registry By Martin V. Appel Daniel W. Gillman Samuel N. Highsmith,
Ocean Observatories Initiative Data Management (DM) Subsystem Overview Michael Meisinger September 29, 2009.
1 - A View from the Field - The Next Generation Data Standards For the PDS - PDS4 - ESIP Federation Meeting July 8, 2009 J. Steven Hughes JPL Copyright.
Dan Crichton/JPL Steve Hughes/JPL Sean Kelly/UTA Sean Hardman/JPL
NOVA Networked Object-based EnVironment for Analysis P. Nevski, A. Vaniachine, T. Wenaus NOVA is a project to develop distributed object oriented physics.
1 CS 502: Computing Methods for Digital Libraries Lecture 19 Interoperability Z39.50.
Chris Kuruppu NWS Office of Science and Technology Systems Engineering Center (Skjei Telecom) 10/6/09.
September 2003ERPANET - Marburg 2003View # 1 Data preservation Using CCSDS and ISO Standards Denis MINGUILLON
Chapter 27 The World Wide Web and XML. Copyright © 2004 Pearson Addison-Wesley. All rights reserved.27-2 Topics in this Chapter The Web and the Internet.
1 CCSDS Information Architecture Working Group Daniel J. Crichton, Chair NASA/JPL 14 September 2005.
MODEL-BASED SOFTWARE ARCHITECTURES.  Models of software are used in an increasing number of projects to handle the complexity of application domains.
Metadata “Data about data” Describes various aspects of a digital file or group of files Identifies the parts of a digital object and documents their content,
NOVA A Networked Object-Based EnVironment for Analysis “Framework Components for Distributed Computing” Pavel Nevski, Sasha Vanyashin, Torre Wenaus US.
Dan Crichton, Manager, Enterprise Data Architecture Task, Principal Investigator Object Oriented Data Technology Task Steve Hughes, Lead System Engineer,
1 Registry Services Overview J. Steven Hughes (Deputy Chair) Principal Computer Scientist NASA/JPL 17 December 2015.
Page 1 Object Oriented Data Technology for Space Science Data Archiving and Retrieval: Potential Applications in Biomedical Research March 21, 2001 Dan.
International Planetary Data Alliance (IPDA): A Standards Initiative for Building Compatible Archives Dan Crichton, NASA/JPL Reta Beebe, New Mexico State.
Challenges in the Business Digital Ecosystems Pierfranco Ferronato, Soluta.net DBE Principal Architect Digital Ecosystem Workshop, 18 May 2005 “Towards.
PDS Geosciences Node Page 1 Archiving LCROSS Ground Observation Data in the Planetary Data System Edward Guinness and Susan Slavney PDS Geosciences Node.
August 2003 At A Glance The IRC is a platform independent, extensible, and adaptive framework that provides robust, interactive, and distributed control.
12 Oct 2003VO Tutorial, ADASS Strasbourg, Data Access Layer (DAL) Tutorial Doug Tody, National Radio Astronomy Observatory T HE US N ATIONAL V IRTUAL.
A Data Architecture for Interoperable Space Sciences Data Systems 1st Annual ERDN Workshop Early Detection Research Network Chicago, IL September 27,
Working with XML. Markup Languages Text-based languages based on SGML Text-based languages based on SGML SGML = Standard Generalized Markup Language SGML.
Evaluation Biztalk Table of Contents Introduction to XML. Anatomy of an XML document. What is an XML Schema? What is SOAP? XML Web Services overview.
Copyright 2007, Information Builders. Slide 1 iWay Web Services and WebFOCUS Consumption Michael Florkowski Information Builders.
1 Steve Hughes Daniel J. Crichton NASA/JPL January 16, 2007 CCSDS Information Architecture Working.
Distributed Archives Interoperability Cynthia Y. Cheung NASA Goddard Space Flight Center IAU 2000 Commission 5 Manchester, UK August 12, 2000.
Ontologies Reasoning Components Agents Simulations An Overview of Model-Driven Engineering and Architecture Jacques Robin.
VO Data Access Layer IVOA Cambridge, UK 12 May 2003 Doug Tody, NRAO.
Building Preservation Environments with Data Grid Technology Reagan W. Moore Presenter: Praveen Namburi.
IPDA Architecture Project International Planetary Data Alliance IPDA Architecture Project Report.
International Planetary Data Alliance Registry Project Update September 16, 2011.
Geospatial metadata Prof. Wenwen Li School of Geographical Sciences and Urban Planning 5644 Coor Hall
IPDA Registry Definitions Project Dan Crichton Pedro Osuna Alain Sarkissian.
PDAP Query Language International Planetary Data Alliance
The Re3gistry software and the INSPIRE Registry
Google Sky.
Presentation transcript:

A Multi-Discipline Metadata Registry for Science Interoperability J. Steven Hughes/JPL - Daniel J. Crichton/JPL - Jason J. Hyon/JPL - Sean C. Kelly/UTA - Open Forum on Metadata Registries January 17-21, 2000 Santa Fe, New Mexico

A Multi-Discipline Metadata Registry for Science Interoperability Background Problem Statement System Overview Profile Development Conclusion and Issues

Background NASA’s Office of Space Science Planetary Science Planetary Data System (PDS) 5 Science disciplines nodes - 2 Support nodes 1 Central node Heterogeneous domains - short term missions Astrophysics Astrophysics Data System 100s to 1000s of nodes Homogeneous domains - long term missions Space Physics Space Physics Data System* Several identified nodes

Background Planetary Data System (PDS) Archives essentially all science data from solar system exploration missions Prototype , Operational Publishes archive quality products Well defined standards architecture

Background Planetary Science Standards Architecture

Background Planetary Science Data Dictionary Data Elements spanning Planetary Science disciplines Nomenclature Standard Meaning, type, ranges, enumerated values Planetary Science Data Model Developed as Planetary Science enterprise E/R model Planetary Science Entities - Spacecraft, Instruments Science Data Entities - Data Products, Projections,... Data Organization Entities - Volumes Management Entities - Nodes, Personnel Implemented as the PDS Data Set Catalog in an RDBMS Distributed in Object Description Language

Background Challenge Develop single interface for locating space science data. Provide data system interoperability. Support correlative Science.

Problem Statement Space scientists can not easily locate or use data across the hundreds if not thousands of autonomous, heterogeneous, and distributed data systems currently in the Space Science community. Heterogeneous Systems Data Management - RDBMS, ODBMS, HomeGrownDBMS, BinaryFiles Platforms - UNIX, LINUX, WIN3.x/9x/NT, Mac, VMS, … Interfaces - Web, Windows, Command Line Data Formats - HDF, CDF, NetCDF, PDS, FITS, VICR, ASCII,... Data Volume - KiloBytes to TeraBytes Heterogeneous Disciplines Moving targets and stationary targets Multiple coordinate systems Multiple data object types (images, cubes, time series, spectrum, tables, binary, document) Multiple interpretations of single object types Multiple software solutions to same problem. Incompatible and/or missing metadata

Proposed Solution Encapsulate individual data systems. (Hide uniqueness.) Communicate using metadata that describe resources Data (e.g. data sets, images) non-Data (e.g. catalogs, services) Enable interoperability based on metadata compatibility. Refocus problem on metadata development.

Proposed Solution (cont) Object_Oriented Data Technology Task (OODT) –Domain independent data management infrastructure Domain independent data structures –XML - Standard interchange language –Metadata management Resource profile –Message passing Domain independent system infrastructure –CORBA for interoperability between computer systems and languages –Message passing to simply interface design –Standardized reusable server components

System Overview Object Oriented Data Technology Framework SeaWinds StagingOODT ServerPDS StagingPTI Staging Profile Server Query Server Archive Server Product Server Archive Server Profile Server SybaseOracle Profile Server PDS Systems Product Server Prof Scientist Web Server

System Overview Profile Service Profile describes a resource –Available datasets and products –Types of resources and where they’re located Optionally reference other profile servers Profile Server Prof Data system 1 Data system 2 Profile Server Prof Profile Server

System Overview Query Service Knows how to “crawl” through servers to produce a result –Crawls through profiles to discover other profiles and product servers –Crawls through product servers to display available products Accessible through CORBA API or through web browser

Profile Development Objective Objective Design and develop domain generic structure that will capture the metadata necessary for identifying and locating science data resources across distributed heterogeneous data systems. Result Profile - A resource description (subset of meta-model) sufficient to determine if the resource might resolve a query.

Profile Development Approach Choose a common interchange format. Develop a domain generic language. Implement domain specific instances. Model the domain. Capture the meta-data. Develop system to manage the results.

Profile Development Choose a common interchange format XML eXtensible Markup Language More expressive than HTML More simple than SGML A meta-language used to define domain languages. XSIL - eXtensible Scientific Interchange Language. XIL - Instrument control language. Wide acceptance as an interchange format. Electronic data interchange (EDI) standard.

Profile Development Develop a domain generic language Define a generic structure (XML DTD) that can describe heterogeneous domain-specific resources. Profile - A resource description with sufficient information to determine if the resource satisfies a query. Profile elements name, syntax, unit, value_instance, meaning, alias, … encodes selected domain attributes and their values specific to this resource Resource attributes - id, title, discipline, location_id, … Profile attributes - id, title, desc, type, data_dictionary_id, …

Profile Development Develop a domain generic language prof.dtd <!ELEMENT PROFILES (PROFILE+)> <!ELEMENT PROFILE (PROFILE_ATTRIBUTES, RESOURCE)> <!ATTLIST PROFILE PROFILE_ID CDATA #REQUIRED > <!ELEMENT PROFILE_ATTRIBUTES (ID, TITLE*, DESC*, TYPE*, STATUS_ID*, SECURITY_TYPE*, PARENT_ID*, CHILD_ID*, REVISION_NOTE*, DATA_DICTIONARY_ID*)> <!ELEMENT RESOURCE (RESOURCE_ATTRIBUTES, PROFILE_ELEMENT*)> <!ELEMENT RESOURCE_ATTRIBUTES (RESOURCE_ID, RESOURCE_TITLE, RESOURCE_DISCIPLINE, RESOURCE_AGGREGATION, RESOURCE_CLASS, RESOURCE_LOCATION_ID, RESULT_MIME_TYPE)> <!ELEMENT PROFILE_ELEMENT (ELEMENT_NAME, ELEMENT_MEANING*, ELEMENT_ALIAS*, VALUE_SYNTAX*, VALUE_UNIT*, (VALUE_INSTANCE | (MINIMUM_VALUE, MAXIMUM_VALUE))*)>

Profile Development Profile Example - PDS Distributed Inventory System PROFILE_PDS_DIS_V1.3.n Planetary Data System - Distributed Inventory System - Profile V1.0 This profile describes the Planetary Data System (PDS) Distributed Inventory System (DIS)... PROFILE OODT_PDS_DATA_SET_DD_V1.0 PDS_DIS_V1.3.n Planetary Data System - Distributed Inventory System PDS GRANULE+ INVENTORY text/html...

Profile Development Profile Example (cont) - PDS Distributed Inventory System … DATA_OBJECT_TYPE The data_object_type element provides the type... ENUMERATION N/A IMAGE... DATA_SET_NAME The data_set_name element identifies a PDS data set. -- example... ENUMERATION N/A VO1/VO2 MARS VISUAL IMAGING SUBSYSTEM DIGITAL... VO2 MARS RADIO SCIENCE SUBSYSTEM RESAMPLED LOS …... TARGET_NAME The target_name element provides the names of the targets... ADS.OBJECT_ID ENUMERATION N/A IDA JUPITER...

Profile Development Develop a domain generic language Specialize the profile class Profile - One profile to one resource (e.g. inventory) Inventory - One profile to many resources (e.g. data set, image) Minimized profile element attributes no meanings subsets of preferred values Dictionary - One profile to one discipline Maximize profile element attributes aliases, meanings union of all preferred values

Profile Development Develop a domain generic language Profile element hierarchy Dictionary - Planetary Science Data Dictionary data elements - union of all data elements in all profiles preferred values - union of all data element values e.g. TARGET_NAME = {ADRASTEA, …, VENUS} Profile - Planetary Image Atlas - Viking, Galileo, MPF,... data elements - union of all data elements for all entities managed by resource preferred values - union of data element values e.g. TARGET_NAME = {MARS, DEIMOS, PHOBOS, JUPITER,...} Inventory - Viking Orbiter Image Catalog data elements - data elements associated with inventory item. perferred values - data element values for inventory item. e.g. TARGET_NAME = {MARS, DEIMOS, PHOBOS}

Profile Development Implement domain specific instances Apply domain generic language to specific domain. E.g. Space/Earth Science data and other resources. Model the domain Data Dictionary Data Model Capture the meta-data Extracted from domain metadata repository

Profile Development Implement domain specific instances Inventory Example - PDS Data Set VO1/VO2-M-VIS-5-DIM-V1.0 VO1/VO2 MARS VISUAL IMAGING SUBSYSTEM DIGITAL IMAGING MODEL... PDS GRANULE+ DATA text/html DATA_SET_NAME VO1/VO2 MARS VISUAL IMAGING SUBSYSTEM DIGITAL IMAGING MODEL... DATA_OBJECT_TYPE IMAGE TARGET_NAME MARS VOLUME_ID VO_ VO_2014

Conclusion Profile Development - Review Choose a common interchange format. (XML) Develop a domain generic language. (X2PL) (XML eXtensible Profile Language) Implement domain specific instances. (Resource Profiles) Develop system to manage the profiles. (Profile Servers)

Conclusion Issues Develop space science metadata registry ~10 high level concepts - “Anchor Points” Complete development of discipline registries Determine management policy Design meta-model and mandate conformance Evolved meta-model through voluntary conformance Determine space science metadata standards NASA Data Entity Dictionary Specification Language (DEDSL - XML syntax) currently being used