Presentation is loading. Please wait.

Presentation is loading. Please wait.

Cyberinfrastructure and EarthScope Science goals: A GEON perspective What is Cyberinfrastructure? What is GEON? How will GEON research facilitate discovery.

Similar presentations


Presentation on theme: "Cyberinfrastructure and EarthScope Science goals: A GEON perspective What is Cyberinfrastructure? What is GEON? How will GEON research facilitate discovery."— Presentation transcript:

1 Cyberinfrastructure and EarthScope Science goals: A GEON perspective What is Cyberinfrastructure? What is GEON? How will GEON research facilitate discovery and integration of earth science data? What are the benefits of such a research initiative for EarthScope? How can earth scientists participate in Cyberinfrastructure research opportunities? CYBERINFRASTRUCTURE FOR THE GEOSCIENCES A.K.Sinha, Virginia Tech, 2005

2 Cyberinfrastructure National Science Foundation’s Cyberinfrastructure Cyberinfrastructure is the organized aggregate of technologies enabling access and coordination of information technology resources to facilitate science, engineering, and societal goals. –Data access from distributed systems –Data inter-operability –Computation: grid based and workflows –Visualization –Tools –Integration: highlighted today NSF Blue Ribbon Panel (Atkins) Report provided a compelling and comprehensive vision of an integrated Cyberinfrastructure Modified from Berman, SDSC, 2005 CYBERINFRASTRUCTURE FOR THE GEOSCIENCES A.K.Sinha, Virginia Tech, 2005

3 A KEY OBSERVATION IN SUPPORT OF CYBERINFRASTRUCTURE RESEARCH IN GEOSCIENCES “Large team ** efforts are required to build a federation of data and tools; but smaller groups or individuals working independently and given access to these data and tools can (and likely will) make fundamental discoveries” MODIFIED FROM BLUE RIBBON ADVISORY PANEL ON CYBERINFRUSTRUCTURE REPORT, NSF ** such as GEON and other projects CYBERINFRASTRUCTURE FOR THE GEOSCIENCES A.K.Sinha, Virginia Tech, 2005

4 Towards an Integrated Earth Science data and knowledge base to achieve EarthScope Science and education goals Science Investigators Educators and the Public USArray PBO SAFOD InSAR EarthScope Instrumentation and Data Cyberinfrastructure Resources Plus semantic integration of other earth science data CYBERINFRASTRUCTURE FOR THE GEOSCIENCES A.K.Sinha, Virginia Tech, 2005

5 Adapted from D.Seber,SDSC CYBERINFRASTRUCTURE FOR THE GEOSCIENCES A.K.Sinha, Virginia Tech, 2005

6 Three dimensional view of the lithosphere-asthenosphere boundary and surface topography of the northern Appalachians. Base of lithosphere interpolated from migrated Ps waveform images at 6 labeled stations. ( From Rychert et al. 2005) New knowledge about evolution of continents requires complex integration of geophysical data with those associated with sub-crustal lithosphere ages, its composition and physical properties (seismic, thermal etc), surface geology and associated events chronology CYBERINFRASTRUCTURE FOR THE GEOSCIENCES A.K.Sinha, Virginia Tech, 2005

7 CYBERINFRASTRUCTURE FOR THE GEOSCIENCES A.K.Sinha, Virginia Tech, 2005 What is the geologic and geophysical record of Super- Continent assembly and dispersal?

8 EarthScope Science Targets: Examples from eastern North America What is the geologic and geophysical record of Super-Continent assembly and dispersal? What are the architectures of terrane boundaries at depth? How do composition, temperature and strain fabrics vary within the lithosphere and asthenosphere? Are lithospheric and asthenospheric strain coupled? How sharp is the lithosphere-asthenosphere boundary? What defines it? DATA NEEDED TO ADDRESS THESE QUESTIONS ARE DISTRIBUTED ACROSS THE COUNTRY, IN DIFFERENT FORMATS AND CANNOT BE INTEGRATED IN A WEB ENVIRONMENT WITH EXISTING TECHNOLOGIES —overcoming heterogeneity is a priority cyberinfrastructure challenge CYBERINFRASTRUCTURE FOR THE GEOSCIENCES A.K.Sinha, Virginia Tech, 2005

9 Outline Data integration problem and solutions GEON data integration solution: ontology enabled semantic mediation What is ontology Registering data to ontologies Discovering data and using workflows in a web environment to go from queries to questions CYBERINFRASTRUCTURE FOR THE GEOSCIENCES A.K.Sinha, Virginia Tech, 2005

10 Platform heterogeneity: different OS platforms DBMS heterogeneity: different database systems, e.g. SQLServer, mySQL, DB2 Data type heterogeneity Schema heterogeneity Heterogeneity in units, accuracy, resolution Semantic heterogeneity ( modified from Baru, SDSC, 2005) CYBERINFRASTRUCTURE FOR THE GEOSCIENCES A.K.Sinha, Virginia Tech, 2005 GEON Architecture addresses problems of : 1. Variety of data sources and types 2. Discovery and relevance 3. Addressing needs of different communities

11 What is GEON ? How can GEON help integrate heterogeneous and distributed data? CYBERINFRASTRUCTURE FOR THE GEOSCIENCES A.K.Sinha, Virginia Tech, 2005

12 GEON: The Geosciences Network www.geongrid.org  GEON is a NSF funded collaborative research between IT and Earth Science researchers with the goal of developing cyberinfrastructure to enable new integrative modes of geosciences research  GEON is developing a pioneering system to use knowledge-based techniques to discover, query, and integrate data in the Geosciences  Project participants include 14 PI institutions, as well as partners from other projects, agencies, and industry.  GEON has deployed a Web services- based, distributed computing infrastructure, called the GEONgrid, across the PI and partner sites.  GEONgrid provides access to distributed data collections, tools, and applications Research and Education Products and Results:  Technologies for “Smart Search”, On-the-fly Data Integration, GIS Map Integration, Distributed Portals, and 4D Visualization  Earth Science Research within GEON on  3D Lithospheric Structure  Integrated Geoscience Modeling  Geologic Evolution of North America  Ontologic Framework for the Geo-sphere  Cyberinfrastructure Summer Institute for Geoscientists and Graduate Courses in Geoinformatics CYBERINFRASTRUCTURE FOR THE GEOSCIENCES A.K.Sinha, Virginia Tech, 2005

13 GEON and Cyberinfrastructure Develop cyberinfrastructure that enables interlinking and sharing multidisciplinary Earth Science data resources, software and tools Create a scientist-friendly portal to access data, software for analysis, modeling, and visualization Create the GEONgrid to enable seamless data integration and analysis environment CYBERINFRASTRUCTURE FOR THE GEOSCIENCES A.K.Sinha, Virginia Tech, 2005

14 Physical model Model results HPCC Data Modeling Environment GEON: GEOsciences Network Core Grid Services GT3, OGSA-DAI, GSI, CAS, gridFTP, SRB, PostGIS, mySQL, DB2 Portal (login, myGEON) Physical Grid RedHat Linux, ROCKS, Internet, I2, OptIPuter (planned) Registration Services Data Integration Services Indexing Services Workflow Services Visualization & Mapping Services RegistrationGEONsearchGEONworkbench CYBERINFRASTRUCTURE FOR THE GEOSCIENCES A.K.Sinha, Virginia Tech, 2005

15 Discovering, sharing and using data in a web environment: GEON style Discovery of data resources (e.g., gravity, geologic maps, etc) requires registration through use of high level index terms GEON has deployed extension of AGI Index terms-will be cross indexed to others such as GCMD, AGU Discovering Item level content of databases requires registration through data level ontologies (e.g. column in geochemical database that represents SiO2 measurement) and is a requirement for semantic integration Item detail level registration through ontologies reduces schema based data heterogeneities Computation and modeling tools can be registered for use by community Visualization capabilities Easy access to data through GEON Portal Individual workbench built into GEON Portal Scientific Workflow Systems provide computational and query capabilities in a web environment CYBERINFRASTRUCTURE FOR THE GEOSCIENCES A.K.Sinha, Virginia Tech, 2005

16 Index terms from AGI used for identifying type of data AGI Index Terms GEON Index Ontology CYBERINFRASTRUCTURE FOR THE GEOSCIENCES A.K.Sinha, Virginia Tech, 2005

17 Integration: a buzz word but with complex solutions What is Integration? –Relationships in information contained in heterogeneous and multi-disciplinary databases What are our choices? –Layering of data (commonly used) –View based techniques (create a virtual schema) –Schema based integration (merging of schema, but user must be knowledgeable about the organization, e.g. semantics of schema) –Ontology based semantic integration utilizing workflows….favored by GEON Data Registration is Important for integration! CYBERINFRASTRUCTURE FOR THE GEOSCIENCES A.K.Sinha, Virginia Tech, 2005

18 What is Ontology? Why use Ontology? Ontology : An explicit formal specifications of the terms in the domain (e.g. Geology) and relations among them (Gruber 1993) Why use ontology  To share and reuse of domain knowledge  To make explicit domain assumptions  To separate domain knowledge from the operational knowledge  To analyze domain knowledge Ontology Languages:  RDF and RDFS  OIL  DAMP+OIL  OWL: Ontology Web Language fromW3C CYBERINFRASTRUCTURE FOR THE GEOSCIENCES A.K.Sinha, Virginia Tech, 2005

19 Motivations for Using Ontologies in GEON A better way to discover and understand datasets Use the knowledge in ontologies to find datasets A better way to query datasets Query through ontologies without knowing the details of the schemas A better way to integrate multiple datasets Integrate multiple datasets on-the-fly if they are registered to ontologies A Better way to segment large data bases Transfer only parts of data bases required for integration Modified From Kai Lin, SDSC, 2005 CYBERINFRASTRUCTURE FOR THE GEOSCIENCES A.K.Sinha, Virginia Tech, 2005 An emerging research frontier- Geo-Ontology

20 Class Diagrams - The Basic Building Block for Semantic Integration Earth Scientists create disciplinary ontologies!!! CYBERINFRASTRUCTURE FOR THE GEOSCIENCES A.K.Sinha, Virginia Tech, 2005

21 Earth Science research : stages in developing ontologies CYBERINFRASTRUCTURE FOR THE GEOSCIENCES A.K.Sinha, Virginia Tech, 2005 Napkin Stage Concept Map Stage High Level Ontology: integrated GEON, SWEET and NADM stage GEON formal ontology

22 High Level Ontology Packages : representing relationships CYBERINFRASTRUCTURE FOR THE GEOSCIENCES A.K.Sinha, Virginia Tech, 2005

23 Planetary Material Minerals Rocks State of Matter Element Data Types CYBERINFRASTRUCTURE FOR THE GEOSCIENCES A.K.Sinha, Virginia Tech, 2005

24 GEON Cyberinfrastructure … More than just about the data, GEON is about going from simple Queries to complex Questions CYBERINFRASTRUCTURE FOR THE GEOSCIENCES A.K.Sinha, Virginia Tech, 2005

25 A Home Buyer’s Information Integration Problem What houses for sale under $500k have at least 2 bathrooms, 2 bedrooms, a nearby school ranking in the upper third, in a neighborhood with below-average crime rate and diverse population??InformationIntegration Realtor Demographics School Rankings Crime Stats “Multiple-Worlds”Mediation Bertram Ludäscher, SDSC CYBERINFRASTRUCTURE FOR THE GEOSCIENCES A.K.Sinha, Virginia Tech, 2005

26 A query example: Use SQL to ask a database to show you all white wines from California of 2003 vintage…. A question: "Tell me what wines I should buy to serve with each course of the following menu. And, by the way, I don't like Sauternes." … from W3C This requires two databases (e.g. food and wine) and prescribed relationships between them that are defined for computers as Ontologies Bertram Ludäscher, SDSC CYBERINFRASTRUCTURE FOR THE GEOSCIENCES A.K.Sinha, Virginia Tech, 2005

27 The Problem: Scientific Data Integration or: … from Queries to Questions CYBERINFRASTRUCTURE FOR THE GEOSCIENCES A.K.Sinha, Virginia Tech, 2005

28 Data Registration: key to integration Click on Submission to register a dataset Input a data set name Select a zipped shapefile Choose an ontology class CYBERINFRASTRUCTURE FOR THE GEOSCIENCES A.K.Sinha, Virginia Tech, 2005

29 Registration at the item detail level using data ontology: working with data CYBERINFRASTRUCTURE FOR THE GEOSCIENCES A.K.Sinha, Virginia Tech, 2005

30 GEONsearch:building on data registration Choose subject (from a “base” ontology)Choose location (from a gazetteer Webservice) Choose a time (numeric range or from a time ontology Webservice) Choose concepts from ontologies Kai Lin, SDSC, 2005 CYBERINFRASTRUCTURE FOR THE GEOSCIENCES A.K.Sinha, Virginia Tech, 2005

31 Ontology Enabled Map Integration :A Case Study Geologic Data sets Arizona, Idaho, Montana, Utah, Nevada, Colorado, Wyoming, New Mexico Ontologies  Geologic Time Scale  Multihierarchial Rock Classification from Canada Geologic Survey  British Rock Classification Scheme Snapshot after querying “Paleozoic” CYBERINFRASTRUCTURE FOR THE GEOSCIENCES A.K.Sinha, Virginia Tech, 2005

32 Scientific Workflow Systems in GEON Adding computational capability in a web environment – Promote “scientific discovery” by providing tools and methods to generate scientific workflows –Support computational infrastructure for modeling,classification,computation –Design frameworks which define efficient ways to connect to the existing data and integrate heterogeneous data from multiple resources CYBERINFRASTRUCTURE FOR THE GEOSCIENCES A.K.Sinha, Virginia Tech, 2005

33 PointInPolygonalgorithm Find data on the basis of ontologic registration Workflow layout for rock classification, but can be used for any query that requires a classifier CYBERINFRASTRUCTURE FOR THE GEOSCIENCES A.K.Sinha, Virginia Tech, 2005

34 Integration Scenario: A-type pluton query Classifying A-types from an Igneous rock database Integrating between Relational and Spatial (shapefiles) databases to query and interactively display GIS results CYBERINFRASTRUCTURE FOR THE GEOSCIENCES A.K.Sinha, Virginia Tech, 2005

35 I & S type A type Ontology System Location States Virginia Classification System Rock Classifiers Igneous Pluton A-type Mineral Zircon Geologic Time Dating Methods U-Pb Zircon Methods Integration Scenario: Stages for access to data and tools in a workflow environment The integration scenario: What is the distribution and U/Pb Zircon ages of A-Type Plutons in Virginia? 10 4 Ga/Al Zr 1 2 3 3 4 5 6 6 5 6 CYBERINFRASTRUCTURE FOR THE GEOSCIENCES A.K.Sinha, Virginia Tech, 2005

36 Distribution and ages of A-Type plutons and their ages based on integration of multiple databases

37 How do earth scientists participate in Cyberinfrastructure research? Know your data……its content and definitions Think more broadly…..integration is between databases that are different from yours Learn more about how to use IT through summer workshop at SDSC, as well as others sponsored by Societies Register your data using Index Terms through GEON Portal to facilitate discovery of databases; use data ontology for discovery of data Build and Share tools and services for use in a web environment Construct concept maps in your discipline….leads to formal ontologies required for semantic integration……remember Geo-Ontology EarthScope requires integrative capabilities CYBERINFRASTRUCTURE FOR THE GEOSCIENCES A.K.Sinha, Virginia Tech, 2005

38 Some processes and objects typically involved in crustal melting. melting heating From Cal Barnes, Texas Tech, 2005 CYBERINFRASTRUCTURE FOR THE GEOSCIENCES A.K.Sinha, Virginia Tech, 2005 From Objects to Processes---- just the beginning of a new integrative world now where?

39 Two important events at GSA DIVISION OF GEOINFORMATICS Data to Knowledge FIRST Business Meeting will take place during the upcoming National GSA meeting,Salt Lake City Tuesday,18 October, Ballroom D, 5.45-7.45pm GEON and EarthScope Reception Monday, 17 October, Hilton Salt Lake City Center Alpine West Ballroom 5.00-7.00pm


Download ppt "Cyberinfrastructure and EarthScope Science goals: A GEON perspective What is Cyberinfrastructure? What is GEON? How will GEON research facilitate discovery."

Similar presentations


Ads by Google