“curator” DB design Curator meeting, GFDL, Sep 20.

Slides:



Advertisements
Similar presentations
Data Base. Objective Become familiar with database terminology. Create a project to display data for a single database table. Use a DataGrid control.
Advertisements

Components of GIS.
Earth System Curator Spanning the Gap Between Models and Datasets.
Metadata Development in the Earth System Curator Spanning the Gap Between Models and Datasets Rocky Dunlap, Georgia Tech.
1 OBJECTIVES To generate a web-based system enables to assemble model configurations. to submit these configurations on different.
Implementation of Web Service Technologies in GFDL's FMS Runtime Environment Y. Malysheva, S. Nikonov, V. Balaji GFDL The 7 th GO-ESSP Workshop September.
Two main requirements: 1. Implementation Inspection policies (scheduling algorithms) that will extand the current AutoSched software : Taking to account.
Requirements Specification
Chapter 12: ADO.NET and ASP.NET Programming with Microsoft Visual Basic.NET, Second Edition.
Geographic Information Systems
Interpret Application Specifications
Web-based Portal for Discovery, Retrieval and Visualization of Earth Science Datasets in Grid Environment Zhenping (Jane) Liu.
Chapter 2 Introduction to Database Development Database Processing David M. Kroenke © 2000 Prentice Hall.
MDC Open Information Model West Virginia University CS486 Presentation Feb 18, 2000 Lijian Liu (OIM:
Metadata Creation with the Earth System Modeling Framework Ryan O’Kuinghttons – NESII/CIRES/NOAA Kathy Saint – NESII/CSG July 22, 2014.
The Earth System Curator Metadata Representations Prototype Portal in Collaboration with ESMF and ESG Rocky Dunlap Spencer Rugaber Georgia Tech.
Information Systems Chapter 5 Building the database Part 1. Unsing Access.
10-1 aslkjdhfalskhjfgalsdkfhalskdhjfglaskdhjflaskdhjfglaksjdhflakshflaksdhjfglaksjhflaksjhf.
Software Engineering 2003 Jyrki Nummenmaa 1 CASE Tools CASE = Computer-Aided Software Engineering A set of tools to (optimally) assist in each.
Event Driven Programming
OracleAS Reports Services. Problem Statement To simplify the process of managing, creating and execution of Oracle Reports.
ESP workshop, Sept 2003 the Earth System Grid data portal presented by Luca Cinquini (NCAR/SCD/VETS) Acknowledgments: ESG.
Configuration Management (CM)
UWG 2013 Meeting PO.DAAC Web Services Demo. What are PO.DAAC Web Services?
Prototype system of the Japanese Virtual Observatory The Japanese Virtual Observatory (JVO) aims at providing easy access to federated astronomical databases.
JVO JVO Portal Japanese Virtual Observatory (JVO) Prototype 2 Masahiro Tanaka, Yuji Shirasaki, Satoshi Honda, Yoshihiko Mizumoto, Masatoshi Ohishi (NAOJ),
RELATIONAL FAULT TOLERANT INTERFACE TO HETEROGENEOUS DISTRIBUTED DATABASES Prof. Osama Abulnaja Afraa Khalifah
DEPICT: DiscovEring Patterns and InteraCTions in databases A tool for testing data-intensive systems.
Learningcomputer.com SQL Server 2008 – Administration, Maintenance and Job Automation.
1 maxdLoad The maxd website: © 2002 Norman Morrison for Manchester Bioinformatics.
Database Design and Management CPTG /23/2015Chapter 12 of 38 Functions of a Database Store data Store data School: student records, class schedules,
SQL Reporting Services From a Developers Perspective Adam Calderon Principal Engineer Interknowlogy LLC
Resource Brokering in the PROGRESS Project Juliusz Pukacki Grid Resource Management Workshop, October 2003.
Web: Minimal Metadata for Data Services Through DIALOGUE Neil Chue Hong AHM2007.
ATLAS Detector Description Database Vakho Tsulaia University of Pittsburgh 3D workshop, CERN 14-Dec-2004.
Andrey Meeting 7 October 2003 General scheme: jobs are planned to go where data are and to less loaded clusters SUNY.
1 © 1999 Microsoft Corp.. Microsoft Repository Phil Bernstein Microsoft Corp.
Mark Kvamme Sequoia Capital Content Happens!. Remember These Guys?
Hussein Suleman University of Cape Town Department of Computer Science Digital Libraries Laboratory February 2008 Data Curation Repositories:
Distributed Data Analysis & Dissemination System (D-DADS ) Special Interest Group on Data Integration June 2000.
FRErator – the Bridge between FRE and Curator DB.
1 GFDL Data Portal Current Status, Achievements and Future Development NOAATECH-2006 K.Dixon, V.Balaji, S.Nikonov GFDL, Princeton.
Feb 24-27, 2004ICDL 2004, New Dehli Improving Federated Service for Non-cooperating Digital Libraries R. Shi, K. Maly, M. Zubair Department of Computer.
Curator: Gap Analysis (from a schema perspective) Rocky Dunlap Spencer Rugaber Georgia Tech.
Building Community and Capability through Common Infrastructure: ESMF and the Earth System Curator Cecelia DeLuca MAP Meeting College.
Module 1: Introduction to Microsoft SQL Server Reporting Services
Application of RDF-OWL in the ESG Ontology Sylvia Murphy: Julien Chastang: Luca Cinquini:
Copyright 2007, Information Builders. Slide 1 iWay Web Services and WebFOCUS Consumption Michael Florkowski Information Builders.
GFDL Data Portal Update: Curator DB Approach S.Nikonov, V.Balaji, K.Dixon GFDL The 5 th GO-ESSP Workshop June , LLNL.
Introduction to SQL Server 2000 Reporting Services Jeff Dumas Technical Specialist Microsoft Corporation
Andy Roberts Data Architect
Hydroinformatics Lecture 15: HydroServer and HydroServer Lite The CUAHSI HIS is Supported by NSF Grant# EAR CUAHSI HIS Sharing hydrologic data.
1 Copyright © 2008, Oracle. All rights reserved. Repository Basics.
The Earth System Curator Metadata Infrastructure for Climate Modeling Rocky Dunlap Georgia Tech.
Embedding Live Access Server into GFDL Data Portal Infrastructure K.O’Brien (PMEL), S.Nikonov (GFDL), R.Schweitzer (PMEL), S.Hankin (PMEL), V.Balaji (GFDL)
CPSC-310 Database Systems
Integrating ArcSight with Enterprise Ticketing Systems
PLM, Document and Workflow Management
Lecture 8 Database Implementation
CUAHSI HIS Sharing hydrologic data
Deploying and Configuring SSIS Packages
Phil Bernstein Microsoft Corp.
Database Management System (DBMS)
Client Access, Queries, Stored Procedures, JDBC
Introduction to D4Science
Knowledge Based Workflow Building Architecture
DATABASES WHAT IS A DATABASE?
Metadata Development in the Earth System Curator
Chapter 1 Introduction to Database Processing
CPSC-608 Database Systems
Presentation transcript:

“curator” DB design Curator meeting, GFDL, Sep 20

2 Why RDBMS  A lot of information:  Model metadata  Experiments metadata  Institution/user metadata  Data metadata  Mostly it’s in textual form  Information is internally linked tightly that can be easy to express by means of relational databases.  Relational databases have well developed means for searching and extracting procedures (SQL query language and program interfaces for any language) as for local as well as for remote user.  Very reliable, safety technology. Curator meeting, GFDL, Sep 20

3 Desirable Features of Model Data Factory  Relational Database storing metadata, containing description of  model components and model configuration  scenarios  postprocessing (model output and CMOR) directives  experiments  variables  formalized rules of Quality Control  data locations  task scheduler  users and groups accounts  XML as data exchange format  for compliance with FRE  working format of existing third party software  good fitted for hierarchical metadata description  prevalent in world, easy to exchange with others Data Portals  Model Builder (FMS Runtime Environment in GFDL)  checks out available model components from DB  chooses model datasets from DB  sets postprocessing directives  checks components and configurations compatibility  builds executable application and runs it  write metadata about experiment into DB (model configuration, scenario, project, organization/user, postprocessing) Curator meeting, GFDL, Sep 20

4 Desirable Features of Model Data Factory (continue)  Climate Model Output Rewriter (CMOR) subsystem  prepares data consistently with specific project requirements  Data Publisher  transfer data to Data Portal storage in accordance to settings from DB  Data Portal Software Package  Configuration Manager (configures Aggregation Server and Data Portal Interface)  Search Catalog Engine  Data Subsampling Engine  Data Computation Engine  Data Visualization  Data Delivery Manager Curator meeting, GFDL, Sep 20

5 Standard scenario of functioning Model Data Factory (ideal picture)  Scientist builds model in FRE using available model components, datasets and forcing scenario.  FRE puts metadata about built model, scenario, experiment into “curator” DB and runs experiment;  Postprocessing subsystem extracts metadata about postprocessing plan from “curator” DB and executes it, and on finish puts metadata about processed experiment back into DB.  Data Publisher (DP) regularly checks “curator” DB for new experiments marked as “public” and if finds any invokes CMOR.  CMOR goes to “curator” DB for metadata and processes needed data following metadata instructions.  DP calls QAC and then transfers data to Data Portal storage.  Configuration Manager configures Aggregation Server and Data Portal Interface and puts records about new public data in “curator” DB.  End of process, data is ready to go. Curator meeting, GFDL, Sep 20

6 Common functionality schema of ‘Model Data Factory’ Curator meeting, GFDL, Sep 20

7 Database Compartments:  Model Metadata Compartment contains models’ descriptions, allows to build coupled model of needed configuration contains models’ descriptions, allows to build coupled model of needed configuration  Variables Compartment List of all related physical variables List of all related physical variables  Workflow Compartment contains scenarios, experiments, institutions, projects and users info contains scenarios, experiments, institutions, projects and users info  Postprocessing Compartment defines postprocessing plan for conducting experiment defines postprocessing plan for conducting experiment  Data Portal Compartment contains info about experiments data contains info about experiments data Database ‘curator’ design Database ‘curator ’ design Curator meeting, GFDL, Sep 20

8 MySQL DB CURATOR Curator meeting, GFDL, Sep 20

9 Model Metadata Compartment (in development) Coupled_Models Model_List Component_Medias Models Experiments Workflow Compartment Variables Variables Compartment Curator meeting, GFDL, Sep 20

10 Data Samples from Model Compartment Components_Medias Coupled_Models Model_List Models Curator meeting, GFDL, Sep 20

11 Variables Compartment Projects Workflow Compartment Variables Variable_Bundles Variable_Lists Variable_List_Contents Proj_Var_Names Curator meeting, GFDL, Sep 20

12 Variable_Lists Variable_List_Contents Data Sample from Variables Compartment Proj_Var_Names Variables Variable_Bundles Curator meeting, GFDL, Sep 20

13 Workflow Compartment (in development) InstitutionsGFDL_USERS Experiment_Status Realization Projects Experiments Scenarios Curator meeting, GFDL, Sep 20

14 Data Samples from Workflow Compartment Experiments Scenarios Curator meeting, GFDL, Sep 20

15 Coupled_Models Postprocessing Compartment PP_Units Post_Proc PP_Content Data Samples from Postprocessing Compartment PP_Units PP_Content Variable_Lists Projects GFDL_USERS Average_Periods Curator meeting, GFDL, Sep 20

16 Data Portal Compartment MissedData_Descriptors Data_GridsData_Files Variables Experiments Variable_Bundles Coupled_Models Curator meeting, GFDL, Sep 20

17 Data Samples from Data Portal Compartments Data_Files Data_Grids MissedData_Descriptors Curator meeting, GFDL, Sep 20

18 “curator” DB is in use now: CM2.0 CM2.0 CM2.0 CM2.1 CM2.1 CM2.1 Curator meeting, GFDL, Sep 20

19 Future Development  Bring DB terms to conventional terminology.  Set up model metadata schema standards and create tables in “curator” DB following this schema.  Fill these tables with real metadata extracted from models of GFDL, CCSM, MIT and from ESMF Component Database.  Implement tables for observation data metadata.  Implement DODS aggregated data support.  Build XML bridge for XML transcoding DB input/output Curator meeting, GFDL, Sep 20

20 END ENDQuestions? Suggestions? Suggestions? Objections? Objections?Thanks! Curator meeting, GFDL, Sep 20