ESP workshop, Sept 2003 the Earth System Grid data portal presented by Luca Cinquini (NCAR/SCD/VETS) Acknowledgments: ESG.

Slides:



Advertisements
Similar presentations
Case Study 2: User Registration for the Earth System Grid.
Advertisements

LEAD Portal: a TeraGrid Gateway and Application Service Architecture Marcus Christie and Suresh Marru Indiana University LEAD Project (
Peter Berrisford RAL – Data Management Group SRB Services.
Data Management Expert Panel - WP2. WP2 Overview.
Earth System Curator Spanning the Gap Between Models and Datasets.
FP7-INFRA Enabling Grids for E-sciencE EGEE Induction Grid training for users, Institute of Physics Belgrade, Serbia Sep. 19, 2008.
MyProxy: A Multi-Purpose Grid Authentication Service
NCAR/SCD/VETS The NCAR Community Data Portal
1 SRM-Lite: overcoming the firewall barrier for large scale file replication Arie Shoshani Alex Sim Lawrence Berkeley National Laboratory April, 2007.
SWIM WEB PORTAL by Dipti Aswath SWIM Meeting ORNL Oct 15-17, 2007.
The EPIKH Project (Exchange Programme to advance e-Infrastructure Know-How) gLite Grid Services Abderrahman El Kharrim
National Center for Supercomputing Applications Integrating MyProxy with Site Authentication Jim Basney Senior Research Scientist National Center for Supercomputing.
Application of GRID technologies for satellite data analysis Stepan G. Antushev, Andrey V. Golik and Vitaly K. Fischenko 2007.
The Cactus Portal A Case Study in Grid Portal Development Michael Paul Russell Dept of Computer Science The University of Chicago
GGF Toronto Spitfire A Relational DB Service for the Grid Peter Z. Kunszt European DataGrid Data Management CERN Database Group.
Toni Saarinen, Tite4 Tomi Ruuska, Tite4 Earth System Grid - ESG.
Data Grids: Globus vs SRB. Maturity SRB  Older code base  Widely accepted across multiple communities  Core components are tightly integrated Globus.
Grid Services at NERSC Shreyas Cholia Open Software and Programming Group, NERSC NERSC User Group Meeting September 17, 2007.
The Earth System Grid Discovery and Semantic Web Technologies Line Pouchard Oak Ridge National Laboratory Luca Cinquini, Gary Strand National Center for.
Web-based Portal for Discovery, Retrieval and Visualization of Earth Science Datasets in Grid Environment Zhenping (Jane) Liu.
CCSM Portal/ESG/ESGC Integration (a PY5 GIG project) Lan Zhao, Carol X. Song Rosen Center for Advanced Computing Purdue University With contributions by:
NCAR NCAR Data and Grid Efforts: The Earth System Grid & The Community Data Portal Don Middleton NCAR Scientific Computing Division CAS2003 September 11,
Presented by The Earth System Grid: Turning Climate Datasets into Community Resources David E. Bernholdt, ORNL on behalf of the Earth System Grid team.
GT Components. Globus Toolkit A “toolkit” of services and packages for creating the basic grid computing infrastructure Higher level tools added to this.
1 School of Computer, National University of Defense Technology A Profile on the Grid Data Engine (GridDaEn) Xiao Nong
ESG The Earth System Grid (ESG) Presented by Don Middleton & Luca Cinquini NCAR Scientific Computing Division On Behalf of the ESG Team SCD Executive Committee.
The Earth System Grid (ESG) Goals, Objectives and Strategies DOE SciDAC ESG Project Review Argonne National Laboratory, Illinois May 8-9, 2003.
1 Use of SRMs in Earth System Grid Arie Shoshani Alex Sim Lawrence Berkeley National Laboratory.
File and Object Replication in Data Grids Chin-Yi Tsai.
Javascript Cog Kit By Zhenhua Guo. Grid Applications Currently, most grid related applications are written as separate software. –server side: Globus,
National Computational Science National Center for Supercomputing Applications National Computational Science NCSA-IPG Collaboration Projects Overview.
CYBERINFRASTRUCTURE FOR THE GEOSCIENCES Data Replication Service Sandeep Chandra GEON Systems Group San Diego Supercomputer Center.
Part Four: The LSC DataGrid Part Four: LSC DataGrid A: Data Replication B: What is the LSC DataGrid? C: The LSCDataFind tool.
June 24-25, 2008 Regional Grid Training, University of Belgrade, Serbia Introduction to gLite gLite Basic Services Antun Balaž SCL, Institute of Physics.
The Replica Location Service The Globus Project™ And The DataGrid Project Copyright (c) 2002 University of Chicago and The University of Southern California.
NA-MIC National Alliance for Medical Image Computing UCSD: Engineering Core 2 Portal and Grid Infrastructure.
The Earth System Grid: A Visualisation Solution Gary Strand.
GO-ESSP Workshop, LLNL, Livermore, CA, Jun 19-21, 2006, Center for ATmosphere sciences and Earthquake Researches Construction of e-science Environment.
Web Portal Design Workshop, Boulder (CO), Jan 2003 Luca Cinquini (NCAR, ESG) The ESG and NCAR Web Portals Luca Cinquini NCAR, ESG Outline: 1.ESG Data Services.
Replica Management Services in the European DataGrid Project Work Package 2 European DataGrid.
The Earth System Grid (ESG) Computer Science and Technologies DOE SciDAC ESG Project Review Argonne National Laboratory, Illinois May 8-9, 2003.
NIEeS Workshop, Cambridge (UK), Sep 2002 Luca Cinquini for the Earth System Grid METADATA DEVELOPMENT for the EARTH SYSTEM GRID Luca Cinquini (SCD/NCAR)
Grid-Powered Scientific & Engineering Applications Ho Quoc Thuan INSTITUTE OF HIGH PERFORMANCE COMPUTING.
Fox 2 AISRP April 4-6, 2005  Earth System Grid  Grid-enabled OPeNDAP  Architecture - Server and Application access  Framework experience.
The Global Land Cover Facility is sponsored by NASA and the University of Maryland.The GLCF is a founding member of the Federation of Earth Science Information.
May 6, 2002Earth System Grid - Williams The Earth System Grid Presented by Dean N. Williams PI’s: Ian Foster (ANL); Don Middleton (NCAR); and Dean Williams.
Cole David Ronnie Julio. Introduction Globus is A community of users and developers who collaborate on the use and development of open source software,
AliEn AliEn at OSC The ALICE distributed computing environment by Bjørn S. Nilsen The Ohio State University.
1 e-Science AHM st Aug – 3 rd Sept 2004 Nottingham Distributed Storage management using SRB on UK National Grid Service Manandhar A, Haines K,
Access Control for NCAR Data Portals A report on work in progress about the future of the NCAR Community Data Portal Luca Cinquini GO-ESSP Workshop, 6-8.
1 Earth System Grid Center for Enabling Technologies ESG-CET Security January 7, 2016 Frank Siebenlist Rachana Ananthakrishnan Neill Miller ESG-CET All-Hands.
1 Accomplishments. 2 Overview of Accomplishments  Sustaining the Production Earth System Grid Serving the current needs of the climate modeling community.
1 Overall Architectural Design of the Earth System Grid.
Development of e-Science Application Portal on GAP WeiLong Ueng Academia Sinica Grid Computing
Database authentication in CORAL and COOL Database authentication in CORAL and COOL Giacomo Govi Giacomo Govi CERN IT/PSS CERN IT/PSS On behalf of the.
1 Summary. 2 ESG-CET Purpose and Objectives Purpose  Provide climate researchers worldwide with access to data, information, models, analysis tools,
1 AHM, 2–4 Sept 2003 e-Science Centre GRID Authorization Framework for CCLRC Data Portal Ananta Manandhar.
Super Computing 2000 DOE SCIENCE ON THE GRID Storage Resource Management For the Earth Science Grid Scientific Data Management Research Group NERSC, LBNL.
SCD User Briefing The Community Data Portal and the Earth System Grid Don Middleton with presentation material developed by Luca Cinquini, Mary Haley,
EGI-Engage Data Services and Solutions Part 1: Data in the Grid Vincenzo Spinoso EGI.eu/INFN Data Services.
The GRIDS Center, part of the NSF Middleware Initiative Grid Security Overview presented by Von Welch National Center for Supercomputing.
Holding slide prior to starting show. Lessons Learned from the GECEM Portal David Walker Cardiff University
1 Scientific Data Management Group LBNL SRM related demos SC 2002 DemosDemos Robust File Replication of Massive Datasets on the Grid GridFTP-HPSS access.
Preservation Data Services Persistent Archive Research Group Reagan W. Moore October 1, 2003.
The Earth System Curator Metadata Infrastructure for Climate Modeling Rocky Dunlap Georgia Tech.
EGI-InSPIRE RI EGI-InSPIRE EGI-InSPIRE RI EGI solution for high throughput data analysis Peter Solagna EGI.eu Operations.
INFSO-RI Enabling Grids for E-sciencE ESR Database Access K. Ronneberger,DKRZ, Germany H. Schwichtenberg, SCAI, Germany S. Kindermann,
The Earth System Grid: A Visualisation Solution
Data Management Components for a Research Data Archive
Presentation transcript:

ESP workshop, Sept 2003 the Earth System Grid data portal presented by Luca Cinquini (NCAR/SCD/VETS) Acknowledgments: ESG collaboration: ANL, ISI, LBNL, LLNL, NCAR, ORNL funding: DOE/SciDAC

ESP workshop, Sept 2003 Introduction ESG goal: build a grid for the geosciences that enables management, discovery, distributed access, processing, & analysis of distributed terascale climate research data Use Globus and Grid technologies The ESG challenges (with respect to CDP):  Federation of distinct DOE and NSF data repositories, each with different data access policies and procedures  Interoperate between diverse remote storages at distinct sites (HPSS at ORNL, NERSC and MSS at NCAR)  Explicitely deal with file replicas management  Use powerful but new technology The ESG advantages: data scope is more limited (initially only climate model data in netCDF format)

ESP workshop, Sept 2003 Overview of ESG web portal Currently under major revision  Moving to MVC (Tomcat/Struts) architecture  New UI under development  Developing/plugging new applications Goal: easy-to-use, grid-enabled, production-level web portal by January 2004 to serve CCSM IPCC data to wide community of climate scientists  will be instrumental in determining the success of ESG ESG web portal components:  Security  Metadata  Data access

ESP workshop, Sept 2003 ESG metadata architecture ESG or climate metadata  Collection-level description metadata  Hierarchical information (project  ensamble  simulation  dataset  logical datafile)  Stored in set of relational tables in OGSA-DAI MySQL database at ISI (RDBMS with OGSA-alpha GS interface)  Input and output of database is XML  ESG web portal allows several power user queries of the db  Base for simple free text search and discovery of data

ESP workshop, Sept 2003 ESG metadata architecture Location/replica metadata  Indicates the physical locations of the many copies of a single logical file (replicas): lfn  [1..n] pfn  Stored in a system of distributed RLS (Replica Location Services): cross-updating grid-enabled MySQL databases installed at each site  Each RLS is composed of two parts:  LRC (Location Replica Catalog): stores lfn  [1..n]pfn at site  RLI (Replica Location Index): stores all lfns at that site, notifies federated sites  Any RLI in the system can be used as starting point for obtaining all replicas (at any site) of a given lfn Data search and discovery example: “pcm”

ESP workshop, Sept 2003 ESG metadata topology RLI MSS HPSS RLI HPSS RLI DISK RLI DISK OGSA-DAI MySQL RDBMS ESG WEB PORTAL Tomcat/Struts cross-update query LBNL ISI LLNL NCAR ORNL LRC

ESP workshop, Sept 2003 Activity diagram: data search and discovery query string ESG userESG system OGSA-DAI DB at ISI [0-n] simulation ids OGSA-DAI DB at ISI [0-m] datasets ids OGSA-DAI DB at ISI [0-l] lfns RLI at NCAR [0-l] LRC [0-k] pfn LRC at NCAR, LBNL, LLNL, ORNL step 1 step 2 step 3

ESP workshop, Sept 2003 ESG web portal authentication Security is at the centre of Globus/Grid software (GSI – Grid Security Infrastructure) A blessing and a curse at the same time: security can be an undesired burden on the user ESG graduated security model:  Data publishing operations (updating databases, moving data between sites) will be performed from the shell and require certificate-based authentication/authorization  Data access operations (metadata search and browsing, data download, data analysis) will be performed through the web portal and require only encrypted username/password – certificates will be used in the background but never exposed to the user  Note that some operations (retrieval from MSS, HPSS) are subject to local policies ESG currently developing “Mailman-like” grid-enabled web-based registration/authentication package to be deployed on ESG web portal

ESP workshop, Sept 2003 Activity diagram: new user registration web browser ESG web portal 1. username password role client 2. confirmation request 3. confirmation response client web browser 4. processing request ESG userESG administrator 5. accept, reject ESG CA 6. certificate request, response MyProxy server 7. store certificate

ESP workshop, Sept 2003 Activity diagram: new user login web browser ESG web portal 1. username, password grid restricted application 4. proxy certificate ESG user MyProxy server 2. authentication request 3. proxy certificate

ESP workshop, Sept 2003 Web-based data retrieval from online & remote storage HRM (Hierarchical Resource Manager)  Allows (massive) data movement between a site local disk cache and remote storage (HPSS, MSS and others are supported)  Allows transfer of data between disks at separate sites through gridFTP (fast, parallel, tunable transfer protocol)  Many features: reliability, cache management, monitoring etc.

ESP workshop, Sept 2003 ESG data retrieval MSS HRM HPSS HRM HPSS HRM DISK HRM DISK ESG WEB PORTAL Tomcat/Struts gridFTP MyProxy authenticate GRAM GATEKEEPER submit execute gridFTP SERVER LBNL LLNL NCAR ORNL

ESP workshop, Sept 2003 Future development of ESG web portal Goal: have friendly, easy, production-level web portal by January ’04 to serve IPCC data:  Iron out any remaining metadata details  Install new web-based registration system  Improve user experience to download the data  Shorten steps/time required to find the replicas  Allow more friendly hierarchical data browsing (may generate THREDDS catalogs from OGSA-DAI and RLS)  Possibly, develop more powerful query capabilities of ESG metadata database  Possibly, add visualization and analysis capabilities

ESP workshop, Sept 2003 ESG TOPOLOGY (Sept 2003) RLI MSS HRM HPSS HRM RLI HPSS HRM RLI DISK HRM RLI DISK OGSA-DAI MySQL RDBMS ESG WEB PORTAL Tomcat/Struts cross-update gridFTP query MyProxy authenticate GRAM GATEKEEPER submit execute gridFTP SERVER LAS SERVER visualize LBNL ISI LLNL NCAR ORNL CAS ANL LRC