Space Physics Interactive Data Resource – SPIDR :Dr. ZHI N, Mik hail Dr. ZHI ZHI N, Mik hail (Ge oph ysic al Cen ter Rus sian Aca d. Sci. ) Dr. KIH N,

Slides:



Advertisements
Similar presentations
National Partnership for Advanced Computational Infrastructure San Diego Supercomputer Center Data Grids for Collection Federation Reagan W. Moore University.
Advertisements

ESA Data Integration Application Open Grid Services for Earth Observation Luigi Fusco, Pedro Gonçalves.
An Operational Metadata Framework For Searching, Indexing, and Retrieving Distributed GIServices on the Internet By Ming-Hsiang.
Integrating NOAA’s Unified Access Framework in GEOSS: Making Earth Observation data easier to access and use Matt Austin NOAA Technology Planning and Integration.
1 Software & Grid Middleware for Tier 2 Centers Rob Gardner Indiana University DOE/NSF Review of U.S. ATLAS and CMS Computing Projects Brookhaven National.
Office of Science U.S. Department of Energy Grids and Portals at NERSC Presented by Steve Chan.
Development of a Community Hydrologic Information System Jeffery S. Horsburgh Utah State University David G. Tarboton Utah State University.
Internet GIS and Wireless Mobile GIS for Disaster Management by Dr. Ming-Hsiang (Ming) Tsou Phone:
Center for Environmental Studies Arizona State University Digital Research Records at Center for Environmental Studies Peter McCartney.
Web-based Portal for Discovery, Retrieval and Visualization of Earth Science Datasets in Grid Environment Zhenping (Jane) Liu.
The UDK: The Environmental Data Catalog of Germany and Austria Dr. Fred Kruse Coordination Center UDK/GEIN.
Information Technology for Ocean Observations and Climate Research TYKKI Workshop, December 9-11, 1998, Tokyo, Japan Nancy N. Soreide NOAA Pacific Marine.
INFSO-RI Enabling Grids for E-sciencE Intelligent Distributed Data Management in Earth system science K. Ronneberger, DKRZ, Germany.
TPAC Digital Library Talk Overview Presenter:Glenn Hyland Tasmanian Partnership for Advanced Computing & Australian Antarctic Division Outline: TPAC Overview.
Overview of the ODP Data Provider Sergey Sukhonosov National Oceanographic Data Centre, Russia Expert training on the Ocean Data Portal technology, Buenos.
Scientific Data Infrastructure in CAS Dr. Jianhui Scientific Data Center Computer Network Information Center Chinese Academy of Sciences.
About CUAHSI The Consortium of Universities for the Advancement of Hydrologic Science, Inc. (CUAHSI) is an organization representing 120+ universities.
AIRNow-International The future of the United States real-time air quality reporting and forecasting program and GEOSS participation John E. White U.S.
DISTRIBUTED DATA FLOW WEB-SERVICES FOR ACCESSING AND PROCESSING OF BIG DATA SETS IN EARTH SCIENCES A.A. Poyda 1, M.N. Zhizhin 1, D.P. Medvedev 2, D.Y.
ESSE Environmental Scenario Search Engine for the Data Services Grid Mikhail Zhizhin, Geophysical Center Russian Academy of Sciences Eric Kihn,
1 CLASS – Simple NOAA Archive Access Portal SNAAP Eric Kihn and Rob Prentice NOAA/NGDC ESIP Meeting January 7 th, 2009 Simple NOAA Archive Access Portal.
[The Virtual Radiation Belt Observatory] Bob Weigel (George Mason University) Software: Eric Kihn (NOAA/NGDC, ViRBO Web and API) Mikhail Zhizhin (RFO,
ANSTO E-Science workshop Romain Quilici University of Sydney CIMA CIMA Instrument Remote Control Instrument Remote Control Integration with GridSphere.
Fundamentals of Database Chapter 7 Database Technologies.
GT Components. Globus Toolkit A “toolkit” of services and packages for creating the basic grid computing infrastructure Higher level tools added to this.
Ohio State University Department of Computer Science and Engineering 1 Cyberinfrastructure for Coastal Forecasting and Change Analysis Gagan Agrawal Hakan.
Introduction to Apache OODT Yang Li Mar 9, What is OODT Object Oriented Data Technology Science data management Archiving Systems that span scientific.
Scientific Investigations; Support from Research Data Archives for Joint Office for Science Support 26 February, 2002 Steven Worley SCD/DSS.
Moving Large Amounts of Data Rob Schuler University of Southern California.
Integrated Grid workflow for mesoscale weather modeling and visualization Zhizhin, M., A. Polyakov, D. Medvedev, A. Poyda, S. Berezin Space Research Institute.
GEM Portal and SERVOGrid for Earthquake Science PTLIU Laboratory for Community Grids Geoffrey Fox, Marlon Pierce Computer Science, Informatics, Physics.
High Data Volume Transfer Issues at NOAA Christopher D. Elvidge Earth Observation Group National Oceanic and Atmospheric Administration National Geophysical.
Ames Research CenterDivision 1 Information Power Grid (IPG) Overview Anthony Lisotta Computer Sciences Corporation NASA Ames May 2,
Policy Based Data Management Data-Intensive Computing Distributed Collections Grid-Enabled Storage iRODS Reagan W. Moore 1.
NA-MIC National Alliance for Medical Image Computing UCSD: Engineering Core 2 Portal and Grid Infrastructure.
CLASS Information Management Presented at NOAATECH Conference 2006 Presented by Pat Schafer (CLASS-WV Development Lead)
IODE Ocean Data Portal - ODP  The objective of the IODE Ocean Data Portal (ODP) is to facilitate and promote the exchange and dissemination of marine.
GVis: Grid-enabled Interactive Visualization State Key Laboratory. of CAD&CG Zhejiang University, Hangzhou
Building the e-Minerals Minigrid Rik Tyer, Lisa Blanshard, Kerstin Kleese (Data Management Group) Rob Allan, Andrew Richards (Grid Technology Group)
State Key Laboratory of Resources and Environmental Information System China Integration of Grid Service and Web Processing Service Gao Ang State Key Laboratory.
A Brave NEtWork World Rob Willis, Ross & Associates Node Mentoring Workshop New Orleans, LA February 28, 2005.
GRID Overview Internet2 Member Meeting Spring 2003 Sandra Redman Information Technology and Systems Center and Information Technology Research Center National.
WGISS and GEO Activities Kathy Fontaine NASA March 13, 2007 eGY Boulder, CO.
INFSO-RI Enabling Grids for E-sciencE Intelligent Distributed Data Management in Earth System Science S. Kindermann, DKRZ, Germany.
NOVA A Networked Object-Based EnVironment for Analysis “Framework Components for Distributed Computing” Pavel Nevski, Sasha Vanyashin, Torre Wenaus US.
ASIAES: Update and Status Pakorn Apaphant GISTDA, Thailand May 20-27, 2007 WGISS 23, Hanoi, Vietnam.
NeuroLOG ANR-06-TLOG-024 Software technologies for integration of process and data in medical imaging A transitional.
What is NCIA? National Cancer Imaging Archive Searchable repository of in vivo cancer images in DICOM format Publicly available at no cost over the Internet.
2. WP9 – Earth Observation Applications ESA DataGrid Review Frascati, 10 June Welcome and introduction (15m) 2.WP9 – Earth Observation Applications.
Intro to Web Services Dr. John P. Abraham UTPA. What are Web Services? Applications execute across multiple computers on a network.  The machine on which.
GRID ANATOMY Advanced Computing Concepts – Dr. Emmanuel Pilli.
Super Computing 2000 DOE SCIENCE ON THE GRID Storage Resource Management For the Earth Science Grid Scientific Data Management Research Group NERSC, LBNL.
AHM04: Sep 2004 Nottingham CCLRC e-Science Centre eMinerals: Environment from the Molecular Level Managing simulation data Lisa Blanshard e- Science Data.
1 CLASS – Simple NOAA Archive Access Portal SNAAP Eric Kihn and Rob Prentice NGDC CLASS Developers Meeting July 14th, 2008 Simple NOAA Archive Access Portal.
The overview How the open market works. Players and Bodies  The main players are –The component supplier  Document  Binary –The authorized supplier.
Rights Management for Shared Collections Storage Resource Broker Reagan W. Moore
Gaia An Infrastructure for Active Spaces Prof. Klara Nahrstedt Prof. David Kriegman Prof. Dennis Mickunas
Physical Oceanography Distributed Active Archive Center THUANG June 9-13, 20089th GHRSST-PP Science Team Meeting GHRSST GDAC and EOSDIS PO.DAAC.
A System for Monitoring and Management of Computational Grids Warren Smith Computer Sciences Corporation NASA Ames Research Center.
InSilicoLab – Grid Environment for Supporting Numerical Experiments in Chemistry Joanna Kocot, Daniel Harężlak, Klemens Noga, Mariusz Sterzel, Tomasz Szepieniec.
ODP V2 Data Provider overview. 22 Scope Data Provider provides access to data and metadata of the local data systems. Data Provider is a wrapper, installed.
Servicing Seismic and Oil Reservoir Simulation Data through Grid Data Services Sivaramakrishnan Narayanan, Tahsin Kurc, Umit Catalyurek and Joel Saltz.
Grid Services for Digital Archive Tao-Sheng Chen Academia Sinica Computing Centre
2005 – 06 – - ESSP1 WDC Climate : Web Access to Metadata and Data Frank Toussaint World Data Center for Climate (M&D/MPI-Met, Hamburg)
INFSO-RI Enabling Grids for E-sciencE ESR Database Access K. Ronneberger,DKRZ, Germany H. Schwichtenberg, SCAI, Germany S. Kindermann,
The CUAHSI Hydrologic Information System Spatial Data Publication Platform David Tarboton, Jeff Horsburgh, David Maidment, Dan Ames, Jon Goodall, Richard.
Russian Academy of Sciences
IP Publishing From IP Data Base to IP list to IP catalog
Google Sky.
Presentation transcript:

Space Physics Interactive Data Resource – SPIDR :Dr. ZHI N, Mik hail Dr. ZHI ZHI N, Mik hail (Ge oph ysic al Cen ter Rus sian Aca d. Sci. ) Dr. KIH N, Eric (Nat iona l Geo phy sica l Dat a Cen ter NO AA) Dr. KIH N, Eric Co-Authors:Mr. ME DV ED EV, Dmi try (Ge oph ysic al Cen ter Rus sian Aca d. Sci. ) Mr. RE DM ON, Rob (Nat iona l Geo phy sica l Dat a Cen ter NO AA) Mr. MIS HIN, Dmi try (Ins titut e of Phy sics of the Eart h Rus sian Aca d. Sci. ) Mikhail ZHIZHIN (Geophysical Center Russian Acad. Sci.) Eric KIHN (National Geophysical Data Center NOAA) Dmitry MEDVEDEV (Geophysical Center Russian Acad. Sci.) Rob REDMON (National Geophysical Data Center NOAA) Dmitry MISHIN (Institute of Physics of the Earth Russian Acad. Sci.)

50 years ago – International Geophysical Year – IGY1957 Total data volume ~ 1 Gb Exchange ~ 1 Mb/year

Yesterday – databases, Internet, web – Y2K Total data volume ~ 1 Tb Exchange ~ 1 Gb/year

Tomorrow – Electronic Geophysical Year – EGY2007 Total data volume ~ 1 Pb Exchange ~ 1 Tb/year

SPIDR mission SPIDR is a de facto standard data source on solar- terrestrial physics, functioning within the framework of the ICSU World Data Centers. It is a distributed database and application server network, built to select, visualize and model historical space weather data distributed across the Internet. SPIDR can work as a fully-functional web- application (portal) or as a grid of web-services, providing functions for other applications to access its data holdings.

SPIDR databases Currently SPIDR archives include solar activity and solar wind data, geomagnetic variations and indices, ionospheric, cosmic rays, radio-telescope ground observations, telemetry and images from NOAA, NASA, and DMSP satellites. SPIDR database clusters and portals are installed in the USA, Russia, China, Japan, Australia, South Africa, and India.

SPIDR components SPIDR portal combines the central XML metadata repository with a set of distributed data web services and data file collections. A user can search for data using metadata inventory, use persistent data basket to save the selection for the next session, and plot or download in parallel the selected data in different formats, including XML and NetCDF.

Metadata catalog of data services

Selections from different data services plotted in parallel

Satellite orbits navigator

FTP data file repository viewer

Data service: common data model serialization + URL All grid data services in SPIDR share the same Common Data Model and compatible metadata schema.

Local and/or remote data service: output data stream It is possible at the same time to use a local data source with JDBC protocol and a remote data service with SOAP protocol. The type of protocol is defined by the SPIDR configuration.

Data upload and synchronization: input data stream A database administrator can upload new files into the SPIDR databases using the web services directly or through the web portal. SPIDR databases are self-synchronizing via the web services.

SPIDR metadata “compromise” XML database (high level, low-granularity metadata) = Virtual Observatory (VxO) –Hierarchy of the data categories, key words, textual descriptions –Methods and credentials to access the data (web-service, ftp- directory) –User Forum for data quality and usability support SQL database (low level, high-granularity metadata) = Data Inventory –Parameters (name, physical meaning, units of measurement, virtual formula) or database schema –Availability and accreditation of the data (inventory) –Visualization details (type of the plot and coordinate system, scales, labels) –Input-output formats

High-level metadata search

Low-level database inventory

Simplistic for novice users to be driven by Guru Advanced user interface System administrator interface SPIDR usage tutorial Data description and help Different workflows and interfaces for different User groups SPIDR homepage

Real-time usage statisics for a given time interval User sessions per day Total ~ registered users Per database requests for plot (red) and export (blue)

Input: ground and satellite data from SPIDR data services Space weather numerical models Output: high-resolution rendering of the near-Earth space Numerical modeling on the Grid: Space Weather Reanalysis - SWR

SWR Computer Resources 768 Intel Pentium 4 Xeon Nodes (Dual 2.2 GHz Processors) Myricom Myrinet CLOS64 (2.4 Gbs) ADIC Fileserve MSS (100 Tbytes) NGDC was the #2 JET user for The SWR consumed 400,000 + CPU Hours The SWR has produced over 2.5 Tb data, this exceeds all of NGDC’s non-satellite holdings! JET Supercomputer FSL/NOAA, Boulder The SWR requires a tremendous array of computer support in order to meet its goals. Challenges include sufficient CPU power, integrating distributed model runs, and storage space for input and output data sets. The SWR project makes use of shared time on FSL’s JET supercomputer as well as RAID and Tivoli based storage systems at NGDC NOAA

SPIDR integration with VxO and Grid infrastructure Two reasons to move to the Grid middleware: 1.The digital certificates for security and authentication simplify inter-site communication 2. Processing large environmental archives requires asynchronous web-services call mechanism

Some conclusions Grid (web) data services accessible from SPIDR portal and a number of clients in Java, C#, Matlab, MS Excel Near-real time IMF, ionosphere and geomagnetic data input streams Data accreditation, FTP file depositary synchronous with the database Metadata service with high-level data description and low-level data inventory Virtual Observatory and User Community functionality: forum, bookmarks, i-mail, external metadata services Integration with Web Map Services “Fork” of the SPIDR-based data resource on solid Earth “Proprietary” SPIDR common data model becomes limiting, need generic like NetCDF SPIDR as a resource on the Space Physics Grid