Climate Analytics on Global Data Archives Aparna Radhakrishnan 1, Venkatramani Balaji 2 1 DRC/NOAA-GFDL, 2 Princeton University/NOAA-GFDL 2. Use-case 3.

Slides:



Advertisements
Similar presentations
Earth System Curator Spanning the Gap Between Models and Datasets.
Advertisements

Metadata Development in the Earth System Curator Spanning the Gap Between Models and Datasets Rocky Dunlap, Georgia Tech.
ASCR Data Science Centers Infrastructure Demonstration S. Canon, N. Desai, M. Ernst, K. Kleese-Van Dam, G. Shipman, B. Tierney.
RAMADDA for Big Climate Data Don Murray NOAA/ESRL/PSD and CU-CIRES Boulder/Denver Big Data Meetup - June 18, 2014.
May 17, Capabilities Description of a Rapid Prototyping Capability for Earth-Sun System Sciences RPC Project Team Mississippi State University.
The International Surface Pressure Databank (ISPD) and Twentieth Century Reanalysis at NCAR Thomas Cram - NCAR, Boulder, CO Gilbert Compo & Chesley McColl.
CLIMATE SCIENTISTS’ BIG CHALLENGE: REPRODUCIBILITY USING BIG DATA Kyo Lee, Chris Mattmann, and RCMES team Jet Propulsion Laboratory (JPL), Caltech.
TPAC Digital Library Talk Overview Presenter:Glenn Hyland Tasmanian Partnership for Advanced Computing & Australian Antarctic Division Outline: TPAC Overview.
Introduction Downloading and sifting through large volumes of data stored in differing formats can be a time-consuming and sometimes frustrating process.
1 Building National Cyberinfrastructure Alan Blatecky Office of Cyberinfrastructure EPSCoR Meeting May 21,
NSF NCAR | NASA GSFC | DOE LANL ANL | NOAA NCEP GFDL | MIT | U MICH First Field Tests of ESMF GMAO Seasonal Forecast NCAR/LANL CCSM NCEP.
The Earth System CoG Collaboration Environment Sylvia Murphy and Cecelia DeLuca (NOAA/CIRES), and Luca Cinquini (NASA/JPL) AGU Ocean Sciences February.
1 NOAA’s Environmental Modeling Plan Stephen Lord Ants Leetmaa November 2004.
IS-ENES [ees-enes] InfraStructure for the European Network for Earth System Modelling IS-ENES will develop a virtual Earth System Modelling Resource Centre.
Project Overview GMAO Seasonal Forecast NCAR/LANL CCSM NCEP Forecast GFDL FMS Suite MITgcm NASA GMAO Analysis Climate Data Assimilation.
CORDEX Scope, or What is CORDEX?  Provide a set of regional climate scenarios (including uncertainties) covering the period , for the majority.
Data Merge Examples, Toolsets for Airborne Data (TAD): Customized Data Merging Function ASDC Introduction The Atmospheric Science Data Center (ASDC) at.
Presented by The Earth System Grid: Turning Climate Datasets into Community Resources David E. Bernholdt, ORNL on behalf of the Earth System Grid team.
Managing Sustainability Solutions Initiative (SSI) data Kate Beard, Steve Cousins University of Maine NERACOOS/NECOSP Data Management Workshop, Sept. 26,
NE II NOAA Environmental Software Infrastructure and Interoperability Program Cecelia DeLuca Sylvia Murphy V. Balaji GO-ESSP August 13, 2009 Germany NE.
RDFS Rapid Deployment Forecast System Visit at: Registration required.
DOE BER Climate Modeling PI Meeting, Potomac, Maryland, May 12-14, 2014 Funding for this study was provided by the US Department of Energy, BER Program.
Introduction to Apache OODT Yang Li Mar 9, What is OODT Object Oriented Data Technology Science data management Archiving Systems that span scientific.
ATMOSPHERIC SCIENCE DATA CENTER ‘Best’ Practices for Aggregating Subset Results from Archived Datasets Walter E. Baskin 1, Jennifer Perez 2 (1) Science.
A Global Agriculture Information System Zhong Liu 1,4, W. Teng 2,4, S. Kempler 4, H. Rui 3,4, G. Leptoukh 3 and E. Ocampo 3,4 1 George Mason University,
Mathematics and Computer Science & Environmental Research Divisions ARGONNE NATIONAL LABORATORY Regional Climate Simulation Analysis & Vizualization John.
Integrated Model Data Management S.Hankin ESMF July ‘04 Integrated data management in the ESMF (ESME) Steve Hankin (NOAA/PMEL & IOOS/DMAC) ESMF Team meeting.
OOI CI LCA REVIEW August 2010 Ocean Observatories Initiative External Observatory Integration Christopher Mueller, Matt Arrott, John Graybeal Life Cycle.
Data Publication and Quality Control Procedure for CMIP5 / IPCC-AR5 Data WDC Climate / DKRZ:
The New Zealand Institute for Plant & Food Research Limited Use of Cloud computing in impact assessment of climate change Kwang Soo Kim and Doug MacKenzie.
ESIP Federation 2004 : L.B.Pham S. Berrick, L. Pham, G. Leptoukh, Z. Liu, H. Rui, S. Shen, W. Teng, T. Zhu NASA Goddard Earth Sciences (GES) Data & Information.
- EGU 2010 ESSI May Building on the CMIP5 effort to prepare next steps : integrate community related effort in the every day workflow to.
Climate Change Working Group (CCWG) July, 2004 Co-chairs: Gerald A. Meehl, Ben Santer, and Warren Washington.
The NOAA Operational Model Archive and Distribution System NOMADS The NOAA Operational Model Archive and Distribution System NOMADS Dave Clark for Glenn.
- Vendredi 27 mars PRODIGUER un nœud de distribution des données CMIP5 GIEC/IPCC Sébastien Denvil Pôle de Modélisation, IPSL.
GRID Overview Internet2 Member Meeting Spring 2003 Sandra Redman Information Technology and Systems Center and Information Technology Research Center National.
UAF/OSMC Presenters: Kevin O’Brien and Eugene Burger Abstract: Kevin O’Brien and Eugene Burger are from NOAA’s Pacific Marine Environmental Laboratory.
Regional Climate Model Evaluation System based on satellite and other observations for application to CMIP/AR downscaling Peter Lean 1, Jinwon Kim 1,3,
Improving Data Catalogs with Free and Open Source Software Kevin O’Brien University of Washington Joint Institute for the Study of the Atmosphere and Ocean.
The Unified Access Framework for Gridded Data … the 1 st year focus of NOAA’s Global Earth Observation Integrated Data Environment (GEO-IDE) Steve Hankin,
CTB computer resources / CFSRR project Hua-Lu Pan.
1 Adventures in Web Services for Large Geophysical Datasets Joe Sirott PMEL/NOAA.
1 Accomplishments. 2 Overview of Accomplishments  Sustaining the Production Earth System Grid Serving the current needs of the climate modeling community.
1 Overall Architectural Design of the Earth System Grid.
Product-Generation in ESG: some explorations of the user experience and discussion of implications for the design of ESG Steve Hankin & Roland Schweitzer.
SCD Research Data Archives; Availability Through the CDP About 500 distinct datasets, 12 TB Diverse in type, size, and format Serving 900 different investigators.
Earth System Curator and Model Metadata Discovery and Display for CMIP5 Sylvia Murphy and Cecelia Deluca (NOAA/CIRES) Hannah Wilcox (NCAR/CISL) Metafor.
Welcome to the PRECIS training workshop
LAS and THREDDS: Partners for Education Roland Schweitzer Steve Hankin Jonathan Callahan Joe Mclean Kevin O’Brien Ansley Manke Yonghua Wei.
O AK R IDGE N ATIONAL L ABORATORY U.S. D EPARTMENT OF E NERGY Data Requirements for Climate and Carbon Research John Drake, Climate Dynamics Group Computer.
ESMF and the future of end-to-end modeling Sylvia Murphy National Center for Atmospheric Research
Support to scientific research on seasonal-to-decadal climate and air quality modelling Pierre-Antoine Bretonnière Francesco Benincasa IC3-BSC - Spain.
Metadata Development in the Earth System Curator Spanning the Gap Between Models and Datasets Rocky Dunlap, Georgia Tech 5 th GO-ESSP Community Meeting.
GO-ESSP The Earth System Grid The Challenges of Building Web Client Geo-Spatial Applications Eric Nienhouse NCAR.
5-7 May 2003 SCD Exec_Retr 1 Research Data, May Archive Content New Archive Developments Archive Access and Provision.
The NOAA Operational Model Archive and Distribution System NOMADS CEOS-Grid Application Status Report Glenn K. Rutledge NOAA NCDC CEOS WGISS-19 Cordoba,
AOLI 2015 The NMME Experience: A Research Community Archive Lessons learned from Climate Model data archive and use AOLI Meeting 2015 Eric Nienhouse NCAR.
A41I-0105 Supporting Decadal and Regional Climate Prediction through NCAR’s EaSM Data Portal Doug Schuster and Steve Worley National Center for Atmospheric.
Center for Satellite Applications and Research (STAR) Review 09 – 11 March 2010 Image: MODIS Land Group, NASA GSFC March 2000 STAR Enterprise Synthesis.
Sea Ice Component Description Number of historical ensemble members
Data Browsing/Mining/Metadata
AP7/AP8: Long-Term Archival of CMIP6 Data
World Conference on Climate Change October 24-26, 2016 Valencia, Spain
Climate Data Analytics in a Big Data world
WORLD CLIMATE RESEARCH PROGRAMME
National Center for Atmospheric Research
Production and use of regional climate model projections at the Swedish Meteorological and Hydrological Institute Erik Kjellström Rossby Centre,
Task Team for Intercomparison of ReAnalysis
Precipitation variability over Arizona and
Metadata Development in the Earth System Curator
Presentation transcript:

Climate Analytics on Global Data Archives Aparna Radhakrishnan 1, Venkatramani Balaji 2 1 DRC/NOAA-GFDL, 2 Princeton University/NOAA-GFDL 2. Use-case 3. User Interface 4. Future Work Analysis products bundled with Live Access Server: The rapidly growing climate modeling enterprise challenges us in different facets such as the availability and utilization of computing resources, data storage, usability and maintenance, analysis capabilities etc. With the availability of high- resolution climate data, the data volume gears up significantly, making it a challenge to apply user- developed climate analytics. The CMIP5 project is a significant example for such a scenario. This scientific project was designed by a team spanning twenty or more modeling centers across the planet: many of the largest supercomputers in the world were given over many months to the running of the experiments; the data is now stored in a distributed archive of nodes governed by the Earth System Grid Federation (ESGF), with a core measuring more than 1 PB, and a total of about 20 PB. With the global availability of the data archives, there is an explosion of interest in climate analytics and research work. Often, there is a need for replicating a specific analysis suite to analyze the behavior of different climate datasets, compare inter-model- experiments to evaluate and address the specifics. In this presentation, we provide an innovative solution to deploy user-developed climate analytics on CMIP5 ESGF federated archive in the form of a web service. Originally, climate analytics were applied to GFDL datasets using the Curator database to locate the internal resources/variables, etc. Later, the approach significantly transformed to being able to apply climate analytics on ESGF’s global data archives. 1. Introduction “Climate analytics on global data archives” is an ongoing work under the auspices of the ExArch Project. This project is principally a framework for the scientific interpretation of multi-model ensembles at the peta-and exa-scale. It applies a strategy, a prototype infrastructure and demonstration usage examples in the context of the imminent CMIP5 archive. The work is sponsored by a coordinated effort among science agencies of the G8 countries, including NSF. Say, there is an innovative analysis script developed by a user, who has developed it using local analysis resources and some small subset of data downloaded from the CMIP5 federated archive. Her analysis is widely cited, and there is interest worldwide in replicating her study on other datasets from the archive. How is this to be achieved? Participating Institutions BCC, BNU, CCMA, CMCC CNRM-CERFACS, COLA- CFS, CSIRO-QCCCE, INM, IPSL, LASG-CESS, LASG- IAP, MIROC, MPI-M, MRI, NASA-GISS, NASA- GMAO, NCAR, NCC, NOAA-GFDL, NOAA- NCEP.. CMIP5 models ̴ = 52 climate models Experiments Long-term, near-term, atmosphere-only (Total ̴ = 116 experiments) Don’t forget the ensemble members! Frequency 3-hourly, 6-hourly, climatology monthly mean, daily, fixed, monthly, sub-hourly, yearly Realms Aerosol, atmosphere, land, land-ice, ocean, sea-ice, ocean Biogeochemistry Climate fields: Total ̴ = 550 CMIP5 Tree 2.1 Bring analysis to data: Input: Get dataset Identifiers (D1,D2) for the experiments E1,E2 in comparison. (This includes model-name, experiment, ensemble_member, frequency, realm, CMIP table) Eg: NOAA-GFDL.GFDL-ESM2G.historical.mon.atmos.Amon.r1i1p1 Input: Get start_time (t 0 E1,t 0 E2) and end_time (t 1 E1,t 1 E2) for experiments E1, E2 in comparison – as input Input: Get CMIP5 variable name (V) to be analyzed Input: Get climate analytics plot type to be applied to datasets (D1,D2) THREDDS CATALOG FEEDER (python-based) NetCDF files Analysis Products 1. Crawls through ESGF Root THREDDS catalogs and locates datasets D1,D2. 2. Fetches the OPeNDAP aggregation URL for variable “V “ in datasets D1,D2. 3. Prepares arguments to be passed to the analysis script templates along with the start_time(s) (t 0 E1,t 0 E2) and end_time(s) (t 1 E1,t 1 E2). 4. Runs the analysis scripts (any language. Currently, tested with Ferret) server-side. 5. Sends analysis products back to Thredds Catalog feeder. 6. Throws exceptions if the timer ranges are not available for specified experiments or if specified variables are not part of a given experiment d map comparisons for tropical oceanic fields 4.2. Statistical Downscaling 4.3 And many more.. Fig. 1. CMIP5 Tree Step 1: Select the data sets to be compared Step 2: Select the plot type Step 3: Select the variable and/or region Step 4: Select the year range to be compared Acknowledgement: This work was partly funded by the international ExArch project under the G8 initiative by National Science Foundation Award Many thanks to: ExArch, Andrew Wittenberg (GFDL), Roland Schweitzer (Weathertop Consulting/PMEL) Fig. 2. Bring analysis to data Fig. 4. precip map comparison Fig. 5. Statistical Downscaling overview Step 5: View output Fig. 3. Analysis output from LAS