Data for Climate and Energy Studies Steven Worley Computational and Information Systems Laboratory NCAR.

Slides:



Advertisements
Similar presentations
Data management in SCD Steven Worley General Categories –The Mass Storage System –NCAR user file services (home directories) –Computer attached storage.
Advertisements

New Resources in the Research Data Archive Doug Schuster.
Experiments with Monthly Satellite Ocean Color Fields in a NCEP Operational Ocean Forecast System PI: Eric Bayler, NESDIS/STAR Co-I: David Behringer, NWS/NCEP/EMC/GCWMB.
RAMADDA for Big Climate Data Don Murray NOAA/ESRL/PSD and CU-CIRES Boulder/Denver Big Data Meetup - June 18, 2014.
ICOADS Archive Practices at NCAR JCOMM ETMC-III 9-12 February 2010 Steven Worley.
The NCEP operational Climate Forecast System : configuration, products, and plan for the future Hua-Lu Pan Environmental Modeling Center NCEP.
The International Surface Pressure Databank (ISPD) and Twentieth Century Reanalysis at NCAR Thomas Cram - NCAR, Boulder, CO Gilbert Compo & Chesley McColl.
Operational Dataset Update Functionality Included in the NCAR Research Data Archive Management System 1 Zaihua Ji Doug Schuster Steven Worley Computational.
TPAC Digital Library Talk Overview Presenter:Glenn Hyland Tasmanian Partnership for Advanced Computing & Australian Antarctic Division Outline: TPAC Overview.
Introduction Downloading and sifting through large volumes of data stored in differing formats can be a time-consuming and sometimes frustrating process.
Scientific Investigations; Support from Research Data Archives for Computing in Atmospheric Sciences October, 2001 Steven Worley National Center.
Dr Mark Cresswell Model Assimilation 69EG6517 – Impacts & Models of Climate Change.
WRF-VIC: The Flux Coupling Approach L. Ruby Leung Pacific Northwest National Laboratory BioEarth Project Kickoff Meeting April 11-12, 2011 Pullman, WA.
October 16-18, Research Data Set Archives Steven Worley Scientific Computing Division Data Support Section.
EGU 2011 TIGGE, TIGGE LAM and the GIFS T. Paccagnella (1), D. Richardson (2), D. Schuster(3), R. Swinbank (4), Z. Toth (3), S.
TIGGE Archive Highlights. First Service Date ECMWF – October 2006 NCAR – October 2006 CMA – June 2007.
Details for Today: DATE:18 th November 2004 BY:Mark Cresswell FOLLOWED BY:Literature exercise Model Assimilation 69EG3137 – Impacts & Models of Climate.
Research Data at NCAR 1 August, 2002 Steven Worley Scientific Computing Division Data Support Section.
The Eta Regional Climate Model: Model Development and Its Sensitivity in NAMAP Experiments to Gulf of California Sea Surface Temperature Treatment Rongqian.
RDFS Rapid Deployment Forecast System Visit at: Registration required.
Scientific Investigations; Support from Research Data Archives for Joint Office for Science Support 26 February, 2002 Steven Worley SCD/DSS.
Data to Support Ocean-Atmosphere Research NCAR Research Data Archive (RDA), Zaihua Ji, NCAR Steven Worley, NCAR Scott Woodruff,
Dataset Development within the Surface Processes Group David I. Berry and Elizabeth C. Kent.
Archive and Access Practices that Support Data Reuse and Transparency Steven Worley Doug Schuster Bob Dattore National Center for Atmospheric Research.
Describe workflows used to maintain and provide the RDA to users – Both are 24x7 operations Transition to the NWSC with zero downtime NWSC is new environment.
Integrated Grid workflow for mesoscale weather modeling and visualization Zhizhin, M., A. Polyakov, D. Medvedev, A. Poyda, S. Berezin Space Research Institute.
What Makes a Data Archive Tick: Marrying Content and User Support Steven Worley National Center for Atmospheric Research Computational and Information.
Reducing Canada's vulnerability to climate change - ESS J28 Earth Science for National Action on Climate Change Canada Water Accounts AET estimates for.
Automated Weather Observations from Ships and Buoys: A Future Resource for Climatologists Shawn R. Smith Center for Ocean-Atmospheric Prediction Studies.
ICOADS: Update Status and Data Distribution Steven J. Worley Scott D. Woodruff Sandra J. Lubker Ziahua Ji J. Eric Freeman NCAR, NOAA/ESRL, NOAA/NCDC CLIMAR-III,
Analyzed Data Products Available from NCAR that Support Marine Climate Research JCOMM ETMC-III 9-12 February 2010 Steven Worley Doug Schuster.
Modern Era Retrospective-analysis for Research and Applications: Introduction to NASA’s Modern Era Retrospective-analysis for Research and Applications:
Data Discovery and Access to The International Surface Pressure Databank (ISPD) 1 Thomas Cram Gilbert P. Compo* Doug Schuster Chesley McColl* Steven Worley.
Content, Discovery, and Accessibility Enhancements to the NCAR Research Data Archive Doug Schuster and Steve Worley NCAR.
APEC Climate Center Data Service System Chi-Yung Francis Tam APCC.
RDA Data Support Section. Topics 1.What is it? 2.Who cares? 3.Why does the RDA need CISL? 4.What is on the horizon?
TIGGE Data Archive and Access at NCAR November 2008 November 2008 Steven Worley National Center for Atmospheric Research Boulder, Colorado, U.S.A.
Outcomes of CLIMAR-IV DAVID I. BERRY ETMC-V, 22 – 25 JUNE 2015.
TIGGE Data Archive at NCAR 8th GIFS-TIGGE Working Group World Meteorological Organization Geneva February, 2010 Doug Schuster Steven Worley Dave.
The TIGGE Model Validation Portal: An Improvement in Data Interoperability 1 Thomas Cram Doug Schuster Hannah Wilcox Steven Worley National Center for.
29 March 2004 Steven Worley, NSF/NCAR/SCD 1 Research Data Stewardship and Access Steven Worley, CISL/SCD Cyberinfrastructure meeting with Priscilla Nelson.
TIGGE Archive Status at NCAR THORPEX Workshop and 6th GIFS-TIGGE Working Group Meetings WMO Headquarters Geneva September 2008 Steven Worley Doug.
SCD Research Data Archives; Availability Through the CDP About 500 distinct datasets, 12 TB Diverse in type, size, and format Serving 900 different investigators.
A Brief Introduction to CRU, GHCN, NCEP2, CAM3.5 Yi-Chih Huang.
The Research Data Archive at NCAR: A System Designed to Handle Diverse Datasets Bob Dattore and Steven Worley National Center for Atmospheric Research.
TIGGE Archive Access at NCAR Steven Worley Doug Schuster Dave Stepaniak Hannah Wilcox.
Data Discovery and Access to The International Surface Pressure Databank (ISPD) 1 Thomas Cram Gilbert P. Compo* Doug Schuster Chesley McColl* Steven Worley.
RDA Data Support Section. Topics 1.What is it? 2.Who cares? 3.Why does the RDA need CISL? 4.What is on the horizon?
SCD Research Data for Ocean Observatories Steering Committee June 18, 2001 Steven Worley Scientific Computing Division Data Support Section.
5-7 May 2003 SCD Exec_Retr 1 Research Data, May Archive Content New Archive Developments Archive Access and Provision.
The TIGGE Model Validation Portal: An Improvement in Data Interoperability 1 Thomas Cram Doug Schuster Hannah Wilcox Michael Burek Eric Nienhouse Steven.
1. Gridded Data Sub-setting Services through the RDA at NCAR Doug Schuster, Steve Worley, Bob Dattore, Dave Stepaniak.
A41I-0105 Supporting Decadal and Regional Climate Prediction through NCAR’s EaSM Data Portal Doug Schuster and Steve Worley National Center for Atmospheric.
Introduction What purpose does a data archive center serve if users can’t find or access the holdings they might need to facilitate their research discoveries?
TIGGE Archives and Access
TIGGE Data Archive and Access System at NCAR
Operational Dataset Update Functionality Included in the NCAR Research Data Archive Management System Zaihua Ji Doug Schuster Steven Worley Computational.
Development and Futures of Research Data Archives
TIGGE Data Archive at NCAR
Research Data Archives at NCAR
Key new features in the research data archive
Steven Worley, NSF/NCAR/SCD
Steven Worley, Douglas Schuster,
CISL’s Research Data Archive (RDA) : Description and Methods
Comeaux and Worley, NSF/NCAR/SCD
Data Management Components for a Research Data Archive
Robert Dattore and Steven Worley
Data Curation in Climate and Weather
Comeaux and Worley, NSF/NCAR/SCD
Presentation transcript:

Data for Climate and Energy Studies Steven Worley Computational and Information Systems Laboratory NCAR

Topics Scope of the NCAR Research Data Archive (RDA) Discovery and Access Highlights User ranked popular datasets Examples Near-term service improvements 7 May NCAR-CSM Symposium on Climate and Energy

Scope of the NCAR Research Data Archive (RDA) Focus on atmospheric, oceanographic, and related geo-sciences observational data and derived analyses.  Some weather forecast data  Do not specialize in climate prediction datasets 7 May NCAR-CSM Symposium on Climate and Energy Active stewardship program to maintain and grow the RDA for 40+ years.  Large variety, 600+ datasets, ~ 400 TB, 4M files

Discovery and Access Highlights 7 May NCAR-CSM Symposium on Climate and Energy Primary design feature for web portal Data Discovery – Find Data!

Discovery and Access Highlights 7 May NCAR-CSM Symposium on Climate and Energy Multiple Methods - simple to interoperable 1.Find the files in our lists and download Through your browser – limit 2GB We create a ‘wget’ script for you – run in background on your machine – no limit 2.You select temporal, spatial, parameter domains We build a file list for you Download options as in 1 3.Data is not online to the web – but, is on archive storage We automatically stage data to online, then download 4.You select temporal, spatial, parameter domains - we build CURL commands - you get only the grids you select About CURL Client URL Library functions Readily available on Linux OS We use HTPPS protocols – others are available Applies well to WMO GRIB data format Users modify the CURL commands and script them to perform routine data extractions from RDA

User ranked popular datasets 7 May NCAR-CSM Symposium on Climate and Energy Unique users FY09datasetsTitles 2878ds082.0, ds083.2, ds083.0NCEP FNL Operational Model Global Tropospheric Analyses 924ds090.0NCEP/NCAR Global Reanalysis Products 510ds758.0, ds759.3, ds759.2NGDC Global 2' and 5' Elevations, USGS 30 ARC-second 477 ds461.0, ds351.0 ds337.0, ds464.0,ds353.4NCEP ADP/PREPBUFR Global Surface and Upper Air Observations 358ds608.0NCEP North American Regional Reanalysis (NARR) 264ds609.2GCIP NCEP ETA model output 262ds540.1, ds540.0International Comprehensive Ocean-Atmosphere Data Set (ICOADS) 190ds744.4QSCAT/NCEP Blended Ocean Winds 173ds277.0NCEP V2.0 OI Global SST, V3.0 Extended Reconstructed Analyses 153ds335.0, ds336.0Unidata (IDD) Observations and Model Data 106ds091.0NCEP/DOE Reanalysis II 106ds552.1, ds552.0, ds556.0River Discharge Data 91ds277.3Hadley Centre Global Sea Ice and Sea Surface Temperature (HadISST) 89ds824.1, ds330.3Global Tropical Cyclone "Best Track" Position and Intensity Data, TIGGE Cyclone Tracks 72ds570.0World Monthly Surface Station Climatology 69ds314.0Global Meteorological Forcing Dataset for Land Surface Modeling 68ds900.0U.S. AFGWC Station (Surface and Upper Air) Library 61ds260.3NOCS Surface Flux Dataset v2.0 58ds285.3Japanese Subsurface Temperature And Salinity Analyses V6.7 56ds512.0CPC Global Summary of Day/Month Observations 56ds625.0Japanese 25-year Reanalysis Project 55ds578.1, ds485.0China Monthly Station Precipitation and Temperature, Daily Precip. and Monthly Soil Temperature 53ds285.0World Ocean Database and World Ocean Atlas 47ds770.0GISS Soil and Surface Slope 45ds215.0 Global Monthly Surface Temperature Anomalies ( ), Precipitation ( ), and Sea Level Pressure ( ) from the University of East Anglia Climatic Research Unit 42ds277.7NOAA OI 1/4 Degree Daily SST Analysis 42ds330.2TIGGE Near Real-time 40ds472.0TDL U.S. and Canada Surface Hourly Observations 36ds232.2Scatterometer Climatology of Ocean Winds 32ds131.1, ds131.0NOAA-CIRES Twentieth Century Global Reanalysis Version I and II 30ds260.2CORE.2 Global Air-Sea Flux Dataset 27ds885.1NCDC TD9640 U.S. Palmer Drought Indices 25ds627.0ERA-Interim Project 25ds510.0NCDC TD3200 U.S. Cooperative Summary of Day 24ds564.0Global Historical Climatology Network (GHCN) Temperature, Precipitation, Pressure 5921All DatasetsAll DSS datasets Top 30 datasets/groups FY09 ~ 6000 Unique Users Annually

One example Final Global Analysis from NOAA/NCEP  4x Daily  Updated in the RDA 1x/day  1° horizontal resolution  26 vertical pressure levels, plus surface  Series starts in 1999  Over 55 parameter fields 7 May NCAR-CSM Symposium on Climate and Energy

One example 7 May NCAR-CSM Symposium on Climate and Energy

Re-analyses 7 May 2010 NCAR-CSM Symposium on Climate and Energy 9 Table 1: Global atmospheric and oceanographic re-analyses are one of many valuable data resources provided by external organizations that employ the expertise of RDA consultants and are the most recent major reanalyses available in the Research Data Archive. Most time periods are ongoing, that is, providers continue to produce the products gong forward in time. In general, all reanalyses also have lower temporal and horizontal resolutions than those shown above. Most reanalyses also have variables on vertical model coordinate levels, as well as large numbers of surface specific fields, and vertically integrate values.

Near-term service improvements  Current and soon-to-be workflow 7 May NCAR-CSM Symposium on Climate and Energy

Complete User Community Advantages: Fast access to online data – limited part of RDA Access to all RDA content metadata Access to RDA data processing services Complete User Community Disadvantages: Slow access to MSS data – delayed mode Have to create a separate RDA account and log in Data processing requests take a long time to finish Slow download speeds for some users HPC User Community Advantages: Access to full RDA Fast computing No login required HPC User Community Disadvantages: No access to online data Use MSS as a file server No direct access to RDA metadata No direct access to RDA data processing services Require separate account to access RDA web server

HPC User Community Improvements: Fast access to full RDA Access to all RDA content metadata Access to RDA data processing services Single CISL account Single “first point of contact” Complete User Community Improvements: Fast access to full RDA Expanded data processing services available Single CISL account - no separate RDA account Faster download speeds – grid-based tools, e.g. GRIDFTP Single “first point of contact” for user support Resolved all the disadvantages New Challenges: GPFS and HPSS don’t have generic file use logging Need for metrics & services HPSS doesn’t have sophisticated file access control Some RDA assets have limited access policies Abandon a functional RDA registration system – retool a 20K+ user DB Of course, there will be more! Big transition while maintaining RDA content building and services

End  Scope of the NCAR Research Data Archive (RDA)  Discovery and Access Highlights  User ranked popular datasets  Examples  Near-term service improvements 7 May NCAR-CSM Symposium on Climate and Energy