12.09.2002M. Lautenschlager (M&D/MPIM)1 WDC on Climate as Part of the CERA 1 Database System Michael Lautenschlager Modelle und Daten Max-Planck-Institut.

Slides:



Advertisements
Similar presentations
Std-doi Publication of Climate Data at WDCC DataCite Summer Meeting 7./8. June 2010 Publication of climate data Heinke Höck World Data Center for Climate.
Advertisements

Data management in SCD Steven Worley General Categories –The Mass Storage System –NCAR user file services (home directories) –Computer attached storage.
WMO Core Profile of the ISO Metadata Standard Steve Foreman Chair IPET-Metadata Implementation.
BADC Workshop 2: BADC Services to Data Suppliers Royal Met. Soc. Conference – 14 September 2005 Ag Stephens et al.
1 CEOS/WGISS20 – Kyiv – September 13, 2005 Paul Kopp SIPAD New Generation: Dominique Heulet CNES 18, Avenue E.Belin Toulouse Cedex 9 France
Long-term Archiving of Climate Model Data at WDC Climate and DKRZ Michael Lautenschlager WDC Climate / Max-Planck-Institute for Meteorology, Hamburg Data.
M.Lautenschlager (WDCC/MPI-M) / / 1 The CEOP Model Data Archive at the World Data Center for Climate as part of the CEOP Data Network CEOP / IGWCO.
CERA / WDCC Hannes Thiemann Max-Planck-Institut für Meteorologie Modelle und Daten zmaw.de NCAR, October 27th – 29th, 2008.
Architecture & Data Management of XML-Based Digital Video Library System Jacky C.K. Ma Michael R. Lyu.
Web-based Portal for Discovery, Retrieval and Visualization of Earth Science Datasets in Grid Environment Zhenping (Jane) Liu.
M.Lautenschlager (WDCC / MPI-M) / / 1 WS Spatiotemporal Databases for Geosciences, Biomedical sciences and Physical sciences Edinburgh, November.
German Cluster of WDCs for Earth System Research - Entwurf - Michael Lautenschlager 1, Michael Diepenbroek 2, Hannes Grobe 2, Michael Bittner 3, Jens Klump.
M. Diepenbroek (MARUM), M. Lautenschlager (MPI-M), E. Paliouras (DLR), H. Grobe (AWI) CODATA General Assembly, Berlin World Data Center Cluster.
M.Lautenschlager (WDCC / MPI-M) / / 1 GO-ESSP at LLNL Livermore, June 19th – 21st, 2006 World Data Center Climate: Status and Portal Integration.
INFSO-RI Enabling Grids for E-sciencE Intelligent Distributed Data Management in Earth system science K. Ronneberger, DKRZ, Germany.
M. Lautenschlager (M&D/MPIM)1 The CERA Database Michael Lautenschlager Modelle und Daten Max-Planck-Institut für Meteorologie Workshop "Definition.
October 16-18, Research Data Set Archives Steven Worley Scientific Computing Division Data Support Section.
Z EGU Integration of external metadata into the Earth System Grid Federation (ESGF) K. Berger 1, G. Levavasseur 2, M. Stockhause 1, and M. Lautenschlager.
Inter-American Workshop on Environmental Data Access Panel discussion on scientific and technical issues Merilyn Gentry, LBA-ECO Data Coordinator NASA.
Research Data at NCAR 1 August, 2002 Steven Worley Scientific Computing Division Data Support Section.
Creating Documentation and Metadata: Metadata for Discovery Lola Olsen 1, Tyler Stevens 2, 1 National Aeronautics and Space Administration (NASA) 2 Wyle.
F. Toussaint (WDCC, Hamburg) / / 1 CERA : Data Structure and User Interface Frank Toussaint Michael Lautenschlager World Data Center for Climate.
Archival information system ARHiNET Croatian national archival information system Vlatka Lemić Croatian State Archives, Croatia.
Metadata and Geographical Information Systems Adrian Moss KINDS project, Manchester Metropolitan University, UK
Scientific Investigations; Support from Research Data Archives for Joint Office for Science Support 26 February, 2002 Steven Worley SCD/DSS.
World Data Center for Marine Environmental Sciences.
Michael Lautenschlager World Data Center Climate Model and Data / Max-Planck-Institute for Meteorology German Climate Computing Centre (DKRZ)
Managing the Impacts of Programmatic Scale and Enhancing Incentives for Data Archiving A Presentation for “International Workshop on Strategies for Preservation.
Bulk Metadata Structures in CERA Frank Toussaint, Michael Lautenschlager Max-Planck-Institut für Meteorologie World Data Center for Climate.
M.Lautenschlager (WDCC, Hamburg) / / 1 Semantic Data Management for Organising Terabyte Data Archives Michael Lautenschlager World Data Center.
M.Lautenschlager (WDCC, Hamburg) / / 1 Semantic Data Management for Organising Terabyte Data Archives Michael Lautenschlager World Data Center.
Long-term Archiving of Climate Model Data at WDC Climate and DKRZ Michael Lautenschlager WDC Climate / Max-Planck-Institute for Meteorology, Hamburg Wolfgang.
M.Lautenschlager (WDCC, Hamburg) / / 1 Training-Workshop Facilities and Sevices for Earth System Modelling Integrated Model and Data Infrastructure.
Data Publication and Quality Control Procedure for CMIP5 / IPCC-AR5 Data WDC Climate / DKRZ:
M.Lautenschlager (WDCC, Hamburg) / / 1 ICSU World Data Center For Climate Semantic Data Management for Organising Terabyte Data Archives Michael.
Opendap dev - meeting, Boulder, Feb 2007 OPeNDAP infrastructure in European Operational Oceanography T Loubrieu (IFREMER) T Jolibois (CLS)
Using the Global Change Master Directory (GCMD) to Promote and Discover ESIP Data, Services, and Climate Visualizations Presented by GCMD Staff January.
National Aeronautics and Space Administration Jet Propulsion Laboratory California Institute of Technology Pasadena, California EDGE: The Multi-Metadata.
The CERA2 Data Base Data input – Data output Hans Luthardt Model & Data/MPI-M, Hamburg Services and Facilities of DKRZ and Model & Data Hamburg,
Michael Lautenschlager, Hannes Thiemann, Frank Toussaint WDC Climate / Max-Planck-Institute for Meteorology, Hamburg Joachim Biercamp, Ulf Garternicht,
H. Thiemann (M&D) / / 1 Hannes Thiemann M&D Statusseminar, 22. April 2004.
IPCC TGICA and IPCC DDC for AR5 Data GO-ESSP Meeting, Seattle, Michael Lautenschlager World Data Center Climate Model and Data / Max-Planck-Institute.
INFSO-RI Enabling Grids for E-sciencE A service oriented framework to create, manage and update metadata for earth system science.
1-2-3 February 2006 –Page 1 Mersea Integrated System How to improve Access/Downloading services ? How far do we go in terms of standardization ?
WGISS and GEO Activities Kathy Fontaine NASA March 13, 2007 eGY Boulder, CO.
The Repository of the World Data Centre for Climate Frank Toussaint, Michael Lautenschlager Max-Planck-Institut für Meteorologie Repositories in Research.
INFSO-RI Enabling Grids for E-sciencE Intelligent Distributed Data Management in Earth System Science S. Kindermann, DKRZ, Germany.
PSI Meta Data meeting, Toulouse - 15 November The CERA C limate and E nvironment data R etrieval and A rchiving system at MPI-Met / M&D S. Legutke,
MarLIN - CSIRO Marine Laboratories Information Network.
H. Widmann (M&D) Data Discovery and Processing within C3Grid GO-ESSP/LLNL / June, 19 th 2006 / 1 Data Discovery and Basic Processing within the German.
Lautenschlager + Thiemann (M&D/MPI-M) / / 1 Introduction Course 2006 Services and Facilities of DKRZ and M&D Integrating Model and Data Infrastructure.
MarLIN: a research data metadatabase for CSIRO Marine Research Tony Rees Divisional Data Centre CSIRO Marine Research, Hobart contact:
Trials and Tribulations of a Small Archive Presented at the THIC Conference, NCAR, Boulder CO June 30, 2004 Presented at the THIC Meeting at the National.
Create XML from a template Browse available records WDCC Metadata Generation with GeoNetwork Hans Ramthun, Michael Lautenschlager, Hans-Hermann Winter.
The Proliferation of Metadata Standards and the Evolution of NASA’s Global Change Master Directory (GCMD) Standard for Uses in Earth Science Data Discovery.
Global Change Master Directory (GCMD) Mission “To assist the scientific community in the discovery of Earth science data, related services, and ancillary.
IPCC WG II + III Requirements for AR5 Data Management GO-ESSP Meeting, Paris, Michael Lautenschlager, Hans Luthardt World Data Center Climate.
Distributed Archives Interoperability Cynthia Y. Cheung NASA Goddard Space Flight Center IAU 2000 Commission 5 Manchester, UK August 12, 2000.
5-7 May 2003 SCD Exec_Retr 1 Research Data, May Archive Content New Archive Developments Archive Access and Provision.
Building Preservation Environments with Data Grid Technology Reagan W. Moore Presenter: Praveen Namburi.
1. 2 NOAA’s Mission To describe and predict changes in the Earth’s environment. To conserve and manage the Nation’s coastal and marine resources to ensure.
CEOS Working Group on Information System and Services (WGISS) Data Access Infrastructure and Interoperability Standards Andrew Mitchell - NASA Goddard.
Center of Excellence for Oceans and Human Health at the Hollings Marine Laboratory Metadata Development in Support of the Oceans and Human Health Tidal.
2005 – 06 – - ESSP1 WDC Climate : Web Access to Metadata and Data Frank Toussaint World Data Center for Climate (M&D/MPI-Met, Hamburg)
AP7/AP8: Long-Term Archival of CMIP6 Data
World Conference on Climate Change October 24-26, 2016 Valencia, Spain
Flanders Marine Institute (VLIZ)
Design central EMODnet portal Objectives and Technical description Initial draft prepared by the Flanders Marine Institute.
School of Information Studies, Syracuse University, Syracuse, NY, USA
Data Management Components for a Research Data Archive
Presentation transcript:

M. Lautenschlager (M&D/MPIM)1 WDC on Climate as Part of the CERA 1 Database System Michael Lautenschlager Modelle und Daten Max-Planck-Institut für Meteorologie WDC Review Hamburg, ) Climate and Environmental data Retrieval and Archiving

M. Lautenschlager (M&D/MPIM)2 Content CERA Concept User Interface Data Content WDC Integration

M. Lautenschlager (M&D/MPIM)3 CERA Accessing Countries

M. Lautenschlager (M&D/MPIM)4 File Access Problems Missing Data Catalogue Directory structure of the Unix file system is not sufficient to organise millions of files Data are not stored application-oriented Raw data contain time series of 4D data blocks Access pattern is time series of 2D fields Lack of experience with climate model data Problems in extracting relevant information Year Moderate Increase210 TB650 TB1620 TB2670 TB3720 TB Linear Increase210 TB1270 TB4260 TB7580 TB10910 TB

M. Lautenschlager (M&D/MPIM)5 CERA Concept: Semantic Data Management (I) Data catalogue and pointer to Unix files Enable search and identification of data Allow for data access as they are (II) Application-oriented data storage –Time series of individual variables are stored as BLOB entries in DB Tables Allow for fast and selective data access –Storage in standard file-format (GRIB) Allow for application of standard data processing routines (PINGOs)

M. Lautenschlager (M&D/MPIM)6 CERA Database: 7.1 TB ( ) * Data Catalogue * Processed Climate Data * Pointer to Raw Data files Mass Storage Archive: 210 TB neglecting Security Copies ( ) CERA Database System Web-Based User Interface Catalogue Inspection Climate Data Retrieval DKRZ Mass Storage Archive InternetAccess Current database size is Terabyte Number of experiments: 279 Number of datasets: Number of blob within CERA at 06-SEP-02: Typical BLOB sizes: 17 kB and 100 kB Number of data retrievals: 1500 – 5500 / month

M. Lautenschlager (M&D/MPIM)7 CERA-2 Data Model Complete with respect to IEEE’s Reference Model for Metadata (Bretherton, 1994) –Browse, Search and Retrieval –Ingest, Quality Assurance, Reprocessing –Application to Application Transfer –Storage and Archive Supports interoperability due to inclusion of international standards –Directory Interchange Format (NASA, 1998) –FGDC Metadata Content Standard (FGDC, 1996) –ISO Metadata Standard for Geographic Information (ISO 19115) Reference –“The CERA-2 Data Model” (DKRZ-Report No. 15, 1998) –URL:

M. Lautenschlager (M&D/MPIM)8 CERA-2 Data Model Blocks Metadata Entry This is the central CERA Block, providing information on the entry's title type and relation to other entries the project the data belong to a summary of the entry a list of general keywords related to data creation and review dates of the metadata Additionally: Modules and Local Extensions Module DATA_ORGANIZATION (grid structure) Module DATA_ACCESS (physical storage) Local extension for specific information on (e.g.) data usage data access and data administration Coverage Information on the volume of space-time covered by the data Reference Any publication related to the data togehter with the publication form Status Status information like data quality, processing steps, etc. Distribution Distribution information including access restrictions, data format and fees if necessary Contact Data related to contact persons and institutes like distributor, investigator, and owner of copyright Parameter Block describes data topic, variable and unit Spatial Reference Information on the coordinate system used

M. Lautenschlager (M&D/MPIM)9 Data Model Functions The CERA2 data model … –allows for data search according to discipline, keyword, variable, project, author, geographical region and time interval and data retrieval. –allows for specification of data processing (aggregation and selection) without attaching the primary data. –is flexible with respect to local adaptations and storage of different types of geo-referenced data. –is open for cooperation and interchange with other database systems.

M. Lautenschlager (M&D/MPIM)10 Data Structure in CERA Level 1 Level 2 Experiment Description Pointer to Unix-Files Dataset 1 Description Dataset n Description BLOB Data Table BLOB Data Table

M. Lautenschlager (M&D/MPIM)11 User Interface Structure

M. Lautenschlager (M&D/MPIM)12 User Interface Signed Java Applet: Catalogue Inspection Climate Data Retrieval IPCC DDC

M. Lautenschlager (M&D/MPIM)13 CERA Data Content Climate Model Data (Continuous stream of new data) –Local climate model production experiments for present and future but also for past climates IPCC DDC (Data Distribution Centre) –Archive and dissemination of selected data from international climate scenario calculations (IS92a and SRES) –Will be continued for the Forth Assessment Report Project Support (encourage Good Scientific Practice) –Archive and dissemination of project data HOAPS (Hamburg Ocean Atmosphere Parameters and Fluxes from Satellite Data) CARIBIC (Civil Aircraft for Regular Investigation of the Atmosphere Based on an Instrumentation Container), MPI Mainz

M. Lautenschlager (M&D/MPIM)14

M. Lautenschlager (M&D/MPIM)15 CERA Data Content Observational Data –Model related observations ERA15 (ECMWF) NCEP/NCAR 40 Year Reanalysis ERA40  in preparation –Instrumental data WOCE (World Ocean Circulation Experiment): field measurements and products are transferred from BSH –Earth observations Access to SST's from NOAA AVHRR in cooperation with DFD/DLR (distributed archive)  preparation for WDC cooperation

M. Lautenschlager (M&D/MPIM)16 CERA Data: Jan. Temp.

M. Lautenschlager (M&D/MPIM)17 CERA Data: Jan. Wind (2 x 250 MB)

M. Lautenschlager (M&D/MPIM)18 Integration of WDC on Climate WDC will be part of the operational CERA DB system –Requires only little additional work –Consumes only little hardware resources –All freely available data will be part of the WDC on Climate CERA DB / WDC on Climate fit requirements from interdisciplinary data access, non experienced users and small network bandwidth –Assistance and training, visitor programme –Small data units, media copies on request Data import in cooperation with producers Data dissemination and long-term storage is maintained by the CERA DB system

M. Lautenschlager (M&D/MPIM)19 Future Distributed data archive –Cooperation with WDC for Earth Observation (Oberpfaffenhofen) and WDC for Marine Environmental Sciences (Bremen) WDC for Paleoclimatolology (Boulder) –Sharing data holdings and responsibilities for climate data International cooperation and opening data archives –Federation of WDC's (related to Climate Research) –Web-based WDC data portal