The Modeling Circle Courtesy M. Lautenschlager, DKRZ.

Slides:



Advertisements
Similar presentations
Creating Institutional Repositories Stephen Pinfield.
Advertisements

IUFRO International Union of Forest Research Organizations Eero Mikkola The Increasing Importance of Metadata in Forest Information Gathering NEFIS Symposium.
Develop an Information Strategy Plan
Office of the DVC (S&E) Students as Change Agents Dr Cassandra Saunders Student Evaluation, Review and Reporting Unit (SERRU) Students Matter Forum 2013.
Introduction on WP7/WP9 Dominique PORTE 29/05/2008 Menu What is WP7? What is WP9? Goal of the brainstorming Introduction on WP7/WP9.
Earth System Curator Spanning the Gap Between Models and Datasets.
Metadata Development in the Earth System Curator Spanning the Gap Between Models and Datasets Rocky Dunlap, Georgia Tech.
Why, what were the idea ? 1.Create a data infrastructure, 2.Data + the knowledge products that are produced on the basis of data a) Efficiant access to.
Episode 3 / CAATS II joint dissemination event Lessons Learnt Episode 3 - CAATS II Final Dissemination Event Philippe Leplae EUROCONTROL Episode 3 Brussels,
Alternate Software Development Methodologies
Policy recommendations for wider implementation of telemedicine Peeter Ross, MD, PhD e-Health expert, Estonian eHealth Foundation, Estonia.
Benchmarking as a management tool for continuous improvement in public services u Presentation to Ministry of Culture of the Russian Federation u Peter.
New DFG Information Infrastructure Projects Dr. Stefan Winkler-Nees; Birmingham, 28. March 2011 New DFG Information Infrastructure Projects.
US GPO AIP Independence Test CS 496A – Senior Design Team members: Antonio Castillo, Johnny Ng, Aram Weintraub, Tin-Shuk Wong Faculty advisor: Dr. Russ.
Database System Development Lifecycle Transparencies
EC Review – 01/03/2002 – G. Zaquine – Quality Assurance – WP12 – CS-SI – n° 1 DataGrid Quality Assurance Gabriel Zaquine Quality Engineer - WP12 – CS-SI.
US NITRD LSN-MAGIC Coordinating Team – Organization and Goals Richard Carlson NGNS Program Manager, Research Division, Office of Advanced Scientific Computing.
OSSE School Improvement Data Workshop Workshop #4 June 30, 2015 Office of the State Superintendent of Education.
Z EGU Integration of external metadata into the Earth System Grid Federation (ESGF) K. Berger 1, G. Levavasseur 2, M. Stockhause 1, and M. Lautenschlager.
Eric Guilyardi (LOCEAN/IPSL and Univ. Reading) and the Metafor team Common Metadata for Climate Modelling Digital Repositories IS-ENES kick-off meeting.
GEO Work Plan Symposium 2012 ID-05 Resource Mobilization for Capacity Building (individual, institutional & infrastructure)
“Filling the digital preservation gap” an update from the Jisc Research Data Spring project at York and Hull Jenny Mitcham Digital Archivist Borthwick.
Metadata Creation with the Earth System Modeling Framework Ryan O’Kuinghttons – NESII/CIRES/NOAA Kathy Saint – NESII/CSG July 22, 2014.
The Preparatory Phase Proposal a first draft to be discussed.
The Digital Library for Earth System Education: A Community Resource
CORDEX Scope, or What is CORDEX?  Provide a set of regional climate scenarios (including uncertainties) covering the period , for the majority.
CIM – The Common Information Model in Climate Research
 To explain the importance of software configuration management (CM)  To describe key CM activities namely CM planning, change management, version management.
1 INFRA : INFRA : Scientific Information Repository supporting FP7 “The views expressed in this presentation are those of the author.
EMI INFSO-RI SA2 - Quality Assurance Alberto Aimar (CERN) SA2 Leader EMI First EC Review 22 June 2011, Brussels.
OPERATIONAL GUIDELINES Ensuring Ownership of PARSEL by Partners.
Luisa Franconi Integration, Quality, Research and Production Networks Development Department Unit on microdata access ISTAT Essnet on Common Tools and.
A new start for the Lisbon Strategy Knowledge and innovation for growth.
InWEnt | Qualified to shape the future1 Internet based Human Resource Development Management Platform Human Resource Development Programme in Natural Disaster.
SAON Data Management Workshop Report June 7-8, 2010, Norway Recommendations (Extracted by Jan René Larsen, 25 September 2012),
 Copyright 2005 Digital Enterprise Research Institute. All rights reserved. Semantic Web services Interoperability for Geospatial decision.
Database System Development Lifecycle 1.  Main components of the Infn System  What is Database System Development Life Cycle (DSDLC)  Phases of the.
“ BIRD Project“ 1 Broadband Access, Innovation & Regional Development” Broadband Access, Innovation & Regional Development” Project Description Ulrich.
CLARIN work packages. Conference Place yyyy-mm-dd
Current Situation and CI Requirements OOI CyberInfrastructure Science User Requirements Workshop: San Diego January 23-24, 2008.
Data Publication and Quality Control Procedure for CMIP5 / IPCC-AR5 Data WDC Climate / DKRZ:
The Future ENES Strategy – Toward a Foresight Document Jochem Marotzke Max Planck Institute for Meteorology (MPI-M) German Climate Computing Centre (DKRZ)
The ToolBox Product Management & Product Development Framework Welcome to the Product Management & Product Development “Good Practice” workshop Facilitated.
LAW&ICT Shared Virtual Campus, Zaragoza Meeting, October model for technical support to LAW&ICT Shared Virtual Campus: a proposal Selahattin Kuru.
Large Scale Nuclear Physics Calculations in a Workflow Environment and Data Provenance Capturing Fang Liu and Masha Sosonkina Scalable Computing Lab, USDOE.
NA2 objectives The „v.E.R.C.“ (virtual Earth System Resource Center) v.E.R.C is: A (technical) infrastructure to improve the utilization of the ESM core-infrastructure,
1 Direction scientifique Networks of Excellence objectives  Reinforce or strengthen scientific and technological excellence on a given research topic.
ADP SUPPORT IN UGANDA BUILDING A NATIONAL DATA ARCHIVE Presented by Kizito Kasozi Director Information Technology Uganda Bureau of Statistics PARIS21.
BalticGrid-II Project The Second BalticGrid-II All-Hands Meeting, Riga, May, Joint Research Activity Enhanced Application Services on Sustainable.
Portable Infrastructure for the Metafor Metadata System Charlotte Pascoe 1, Gerry Devine 2 1 NCAS-BADC, 2 NCAS-CMS University of Reading PIMMS provides.
A Practical Approach to Metadata Management Mark Jessop Prof. Jim Austin University of York.
PROmoting Local INNOVAtion in ecologically-oriented agriculture and NRM What can be done with farmers’ innovations?
DOE Data Management Plan Requirements
WP4 - Strengthening the European Network on Earth System Modelling IS-ENES kick-off meeting – March 30-31, Paris Partners and general objectives CERFACS.
Platform for African-European partnership in agricultural research for development (Phase II) Nairobi WP5 meeting - PAEPARD II 4YE – Thierry - February.
Summary of HEP SW workshop Ian Bird MB 15 th April 2014.
Metadata Development in the Earth System Curator Spanning the Gap Between Models and Datasets Rocky Dunlap, Georgia Tech 5 th GO-ESSP Community Meeting.
Co-ordination & Harmonisation of Advanced e-Infrastructures for Research and Education Data Sharing Research Infrastructures – Grant Agreement n
Session 2: Developing a Comprehensive M&E Work Plan.
FISCO2 – Financial and Scientific Coordination Work Package dedicated to ENSAR2 management WP leader: Ketel Turzó WP deputy: Sandrine Dubromel ENSAR2 Management.
Capacity Building in: GEO Strategic Plan 2016 – 2025 and Work Programme 2016 Andiswa Mlisa GEO Secretariat Workshop on Capacity Building and Developing.
1 This slide indicated the continuous cycle of creating raw data or derived data based on collections of existing data. Identify components that could.
Intentions and Goals Comparison of core documents from DFIG and Publishing Workflow IG show that there is much overlap despite different starting points.
Approaches and Challenges in Managing Persistent Identifiers
RDA US Science workshop Arlington VA, Aug 2014 Cees de Laat with many slides from Ed Seidel/Rob Pennington.
Data Ingestion in ENES and collaboration with RDA
Institutional role in supporting open access, open science, open data
RECARE set-up Rudi Hessel on behalf of coordination team
ESS.VIP VALIDATION An ESS.VIP project for mutual benefits
Statistical Information Technology
Presentation transcript:

The Modeling Circle Courtesy M. Lautenschlager, DKRZ

Motivation for WP4 of ISENES2 „ The objective of this work package is to provide networking activities to increase the pace of climate science employing modelling by sharing best practice in software environments for Earth System Models and encouraging more sharing of selected codes within the climate community.“ task 1: workflow and post processing task 2: configuration management task 3: meta-data capture task 4: coupling

Tools Required Workflow and post processing tools –Increasing complexity –S2D –Data volumes –Downstream users Configuration management tools –Efficient Mgt of ✴ Scientific models ✴ Technical codes ✴ Experiment definitions Meta-data capture tools –Description of ✴ Experiments ✴ Simulations ✴ Data Coupling tools –For scientific codes to be coupled in new combinations for ESMs

ISENES2 Workshops Workflows 1 + 2; MPI-M / MO 1. A first workshop will identify issues and opportunities to be explored in more depth. June The second workshop will also include discussion on available post processing solutions in use in the community and how they are integrated into workflows. Configuration Management 1 + 2; MO / MPI-M 1.A first WS to start community evaluation of FCM by experienced (IPSL) and novice (MPI-M) users. Sep A second WS for the evaluation and dissemination of the findings MD-Generation 1 + 2; DKRZ 1.The aim will be to encourage investment in software and working processes that will allow more comprehensive meta-data to be collected more efficiently. Further, the development of workflow and diagnostic solutions will be influenced by the meta-data requirements. Jan 21/ To support the second workshop, the Met Office and DKRZ will develop documents that identify key interfaces between the meta-data and the experiment definition and modelling processes, and explore design solutions.

Why are you here? –Discuss MD generation „on the fly“ –Some centers don’t/can’t(?) do that –Those that can do it, do it differently –Should learn from each other!

Motivation from DoW Networking will lower the following barriers for the use of common software solutions for workflow and post processing (task 1), configuration management (task 2), meta- data capture (task 3) and coupling (task 4): Technical and human resources in most institutions are stretched and often applied to extend existing legacy solutions rather than to take the more risky approach of trying new solutions. Also, developments are targeted at local problems. Even when they have the potential to find more generic application, they are not advertised as such and are not used more widely. This work package will provide opportunity for advertising solutions to a wider community by sharing experience in software that deals with the modelling environments. The lack of understanding of longer-term benefits of available solutions. If some participants are able to quantify the benefits of apparently risky or large changes, other participants are more likely to invest thus spreading best practice. There are large overheads of software evaluation in this field. Any software needs to be adapted to meet specific, complex local needs. This work package supports evaluations of software for all the ESM environment tools listed above.

Motivation cont’d For all tasks 1 to 3, the objective of this work package is to facilitate best practices and software sharing by supporting software evaluations that will lead to well prepared, in depth workshops allowing partners to understand the opportunities for shared software solutions. It will fund teams to evaluate software and effort to support the evaluators. This bottom-up approach will be complemented by the management engagement proposed in NA1*. * Governance and strategy activities will be developed in strong link with other work packages such as NA2 on future HPC technologies and model developments, NA3 on possible common developments on software and development of governance methods, and NA4 on data archive governance.

Task Description from DoW IS-ENES2 Significant experience has been gained in CMIP5 and related exercises in providing meta-data to describe ESM experiment sets. A number of sites are recognising the need to build meta-data capture into the heart of the ESM experiment process and to drive data provision exercises; this needs to be supported by both software and processes. This networking activity will promote the sharing of experiences and designs in this emerging area through two workshops organised by DKRZ. The aim will be to encourage investment in software and working processes that will allow more comprehensive meta-data to be collected more efficiently. Further, the development of workflow and diagnostic solutions will be influenced by the meta-data requirements. To support the workshop, the Met Office and DKRZ will develop documents that identify key interfaces between the meta-data and the experiment definition and modelling processes, and explore design solutions.

Deliverable/Milestones from DoW Deliverable: –Meta-data capture final workshop report (DKRZ) Milestones: –MS42: Initial workshop on meta-data generation during experiments, mth 9 – MS46: Final workshop on meta-data generation during experiments, mth 37

Minutes etc. Issues –Compare schemas of WFs/FWs (IPSL, MOHC, GFDL) –Reasons to collect Provenance Data: ✴ Robustness (restart possibility) ✴ Good scientific practice ✴ Visibility of provenance data collection process needs discussion ✴ The numerical logbook ✴ Usage and development of CVs ✴ Usage and development of PIDs Time line –Next MD-WS needs to be planned for m37(ISENES2) = May 2016 –Meta-data capture final workshop report for m40(ISENES2) = Aug 2016 Next Steps –Produce Minutes of this meeting –Idea: Accompany CMIP6 prep work by a paper

Idea CMIP6: –Modeling Infrastructure Panel planned ✴ tasked with establishing and maintaining standards for model data sharing ✴ to create a document outlining the technologies necessary for operation of a global data infrastructure, and ✴ the standards necessary for maintaining these technologies. –The document will outline a protocol for creating and running a MIP. Produce a basis for this document –Possible/probably with the other WS in ISENES ~ “Best practices for MD generation and workflows in ESM experimentation“

PID PID to mark CIM documents! PID make versioning easier …can be on file or directory level  needs to be decided

Recommendations There should be a means of informing users and data centres of deviations from the recommendations on formats, headers, etc. Agreements on formats etc: - earlier to publish - better to be communicated (especially verification checks)! Checks on data should result in warnings, not in errors as right or wrong depends on the usage.

…to be put out clear: Data centres cannot guarantee to publish data that do not follow the technical requirements

MD capture One ore more DB aside – filled during the data production process Aside or not aside: Social problem, „cleaner“ social engeneering (access, responsability, …) when aside  2 or 3 DB per system Data into file headers – collected and filled in a DB later