M. Stockhause 1, G. Levavasseur 2, K. Berger 1 1 Deutsches Klimarechenzentrum (DKRZ) 2 Institute Pierre Simon Laplace (IPSL) ESGF-QCWT Quality Control.

Slides:



Advertisements
Similar presentations
Putting the Pieces Together Grace Agnew Slide User Description Rights Holder Authentication Rights Video Object Permission Administration.
Advertisements

Technical Highlights 25th August 2011 Sebastian Peters German National Library of Science and Technology.
IRRA DSpace April 2006 Claire Knowles University of Edinburgh.
JTX Overview Overview of Job Tracking for ArcGIS (JTX)
Transformations at GPO: An Update on the Government Printing Office's Future Digital System George Barnum Coalition for Networked Information December.
Depositing e-material to The National Library of Sweden.
1 Archiving Workflow between a Local Repository and the National Library Archive Experiences from the DiVA Project Eva Müller, Peter Hansson, Uwe Klosa,
PAWN: A Novel Ingestion Workflow Technology for Digital Preservation
M. Stockhause et al. Martina Stockhause, Michael Lautenschlager, Frank Toussaint Deutsches Klimarechenzentrum (DKRZ) World Data Centre for Climate (WDCC)
System Design/Implementation and Support for Build 2 PDS Management Council Face-to-Face Mountain View, CA Nov 30 - Dec 1, 2011 Sean Hardman.
Release & Deployment ITIL Version 3
Presented by DOI Create: TERN as a use-case Siddeswara Guru
Z EGU Integration of external metadata into the Earth System Grid Federation (ESGF) K. Berger 1, G. Levavasseur 2, M. Stockhause 1, and M. Lautenschlager.
The Earth System CoG Collaboration Environment Sylvia Murphy and Cecelia DeLuca (NOAA/CIRES), and Luca Cinquini (NASA/JPL) AGU Ocean Sciences February.
DOI Registration for Social and Economic Data da|ra Brigitte Hausstein GESIS Leibniz-Institute for the Social Sciences, Berlin.
Science Archives in the 21st Century 25/26 April Towards an International standard for Audit and Certification of Digital Repositories David Giaretta.
OASIS ebXML Registry Standard Open Forum 2003 on Metadata Registries 10:30 – 11:15 January 20, 2003 Kathryn Breininger The Boeing Company Chair, OASIS.
1 XML as a preservation strategy Experiences with the DiVA document format Eva Müller, Uwe Klosa Electronic Publishing Centre Uppsala University Library,
Thomson Scientific October 2006 ISI Web of Knowledge Autumn updates.
Relationships July 9, Producers and Consumers SERI - Relationships Session 1.
University of North Texas Libraries Building Search Systems for Digital Library Collections Mark E. Phillips Texas Conference on Digital Libraries May.
CRISP WP17 2/2 Data Continuum Achievements & Perspectives 18th March 2013Jean-François Perrin - Institut Laue Langevin - CRISP 2nd Annual Meeting1.
1 Schema Registries Steven Hughes, Lou Reich, Dan Crichton NASA 21 October 2015.
Design of a Search Engine for Metadata Search Based on Metalogy Ing-Xiang Chen, Che-Min Chen,and Cheng-Zen Yang Dept. of Computer Engineering and Science.
Data Publication and Quality Control Procedure for CMIP5 / IPCC-AR5 Data WDC Climate / DKRZ:
GBIF Data Access and Database Interoperability 2003 Work Programme Overview Donald Hobern, GBIF Programme Officer for Data Access and Database Interoperability.
Planned Document Management Improvements Rob McKercher, Iain Goodenow, George Angeli.
DRAFT EDMC Procedural Directives NOAA Environmental Data Management Committee 12/3/2015 1
Archivists' Toolkit - All Hands Meeting Project Objectives Build an application for creating and managing archival information Target core archival.
NOAA Data Citation Procedural Directive 8 November 2012 DAARWG.
DGC Paris WP2 Summary of Discussions and Plans Peter Z. Kunszt And the WP2 team.
1 Registry Services Overview J. Steven Hughes (Deputy Chair) Principal Computer Scientist NASA/JPL 17 December 2015.
System/SDWG Update Management Council Face-to-Face Flagstaff, AZ August 22-23, 2011 Sean Hardman.
The New GBIF Data Portal Web Services and Tools Donald Hobern GBIF Deputy Director for Informatics October 2006.
The library is open Digital Assets Management & Institutional Repository Russian-IUG November 2015 Tomsk, Russia Nabil Saadallah Manager Business.
LLNL-PRES-XXXXXX This work was performed under the auspices of the U.S. Department of Energy by Lawrence Livermore National Laboratory under contract DE-AC52-07NA27344.
LLNL-PRES-XXXXXX This work was performed under the auspices of the U.S. Department of Energy by Lawrence Livermore National Laboratory under contract DE-AC52-07NA27344.
D4Science and ETICS Building and Testing gCube and gCore Pedro Andrade CERN EGEE’08 Conference 25 September 2008 Istanbul (Turkey)
Data Citation Implementation Pilot Workshop
Joint Declaration of Data Citation Principles (Overview) The Data Citation Synthesis Group Joint Declaration.
PDS4 Project Report PDS MC F2F UCLA Dan Crichton November 28,
Simulation Production System Science Advisory Committee Meeting UW-Madison March 1 st -2 nd 2007 Juan Carlos Díaz Vélez.
ICSU-WDS & RDA Data Publication Services WG. 2 Linking Research Data and the Literature: why? Why link? 1.Increase visibility & discoverability of research.
Developing a Dark Archive for OJS Journals Yu-Hung Lin, Metadata Librarian for Continuing Resources, Scholarship and Data Rutgers University 1 10/7/2015.
International Planetary Data Alliance Registry Project Update September 16, 2011.
Adaptive Software Development Process Framework. Version / 21 / 2001Page Project Initiation 2.0 Adaptive Cycle Planning 5.0 Final Q/A and.
IPDA Registry Definitions Project Dan Crichton Pedro Osuna Alain Sarkissian.
Weigel, Berger, Kindermann, Lautenschlager EGU Versioning for CMIP6 in the Earth System Grid Federation Data preparation Initial registration.
Metafor Year 3 EU Review CIM Component Services. Of itself an Ontology is an inert artefact, i.e. a dictionary. CON-CIM Conceptual CIM.
1 This slide indicated the continuous cycle of creating raw data or derived data based on collections of existing data. Identify components that could.
Architecture Review 10/11/2004
RDA WG on Dynamic Data Citation
Software & Technologies: an overview
AP7/AP8: Long-Term Archival of CMIP6 Data
World Conference on Climate Change October 24-26, 2016 Valencia, Spain
Trusted Repository Systems Overview
WHY? - Found initiative while case statement preparation
DIAS & DIAS data release 2 years DIAS-GCI Cooperation Hiroko KINUTANI DIAS (Data Integration and Analysis System in Japan) , St. Petersburg.
Data Citation Service for CMIP6 and IPCC DDC Aspects
The IPT user interface and data quality tools
Building Search Systems for Digital Library Collections
Flexible Extensible Digital Object Repository Architecture
Flexible Extensible Digital Object Repository Architecture
The Re3gistry software and the INSPIRE Registry
CMIP6 / ENES Data TF Meeting: DKRZ
EUDAT B2FIND A Cross-Discipline Metadata Service and Discovery Portal
Tech introduction.
IS-ENES Cases Seven use cases are listed as data lifecycle steps A B C
A Case Study for Synergistically Implementing the Management of Open Data Robert R. Downs NASA Socioeconomic Data and Applications.
RDA uptake activities and plans: ESGF
Presentation transcript:

M. Stockhause 1, G. Levavasseur 2, K. Berger 1 1 Deutsches Klimarechenzentrum (DKRZ) 2 Institute Pierre Simon Laplace (IPSL) ESGF-QCWT Quality Control WT Report

M. Stockhause, G. Levavasseur, K. Berger QCWT Overview 2 ESGF-QCWT Report, ESGF Conference 2015 The QCWT aims to improve the quality of ESGF user services with regard to external documentations. Integration of data-related information into the ESGF CoG portal:  Referenced Metadata: Completely independent developments, loosely connected  Registered Metadata: Core metadata registered in ESGF Implementation of WIP concepts on:  Data Citation: Citation information are accessible in the ESGF portal based on the external citation service  Errata Annotations: Errata information are accessible in the ESGF Service based on information registered in the Handle Service (PIDs) and in an external “Issue Manager” Data Preparation CMOR Data Preparation CMOR REGISTERED METADATA Errata REFERENCED METADATA Data Citation Metadata Ingest External repository Metadata Ingest External repository Core Metadata Registration Core Metadata Registration Data Ingest ESGF data node Data Ingest ESGF data node ESGF Publication ESGF index node ESGF Publication ESGF index node Metadata Ingest External repository Metadata Ingest External repository Metadata Publication Metadata Publication Long-Term Archival Stable data enriched with metadata Long-Term Archival Stable data enriched with metadata DATA

M. Stockhause, G. Levavasseur, K. Berger QCWT Members 3 ESGF-QCWT Report, ESGF Conference 2015 NameAffiliationResponsibility Martina StockhauseDKRZData Citation Service Guillaume LevavasseurIPSLErrata Service Katharina BergerDKRZESGF Implementation Jeff PainterLLNL/DOE Claire TrenhamANU Jingbo WangANU Dean N. WilliamsLLNL/DOE

M. Stockhause, G. Levavasseur, K. Berger Working Packages 4 ESGF-QCWT Report, ESGF Conference 2015 Persistence of ESGF metadata (Katharina with ESGF- RVWT) Integration of the Versioning Approach (all) Errata Service (Guillaume) Data Citation Service (Martina) Display information out of external repositories for ESGF users (Guillaume - Frontend, Katharina - Backend) Communication (Martina)

M. Stockhause, G. Levavasseur, K. Berger Report and Status for 2015 (1) 5 ESGF-QCWT Report, ESGF Conference 2015 Persistence of Metadata and Version Support – Katharina with ESGF-RVWT – Concept development at ESGF Publication Sprint in Paris (October 2015) Name of version directory used as default version for ESGF publication Persistence of ESGF metadata in work

M. Stockhause, G. Levavasseur, K. Berger Report and Status for 2015 (2) 6 ESGF-QCWT Report, ESGF Conference 2015 Errata Service – Guillaume – During CMIP5 it was difficult for scientists: to know easily whether the data in hands is deprecated and/or replaced and corrected by a newer version, to have access to a description of this issue. The Quality Control Working Team aims to define and establish a stable and coordinated procedure to collect and provide access to errata information related to datasets hosted by ESGF. PID(s) PYTHON/DJANGO MODULE

M. Stockhause, G. Levavasseur, K. Berger Report and Status for 2015 (3) 7 ESGF-QCWT Report, ESGF Conference 2015 Errata Service – Guillaume – Concept development at ESGF Publication Sprint in Paris (October 2015) WIP paper review to fix Errata Service architecture and ESGF dependencies Exploiting the PID (un)/publication chains and standardized issues information, the Errata Service records and tracks reasons for datasets version changes. For more details see “CMIP6 errata as a new ESGF service” poster

M. Stockhause, G. Levavasseur, K. Berger Report and Status for 2015 (4) 8 ESGF-QCWT Report, ESGF Conference 2015 Data Citation Service – Martina – Data Citations are provided on model and simulation granularities. Credit use case Evidence use case

M. Stockhause, G. Levavasseur, K. Berger Report and Status for 2015 (5) 9 ESGF-QCWT Report, ESGF Conference 2015 Data Citation Service – Martina – 1.Concept development and description in a WIP paper 2.Repository set-up: Definition of use cases for insert/update, export/dissemination of citation information Development of the Database schema to store citation information Testing of use cases 3.Start of implementation: Development of GUI for data creators started Export and display of citation information (“landing page”) started Integration of “Data Citation” link template in ESGF CoG portal  On Schedule  Poster on “Data Citation Service” tomorrow

M. Stockhause, G. Levavasseur, K. Berger Plans for ESGF-QCWT Report, ESGF Conference 2015 – Versioning – Continue implementation (together with ESGF-RVWT) Version history service planned – Errata Service – Concept development and description in a WIP paper (in review) Discuss and finalize the issue registration process/form Development of An Issue Manager The errata module for ESGF CoG front-end APIs to request Handle Service and Issue Manager Deployment on ESGF index nodes – Data Citation Service – Discuss ESGF version support with CoG / Search API Discuss/define interfaces with ESGF and ES-DOC Release the citation GUI for data creators Provide citation XML for CMIP6 on the OAI Server Discuss and finalize landing page design and content Start the integration of DataCite DOI and LTA services

M. Stockhause, G. Levavasseur, K. Berger ESGF-QCWT Report, ESGF Conference Timeline for /2015Errata Service: Fix errata concept Citation Service: ESGF test implementation + discuss version support at ESGF F2F 01/2016Errata Service: Fix Issue information design Citation Service: Beta release of GUI for testing + consolidation of ESGF test implementation 02/2016Errata Service developments 03/2016Citation Service: Release of GUI for data creators + discuss landing page content 06/2016Errata Service: Operable issue registration Citation Service: Operable early citation part Draft on recommendations for the integration of external information into ESGF 09/2016Errata Service: Full operability Citation Service: Integration of early citations into LTA/IPCC-DDC and DataCite DOI processes Final version of recommendations for the integration of external information into ESGF (source:

M. Stockhause, G. Levavasseur, K. Berger Risks and Planned Co-operations (1) 12 ESGF-QCWT Report, ESGF Conference 2015 Data Citation Service Data Citations are provided on model and simulation granularities. Credit use case Evidence use case

M. Stockhause, G. Levavasseur, K. Berger Risks and Planned Co-operations (2) 13 ESGF-QCWT Report, ESGF Conference 2015 – Planned Co-operations – ES-DOC and QC teams: Add applications as use cases for the integration of external information into ESGF, Develop recommendations ESGF : CoG and RVWT on version support PWT on the integration of the errata service – Risks – Data Citation - Critical requirement on version support (evidence use case): Operable CoG/Search for datasets ≤ version/access date  display metadata for unpublished datasets Fall back: Display information on creators, funders, title etc. for use in acknowledgements. Data is not formally citable (not in the reference list), because of non-compliance with Force 11’s ‘Joint Declaration of Data Citation Principles’. Force 11’s ‘Joint Declaration of Data Citation Principles’

M. Stockhause, G. Levavasseur, K. Berger 14 Summary ESGF-QCWT Report, ESGF Conference 2015 – Status – Errata Service concept finalization: WIP review + ESGF F2F feedbacks Data Citation Service on schedule and open issues to be discussed at ESGF F2F – Future Plans and Risks –  IPSL hires Atef Bennasser for developing missing components and API for Errata Service deployment/operability (first half of 2016)  Operable Data Citation Service with basic functionality planned for 06/2016. Version support for Data Citation Service is currently unclear.  Prepare recommendations for the integration of external information into ESGF in cooperation with ES-DOC.

M. Stockhause, G. Levavasseur, K. Berger 15 ESGF-QCWT Report, ESGF Conference 2015 – References – ESGF-QCWT: Errata Service IPSL PoC: Data Citation Service: WIP Papers: G. Levavasseur, S. Denvil (2015): Errata system for CMIP6. M. Stockhause, F. Toussaint, M. Lautenschlager (2015): CMIP6 Data Citation and LTA.