3/30/04 16:14 1 Lessons Learned CERES Data Management Presented to GIST 21 “If the 3 laws of climate are calibrate, calibrate, calibrate, then the 3 laws.

Slides:



Advertisements
Similar presentations
Configuration management
Advertisements

Software change management
Configuration management
Configuration Management
Peter Griffith and Megan McGroddy 4 th NACP All Investigators Meeting February 3, 2013 Expectations and Opportunities for NACP Investigators to Share and.
Evaluation and Reporting ENTC 4060 Project Scheduling.
Chapter 10 Schedule Your Schedule. Copyright 2004 by Pearson Education, Inc. Identifying And Scheduling Tasks The schedule from the Software Development.
W5HH Principle As applied to Software Projects
Systems Analysis and Design 9th Edition
IS 214 Needs Assessment and Evaluation of Information Systems Managing Usability © Copyright 2001 Kevin McBride.
Architecture is More Than Just Meeting Requirements Ron Olaski SE510 Fall 2003.
Pertemuan 25 Matakuliah: A0214/Audit Sistem Informasi Tahun: 2007.
Fundamentals of Information Systems, Second Edition
Systems Engineering Management
Archival Prototypes and Lessons Learned Mike Smorul UMIACS.
Effort in hours Duration Over Weeks Or Months Inception Launch Web Lifecycle Methodology Maintenance Phases Copyright Wonderlane Studios.
Configuration Management
Chapter 25 – Configuration Management 1Chapter 25 Configuration management.
Software Documentation Written By: Ian Sommerville Presentation By: Stephen Lopez-Couto.
Systematic learning from mistakes: achievements and challenges Andy Sutherland, NHS Information Centre for health and social care.
System Implementation
Release & Deployment ITIL Version 3
This chapter is extracted from Sommerville’s slides. Text book chapter
Effective Methods for Software and Systems Integration
8/28/ Case Study Who – capstone prepared students What – opportunity to work and learn in a team environment Why – to interactively experience.
S/W Project Management
IT Project Management Cheng Li, Ph.D. August 2003.
BTS730 Communications Management Chapter 10, Information Technology Management, 5ed.
Information Systems Security Computer System Life Cycle Security.
1 Quality Assurance In moving information from statistical programs into the hands of users we have to guard against the introduction of error. Quality.
EGI-InSPIRE RI EGI-InSPIRE EGI-InSPIRE RI Future support of EGI services Tiziana Ferrari/EGI.eu Future support of EGI.
 To explain the importance of software configuration management (CM)  To describe key CM activities namely CM planning, change management, version management.
3.08 b Determine venture’s information technology.
Configuration Management (CM)
Creator: ACSession No: 16 Slide No: 1Reviewer: SS CSE300Advanced Software EngineeringFebruary 2006 (Software Quality) Configuration Management CSE300 Advanced.
Project Scope Management Project management Digital Media Department Unit Credit Value : 4 Essential Learning time : 120 hours.
Managing the Impacts of Programmatic Scale and Enhancing Incentives for Data Archiving A Presentation for “International Workshop on Strategies for Preservation.
Software Engineering Quality What is Quality? Quality software is software that satisfies a user’s requirements, whether that is explicit or implicit.
Chapter 101 The Design Process Chapter 10 Achieving Quality Through Continual Improvement Claude W. Burrill / Johannes Ledolter Published by John Wiley.
Planetary Science Archive PSA User Group Meeting #1 PSA UG #1  July 2 - 3, 2013  ESAC PSA Archiving Standards.
Software Quality Assurance
Creating documentation and metadata: Recording provenance and context Jeff Arnfield National Climatic Data Center Version a1.0 Review Date.
Managing the Impacts of Change on Archiving Research Data A Presentation for “International Workshop on Strategies for Preservation of and Open Access.
Fundamentals of Information Systems, Second Edition 1 Systems Development.
Business Analysis. Business Analysis Concepts Enterprise Analysis ► Identify business opportunities ► Understand the business strategy ► Identify Business.
Requirements Validation
Configuration Management CSCI 5801: Software Engineering.
1 Project Communications Management Lecture 11. Learning Objectives Describe the importance of good communication on projects and major components of.
United Nations Oslo City Group on Energy Statistics OG7, Helsinki, Finland October 2012 ESCM Chapter 8: Data Quality and Meta Data 1.
Copyright 2010, The World Bank Group. All Rights Reserved. Recommended Tabulations and Dissemination Section B.
1 Chapter 12 Configuration management This chapter is extracted from Sommerville’s slides. Text book chapter 29 1.
State of Georgia Release Management Training
Requirement engineering & Requirement tasks/Management. 1Prepared By:Jay A.Dave.
Oman College of Management and Technology Course – MM Topic 7 Production and Distribution of Multimedia Titles CS/MIS Department.
Advances In Software Inspection
CS223: Software Engineering Lecture 16: The Agile Methodology.
Publishing Services Bureau Web Communications Services Tips for managing the publication process Communications Workshop October 23, 2003.
JRA1 Meeting – 09/02/ Software Configuration Management and Integration EGEE is proposed as a project funded by the European Union under contract.
Open Science (publishing) as-a-Service Paolo Manghi (OpenAIRE infrastructure) Institute of Information Science and Technologies Italian Research Council.
Serving IT up with ITIL By Thane Price. IT is the laboratory’s pit crew  Goal : Make technology transparent while accomplishing valuable internal customer.
Chapter 25 – Configuration Management 1Chapter 25 Configuration management.
NASA Earth Science Data Stewardship
Configuration Management
Information Systems Development
Configuration Management
Software Configuration Management
Software Documentation
Chapter 11: Software Configuration Management
Configuration management
Software Reviews.
Presentation transcript:

3/30/04 16:14 1 Lessons Learned CERES Data Management Presented to GIST 21 “If the 3 laws of climate are calibrate, calibrate, calibrate, then the 3 laws of data management are communicate, communicate, communicate.” Anon. April 1, 2004

3/30/04 16:14 2 Presentation Objectives Document experiences in CERES Data Management (DM) that might be useful in releasing GERB data products –Data Release –Metadata –End user support Foment discussion –CERES wants to learn from your experiences, as well

3/30/04 16:14 3 CERES DM Lessons Learned Data Products Design Affects Release Data products are designed by the instrument team, not the end users –Inherent flaws from the end user point of view –Involve end users as early as possible –Ask yourself what you liked/disliked about products you have used Good Software Engineering is critical to code success over lifetime –Compartmentalize dependencies to permit maintainability Minimize hardware and compiler dependencies –Satellite and specific instrument dependencies in configuration files –Separate scientific algorithms and data management functions Naming Conventions for Data Sets –Make it intuitive –Maturity/scientific quality as well as contents Data Product format should be well documented –Make it easy for the consumer to use the data –CERES sample read-packages provide working code, samples

3/30/04 16:14 4 CERES DM Lessons Learned Data Production Ensure development and production platforms are identical to avoid surprises Re-processing is never considered early enough –Processing and re-processing are different production considerations –Long duration observations evolve over their lifetime Inputs change, algorithms improve Hardware, compilers and operating systems evolve –Climate data records necessitate highly stable data products Need consistent processing conditions to avoid introducing biases –PI’s always want re-processing done quickly (10x) Target for CERES is to fully re-process the entire collection in 1 year Monitor ingest for missing data elements –Prepare production control to test for missing data and avoid running without it –Late delivery of data products –Look for opportunities to substitute for missing data in production stream –Prepare for discovery of bad data after months of ingest

3/30/04 16:14 5 CERES DM Lessons Learned Data Release Process Two release processes targeting different audiences Science Team “release”: ValRx to evaluate Production data –QC to avoid processing and distributing bad data or algorithms Public Release requires careful consideration –Make Public Release a formal process PI & Science Team approval –Any public release should be a complete package Assessment of maturity and quality (Quality Summary) Tools to make it easy to use (sample read package) End user data collection, agreements and notices –What is the value of data being published? Early peer review often detects systematic problems that developers do not see Use in assimilation for modeling? Cross-calibration? Stakeholder value to claim credit for accomplishment and a basis for future funding? Final data set for long term use by unexpected users?

3/30/04 16:14 6 CERES DM Lessons Learned Data Product Distribution Process CERES distribution is constrained by size of data products Make data products easy to obtain and use by the consumers Do not withdraw data products if used in publication –Even if there are errors –If they should not be used in publication, make that clear Make delivery as flexible as possible to make data useful –Subsetting, based on time and geography Pre-formatted data products are often huge –Users often don’t have means of locally subsetting the data –Size impacts customer response time and cost of distribution Networks and physical media can impact customer satisfaction Require registration to obtain data products –Request notification if data products are re-distributed

3/30/04 16:14 7 CERES DM Lessons Learned Post-delivery User Support Communicate, communicate, communicate –Make customers interested in your future –Be proactive about communicating problems when they are discovered –Ask for feedback about their use of your data –Obtain acknowledgement of use in publication Citation searches by a library, etc Request users to provide a copy of papers in which your data is used Stakeholders always ask for some proof of use Good documentation can reduce the interactive user support –Both format and science quality –Separate technical support for accessing the data products from the science –Sample Read Packages to help Users use it Permit them to get to data quickly