DISCUSSION DRAFT ONLY Data Management METRICS for NNDC and CLASS David Hermreck.

Slides:



Advertisements
Similar presentations
Requirements Engineering Process
Advertisements

Pulling it all together… with thanks to Sheila Anderson.
Transformations at GPO: An Update on the Government Printing Office's Future Digital System George Barnum Coalition for Networked Information December.
Preservation Strategies: What do long-term archives do with my data? Jeff Arnfield NOAA’s National Climatic Data Center Version 1.0 Review Date.
Internet Resources Discovery (IRD) IBM DB2 Digital Library Thanks to Zvika Michnik and Avital Greenberg.
Mike Smorul Saurabh Channan Digital Preservation and Archiving at the Institute for Advanced Computer Studies University of Maryland, College Park.
PV2013 Summary Results Data Stewardship Interest Group WGISS-37 Meeting Cocoa Beach (Florida-US) - April 14-18, 2014.
System Design/Implementation and Support for Build 2 PDS Management Council Face-to-Face Mountain View, CA Nov 30 - Dec 1, 2011 Sean Hardman.
Data-PASS Shared Catalog Micah Altman & Jonathan Crabtree 1 Micah Altman Harvard University Archival Director, Henry A. Murray Research Archive Associate.
Live Bank Case Study.  Applying CBA Executive Banking School learning objectives to a Live Case Study on a current relevant topic  Applying Retail Bank.
Ingest and Dissemination with DAITSS Presented by Randy Fischer, Programmer, Florida Center for Library Automation, University of Florida DigCCurr2007.
Transaction Processing System  Business Transactions are certain events that occur routinely in a business firm.  A transaction is a set of activities.
Reliability Focus Area Project L13 SHRP 2 Technical Coordinating Committee for Reliability Research Meeting Irvine, California April 08, 2010 Zongwei Tao,
MASSACHUSETTS INSTITUTE OF TECHNOLOGY NASA GODDARD SPACE FLIGHT CENTER ORBITAL SCIENCES CORPORATION NASA AMES RESEARCH CENTER SPACE TELESCOPE SCIENCE INSTITUTE.
1 Next Generation of Operational Earth Observations From the National Polar-Orbiting Operational Environmental Satellite System (NPOESS): Program Overview.
Natural Resource Program Center Data Manager’s Conference Data Store and NatureBib April 3, 2008 Brent Frakes.
Case 2: Emerson and Sanofi Data stewards seek data conformity
DAITSS: Dark Archive in the Sunshine State Priscilla Caplan, Florida Center for Library Automation DCC Workshop on Long-term Curation within Digital Repositories.
OAIS Rathachai Chawuthai Information Management CSIM / AIT Issued document 1.0.
Planetary Science Archive PSA User Group Meeting #1 PSA UG #1  July 2 - 3, 2013  ESAC PSA Archiving Standards.
The DPubS Development Project: Building an Open Source Electronic Publishing System David Ruddy Cornell University Library.
Mr. Gopi Nair Defense Technical Information Center Briefing at Board on Research Data and Information (BRDI) Meeting September 24, 2009 Approved for Public.
Preservation Strategies: Framing The Approach Nancy Hoebelheinrich Knowledge Motifs LLC Data Management Workshop American Geophysical.
“Guidance on the Selection and Appraisal of Geospatial Content of Enduring Value, April 2014 Draft” groups-subcommittees/hdwg/index_html.
1 NOAA Use of the Open Archival Information System Reference Model (OAIS-RM) Ken McDonald NOAA NESDIS ESIP Federation Meeting July 9, 2009.
Chapter 1 Introduction to Databases. 1-2 Chapter Outline   Common uses of database systems   Meaning of basic terms   Database Applications  
INFORMATION MANAGEMENT Unit 2 SO 4 Explain the advantages of using a database approach compared to using traditional file processing; Advantages including.
Diagnostics Clinical Information Management (CIM) Services Field Report: Implementation of CDISC ODM Michael Walter.
Small steps and lasting impact: making a start with preservation or It’s not all NASA Patricia Sleeman Digital Archives and Repositories University of.
Archival Workshop on Ingest, Identification, and Certification Standards Certification (Best Practices) Checklist Does the archive have a written plan.
NIST Data Science SymposiumMarch 4, 2014 NIST Data Science SymposiumMarch 4, Climate Archives in NOAA: Challenges and Opportunities March 4, 2014.
CLASS Information Management Presented at NOAATECH Conference 2006 Presented by Pat Schafer (CLASS-WV Development Lead)
User Working Group 2013 Data Access Mechanisms – Status 12 March 2013
Selene Dalecky March 20, 2007 FDsys: GPO’s Digital Content System.
OAIS Rathachai Chawuthai Information Management CSIM / AIT Issued document 1.0.
How to Implement an Institutional Repository: Part II A NASIG 2006 Pre-Conference May 4, 2006 Technical Issues.
Funded by: © AHDS Preservation in Institutional Repositories Preliminary conclusions of the SHERPA DP project Gareth Knight Digital Preservation Officer.
CASE (Computer-Aided Software Engineering) Tools Software that is used to support software process activities. Provides software process support by:- –
Archiving microdata Standards and good practices United Nations Statistics Commission New York, February 26, 2009 Olivier Dupriez World Bank, Development.
Oct 12-14, 2003NSDL Challenges in Building Federation Services over Harvested Metadata Kurt Maly, Michael Nelson, Mohammad Zubair Digital Library.
29 March 2004 Steven Worley, NSF/NCAR/SCD 1 Research Data Stewardship and Access Steven Worley, CISL/SCD Cyberinfrastructure meeting with Priscilla Nelson.
M-1 INGEST OVERVIEW Don Sawyer National Space Science Data Center NASA/GSFC October 13, 1999.
1 Accomplishments. 2 Overview of Accomplishments  Sustaining the Production Earth System Grid Serving the current needs of the climate modeling community.
1 Overall Architectural Design of the Earth System Grid.
EO Dataset Preservation Workflow Data Stewardship Interest Group WGISS-37 Meeting Cocoa Beach (Florida-US) - April 14-18, 2014.
Ed Kearns National Climatic Data Center Asheville, NC.
Infrastructure Breakout What capacities should we build now to manage data and migrate it over the future generations of technologies, standards, formats,
Physical Oceanography Distributed Active Archive Center THUANG June 9-13, 20089th GHRSST-PP Science Team Meeting GHRSST GDAC and EOSDIS PO.DAAC.
Building Preservation Environments with Data Grid Technology Reagan W. Moore Presenter: Praveen Namburi.
Chang, Wen-Hsi Division Director National Archives Administration, 2011/3/18/16:15-17: TELDAP International Conference.
OAIS (archive) Producer Management Consumer. Representation Information Data Object Information Object Interpreted using its Yields.
OAIS (archive) OAIS (archive) Producer Management Consumer.
Data Stewardship and Maintenance for the National Hydrography Dataset Gladys Conaway USGS National Geospatial Technical Operations Center III.
Introduction To DBMS.
Ingest and Dissemination with DAITSS
User Characterization in Search Personalization
Requirements Engineering Process
OneStop Project Update for WGISS
OAIS Producer (archive) Consumer Management
Adapted from Presentations by:
DAITSS: Dark Archive in the Sunshine State
Data Centres in the Virtual Observatory Age
Operational Dataset Update Functionality Included in the NCAR Research Data Archive Management System Zaihua Ji Doug Schuster Steven Worley Computational.
Implementing an Institutional Repository: Part II
Research data preservation in Canada
Digital Preservation and Trusted Digital Repositories
Robin Dale RLG OAIS Functionality Robin Dale RLG
Implementing an Institutional Repository: Part II
How to Implement an Institutional Repository: Part II
Presentation transcript:

DISCUSSION DRAFT ONLY Data Management METRICS for NNDC and CLASS David Hermreck

DISCUSSION DRAFT ONLY Metrics - Context Revisit appropriate operational performance metrics in an environment with an operational CLASS. CLASS and NNDC metrics are currently overlapping. Metrics should focus on core functions. Need both Development and Operations Metrics Will use a “bank” analogy to better understand CLASS and NNDC Operations roles

DISCUSSION DRAFT ONLY CLASS Development Metrics CLASS development requires Capability Maturity Model Integration (CMMI) level 3 – this model provides many potential metrics Potential metrics could include: Number of Change Requests implemented on time and budget Major software releases delivered to operations on time and budget Other CMMI metrics

DISCUSSION DRAFT ONLY Operations Metrics - The Bank Analogy Safe and Secure Preservation just storage CLASS is…is NOT…

DISCUSSION DRAFT ONLY The Bank Analogy Safe and Secure Preservation Stewardship CLASS is…is NOT…

DISCUSSION DRAFT ONLY The Bank Analogy Wholesale Access Retail Access CLASS is… is NOT…

DISCUSSION DRAFT ONLY The Bank Analogy Retail Access: Checks Debit Cards Branch Banking ATM access Public usability CLASS is…NNDC is… “Interbank” Transactions “Owner” Deposits “Owner” Withdrawal Commercial Access “Read Only” Expert Users Primary

DISCUSSION DRAFT ONLY CLASS Service CLASS Services – Ingest Archive Storage “One NOAA” Access Coordinated OAIS with NNDC NODCNCDCNGDC Climate Perspective Ocean Perspective Geophysical Perspective Raw IngestRaw Access XML? Per submission agreements IT Services Layer Domain Expert Layer Preservation Layer Stewardship Layer

DISCUSSION DRAFT ONLY Service Metrics CLASS Metrics – Data Stored (PB, # sets) Data Accessed (# inquiry, volume) Data Ingested Latency Preservation Activity NODCNCDCNGDC Climate Perspective Ocean Perspective Geophysical Perspective Raw IngestRaw Access XML? computer to computer Per submission agreements IT Services Layer Domain Expert Layer Preservation Layer Stewardship Layer Customers Served Tailored portals Stewardship actions Metadata enhancement Note that NNDC stewarded (“owned”) data can be delivered to end-users without passing through the NNDC owner’s site. Domain Portals & Expert User Support

DISCUSSION DRAFT ONLY NNDC Metrics: Questions re. CLASS Does CLASS support: subsetting capabilities? (Does this require an inappropriate understanding of content?) data mining? Regeneration (e.g., producing an intermediate data form in response to a query?) Can CLASS change (obsolete) the external “look and feel” of data access (e.g., no command line access)? Can CLASS obsolete “old” access methods (e.g., dial-up modems, 8” diskettes, 2” tape, etc.?) How do these impact CLASS metrics?

DISCUSSION DRAFT ONLY NNDC Metrics: Questions re. CLASS Is CLASS independently responsible for reversible transformations (perhaps as part of media migration)? Is this an “operations” question? Can CLASS independently do irreversible transformations, if required for data preservation? Can CLASS move obsolete editions to lower care levels? How does CLASS measure closely coupled data transfers (e.g., for reprocessing)? How do these impact NNDC metrics?

DISCUSSION DRAFT ONLY Metrics CLASS metrics would move toward: infrastructure/wholesale storage and preservation NNDC metrics would move toward: Value-added/retail Stewardship focused CLASS metrics and NNDC metrics should (eventually) be distinguishable. However, CLASS should report nonintermediated access statistics for NNDC owned/stewarded datasets. Metrics need further development.

DISCUSSION DRAFT ONLY CLASS Metrics ?? Volume in and out by time, by NNDC Volume stored Collection Inventory Inventory changes Preservation activity Data flows & latency Storage and bandwidth reserves

DISCUSSION DRAFT ONLY NNDC Metrics ?? Number & quality of Value- added interfaces (usage?) Datasets reprocessed or enhanced # datasets at highest level of maturity Volume served/total of data at highest level of maturity Customer liaison contacts Metadata enhancements

DISCUSSION DRAFT ONLY Conclusion Eventually, NNDC and CLASS metrics should be distinct. More work is needed to identify the “right” metrics to measure effectiveness. Coordinate with other data centers in NASA and USGS on metrics Good metrics are HARD! – multiple measures are required