Proposed Post-processing and Referral Metrics Robert Wolfe (GSFC – MODAPS) and Tom Sohre (EDC – LP DAAC) Presented by Jason Werpy (EDC – LP DAAC) Earth.

Slides:



Advertisements
Similar presentations
Closing the User Feedback Loop: Effective and Swift Interface Development at ASF J.I. Garron 1, D.A. Simmons 1 and B.E. Crevensten 2 1 Alaska Satellite.
Advertisements

U.S. Department of the Interior U.S. Geological Survey MRTWeb: Enhanced MODIS data discovery and delivery services from the LP DAAC January 18, 2007 Tom.
MODIS Data at NSIDC MODIS Collection 5/Long Term Data Record Workshop Molly McAllister & Terry Haran January
Metrics Planning Group (MPG) Report to Plenary Clyde Brown ESDSWG Nov 3, 2011.
LP DAAC Status MODIS Science Team Meeting April , 2014 Chris Doescher, PMP LP DAAC Project Manager Dr. David Meyer.
Other Distribution of MODIS Data MODIS Land Collection 5 Workshop Jan. 17, 2007 Robert Wolfe NASA GSFC Code
LP DAAC Status Dave Meyer Bhaskar Ramanchandran MODIS Science Team Meeting University of Maryland, College Park May 18-20, 2011.
C van Ingen, D Agarwal, M Goode, J Gupchup, J Hunt, R Leonardson, M Rodriguez, N Li Berkeley Water Center John Hopkins University Lawrence Berkeley Laboratory.
NASA Agency Report Ken McDonald NASA May 2002
Connecting Diverse Web Search Facilities Udi Manber, Peter Bigot Department of Computer Science University of Arizona Aida Gikouria - M471 University of.
Indrani Kommareddy, David Roy & Junchang Ju Geographic Information Science Center of Excellence South Dakota State University, USA Web-enabled Landsat.
Best Practices Using Enterprise Search Technology Aurelien Dubot Consultant – Media and Entertainment, Fast Search & Transfer (FAST) British Computer Society.
Introduction Downloading and sifting through large volumes of data stored in differing formats can be a time-consuming and sometimes frustrating process.
User Services Experience GSFC DAAC MODIS Science Team Meeting Greenbelt, MD July 2002
Methods for Data Discovery – Portals Portal facilitates access to and also assimilation of data Portal is not simply a web site: it offers services such.
05 December, 2002HDF & HDF-EOS Workshop VI1 SEEDS Standards Process Richard Ullman SEEDS Standards Formulation Team Lead
U.S. Department of the Interior U.S. Geological Survey Land Processes DAAC Update MODIS-VIIRS Science Team Meeting January 26-28, 2010 Dave Meyer LP DAAC.
Earth Observing System Data and Information System (EOSDIS) provides access to more than 3,000 types of Earth science data products and specialized services.
December 5, 2002HDF-EOS Workshop VI1 The Goddard Earth Sciences Distributed Active Archive Center The Goddard Earth Sciences Distributed Active Archive.
Global Land Cover Facility The Global Land Cover Facility (GLCF) is a member of the Earth Science Information Partnership (ESIP) Federation providing data,
Dec 9-11, 2003ICADL Challenges in Building Federation Services over Harvested Metadata Hesham Anan, Jianfeng Tang, Kurt Maly, Michael Nelson, Mohammad.
ATMOSPHERIC SCIENCE DATA CENTER ‘Best’ Practices for Aggregating Subset Results from Archived Datasets Walter E. Baskin 1, Jennifer Perez 2 (1) Science.
Web services at TRFIC TRFIC has developed the Access Technologies to achieve its goals of interoperability and provide access to data and information on.
U.S. Department of the Interior U.S. Geological Survey Access to MODIS Land Data Products Through the Land Processes DAAC John Dwyer and Carolyn Gacke,
MODIS Land Product Subsets Suresh K. Santhana Vannan, Robert B. Cook, Bruce E. Wilson, Lisa M. Olsen HDF and HDF-EOS Workshop XII October 15 – October.
EOSDIS Status 9/29/2010 Dan Marinelli, NASA GSFC
MPARWG Business & Disposition of Action Items from MPARWG October 2009 H. K. (Rama) Ramapriyan NASA/GSFC Metrics Planning and Reporting (MPAR) WG 9 th.
September 4, 2003MODIS Ocean Data Products Workshop, Oregon State University1 Goddard Earth Sciences (GES) Distributed Active Archive Center (DAAC) MODIS.
EOSDIS Status 10/16/2008 Dan Marinelli, Science Systems Development Office.
ESDSWG meeting – 10/21-23/2008 Metrics Planning and Reporting (MPAR) WG Breakout Summary H. K. (Rama) Ramapriyan NASA GSFC Clyde Brown NASA LaRC Co-Chairs,
MODLAND Volumes and Loads Status MODIS Land Science Team Workshop July 15, 2003 Robert Wolfe MODIS Land Team Support Group NASA GSFC Code 922, Raytheon.
Health eDecisions Use Case 2: CDS Guidance Service Strawman of Core Concepts Use Case 2 1.
National Aeronautics and Space Administration Jet Propulsion Laboratory California Institute of Technology Pasadena, California EDGE: The Multi-Metadata.
NetCDF file generated from ASDC CERES SSF Subsetter ATMOSPHERIC SCIENCE DATA CENTER Conversion of Archived HDF Satellite Level 2 Swath Data Products to.
User Working Group 2013 Data Access Mechanisms – Status 12 March 2013
GEON2 and OpenEarth Framework (OEF) Bradley Wallet School of Geology and Geophysics, University of Oklahoma
EOSDIS FY2009 Annual Metrics Report Prepared By: Hyo Duck Chang Adnet, Inc. Brian Krupp Adnet, Inc. Lalit Wanchoo Adnet, Inc. March 2010.
Evolving toward a Coherent, Collaborative Framework for Earth Science Data, Tools and Services Christopher Lynnes, Kwo-Sen Kuo and Kevin Murphy Earth Science.
The Global Land Cover Facility is sponsored by NASA and the University of Maryland.The GLCF is a founding member of the Federation of Earth Science Information.
1 ECHO and EDG Status May 9, 2006 Beth Weinstein, Yonsook Enloe,
Cyberinfrastructure to promote Model - Data Integration Robert Cook, Yaxing Wei, and Suresh S. Vannan Oak Ridge National Laboratory Presented at the Model-Data.
MDPHnet & ESP Data Partner Participation Overview The following slides describe the necessary steps for a data partner to participate in the MDPHnet Network.
A Menagerie of ESIP OpenSearch Clients C. Lynnes, NASA/GSFC K. Keiser, U. Alabama--Huntsville.
Oct 12-14, 2003NSDL Challenges in Building Federation Services over Harvested Metadata Kurt Maly, Michael Nelson, Mohammad Zubair Digital Library.
NASA REASoN Project SHAirED: S ervices for H elping the Air -quality Community use E SE D ata Stefan Falke, Kari Höijärvi and Rudolf Husar, Washington.
NASA REASoN Project SHAirED: S ervices for H elping the Air -quality Community use E SE D ata Stefan Falke, Kari Höijärvi and Rudolf Husar, Washington.
2011 ACSI Survey Summary HDF/HDF-EOS Workshop Riverdale, MD April 18, 2012.
EOSDIS FY2008 Annual Metrics Report Prepared By: Ed Sofinowski SGT, Inc. Donna Rahmani SGT, Inc. March 2009 ESDIS Project GSFC Code 423.
An Enterprise Clinical Data Search Solution. is Designed for: Informatics professionals, clinicians, statisticians, data managers and process/quality.
MODIS SDST, STTG and SDDT MODIS Science Team Meeting (Land Discipline Breakout Session) July 13, 2004 Robert Wolfe Raytheon NASA GSFC Code 922.
Science Review Panel Meeting Biosphere 2, Tucson, AZ - January 4-5, 2011 Vegetation Phenology and Vegetation Index Products from Multiple Long Term Satellite.
MODIS Data at NSIDC MODIS Science Team Meeting - Nov. 2, 2006.
ORNL DAAC SPATIAL DATA ACCESS TOOL Open Geospatial Consortium (OGC) Services Bruce E. Wilson Suresh K. Santhana Vannan Yaxing Wei Tammy W. Beaty National.
NPP DataVisualization using McIDAS-V NPP DataVisualization using McIDAS-V Tommy Jasmin, Tom Rink, and Tom Achtor
1 2.5 DISTRIBUTED DATA INTEGRATION WTF-CEOP (WGISS Test Facility for CEOP) May 2007 Yonsook Enloe (NASA/SGT) Chris Lynnes (NASA)
ECHO Technical Interchange Meeting 2013 Timothy Goff 1 Raytheon EED Program | ECHO Technical Interchange 2013.
ETICS An Environment for Distributed Software Development in Aerospace Applications SpaceTransfer09 Hannover Messe, April 2009.
U.S. Department of the Interior U.S. Geological Survey LP DAAC Big Earth Data Initiative (BEDI) Developed Web Services 1 Jason Werpy LP DAACEnterprise.
1 Digital Object Identifiers Update ESIP Data Stewardship Committee Meeting May 16, 2016 Presenters: Nate James, ESDIS Lalit Wanchoo, ADNET Systems Inc.
LP DAAC Overview – Land Processes Distributed Active Archive Center Chris Doescher LP DAAC Project Manager (605) Chris Torbert.
U.S. Department of the Interior U.S. Geological Survey July 2014 OPeNDAP Services – Present and Future at LP DAAC Brian Davis 1, Rob Quenzer 1, Jason Werpy.
AIRS Meeting GSFC, February 1, 2002 ECS Data Pool Gregory Leptoukh.
What’s New for the MES Product Suite Tom Hechtman & Jason Coope.
ESDRs Distribution and User Support – Miscellaneous Topics
Persistent Identifiers Implementation in EOSDIS
Improving Data Access, Discovery, and Usability
LP DAAC AppEEARS Data Access
Earth Data Search Tool Demo
LP DAAC OPeNDAP Services
Presentation transcript:

Proposed Post-processing and Referral Metrics Robert Wolfe (GSFC – MODAPS) and Tom Sohre (EDC – LP DAAC) Presented by Jason Werpy (EDC – LP DAAC) Earth Science Data System Working Group Meeting – Metrics Breakout October 22, 2010

Metrics Challenge #1: Post-processing Data Archive (i.e. LPDAAC) user Value-Add Processing Service (i.e. MRTWeb) Value-added services are available to users to provide customized products, rather than just the standard HDF granule Distribution metrics have historically focused on how much data went out the door (granules or volume) … And, performance measurement baselines have been developed around “distribution volumes”. Challenges: – Services perform “processing” on the user’s behalf … But this “processing effort” isn’t currently reported. – Services such as mosaicing and subsetting change “what” is being distributed (could be a portion of the original data … or a “new piece” of data that is really a combination of multiple “archived” granules)... What should be reported? The volume/number of “final” data pieces delivered to the user? And/or … the volume/number of “input” data necessary to create the value-added product? Subset request 1 granule returned from the archive to support request; 1MB 1 granule returned from the Service ;100k (10x reduced volume) Data Archive (i.e. LPDAAC) user Value-Add Processing Service (i.e. MRTWeb) Mosaic request 10 granule returned from the archive to support request 1 granule returned from the Service 2

Why Post-processing Metrics? Post-processing metrics would help NASA management understand the value added from post-processing at the NASA data centers Questions that could be answered are: – How much does post-processing reduce data volume (and/or number of granules) to users? – How much processing resources were used to perform the post-processing? ESDS WG - Oct. 22, 20103

Post-processing Metrics Post-processing activities include: reformatting, spatial sub-setting, variable subsetting, mosaicing, reprojection, aggregation, filtering Some of these activities have minimal computational requirements (e.g. reformatting, sub-setting) and others have are significant (e.g. reprojection) Most of the activities are volume reducers, but some increase the volume (e.g. reprojection to a smaller pixel size) Many of the activities reduce the number of granules, but for some the number of granules may increase (e.g. reformatting) In most cases, the post-processing occurs at the data center where the data is archived, but in some cases it may be done at a separate data center ESDS WG - Oct. 22, 20104

Additional Post-processing Metrics Data Archive Post Processing Staging Area User DU - Data to user DP2 – Post- processed data Simplified Post- processing flow PP – Post processing computation DP1 – Data to post-processing Data Center Direct delivery data flow Post-processing data flow ESDS WG - Oct. 22, 20105

Post-processing Metrics Standard data transfer metric: – DU – Data to user: volume and number of granules Proposed additional post-processing metrics: – DP1 – Data to post-processing: volume and number of granules – DP2 – Post-processed data: volume and number of granules – PP – Post-processing computation: standard computational metrics (standardized CPU seconds?), number of post-processing orders and type(s) of post- processing ESDS WG - Oct. 22, 20106

Additional Post-processing Metrics – Two Data Centers Case Simplified Post- processing flow Data Archive Post Processing Staging Area DP2’ – Post- processed data PP’ – Post processing computation DP1’ – Data to post-processing Data Center 1 (archive) Post Processing Staging Area User DU - Data to user DP2 – Post- processed data PP – Post processing computation DP1 – Data to post-processing Data Center 2 (distribution) DD - Data center transfer (for post processing) Post-processing at archive data center Post-processing at distribution data center ESDS WG - Oct. 22, 20107

PP Metrics – Two Data Center Case Standard data transfer metric: – DU – Data to user: volume and number of granules Proposed additional post-processing metrics: – DP1, DP2 and PP – Same as single data center case – DD – Data center transfer (for post-processing): volume and number of granules – DP1’, DP2’ and PP’ – (optional) additional post- processing metrics from archive data center ESDS WG - Oct. 22, 20108

Metrics Challenge #2: Referral Data Archive (i.e. LPDAAC) user Data Discovery Interface (i.e. WIST) Data Discovery Interface (i.e. Mercury) Data Discovery Interface (i.e. LAADS) User(s) utilize various interfaces to search/access data from an archive (DAAC)… The archive currently reports “distribution” metrics (i.e. how much data went out the door…) – But the mechanism to “find” the data isn’t generally reported? Challenges: – How to recognize the referral mechanism? (what metrics?) – Don’t want to “double count” distribution (once from the archive, and again from the “client”) – The referral mechanism may or may not be another ESDIS partner. (could be a 3 rd party using a web service developed by the archive) data distribution ESDS WG - Oct. 22, 20109

Referral Metrics for Cross-Data Center Ordering Metrics are needed in the case for cross-data center ordering Scenario: A search and order tool at one data center (or WIST/ECHO) is used to find a set of data (granules) at a different center. Why?: It would be useful to know how much (and what) data is being cross-searched. Metric: The number of times a user searches (and finds) a data set stored at a different data center. ESDS WG - Oct. 22,

Questions/Comments? ESDS WG - Oct. 22,