The HDF Group HDF4 Mapping Project Update Apr. 17-19, 2012HDF/HDF-EOS Workshop XV1 Ruth Aydt

Slides:



Advertisements
Similar presentations
The HDF Group Ensuring Long Term Access to Remotely Sensed HDF4 Data with Layout Maps Ruth Duerr, NSIDC Christopher Lynnes, GES DISC Mike.
Advertisements

Database Planning, Design, and Administration
The HDF Group HDF/HDF-EOS Workshop XIV1 Easy Remote Access via OPeNDAP Kent Yang and Joe Lee The HDF Group The 14 th HDF/HDF-EOS Workshop.
The HDF Group HDF Group Support for NPP/JPSS Mike Folk, Elena Pourmal, Larry Knox, Albert Cheng The HDF Group The 15 th HDF and HDF-EOS.
The HDF Group ESIP Summer Meeting HDF-Java Overview Joel Plutchak The HDF Group 1 July 8 – 11, 2014.
HDF5 OPeNDAP Project Update and Demo MuQun Yang and Hyo-Kyung Lee (The HDF Group) James Gallagher (OPeNDAP, Inc.)
The HDF Group HDF4 Mapping Status and Proposal Mike Folk, Ruth Aydt The HDF Group Oct. 17, 2012 Apr , 2012HDF/HDF-EOS Workshop XV.
The HDF Group Apr , 2012HDF/HDF-EOS Workshop XV1 Interoperability with netCDF-4 Kent Yang, Larry Knox, Elena Pourmal The HDF Group.
Database Planning, Design, and Administration Transparencies
Improving long-term preservation of EOS data by independently mapping HDF4 data objects The HDF Group.
Lecture Nine Database Planning, Design, and Administration
The HDF Group Improving long-term preservation of EOS data by independently mapping HDF4 data objects Mike Folk, Ruth Aydt, Joe Lee, Binh-Minh.
University of Illinois at Urbana-ChampaignHDF 1McGrath/Yang 2/27/02 Transitioning from HDF4 to HDF5 Robert E. McGrath Kent Yang.
Support EOS: Review and Discussions Kent Yang and Joe Lee The HDF Group October 16, 2012 Oct. 16, 2012Annual HDF Briefing to ESDIS1.
The HDF Group HDF/HDF-EOS Workshop XIV1 Easy Access of NASA HDF data via OPeNDAP Kent Yang and Joe Lee The HDF Group September 28,2010.
Chapter 9 Database Planning, Design, and Administration Sungchul Hong.
Database System Development Lifecycle © Pearson Education Limited 1995, 2005.
Overview of the Database Development Process
The HDF Group July 8, 2014HDF 2014 ESIP Summer Meeting HDF Product Designer Aleksandar Jelenak, H. Joe Lee, Ted Habermann The.
Ensuring Long Term Access to Remotely Sensed HDF4 Data with Layout Maps Mike Folks, The HDF Group Ruth Duerr, NSIDC 1.
HDF5 Tools Update Peter Cao - The HDF Group November 6, 2007 This report is based upon work supported in part by a Cooperative Agreement.
EOSDIS User survey follow-up Mike Folk, Kent Yang, Elena Pourmal The HDF Group Oct. 17, 2012 Annual HDF Briefing to ESDIS1.
ITEC224 Database Programming
Important ESDIS 2009 tasks review Kent Yang, Mike Folk The HDF Group April 1st, /1/20151Annual briefing to ESDIS.
DM_PPT_NP_v01 SESIP_0715_AJ HDF Product Designer Aleksandar Jelenak, H. Joe Lee, Ted Habermann Gerd Heber, John Readey, Joel Plutchak The HDF Group HDF.
The HDF Group September 28, 2010HDF/HDF-EOS Workshop XIV1 Easy Access of HDF data via NCL/IDL/MATLAB Kent Yang, Tong Qi, Ziying Li, Yi.
Page 1 HDF-EOS Tools Abe Taaheri, Raytheon IIS ESIP Meeting Chapel Hill, NC July 9, 2013.
1 Minggu 9, Pertemuan 17 Database Planning, Design, and Administration Matakuliah: T0206-Sistem Basisdata Tahun: 2005 Versi: 1.0/0.0.
ATMOSPHERIC SCIENCE DATA CENTER ‘Best’ Practices for Aggregating Subset Results from Archived Datasets Walter E. Baskin 1, Jennifer Perez 2 (1) Science.
Why do I want to know about HDF and HDF- EOS? Hierarchical Data Format for the Earth Observing System (HDF-EOS) is NASA's primary format for standard data.
HDF Converting between HDF4 and HDF5 MuQun Yang, Robert E. McGrath, Mike Folk National Center for Supercomputing Applications University of Illinois,
Tools for Interoperability between HDF and NetCDF Mike Folk and MuQun Yang The HDF Group The HDF Group provides the following tools for the NASA HDF and.
1/14/200925th IIPS Conference 1 Challenges to Archive and Access NASA HDF-EOS Data in the long Term MuQun Yang (The HDF Group) Choonghwan Lee (The HDF.
The HDF Group HDF5 Tools Updates Peter Cao, The HDF Group September 28-30, 20101HDF and HDF-EOS Workshop XIV.
Database Planning, Design, and Administration Transparencies
Support for NPP/NPOESS by The HDF Group Mike Folk The HDF Group HDF and HDF-EOS Workshop XII October 17, 2008 Oct HDF and HDF-EOS Workshop XII1.
Archival Information Packages for NASA HDF-EOS Data R. Duerr, Kent Yang, Azhar Sikander.
11/7/2007HDF and HDF-EOS Workshop XI, Landover, MD1 HDF5 Software Process MuQun Yang, Quincey Koziol, Elena Pourmal The HDF Group.
The HDF Group November 3-5, 2009 HDF-OPeNDAP Project Update HDF/HDF-EOS Workshop XIII1 Joe Lee and Kent Yang The HDF Group James Gallagher.
Towards Long-Term Archiving of NASA HDF-EOS and HDF Data Data Maps and the Use of Mark-Up Language Ruth Duerr, Mike Folk, Muqun Yang, Chris Lynnes, Peter.
Ensuring Long Term Access to Remotely Sensed HDF4 Data with Layout Maps Ruth Duerr, NSIDC Christopher Lynnes, GES DISC The HDF Group Oct HDF and.
Creating Archive Information Packages for Data Sets: Early Experiments with Digital Library Standards Ruth Duerr, NSIDC MiQun Yang, THG Azhar Sikander,
HDF5 OPeNDAP Project Update and Demo MuQun Yang and Hyo-Kyung Lee (The HDF Group) James Gallagher (OPeNDAP, Inc.) 1HDF and HDF-EOS Workshop XII10/17/2008.
1 HDF-EOS Status, Related Tools and Issues. 2 Overview.
Sharing Metadata Recommendations Ted Habermann, John Kozimor Earth Science The HDF Group 1 John Farley Raytheon.
HDF OPeNDAP Project Update MuQun Yang and Hyo-Kyung Lee The HDF Group March 31, Annual briefing to ESDIS10/31/2015.
EOSDIS Status 10/16/2008 Dan Marinelli, Science Systems Development Office.
The HDF Group HDF/HDF-EOS Workshop XV1 Tools to Improve the Usability of NASA HDF Data Kent Yang and Joe Lee The HDF Group April 17, 2012.
- 1 - HDF5, HDF-EOS and Geospatial Data Archives HDF and HDF-EOS Workshop VII September 24, 2003.
The HDF Group Support for NPP/NPOESS by The HDF Group Mike Folk, Elena Pourmal, Peter Cao The HDF Group November 5, 2009 November 3-5,
NetCDF file generated from ASDC CERES SSF Subsetter ATMOSPHERIC SCIENCE DATA CENTER Conversion of Archived HDF Satellite Level 2 Swath Data Products to.
HDF5 OPeNDAP Project Update and Demo MuQun Yang and Hyo-Kyung Lee (The HDF Group) James Gallagher (OPeNDAP, Inc.) 1 HDF and HDF-EOS Workshop XII10/17/2008.
HDF5 OPeNDAP Project Update and Demo MuQun Yang and Hyo-Kyung Lee (The HDF Group) James Gallagher (OPeNDAP, Inc.) 1HDF and HDF-EOS Workshop XII, Aurora,
July 20, Update on the HDF5 standardization effort Elena Pourmal, Mike Folk The HDF Group July 20, 2006 SPG meeting, Palisades, NY.
10/16/2012Annual HDF briefing1 HDF OPeNDAP support Kent Yang, Joe Lee, Mike Folk The HDF Group Oct. 16, 2012.
HDF-EOS Workshop IV September 19-21, 2000 Richard E. Ullman ESDIS Information Architect NASA/ GSFC, Code 423.
11/8/2007HDF and HDF-EOS Workshop XI, Landover, MD1 Software to access HDF5 Datasets via OPeNDAP MuQun Yang, Hyo-Kyung Lee The HDF Group.
Improving long-term preservation EOS data by independently mapping HDF4 data objects Mike Folk, Ruth Aydt, Peter Cao, Kent Yang Ruth Duerr Christopher.
HDF and HDF-EOS: Implications for Long-Term Archiving and Data Access.
The HDF Group November 3-5, 2009HDF/HDF-EOS Workshop XIII1 The New HDF-EOS Web Site - How it can help you Kent Yang, Joe Lee The HDF Group.
Support for NPP/NPOESS by The HDF Group Mike Folk, Elena Pourmal The HDF Group Annual HDF Briefing to ESDIS March 31, 2009 March Annual HDF Briefing.
HDF5 OPeNDAP Project Update and Demo MuQun Yang and Hyo-Kyung Lee (The HDF Group) James Gallagher (OPeNDAP, Inc.) 1HDF and HDF-EOS Workshop XII, Aurora,
HDF and HDF-EOS Workshop XII
Moving from HDF4 to HDF5/netCDF-4
Mike Folk, Peter Cao, Kent Yang Ruth Duerr Christopher Lynnes
Persistent Identifiers Implementation in EOSDIS
Kent Yang, Mike Folk The HDF Group March 31, 2009
Access HDF5 Datasets via OPeNDAP’s Data Access Protocol (DAP)
HDF Support for NASA Data Producers
HDF5 Tools Updates and Discussions
Presentation transcript:

The HDF Group HDF4 Mapping Project Update Apr , 2012HDF/HDF-EOS Workshop XV1 Ruth Aydt The HDF Group The 15 th HDF and HDF-EOS Workshop April 17-19, 2012

Project Motivation Apr , 2012HDF/HDF-EOS Workshop XV2 DVD HDF4 file HDF4 LibraryHDFView

Project Purpose Ensure long-term access to EOS data stored in HDF4 files. Apr , 2012HDF/HDF-EOS Workshop XV3

Project Scope Apr , 2012HDF/HDF-EOS Workshop XV4 HDF4 Library HDF4 Files with EOS Data produced HDF4 Files with EOS Data valuable to community HDF4 Mapping Project Scope HDF4 File Content Maps Concern Idea Proof of Concept Prototype ProductDevelopSupport ? ? Verification Requirements Study Verification Implementation Time April 2012

Concern – Workshop VIII (2004) “HDF and HDF EOS: Implications for Long-Term Archiving and Data Access” - Ruth Duerr, NSIDC Apr , 2012HDF/HDF-EOS Workshop XV5 Slide Notes: “Without human readability you are locked into having to maintain the read software forever!”

Idea – Workshop X (2006) “Leveraging HDF Utilities” - Chris Lynnes, GES-DISC Apr , 2012HDF/HDF-EOS Workshop XV6

HDF4 File Contents – User View Apr , 2012HDF/HDF-EOS Workshop XV7 Objects & Relationships User Metadata Object Data

HDF4 File Contents – Format View Apr , 2012HDF/HDF-EOS Workshop XV8 Vgroup name = variable_name class = Var0.0 NDG SDD SD NT variable name = variable_name rank type storagetype data Vdata name = attribute_name class = Attr …* 1 1 attribute name = attribute_name byte order, chunked storage, compression, … Object Data

Proof of Concept (8/07- 7/08) Categorize HDF4 data held by NASA Build a prototype Apr , 2012HDF/HDF-EOS Workshop XV9 Map Writer linked with HDF4 library Map Writer linked with HDF4 library bytestreams Objects & Relationships; User Metadata; Object Data retrieval & reconstruction information HDF4 File Object Data Reader 2 independent readers in C and Perl HDF4 File Content Map (XML) request

Develop Product (11/09 - 7/11) Apr , 2012HDF/HDF-EOS Workshop XV10 Tasks: A.Investigate integration of mapping schema with existing standards B.Determine HDF-EOS 2 requirements C.Redesign and expand the XML schema D.Implement production quality map writer E.Develop demo map reader F.Deploy tools at select NASA data centers For preservation, we must get it right while the HDF4 library, tools, documentation, and expertise are around.

Develop Product (Tasks C & D) C: HDF4 File Content Maps  Have enough information to stand alone Described by schema D: Production Quality Map Writer Read HDF4 file and create Map Command-line options fine-tune behavior HDF4 Library New functions added to facilitate map creation Apr , 2012HDF/HDF-EOS Workshop XV11

Surprise! Expected hardest part to be support for retrieval and reconstruction of object data. In fact, making sure all user-created HDF4 objects were found and represented correctly was a bigger challenge. Existing tools didn’t always report same user-level information. “Correctness” can be subject to interpretation – not always able to know intent of file creator. Apr , 2012HDF/HDF-EOS Workshop XV12 Image from publications.usa.gov

Map from top down and bottom up Watch for extra parts “Over include” in map if any doubt (e.g., 2 palettes for 1 raster) Improve HDF4 library, tools, and documentation to address ambiguities Project Actions in Response Apr , 2012HDF/HDF-EOS Workshop XV13 User View Format View

HDF4 File Content Map Apr , 2012HDF/HDF-EOS Workshop XV14 Represents HDF4 Objects and Relationships Information needed to access and interpret object data in HDF4 file Select object data values included to help reader program verify binary data handled properly

E: Develop Demo Reader Apr , 2012HDF/HDF-EOS Workshop XV15 Developed by student at NSIDC  Only given Content Maps Written in Python Reader extracts object data from HDF4 file Output in ASCII (csv) or binary (numpy) Compares extracted data to values for verification in Content Map

, 2012HDF/HDF-EOS Workshop XV16 $ python hdfmr.py -f MOD29.A hdf.map.xml -e ALL Directory created :MOD29.A hdf.map.xml Processing : MOD29.A hdf.map.xml ----Array: Latitude Valid values: True ----Array: Longitude Valid values: True ----Array: Sea_Ice_by_Reflectance Valid values: True... Dumping complete $ ls -s MOD29.A hdf.map.xml_dump total Root-- G-ID_G1 G-ID_G2 G-ID_G3 Ice_Surface_Temperature-ID_A Root-- G-ID_G1 G-ID_G2 G-ID_G3 Ice_Surface_Temperature_Pixel_QA-ID_A Root-- G-ID_G1 G-ID_G2 G-ID_G3 Sea_Ice_by_Reflectance-ID_A Root-- G-ID_G1 G-ID_G2 G-ID_G3 Sea_Ice_by_Reflectance_Pixel_QA-ID_A Root-- G-ID_G1 G-ID_G2 Latitude-ID_A Root-- G-ID_G1 G-ID_G2 Longitude-ID_A2 $ cat *dump/*Latitude* # Array shape: (406, 271) Datum: >f , , , , , , , , , , , , , , , ,... Demo Reader Example

Releases & Support Apr , 2012HDF/HDF-EOS Workshop XV17 DateVersionComments July schema writer First official release Sept writerMinor bug fixes Nov schema writer Robustly handle empty SDS March 2012ECS Release 8.1 May 2012 (planned) writerMinor bug fixes ? Support 2 palettes with same reference number

HDF4 File Content Maps Apr , 2012HDF/HDF-EOS Workshop XV18 Content Map generation at GES-DISC Datasets mapped TOVS Pathfinder For example: ftp://disc1.gsfc.nasa.gov/data/s4pa/tovs/TOVSADNG/1986/330/ MERRA Model Output In progress TRMM AIRS

ECS Release 8.1 – March 2012 “Raytheon EED deployed the HDF4 File Content Maps capability as part of ECS Release 8.1. This capability wraps the Content Map Writer in the ECS Map Generation Server. ECS DAACs can choose whether or not to enable map generation in operations. With workload spec testing, seeing 2-3 maps/second under load and on unloaded system” -- Evelyn Nakamura, Raytheon “We installed our new big ECS software release which included the code for creating maps. The installers set it up to create maps (not in operations mode) for MOD10A1 and it produced 20 or 30 thousand. We haven't had a chance to look at them yet.” -- Doug Fowler, NSIDC Apr , 2012HDF/HDF-EOS Workshop XV19

Verification* Study (1/12 - 4/12) “Work with DAAC personnel to identify requirements that would produce appropriate and efficient methods of verifying, concurrent with operation activities, correctness of the HDF4 maps that are produced with the ECS 8.1 capability.” * The terms Verification and Validation are used interchangeably. Apr , 2012HDF/HDF-EOS Workshop XV20

Verification Study Activities Webinars with ASDC, LPDAAC, NSIDC, Raytheon Provide background on Mapping Project Gather input on requirements and concerns Collect sample datasets and generate Content Maps  Exposed 3 bugs: 1 in HDF4 library & 2 in Map Writer; Fixed. Discuss possible approaches Seek guidance from NASA on expectations regarding Map creation timeline and verification responsibilities Prototype possible approaches Demonstrate functionality and assess feasibility Apr , 2012HDF/HDF-EOS Workshop XV21

Verification Study Findings (1) Automate verification as much as possible. Focus verification at the ESDT version level. No definitive specification for user-level objects expected in a given HDF4 file. Scientists look at visualizations, not directly at data. Apr , 2012HDF/HDF-EOS Workshop XV22

Verification Study Findings (2) Every DAAC is different Flexibility in deciding when to generate Maps May need involvement of science teams to confirm correctness Content Maps should be produced near end of mission, or sooner if users want them. AMSR-E identified NSIDC involved with Mapping project from the start and comfortable with verification using demo reader Apr , 2012HDF/HDF-EOS Workshop XV23

Verification Study Findings (3) Interest in web-based tools is growing. XSLT stylesheets DAAC representatives are very concerned about long-term access to data. This is beyond the scope of the study But, something to keep in mind when considering different approaches Apr , 2012HDF/HDF-EOS Workshop XV24

, 2012HDF/HDF-EOS Workshop XV25 Verification Dilemma Translator to Reader DVD

, 2012HDF/HDF-EOS Workshop XV26 Possible Approach DVD Creator DVD

Applied to Content Maps Apr , 2012HDF/HDF-EOS Workshop XV27 bytestreams Objects & Relationships; User Metadata; Object Data retrieval & reconstruction information HDF4 File Object Data Reader HDF4 File Content Map (XML) request Replace this… HDF4 Retranslator Objects & Relationships; User Metadata; Object Data retrieval & reconstruction information HDF4 File with this…

Verification Recommendations (1) Check h4mapwriter errors Run xmllint Check for well-formed XML Validate Map conforms to schema These checks are possible now Apr , 2012HDF/HDF-EOS Workshop XV28

Verification Recommendations (2) Develop content map checker to check Filesize and checksum Object data values Values for verification Attribute values in Map Apr , 2012HDF/HDF-EOS Workshop XV29 What people expect to be enough

Verification Recommendations (3) Develop retranslator to create new HDF4 file Allows use of familiar tools (GrADS, IDL, HDFview, hdiff, …) If new file is not equivalent to original (from user perspective), investigate ASAP. Apr , 2012HDF/HDF-EOS Workshop XV30 Needed since no definitive source of correctness for original HDF4 files.

Verification Recommendations (4) Build content map checker and retranslator on common modular infrastructure. Apr , 2012HDF/HDF-EOS Workshop XV31

Not just for Preservation! “I find the HDF Map writer and reader very useful when I am in the discovery phase of new projects using HDF4 datasets. They enable me to analyze the full structure of CERES hdf4 datasets and ensure HDF Attributes from the archived HDF4 files are preserved in subsetted files. I am building a capability to subset MOPITT HDF4 data and am using them to help validate SDS data arrays over 4 dimensions. A team of consultants is working with ASDC on an experimental semantic database implemented on a 'grand challenge' scale. They are interested in using CERES datasets, but are unfamiliar with HDF. They are using the HDF4 map application to analyze the structure of proposed CERES datasets and to help extract metadata and data from target files.” --- Walt Baskin, ASDC Apr , 2012HDF/HDF-EOS Workshop XV32

Presentation “Take Away” HDF4 Content Maps are the best thing since sliced bread! More seriously … Content Maps can be created now and you may find them useful Ask questions and report problems We want to know about issues ASAP Feedback regarding proposed Verification approach very welcome Project report / recommendations due next week Apr , 2012HDF/HDF-EOS Workshop XV33

Project Contributors The HDF Group Ruth Aydt, Peter Cao, Jo Eads, Mike Folk, Joe Lee, Elena Pourmal, Binh-Minh Ribler, Kent Yang, and others NASA / DAACs Jeanne Behnke, Dan Marinelli, H. K. "Rama" Ramapriyan ASDC: Walt Baskin, Greg Cates, Gerald Lemay, Lindsay Parker, Steve Protack GES-DISC: Guang-Dih Lei, Chris Lynnes LP DAAC: Matt Martens, Bhaskar Ramachandran, Jody Rundell, Jim Vermeer NSIDC: Jonathan Crider, Ruth Duerr, Doug Fowler, Luis Lopez Raytheon Evelyn Nakamura, Lou Swentek, Abe Taaheri Apr , 2012HDF/HDF-EOS Workshop XV34

Acknowledgements This work was supported by Subcontract number under Raytheon Contract number NNG10HP02C, funded by the National Aeronautics and Space Administration (NASA) and by cooperative agreement number NNX08AO77A from the NASA. Any opinions, findings, conclusions, or recommendations expressed in this material are those of the authors and do not necessarily reflect the views of Raytheon or the National Aeronautics and Space Administration. Apr , 2012HDF/HDF-EOS Workshop XV35

The HDF Group Questions/comments? Apr , 2012HDF/HDF-EOS Workshop XV36