Presentation is loading. Please wait.

Presentation is loading. Please wait.

A Data Quality Screening Service for Remote Sensing Data Christopher Lynnes, NASA/GSFC Edward Olsen, NASA/JPL Peter Fox, RPI Bruce Vollmer, NASA/GSFC Robert.

Similar presentations


Presentation on theme: "A Data Quality Screening Service for Remote Sensing Data Christopher Lynnes, NASA/GSFC Edward Olsen, NASA/JPL Peter Fox, RPI Bruce Vollmer, NASA/GSFC Robert."— Presentation transcript:

1 A Data Quality Screening Service for Remote Sensing Data Christopher Lynnes, NASA/GSFC Edward Olsen, NASA/JPL Peter Fox, RPI Bruce Vollmer, NASA/GSFC Robert Wolfe, NASA/GSFC Shahin Samadi, NASA/GSFC Contributions from : N. Most, Y.-I. Won, T. Hearty, R. Strub, S. Ahmad, S. Zednik, M. Hegde and A. Rezaiyan-Nojani Funded by : NASA’s Advancing Collaborative Connections for Earth System Science (ACCESS) Program

2 Outline  Satellite Data Quality: Why So Difficult?  Making Quality Information Usable via the Data Quality Screening Service (DQSS)  How does DQSS Scale?  Future Directions for DQSS

3 Satellite Data Quality: Why So Difficult?

4 Earth Observing System Satellite sensors collect data for for climate science research

5 EOS satellite data is archived and distributed by a network of data centers

6 Satellite-borne collection breeds a tendency to keep low-quality data around  Satellites are very expensive to launch and operate.  It takes years to understand the data characteristics.  Data are used for many different purposes. AIRS – Atmospheric Infrared Sounder atmospheric temperature atmospheric moisture trace gas composition

7 Sometimes there are a lot of low-quality pixels. AIRS ParameterBest (%) Good (%) Do Not Use (%) Total Precipitable Water38 24 Carbon Monoxide64729 Surface Temperature54451

8 Quality schemes can be relatively simple… Total Column Precipitable WaterQuality Level BestGoodDo Not Use kg/m 2

9 …or they can be fairly complicated…or they can be fairly complicated Hurricane Ike, viewed by the Atmospheric Infrared Sounder (AIRS) PBest : Maximum pressure of “Best” quality values in temperature profiles Air Temperature at 300 mbar

10 Quality flags are also sometimes packed together into bytes Cloud Mask Status Flag 0=Undetermined 1=Determined Cloud Mask Cloudiness Flag 0=Confident cloudy 1=Probably cloudy 2=Probably clear 3=Confident clear Day / Night Flag 0=Night 1=Day Sunglint Flag 0=Yes 1=No Snow/ Ice Flag 0=Yes 1=No Surface Type Flag 0=Ocean, deep lake/river 1=Coast, shallow lake/river 2=Desert 3=Land Bitfield arrangement for the Cloud_Mask_SDS variable in atmospheric products from Moderate Resolution Imaging Spectroradiometer (MODIS)

11 Current user scenarios...Current user scenarios...  Nominal scenario  Search for and download data  Locate documentation on handling quality  Read & understand documentation on quality  Write custom routine to filter out bad pixels  Equally likely scenario (especially in certain user communities)  Search for and download data  Assume that quality has a negligible effect Repeat for each user

12 Using bad quality data is in general not negligible Total Column Precipitable WaterQuality BestGoodDo Not Use kg/m 2

13 Heavy cloud impairs retrievalsHeavy cloud impairs retrievals AIRS True Color, 2008-09-10

14 Neglecting quality may introduce bias (a more subtle effect) AIRS Relative Humidity Comparison against Dropsonde with and without Applying PBest Quality Flag Filtering

15 Making Quality Information Usable via the Data Quality Screening Service (DQSS).

16 The DQSS filters out bad pixels for the user  Default user scenario  Search for data  Select science team recommendation for quality screening (filtering)  Download screened data  More advanced scenario  Search for data  Select custom quality screening parameters  Download screened data

17 DQSS segregates bad-quality pixels into a separate array Original data array (Total column precipitable water) Mask based on user criteria (Quality level < 2) Good quality data pixels retained Rejected data points Output file has the same format and structure as the input file (except for the extra rejected-data fields)

18 Visualizations help users see the effect of different quality filters Best quality only Best + Good quality

19 DQSS encodes the science team recommendations on quality screening  Straightforward : “Use Best-only for data assimilation uses; Use Best+Good for climatic studies”  More complicated : “Use only VeryGood over land; Use Marginal+Good+VeryGood over ocean”

20 How Does DQSS Scale?How Does DQSS Scale?

21 Ontology-driven design helps scale with diversity of quality schemes Variable Types & Properties Quality View

22 The Quality View links data variables with quality variables.

23 The code queries the ontology to determine the type of quality variable needed and selects the corresponding module to generate the quality mask.

24 Execution at distributed data centers helps to scale with demand The jury is out as to whether Java performance will scale to meet demand. Screening takes place at the data centers where the data reside to avoid excessive data transfers

25 Future directionsFuture directions

26 Operationalize DQSSOperationalize DQSS  Promotion to operations for AIRS data is scheduled for Aug 2010  DQSS will be offered through the main data search interface at the data center  Usage Metrics will be collected:  Basic usage  What criteria are used  Screening invocation is a simple URL GET  What the “ yield” was for the screening

27 Extend DQSS to Other DatasetsExtend DQSS to Other Datasets  Moderate Resolution Imaging Spectroradiometer (MODIS)  Microwave Limb Sounder (MLS)  Ozone Monitoring Instrument (OMI)  Cloudsat  Cloud-Aerosol Lidar and Infrared Pathfinder Satellite Observation (CALIPSO)

28 Collaborative Screening Use Case?Collaborative Screening Use Case? Define custom criteria Tag custom criteria Browse tagged criteria Modify tagged criteria Dr. Alice Dr. Bob

29 DQSS RecapDQSS Recap  Screening satellite data currently is troublesome and time consuming for users  The Data Quality Screening System will provide an easy-to-use service  The result should be:  More attention to quality on users’ part  More accurate handling of quality information…  …With less user effort


Download ppt "A Data Quality Screening Service for Remote Sensing Data Christopher Lynnes, NASA/GSFC Edward Olsen, NASA/JPL Peter Fox, RPI Bruce Vollmer, NASA/GSFC Robert."

Similar presentations


Ads by Google