Reference Data Collection & Accuracy Assessment Dr. Russ Congalton & Kamini Yadav GFSAD30 Meeting, Wisconsin, 14 th -16 th July, 2015 Reference Map 1.

Slides:



Advertisements
Similar presentations
Selected results of FoodSat research … Food: what’s where and how much is there? 2 Topics: Exploring a New Approach to Prepare Small-Scale Land Use Maps.
Advertisements

Scaling Biomass Measurements for Examining MODIS Derived Vegetation Products Matthew C. Reeves and Maosheng Zhao Numerical Terradynamic Simulation Group.
Incorporating regional knowledge into global data sets: Some ideas for rice in Asia Andy Nelson Thursday11 th Sep, 2014.
Accuracy Assessment of Thematic Maps
North American Croplands Richard Massey & Dr. Teki Sankey.
Multiple Criteria for Evaluating Land Cover Classification Algorithms Summary of a paper by R.S. DeFries and Jonathan Cheung-Wai Chan April, 2000 Remote.
Princeton University Global Evaluation of a MODIS based Evapotranspiration Product Eric Wood Hongbo Su Matthew McCabe.
Accuracy Assessment and Reference data Collection Kamini Yadav Dr. Russ Congalton.
Has EO found its customers? 1 Space Applications Institute Directorate General Joint Research Centre European Commission Ispra (VA), Italy
U.S. Geological Survey U.S. Department of Interior GFSAD30m Global Cropland Extent Products of Nominal 250 m (GCE V2.0) Updates Pardhasaradhi.
Module 2.1 Monitoring activity data for forests using remote sensing REDD+ training materials by GOFC-GOLD, Wageningen University, World Bank FCPF 1 Module.
1 SWALIM Workshop June, Nairobi Monitoring Land Cover Dynamics in sub-Saharan Africa H.D. Eva, A. Brink and D. Simonetti.
James C. Tilton Code Computational & Information Sciences and Technology Office NASA Goddard Space Flight Center January 30, 2015 update National.
U.S. Geological Survey U.S. Department of Interior GFSAD30m Global Cropland Extent Products of Nominal 250 m (GCE V2.0) Updates Pardhasaradhi.
Accuracy Assessment. 2 Because it is not practical to test every pixel in the classification image, a representative sample of reference points in the.
Ramesh Gautam, Jean Woods, Simon Eching, Mohammad Mostafavi, Scott Hayes, tom Hawkins, Jeff milliken Division of Statewide Integrated Water Management.
North American Croplands: US Updates 21 May 2015 Richard Massey; Dr. Teki Sankey.
Centre for Geo-information Fieldwork: the role of validation in geo- information science RS&GIS Integration Course (GRS ) Lammert Kooistra Contact:
Assessment of Different Quantification Approaches and Application of Multiple Practices for a Single Farm Unit Dennis Haak, Agriculture & Agri-Food Canada.
Indiana GIS Conference, March 7-8, URBAN GROWTH MODELING USING MULTI-TEMPORAL IMAGES AND CELLULAR AUTOMATA – A CASE STUDY OF INDIANAPOLIS SHARAF.
Operational Agriculture Monitoring System Using Remote Sensing Pei Zhiyuan Center for Remote Sensing Applacation, Ministry of Agriculture, China.
Seto, K.C., Woodcock, C.E., Song, C. Huang, X., Lu, J. and Kaufmann, R.K. (2002). Monitoring Land-Use change in the Pearl River Delta using Landsat TM.
National Mapping Division EROS Data Center U. S. Geological Survey U.S. Geological Survey Earth Resources Operation Systems (EROS) Data Center World Data.
Lu Liang, Peng Gong Department of Environmental Science, Policy and Management, University of California, Berkeley And Center for Earth System Science,
May 16-18, 2005MultTemp 2005, Biloxi, MS1 Monitoring Change Through Hierarchical Segmentation of Remotely Sensed Image Data James C. Tilton Mail Code 606*
1 Improving Statistics for Food Security, Sustainable Agriculture and Rural Development – Action Plan for Africa THE RESEARCH COMPONENT OF THE IMPLEMENTATION.
Remote Sensing for agricultural statistics Main uses and cost-effectiveness in developing countries Insert own member logo here Pietro Gennari, Food and.
Accuracy Assessment Having produced a map with classification is only 50% of the work, we need to quantify how good the map is. This step is called the.
Crop area estimation in Geoland 2 Ispra, 14-15/05/2012 I. Ukraine region II. North China Plain.
Selection of Multi-Temporal Scenes for the Mississippi Cropland Data Layer, 2004 Rick Mueller Research and Development Division National Agricultural Statistics.
North American Croplands Teki Sankey and Richard Massey Update: 11/20/2014.
U.S. Department of the Interior U.S. Geological Survey GFSAD30 Field Work Planning: Progress in Australia Pardha, Prasad, and Jun GFSAD30 monthly meeting,
North American Croplands Teki Sankey and Richard Massey Northern Arizona University Flagstaff, AZ.
North American Carbon Program Sub-pixel Analysis of a 1-km Resolution Land-Water Mask Source of Data: The North American sub-pixel water mask product is.
Ramesh Gautam, Jean Woods, Simon Eching, Mohammad Mostafavi Land Use Section, Division of Statewide Integrated Water Management California Department of.
Crop Mapping in Stanislaus County using GIS and Remote Sensing Ramesh Gautam, Jean Woods, Simon Eching, Mohammad Mostafavi Land Use Section, Division of.
North American Croplands: Updates Richard Massey; Teki Sankey.
Remotely sensed land cover heterogeneity
Chuvieco and Huete (2009): Fundamentals of Satellite Remote Sensing, Taylor and Francis Emilio Chuvieco and Alfredo Huete Fundamentals of Satellite Remote.
Reference Data Standard Protocols Kamini Yadav Dr. Russ Congalton.
14 ARM Science Team Meeting, Albuquerque, NM, March 21-26, 2004 Canada Centre for Remote Sensing - Centre canadien de télédétection Geomatics Canada Natural.
1 U.S. Department of the Interior U.S. Geological Survey LP DAAC Stacie Doman Bennett, LP DAAC Scientist Dave Meyer, LP DAAC Project Scientist.
Reference Data Standard Protocols Kamini Yadav Dr. Russ Congalton.
Croplands.org Aiming to release to public in next couple weeks. There will likely be a separation of work space from public space.
Accuracy Assessment: Building Global Cropland Reference Data Updates for March 2015 Kamini Yadav and Russ Congalton.
U.S. Department of the Interior U.S. Geological Survey GFSAD30 Field Work Planning: Progress in Australia Pardha, Prasad, and Jun GFSAD30 monthly meeting,
BOT / GEOG / GEOL 4111 / Field data collection Visiting and characterizing representative sites Used for classification (training data), information.
Cropland using Google Earth Engine
MODIS-based Cropland Classification in North America Teki Sankey and Richard Massey Northern Arizona University.
1 Global Mapping, Approaches, Issues, and Accuracies Russ Congalton & Kamini Yadav University of New Hampshire January 16, 2014 Menlo Park, CA.
U.S. Geological Survey U.S. Department of Interior GFSAD 30 Cropland Products of Nominal 250 m Using MODIS Data and Cropland Mapping Algorithms:
Object-Based Crop Identification for Collecting Reference Data from VHRI RHSeg and Ecognition Software Kamini Yadav and Russ Congalton.
North American Croplands Teki Sankey and Richard Massey.
Mapping Canada’s Rangeland and Forage Resources using Earth Observation Emily Lindsay MSc Candidate – Carleton University Supervisors: Doug J. King & Andrew.
Reference data & Accuracy Assessment Dr. Russ Congalton Kamini Yadav.
Cropland mapping in South America
Cropland Extent Mapping in South America Global Food Security - Support Analysis m Chandra Giri, Ying Zhong January 19 th, 2016.
U.S. Department of the Interior U.S. Geological Survey Chandra Giri Ying Zhong 7/15/2015 Cropland Extent Mapping in South America.
Dr. Russ Congalton & Kamini Yadav GFSAD30 Meeting, Menlo Park 19 th -21 st January, 2016 Reference Data Collection & Accuracy Assessment: Some Results.
Developing the Vegetation Drought Response Index (VegDRI): Monitoring Vegetation Stress from a Local to National Scale Brian Wardlow National Drought Mitigation.
1 Crop area estimation in Geoland 2 Ispra, 15/12/2011 H. Kerdiles, J. Gallego, O. Léo, MARS Unit, JRC Ispra Q. Dong, I. Piccard, R. Van Hoolst, VITO, BE.
US Croplands Richard Massey Dr Teki Sankey. Objectives 1.Classify annual cropland extent, Rainfed-Irrigated, and crop types for the US at 250m resolution.
ACCURACY ASSESSMENT SOUTH AMERICA, NORTH AMERICA KAMINI YADAV DR. RUSSELL CONGALTON.
26. Classification Accuracy Assessment
HIERARCHICAL CLASSIFICATION OF DIFFERENT CROPS USING
Accuracy Assessment of Thematic Maps
Prof. Nataliia Kussul, Space Research Institute NASU-SSAU
Konstantin Ivushkin1, Harm Bartholomeus1, Arnold K
The GISCO task force “Remote Sensing for Statistics”
Pan-European Assessment of Riparian Zones
Presentation transcript:

Reference Data Collection & Accuracy Assessment Dr. Russ Congalton & Kamini Yadav GFSAD30 Meeting, Wisconsin, 14 th -16 th July, 2015 Reference Map 1

Outline  Reference Data  Africa  Australia  North America  Europe & South America  VHRI data collection  RHSeg Work  eCognition/R Program  Accuracy Assessment (Australia)  Future work 2

Discussion with each Group on Reference data  Africa-Jun  Received reference data form  Had first conference call on April 16, 2015  Australia-Pardha  Received reference data form  Conference call on May 28, 2015, follow up call on 30 th June, 2015  North America-Richard/Teki  Received reference data form  Conference call on June 3, 2015  Europe-Aparna/Mutlu  Received reference data form  Unavailable to schedule the call  South America-Ying/Chandra  Not Approached/Received 3

Outcome of Reference Data Calls  These calls proved to be excellent to coordinate efforts between the mapping team and the accuracy team to determine mutually integrated needs and analysis.  Resulted in detailed knowledge about each mapping team and how they are generating their product. Will help us to perform validation for the respective products.  Determined actions items specific to each team.  Complied all the possible sources of reference data information. Discussed how to collect or build the necessary independent reference data for each continent. 4

Ground Data Sources  Ground data (collected by our team including Murali)  Received shape files for Ethiopia, Tanzania, Malawi, Rawanda, Burundi  India (South India, Rajasthan)  Ground data sourced from other projects (e.g., CORINE)  Curt Reynolds's field data from USDA/FAS  2015 corn map for South Africa and 2014 cotton / rice map for Australia GDA Corp  Ground data from global collection (e.g., Mutlu's)  To be purely use for accuracy assessment  Ground data from literature  Authors contacted to obtain the reference data they used or the map they produced for their project (if willing to share)  Reference data from other work (e.g., USDA CDL, Agriculture and Agri-Food Canada) Statistical Data (FAOSTAT)Ground DataExisting Cropland DataVery High Resolution Imagery 5

Africa Reference data Africa Australia North America 6 Source: With the help of Jim Tilton, received VHRI from NGA

Australia Reference Data  The statistical data from FAOSTAT is only used to evaluate the sub pixel/actual crop area for the year 2014  Before using the ground data, there is a need to define the crosswalk used to compare its classification scheme with the classified map  The Dynamic Land cover map of Australia (DLDC) is used for crop area comparison only. It is available for the years  Create random samples from DLDC map for accuracy assessment  Decide the strategy and perform accuracy assessment for the GCE v m  Select and perform reference classification of VHRI to generate ground data Africa Australia North America Agriculture Ecological Zones: Zone

North America Reference Data Type Number of Pixels (250m) Total Crop (Original)26,444,974 Total Crop (Buffered)6,415,797 Total Unused Pixels20,029,177 Africa Australia North America  CDL – 56m cropland layer from USDA-NASS has been resampled to 250m and used to build pure pixels (6,415,797 out of total 26, 444, 974 crop pixels). The remaining 20, 029, 177 unused pixels are potential ones where homogeneous samples can be chosen for creating validation dataset  NLCD 30m cropland layer can be a possible source to create a crop/No-crop mask  Also, ground data used for NLCD validation and accuracy assessment is possible source?? 8

Canada Reference Data  Generate reference data from AAFC cropland layer as homogeneous samples for 250m mapping (the same way as USDA NASS CDL layer)  Based on the field size, decide the homogeneous pixel criteria to label 3x3 30m or 250m pixel  Process VHRI in some areas and use to test the accuracy of the cropland layer Agriculture Ecological Zones: Zone 3-6, 12 & 13 9

AAFC Cropland Map Labels LabelCodeDefinition Agriculture (undifferentiated) 120 Agricultural land, including annual and perennial crops; and would exclude grassland. This class is mapped only if the distinction of sub-agricultural covers (classes ) is not possible. Pasture / Forages122 Periodically cultivated. Includes tame grasses and other perennial crops such as alfalfa and clover grown alone or as mixtures for hay, pasture or seed. Too Wet to be Seeded130 Agricultural fields that are normally seeded that remain unseeded due to excess spring moisture. Fallow131 Plowed and harrowed fields that are left unsown for the growing season Cereals132 This class is mapped only if the distinction of sub-cereal covers (classes ) is not possible. Barley133 Other Grains134 Millet135 Oats136 Rye137 Spelt138 Triticale139 Wheat140 This sub-cereal class is mapped only if the distinction of sub- wheat covers (classes ) is not possible Switch grass141 Winter Wheat145 Spring Wheat146 Corn147 Tobacco148 Ginseng149 Oilseeds150 This class is mapped only if the distinction of sub-oilseed covers (classes ) is not possible. Borage151 Camelina152 Canola / Rapeseed153 Flaxseed 154 LabelCodeDefinition Mustard155 Safflower156 Sunflower157 Soybeans158 Pulses160 This class is mapped only if the distinction of sub-pulse covers (classes ) is not possible. Peas162 Beans167 Lentils174 Vegetables175 This class is mapped only if the distinction of sub-vegetable covers (classes ) is not possible. Tomatoes176 Potatoes177 Sugar beets178 Other Vegetables179 Fruits180 This class is mapped only if the distinction of sub-fruit covers (classes ) is not possible. Berries181 Cranberry183 Orchards188 Other Fruits189 Vineyards190 Hops191 Sod192 Herbs193 Nursery194 Buckwheat195 Canary Seed196 Hemp197 Vetch198 Other Crops199 Issues in CrossWalking from AAFC to Map Classification: Standard Criteria?? AAFC: Agriculture & Agri Food Canada 10

Reference data from Literature Paper TitleJournalContactData 1 Crop area mapping in West Africa using landscape stratification of MODIS time series and comparison with existing global land products International Journal of Applied Earth Observation and Geoinformation, Volume 14, Issue 1, February 2012, Pages 83–93 rad.fr, mail.com A ground data set collected during the 2009 and 2010 cropping seasons (744 GPS waypoints at the validation sites) 2 Generating plausible crop distribution maps for Sub- Saharan Africa using a spatially disaggregated data fusion and optimization approach Agricultural Systems, Volume 99, Issues 2–3, February 2009, Pages 126– 140 ORG Crop distribution map of sub Saharan Africa 3 Generating global crop distribution maps: From census to grid Agricultural Systems, Volume 127, May 2014, Pages 53–60 ORG Global Rainfed/Irrigated crop map 4 Disaggregating and mapping crop statistics using hyper temporal remote sensing International Journal of Applied Earth Observation and Geoinformation, Volume 12, Issue 1, February 2010, Pages 36–46 Wheat, sunflower, Barley crop maps of southern Spain 5Global rain-fed, irrigated, and paddy croplands (GRIPC) J. Meghan Salmon, Mark A. Friedl, Steve Frolking, Dominik Wisser, Ellen M. Douglas usercontent.com/ u/ /GRIP Cmap.zip. Irrigated/Rainfed Map 6 Finer resolution observation and monitoring of global land cover: first mapping results with Landsat TM and ETM+ data International Journal of Remote Sensing Volume 34, Issue 7, 2013 ey.edu Landsat/MODIS Mapping 91,000 Training samples; 38,000 Test samples 11

Reference data from Literature Paper TitleJournalContactData 7 Data Mining, A Promising Tool for Large-Area Cropland Mapping IEEE Journal of selected topics in applied earth observations and remote sensing, vol. 6, no. 5, October 2013 rad.fr The field surveys were conducted in Mali during the 2009 and 2010 crop seasons (980 Way points) 8 GlobeLand30 ( ex.aspx) ISPRS Journal of Photogrammetry and Remote Sensing 103 (2015) 7–27 154,587 pixel samples 2010 year 9 Mapping and discrimination of soybean and corn crops using spectrotemporal profiles of vegetation indices International Journal of Remote Sensing, 2015, Vol. 36, No. 7, 1809–1824, mail.com Field data from 19 different croplands (state of Paraná, located in the South of Brazil, between) 10 Improving Crop Area Estimation in West Africa Using Multiresolution Satellite Data Proceedings of Global Geospatial Conference 2013 ni-wuerzburg.de field survey conducted between May and July Impact of feature selection on the accuracy and spatial uncertainty of per-field crop classification using Support Vector Machines ISPRS Journal of Photogrammetry and Remote Sensing 85 (2013) 102–119 wuerzburg.de Extensive field survey conducted in Four test sites in Middle Asia. 12 MODIS Collection 5 global land cover: Algorithm refinements and characterization of new datasets Remote Sensing of Environment 114 (2010) 168– Training sites globally 13 Cropland for sub-Saharan Africa: A synergistic approach using five land cover data sets Calibrated synergy map for Africa ( 2010GL046213/abstract) 2553 samples distributed over Africa 12

RHSeg Algorithm: Segmentation 13  The conversion of NTF format to Tiff using GDAL does not match with the one converted using Erdas/ArcMap (for e.g. FCC (GDAL) & NCC (Erdas)satellite imagery on Left have different extent)  The output from RHSeg does not overlay over the Image in eCognition  The issue of converting the RHSeg results from raster to vector format because segmentation results efficiently represented in vector format RHSEG result overlay on World-View-1 (FCC, NCC)

Adding Thematic Layers to RHSeg Result 14  The object labels from RHSeg segmentation need to be appended to a number of parameters  Jim Tilton has worked on NDVI layer which can be computed either prior to processing with RHSeg (pre-processing) – or the NDVI can be computed on the RHSeg output (post-processing). Border Index, Homogeneity Contrast, Band Values, Rectangular Fit…etc. eCognition segmentation output Example

Classification with Random Forest  R Program  RHSeg segmentation result is used for Random forest classification in R  Either the variables can be used in raster thematic form or attached to the region objects  Allows integration of the RHSeg output into the Random forest classification  Useful variables will be selected based on accuracy metrics predicted in R  eCognition Software 15  The random selection of objects created from multi-resolution segmentation are used as training data to perform random forest classification

North America Reference Data Speltz Misc. Vegetables & fruits Water Melon Hops Sod Switch Grass Wildflowers Other tree Crops Pistachio Carrot Garlic Cantaloupes Pumpkin Brocolli Caneberries Cranberries 16 The Study Area showing World View-1 imagery and Crop Database Layer, Yolo, California CDL Class Labels Crosswalk between CDL Labels and classification Methodology Development

North America Reference Data 17 Random distribution of Samples: Vegetation samples appeared more due larger proportion of the area is vegetation in this particular scene The Error Matrix

Accuracy Assessment of GCE v.2 Australia Map  1/3rd Validation set Received: 1,118 Ground Reference Data Locations  The ground data were collected from Zone 3 to Zone 9. Some of the locations are not on the GCE map  Both scale 1 (90x90m) and scale 2 (250x250 m) labels are part of the ground reference data observations  Ground data label: Land prepared for season 2 – Fallow or Rainfed cropland?? 18

Crosswalk between Ground and Map Labels GCE v.2 Classification SchemeGround data Classification 1.Rainfed Single Crop, all crops 2.Rainfed Single Crop, Pastures Rainfed Grazing Rainfed Continuous Orchard Rainfed Single flowers Rainfed Plantation 3. Irrigated Single Crop, Double Crop, all crops 4.Irrigated Single Crop Pastures Irrigated single vegetables 5.Irrigated Continuous, Orchards 6.Fallow Land prepared for season 2 Season 2 Crop Rainfed/Irrigated Single Land prepared for season 2 No-Crop 1. Rf, croplands 2. Rf, Pastures 3. Irrigated croplands 4. Irrigated pastures 5. Cropland, Irrigated, Continuous, Orchard?? 6. Fallow Africa Australia North America 19 Group11 Class Name Category 1AlfafaPasture 2BarleyCrops 3BeansCrops 4CanolaCrops 5LentilsCrops 6LupinCrops 7OatsCrops 8PeasCrops 9WheatCrops 10 Cropland, Irrigated, Continuous, Orchardorchard /Continuous crops 11Cropland single, sown-pasturePasture 12 Cropland, single, land prepared for S2Crops 13Crop harvestedCrops 14Rainfed vegetablesCrops 15PlantationCrops 16Cropland, RF, single, CropCrops 20RF, GrazingPasture 30no cropNon crop Final 6 classes to generate Error Matrix

Accuracy Assessment for SMT and ACCA algorithm generated maps Crop/No-Crop Accuracy The Error matrix for 6 Classes after truncating the samples ACCA SMT 20

More Possible Error Matrices 6 Classes + No-Crop ACCA6 Classes + No-Crop SMT 4 Classes(Merge Rain fed & Irrigated) ACCA 4 Classes(Merge Rain fed & Irrigated) SMT 21 Irrigation map has been used as mask to label ground samples

90 m Samples Verification over Google Earth 22 90m Samples Not Good for 250m

250m Samples Verification from Google Earth May result in Spatial Autocorrelation 23 Some samples are very near to the road and need to be placed more in the center of the field Most of the 250m samples are valid, a few of them need to be revised Most of the 90m samples are good for 30m Map validation but not for 250m Map

Consider Scale 2: 250m Samples only The Error matrix for 7 Classes after removing 90m samples ACCA SMT 24

Conclusion 25  The ground data with 250m homogeneous area are mostly good but the 90m samples do not cover 250m homogeneity in all samples  A number of Error Matrices can be generated to present the accuracy. The rows and columns can be reduced or expanded in detail, as necessary  The objective is to generate a valid, balanced, statistically sound error matrix with proportional representative number of samples for each class

REMEMBER 26  Anyone can generate an error matrix with any data.  Just because there is an error matrix does not mean that there is a valid accuracy assessment.  We have already provided you with a number of resources including a full reference data collection document to help you.  Our goal is to work with each team to make sure that you are thinking about all the requirements now so that our accuracy assessments are valid.

27 Some Key Topics  Classification Scheme  Sample Unit  Sample Size  Sampling Scheme  Spatial Autocorrelation

28 1. Classification Scheme  Key to any mapping project.  Must be done at beginning of project.  We have done this for our project – BUT…  Requirements of the Classification Scheme:  Meets the user’s needs  Consists of both labels and rules (definitions) that are  Mutually exclusive  Totally exhaustive  Hierarchical  Includes a minimum mapping unit.  Issue for us is crosswalking all the different classification schemes used for the various reference data sets all of us are using to our map classification scheme.  Can introduce serious error!

29 2. Sample Unit  Must consider positional accuracy and mmu  We have selected a 3x3 TM pixels (90m x 90m) sample unit that is homogeneous for the Landsat accuracy assessment.  We need at least a single MODIS pixel (250m x 250m) that is homogeneous for the MODIS accuracy assessment.

30 3. Sample Size  A useful rule of thumb: 50 sample units per map class  Need to balance proportion of samples in each map class with insuring that enough samples are taken per map class to know the accuracy of each map class  Need enough samples to insure good distribution across the map (avoid spatial autocorrelation)  Samples MUST BE INDEPENDENT of training data  We all must keep this in mind. This is the #1 reason for our coordination calls with all the mapping teams.  Need Justin’s help here.  If assess map of the continent, the results are for the entire continent. If need eco-region or country estimate, need to do assessment at that level.

31 4. Sampling Scheme

32 6. Spatial Autocorrelation  Spatial autocorrelation occurs when the presence, absence, or degree of a certain characteristic affects the presence, absence, or degree of the same characteristic in neighboring units (Cliff and Ord 1973)  Samples must be adequately spaced apart or they will be spatially autocorrelated.  This is true whether we are collecting reference data on the ground or from very high resolution imagery.

Future Work 33  Implement Object statistics on RHSeg Results  Resolve the issue of different extent of satellite imagery  Perform random forest on RHSeg result in R  Generate pure reference samples from AAFC cropland layer for Canada  Generate reference samples from CDL for North America  Continue our coordination calls with each mapping team

34