Spatially Assessing Model Error Using Geographically Weighted Regression Shawn Laffan Geography Dept ANU.

Slides:



Advertisements
Similar presentations
Spatial point patterns and Geostatistics an introduction
Advertisements

Multiple Regression.
Brief introduction on Logistic Regression
Forecasting Using the Simple Linear Regression Model and Correlation
Regression Analysis Once a linear relationship is defined, the independent variable can be used to forecast the dependent variable. Y ^ = bo + bX bo is.
Chapter 10 Curve Fitting and Regression Analysis
Spatial Analysis and GIS Stephanie Watson Marine Mapping User Group (MMUG) Coordinator California Department of Fish and Game.
SPATIAL DATA ANALYSIS Tony E. Smith University of Pennsylvania Point Pattern Analysis Spatial Regression Analysis Continuous Pattern Analysis.
Objectives (BPS chapter 24)
Basic geostatistics Austin Troy.
GIS and Spatial Statistics: Methods and Applications in Public Health
Correlation and Autocorrelation
The Simple Regression Model
Sampling Distributions
Applied Geostatistics
Chapter Topics Types of Regression Models
Deterministic Solutions Geostatistical Solutions
Chapter 11 Multiple Regression.
Using ranking and DCE data to value health states on the QALY scale using conventional and Bayesian methods Theresa Cain.
Why Geography is important.
Ordinary Kriging Process in ArcGIS
© 2000 Prentice-Hall, Inc. Chap Forecasting Using the Simple Linear Regression Model and Correlation.
Applications in GIS (Kriging Interpolation)
Simple Linear Regression Analysis
Correlation & Regression
1 Assessment of Imprecise Reliability Using Efficient Probabilistic Reanalysis Farizal Efstratios Nikolaidis SAE 2007 World Congress.
Accuracy Assessment. 2 Because it is not practical to test every pixel in the classification image, a representative sample of reference points in the.
Title: Spatial Data Mining in Geo-Business. Overview  Twisting the Perspective of Map Surfaces — describes the character of spatial distributions through.
Correlation and Linear Regression
Calibration Guidelines 1. Start simple, add complexity carefully 2. Use a broad range of information 3. Be well-posed & be comprehensive 4. Include diverse.
Spatial Interpolation of monthly precipitation by Kriging method
Using ESRI ArcGIS 9.3 Spatial Analyst
Introduction to Statistical Inference Chapter 11 Announcement: Read chapter 12 to page 299.
$88.65 $ $22.05/A profit increase Improving Wheat Profits Eakly, OK Irrigated, Behind Cotton.
Practical Statistical Analysis Objectives: Conceptually understand the following for both linear and nonlinear models: 1.Best fit to model parameters 2.Experimental.
Edoardo PIZZOLI, Chiara PICCINI NTTS New Techniques and Technologies for Statistics SPATIAL DATA REPRESENTATION: AN IMPROVEMENT OF STATISTICAL DISSEMINATION.
Interpolation Tools. Lesson 5 overview  Concepts  Sampling methods  Creating continuous surfaces  Interpolation  Density surfaces in GIS  Interpolators.
1 Statistical Distribution Fitting Dr. Jason Merrick.
Geographic Information Science
12.1 Heteroskedasticity: Remedies Normality Assumption.
Analysis of Residuals Data = Fit + Residual. Residual means left over Vertical distance of Y i from the regression hyper-plane An error of “prediction”
Spatial Statistics in Ecology: Continuous Data Lecture Three.
GEOSTATISICAL ANALYSIS Course: Special Topics in Remote Sensing & GIS Mirza Muhammad Waqar Contact: EXT:2257.
Inference for Regression Simple Linear Regression IPS Chapter 10.1 © 2009 W.H. Freeman and Company.
Chapter 4 Linear Regression 1. Introduction Managerial decisions are often based on the relationship between two or more variables. For example, after.
Stat 112: Notes 2 Today’s class: Section 3.3. –Full description of simple linear regression model. –Checking the assumptions of the simple linear regression.
Regression Analysis Part C Confidence Intervals and Hypothesis Testing
Chimiometrie 2009 Proposed model for Challenge2009 Patrícia Valderrama
Final Project : 460 VALLEY CRIMES. Chontanat Suwan Geography 460 : Spatial Analysis Prof. Steven Graves, Ph.D.
Applied Quantitative Analysis and Practices LECTURE#31 By Dr. Osman Sadiq Paracha.
Statistical Surfaces Any geographic entity that can be thought of as containing a Z value for each X,Y location –topographic elevation being the most obvious.
L15 – Spatial Interpolation – Part 1 Chapter 12. INTERPOLATION Procedure to predict values of attributes at unsampled points Why? Can’t measure all locations:
Lecture 6: Point Interpolation
Interpolation and evaluation of probable Maximum Precipitation (PMP) patterns using different methods by: tarun gill.
Chapter 5 Sampling Distributions. The Concept of Sampling Distributions Parameter – numerical descriptive measure of a population. It is usually unknown.
Statistical methods for real estate data prof. RNDr. Beáta Stehlíková, CSc
1 Heteroskedasticity. 2 The Nature of Heteroskedasticity  Heteroskedasticity is a systematic pattern in the errors where the variances of the errors.
Artificial Neural Networks (ANN). Artificial Neural Networks First proposed in 1940s as an attempt to simulate the human brain’s cognitive learning processes.
Intro to Statistics for the Behavioral Sciences PSYC 1900 Lecture 7: Regression.
Biostatistics Regression and Correlation Methods Class #10 April 4, 2000.
INTERPOLATION Procedure to predict values of attributes at unsampled points within the region sampled Why?Examples: -Can not measure all locations: - temperature.
Regression Analysis Part A Basic Linear Regression Analysis and Estimation of Parameters Read Chapters 3, 4 and 5 of Forecasting and Time Series, An Applied.
Synthesis.
Spatial statistics: Spatial Autocorrelation
Correlation, Bivariate Regression, and Multiple Regression
Measures of dispersion
26134 Business Statistics Week 5 Tutorial
Kakhramon Yusupov June 15th, :30pm – 3:00pm Session 3
Regression Computer Print Out
LESSON 4.4. MULTIPLE LINEAR REGRESSION. Residual Analysis
Presentation transcript:

Spatially Assessing Model Error Using Geographically Weighted Regression Shawn Laffan Geography Dept ANU

Non-spatial methods are increasingly used to model and map continuous spatial properties –Artificial Neural Networks, Decision Trees, Expert Systems... –These can use more ancillary variables than explicitly spatial methods Usually assessed using non-spatial global error measures –Summarise many data points –Cannot easily identify where model is correct

Error residuals may be mapped –But usually points –Difficult to visually identify spatial clustering –Large point symbols no multi-scale no quantification Can use spatial error analysis to detect clusters of similar prediction –Use these areas with confidence –Areas with unacceptable error indicate need for different variables or approach

To spatially assess model error a method should: –Locally calculate omission, commission & total error in original data units in one assessment one dataset each –Assess error for unsampled locations generate spatially continuous surfaces for easier interpretation –Provide confidence information about the assessment uncertainty estimate

Possible approaches: –Mean, StdDev, Range for spatial window three attributes to interpret for each of omission, commission and total error mean will often not equal zero –Co-variograms global assessment work only for sampled locations –Local Spatial Autocorrelation –Geographically Weighted Regression

Local Spatial Autocorrelation: –indices of spatial association –easy to interpret –multi-scale –calculate residuals and assess spatial clustering –some indices calculable for unsampled locations Getis-Ord G i *, Openshaw’s GAM

Local Spatial Autocorrelation: –Give difference from expected (global mean) mean will not normally be zero –Must analyse omission & commission separately partly cancel out leads to numeric and sample density problems – confidence information

Geographically Weighted Regression –multivariate spatial analysis in the presence of non-stationarity –perform regression within a moving spatial window –multi-scaled –can directly assess residual error without prior calculation –simultaneous omission, commission and total error assessment –estimates for unsampled locations –r 2 parameter gives confidence information

The approach: –Ordinary Least Squares Y = a + bX –calculated for circles of increasing radius across the entire dataset minimum 5 sample points –no spatial weight decay with distance does not force an assumed distribution on the data –optimal spatial scale when r 2 is maximum

Interpreting regression parameters for error: –error is the square root of the area between the fitted and the optimal lines –this is bounded by the min and max of the predicted distribution as b approaches 1 the intercept approaches +/- infinity causing extremely large error values –use the intersection of the fitted line with the optimal line (1:1, Y=X) to determine omission & commission

The r 2 parameter –high r 2 means reliable b parameters and therefore reliable error measures –low values indicate low confidence caused by dispersed data values these areas cannot be used as b is meaningless

Example application –feed-forward ANN to infer aluminium oxide –used topographic and vegetation indices –1100 km 2 area at Weipa, Far North Queensland, Australia –16000 drill cores –30.4% accurate within +/- 1 original unit –48.7% accurate within +/- 2 original units

Subset of study area

Total error : 4, 7 & 10 cell radius

Total OmissionCommission

Optimal spatial lag Max r 2

r 2 = red omission = green commission = blue Visualising error distribution with confidence information

Limitations –sample density & distribution –outliers data & spatial cause low r 2 landscape does not operate in circles

Extended Utility –can use the regression parameters to correct the ANN prediction –similar to universal kriging but ANN allows for the inclusion of more ancillary variables –have not taken into account r 2 values

Comparison with universal kriging

Conclusions: –GWR allows the spatial investigation of non-spatial model error –calculates total, omission and commission error in one assessment, with confidence information –identified locations of good and poor model prediction in a densely sampled dataset not immediately obvious without GWR –currently exploratory significance tests would be useful