Robust GW summary statistics & robust GW regression are used to investigate a freshwater acidification data set. Results show that data relationships can.

Slides:



Advertisements
Similar presentations
Spatial point patterns and Geostatistics an introduction
Advertisements

A. The Basic Principle We consider the multivariate extension of multiple linear regression – modeling the relationship between m responses Y 1,…,Y m and.
Regression analysis Relating two data matrices/tables to each other Purpose: prediction and interpretation Y-data X-data.
Regression Analysis Using Excel. Econometrics Econometrics is simply the statistical analysis of economic phenomena Here, we just summarize some of the.
LECTURE 3 Introduction to Linear Regression and Correlation Analysis
1 Multivariate Statistics ESM 206, 5/17/05. 2 WHAT IS MULTIVARIATE STATISTICS? A collection of techniques to help us understand patterns in and make predictions.
Psychology 202b Advanced Psychological Statistics, II February 10, 2011.
GRA 6020 Multivariate Statistics The regression model OLS Regression Ulf H. Olsson Professor of Statistics.
Deterministic Solutions Geostatistical Solutions
Data and Interpretation What have you learnt?. The delver into nature’s aims Seeks freedom and perfection; Let calculation sift his claims With faith.
Supervised classification performance (prediction) assessment Dr. Huiru Zheng Dr. Franscisco Azuaje School of Computing and Mathematics Faculty of Engineering.
Spatial Interpolation
Multivariate Data Analysis Chapter 4 – Multiple Regression.
Basics: Notation: Sum:. PARAMETERS MEAN: Sample Variance: Standard Deviation: * the statistical average * the central tendency * the spread of the values.
Why Geography is important.
Ordinary Kriging Process in ArcGIS
Geostatistics Mike Goodchild. Spatial interpolation n A field –variable is interval/ratio –z = f(x,y) –sampled at a set of points n How to estimate/guess.
Applications of Nonparametric Survey Regression Estimation in Aquatic Resources F. Jay Breidt, Siobhan Everson-Stewart, Alicia Johnson, Jean D. Opsomer.
Mapping Chemical Contaminants in Oceanic Sediments Around Point Loma’s Treated Wastewater Outfall Kerry Ritter Ken Schiff N. Scott Urquhart Dawn Olson.
Prelude of Machine Learning 202 Statistical Data Analysis in the Computer Age (1991) Bradely Efron and Robert Tibshirani.
Week 14 Chapter 16 – Partial Correlation and Multiple Regression and Correlation.
Bootstrapping applied to t-tests
Correlation & Regression
Objectives of Multiple Regression
Quantitative Methods Heteroskedasticity.
1 Least squares procedure Inference for least squares lines Simple Linear Regression.
Model Building III – Remedial Measures KNNL – Chapter 11.
Seabed type clustering Identification of seabed type (such as mud, sand and rock) is of value in many applications including seabed mapping, coastal management.
Andrew Thomson on Generalised Estimating Equations (and simulation studies)
Robust GW summary statistics & robust GW regression are used to investigate spatial variation and relationships in a freshwater acidification critical.
Extension to Multiple Regression. Simple regression With simple regression, we have a single predictor and outcome, and in general things are straightforward.
VI. Evaluate Model Fit Basic questions that modelers must address are: How well does the model fit the data? Do changes to a model, such as reparameterization,
Geographic Information Science
Various topics Petter Mostad Overview Epidemiology Study types / data types Econometrics Time series data More about sampling –Estimation.
Part 2: Model and Inference 2-1/49 Regression Models Professor William Greene Stern School of Business IOMS Department Department of Economics.
Chapter 4 Linear Regression 1. Introduction Managerial decisions are often based on the relationship between two or more variables. For example, after.
17 May 2007RSS Kent Local Group1 Quantifying uncertainty in the UK carbon flux Tony O’Hagan CTCD, Sheffield.
Mathematical Models & Optimization?
Spatially Assessing Model Error Using Geographically Weighted Regression Shawn Laffan Geography Dept ANU.
Regression Analysis Part C Confidence Intervals and Hypothesis Testing
Concepts and Applications of Kriging
Multivariate Data Analysis Chapter 1 - Introduction.
Robust Estimators.
Bootstrap Event Study Tests Peter Westfall ISQS Dept. Joint work with Scott Hein, Finance.
Applied Quantitative Analysis and Practices LECTURE#31 By Dr. Osman Sadiq Paracha.
I271B QUANTITATIVE METHODS Regression and Diagnostics.
Esri UC2013. Technical Workshop. Technical Workshop 2013 Esri International User Conference July 8–12, 2013 | San Diego, California Concepts and Applications.
Exposure Assessment for Health Effect Studies: Insights from Air Pollution Epidemiology Lianne Sheppard University of Washington Special thanks to Sun-Young.
Copyright © 2004 by The McGraw-Hill Companies, Inc. All rights reserved.
Stats Term Test 4 Solutions. c) d) An alternative solution is to use the probability mass function and.
By Russ Frith University of Alaska at Anchorage Civil Engineering Department Estimating Alaska Snow Loads.
- 1 - Preliminaries Multivariate normal model (section 3.6, Gelman) –For a multi-parameter vector y, multivariate normal distribution is where  is covariance.
1/61: Topic 1.2 – Extensions of the Linear Regression Model Microeconometric Modeling William Greene Stern School of Business New York University New York.
Hybrid Bayesian Linearized Acoustic Inversion Methodology PhD in Petroleum Engineering Fernando Bordignon Introduction Seismic inversion.
Chapter 22 Inferential Data Analysis: Part 2 PowerPoint presentation developed by: Jennifer L. Bellamy & Sarah E. Bledsoe.
The SweSAT Vocabulary (word): understanding of words and concepts. Data Sufficiency (ds): numerical reasoning ability. Reading Comprehension (read): Swedish.
Exposure Prediction and Measurement Error in Air Pollution and Health Studies Lianne Sheppard Adam A. Szpiro, Sun-Young Kim University of Washington CMAS.
Chapter 12 REGRESSION DIAGNOSTICS AND CANONICAL CORRELATION.
JMP Discovery Summit 2016 Janet Alvarado
Chapter 7. Classification and Prediction
Correlations and simple regression analysis
Chapter 13 Created by Bethany Stubbe and Stephan Kogitz.
Week 14 Chapter 16 – Partial Correlation and Multiple Regression and Correlation.
Multivariate Analysis Lec 4
…Don’t be afraid of others, because they are bigger than you
Inference for Geostatistical Data: Kriging for Spatial Interpolation
CHAPTER 29: Multiple Regression*
Microeconometric Modeling
Concepts and Applications of Kriging
Multiple Regression Chapter 14.
Presentation transcript:

Robust GW summary statistics & robust GW regression are used to investigate a freshwater acidification data set. Results show that data relationships can vary across space, but this perception can depend on only a few influential outlying observations. A new robust GW regression model is used to cater for such phenomena. GW principal components analysis is used to investigate the changing local structure in multivariate spatial data sets. Using Dublin voter turnout data, the GW PCA methodology is advanced to incorporate: (i) automatic bandwidth selection, (ii) tests for its application and (iii) visualisation techniques for its output. Furthermore, a robust GW PCA is developed to detect multivariate spatial outliers. This new extension is demonstrated using the GSI SURGE project soils geochemistry data for Dublin (Fig. 1). Geographically weighted (GW) models: advances in investigating spatial heterogeneity Paul Harris, Martin Charlton & Chris Brunsdon* Studies for spatial exploration, visualisation & outlier detection Inference related problems in GW regression are investigated using new locally-compensated GW regressions and GW PCA to address strong criticisms of GW regression regarding local collinearity. A guide to fitting and interpreting GW regressions in this respect is given. Simulated and Dublin voter turnout data sets are used in these studies (Figs. 3 & 4). Bootstrap methods are also in development with regard to improved inference in GW regression. A Bayesian spatially-varying coefficient (SVC) model is also newly developed to provide an alternative to GW regression. The Bayesian SVC model is an entirely different approach to the nonparametric GW regression model and benefits from a better handle on uncertainty. The SVC model is performance tested for spatial prediction/inference using a house price data set. GW regression as a spatial predictor is assessed in three complementary studies. First, its performance is compared to: (i) standard regression; (ii) standard kriging (from geostatistics) and (iii) new hybrids (kriging combined with GW regression), using simulated data sets. Results show promise with the hybrids but standard kriging should be preferred. In a second study, GW regression with a heteroskedastic error variance is linked and compared with a corresponding kriging model. This new GW predictor is able to provide relatively accurate confidence intervals. In a third study, indicator kriging is combined with GW regression to form a second novel hybrid that also provides promisingly accurate confidence intervals. The latter studies used freshwater acidification data. Kriging with GW variograms is a novel geostatistical-nonparametric hybrid. This non-stationary variogram technique generalises moving window kriging (MWK) where classic estimators are replaced with information-rich, GW variogram estimators. Results indicate (using four pollution data sets) much promise in the new predictor (Fig. 2). Related studies visualise outputs from non-stationary variogram predictors using comap, that include a new robust MWK model. Optimal sample re-design. Initial work used GW summary statistics and a location-allocation algorithm. Current work uses the new GW predictors above or GW PCA, all with simulated annealing to achieve both univariate and multivariate optimisations. All studies use the GSI SURGE project Dublin soils data. Research presented in this poster was funded by a Strategic Research Cluster Grant (07/SRC/I1168) by Science Foundation Ireland under the National Development Plan. The authors gratefully acknowledge this support. Studies for spatial prediction, its uncertainty & sample re-design Studies for model inference & statistical properties Fig. 1: Multivariate spatial outlier detection with GW PCA The GWmodel R software package The fundamental science of the above studies is transferred to studies in applied science on the StratAG project via an open source R package of statistical computing code. The R package includes existing GW models and our newly developed GW models above (including advancements from other members of the spatial heterogeneity team, not presented here). This R package will be mirrored with a set of GW modelling tools for ESRI’s ArcGIS (written in Python). * Visiting professor – University of Liverpool, UK ModelRankModelRank Std kriging4Std MWK2 Std nonlinear kriging3MWK with GW variograms1 Fig. 2: Classic local variograms (top), GW variograms (middle), global variogram (bottom left) and kriging results Fig. 4: The use of GW PCA to map matrix conditional numbers ( > 30 suggests a significant local collinearity effect). Dublin voter turnout covariate data. Fig.3. Example output from simulation studies for investigating local collinearity in GW regression. Actual parameter B1 with low levels of spatial smoothness coupled with strong correlations with B0, B2 and B3 (the other regression parameters).