Strength of Spatial Correlation and Spatial Designs: Effects on Covariance Estimation Kathryn M. Irvine Oregon State University Alix I. Gitelman Sandra.

Slides:



Advertisements
Similar presentations
Spatial point patterns and Geostatistics an introduction
Advertisements

Use of Estimating Equations and Quadratic Inference Functions in Complex Surveys Leigh Ann Harrod and Virginia Lesser Department of Statistics Oregon State.
Predicting the likelihood of water quality impaired stream reaches using landscape scale data and a hierarchical methodology Erin Peterson Geosciences.
Prediction, Correlation, and Lack of Fit in Regression (§11. 4, 11
Multi-Lag Cluster Enhancement of Fixed Grids for Variogram Estimation for Near Coastal Systems Kerry J. Ritter, SCCWRP Molly Leecaster, SCCWRP N. Scott.
MSS/MBSS # 1 Joint work with Erin E. Peterson, Andrew A. Merton, David M. Theobald, and Jennifer A. Hoeting All of Colorado State University, Fort Collins,
Model- vs. design-based sampling and variance estimation on continuous domains Cynthia Cooper OSU Statistics September 11, 2004 R
Nonparametric, Model-Assisted Estimation for a Two-Stage Sampling Design Mark Delorey, F. Jay Breidt, Colorado State University Abstract In aquatic resources,
Workshop 2: Spatial scale and dependence in biogeographical patterns Objective: introduce the following fundamental concepts on spatial data analysis:
1 STARMAP: Project 2 Causal Modeling for Aquatic Resources Alix I Gitelman Stephen Jensen Statistics Department Oregon State University August 2003 Corvallis,
Aim Considerable relations have recently been given to the environmental hazardous caused by agricultural chemicals such as excess fertilizers. Evaluating.
Correlation and Autocorrelation
Erin E. Peterson Postdoctoral Research Fellow CSIRO Mathematical and Information Sciences Division Brisbane, Australia May 18, 2006 Regional.
State-Space Models for Within-Stream Network Dependence William Coar Department of Statistics Colorado State University Joint work with F. Jay Breidt This.
Using the Maryland Biological Stream Survey Data to Test Spatial Statistical Models A Collaborative Approach to Analyzing Stream Network Data Andrew A.
Semiparametric Mixed Models in Small Area Estimation Mark Delorey F. Jay Breidt Colorado State University September 22, 2002.
Deterministic Solutions Geostatistical Solutions
Applied Geostatistics
What is a Multi-Scale Analysis? Implications for Modeling Presence/Absence of Bird Species Kathryn M. Georgitis 1, Alix I. Gitelman 1, Don L. Stevens 1,
1 Accounting for Spatial Dependence in Bayesian Belief Networks Alix I Gitelman Statistics Department Oregon State University August 2003 JSM, San Francisco.
0.6 – – – – – 15.9 MBSS Survey Sites 1996 Dissolved organic carbon (mg/l) 0.7 – – – –
Distribution Function Estimation in Small Areas for Aquatic Resources Spatial Ensemble Estimates of Temporal Trends in Acid Neutralizing Capacity Mark.
Two-Phase Sampling Approach for Augmenting Fixed Grid Designs to Improve Local Estimation for Mapping Aquatic Resources Kerry J. Ritter Molly Leecaster.
Deterministic Solutions Geostatistical Solutions
Example For simplicity, assume Z i |F i are independent. Let the relative frame size of the incomplete frame as well as the expected cost vary. Relative.
Predicting Water Quality Impaired Stream Segments using Landscape-scale Data and a Regional Geostatistical Model Erin E. Peterson Postdoctoral.
SA basics Lack of independence for nearby obs
MSS/MBSS # 1 N. Scott Urquhart Joint work with Erin P. Peterson, Andrew A. Merton, David M. Theobald, and Jennifer A. Hoeting All of Colorado State University,
Distribution Function Estimation in Small Areas for Aquatic Resources Spatial Ensemble Estimates of Temporal Trends in Acid Neutralizing Capacity Mark.
State-Space Models for Biological Monitoring Data Devin S. Johnson University of Alaska Fairbanks and Jennifer A. Hoeting Colorado State University.
Ordinary Kriging Process in ArcGIS
Optimal Sample Designs for Mapping EMAP Data Molly Leecaster, Ph.D. Idaho National Engineering & Environmental Laboratory Jennifer Hoeting, Ph. D. Colorado.
Applications of Nonparametric Survey Regression Estimation in Aquatic Resources F. Jay Breidt, Siobhan Everson-Stewart, Alicia Johnson, Jean D. Opsomer.
Random Effects Graphical Models and the Analysis of Compositional Data Devin S. Johnson and Jennifer A. Hoeting STARMAP Department of Statistics Colorado.
Distribution Function Estimation in Small Areas for Aquatic Resources Spatial Ensemble Estimates of Temporal Trends in Acid Neutralizing Capacity Mark.
Distribution Function Estimation in Small Areas for Aquatic Resources Spatial Ensemble Estimates of Temporal Trends in Acid Neutralizing Capacity Mark.
Method of Soil Analysis 1. 5 Geostatistics Introduction 1. 5
Multi-scale Analysis: Options for Modeling Presence/Absence of Bird Species Kathryn M. Georgitis 1, Alix I. Gitelman 1, and Nick Danz 2 1 Statistics Department,
Weed mapping tools and practical approaches – a review Prague February 2014 Weed mapping tools and practical approaches – a review Prague February 2014.
Comparison of Variance Estimators for Two-dimensional, Spatially-structured Sample Designs. Don L. Stevens, Jr. Susan F. Hornsby* Department of Statistics.
6.1 What is Statistics? Definition: Statistics – science of collecting, analyzing, and interpreting data in such a way that the conclusions can be objectively.
Spatial Interpolation of monthly precipitation by Kriging method
Spatial Statistics Jonathan Bossenbroek, PhD Dept of Env. Sciences Lake Erie Center University of Toledo.
Statistical Modeling and Analysis of MOFEP Chong He ( with John Kabrick, Xiaoqian Sun, Mike Wallendorf) Department of Statistics University of Missouri-Columbia.
Geog. 579: GIS and Spatial Analysis - Lecture 21 Overheads 1 Point Estimation: 3. Methods: 3.6 Ordinary Kriging Topics: Lecture 23: Spatial Interpolation.
Statistical Methods Statistical Methods Descriptive Inferential
Mercury Concentrations in Stream Fish Throughout 12 Western States in the USA Alan Herlihy and Robert Hughes Dept. of Fisheries & Wildlife, Oregon State.
Geographic Information Science
Design of a Sampling Network for an Estuary in the Colombian Caribbean, Using Geostatistical Methods. 1.INTRODUCTION In environmental statistics, model.
Simulation of spatially correlated discrete random variables Dan Dalthorp and Lisa Madsen Department of Statistics Oregon State University
U.S. Department of the Interior U.S. Geological Survey Using Monitoring Data from Multiple Networks/Agencies to Calibrate Nutrient SPARROW* Models, Southeastern.
Short course on space-time modeling Instructors: Peter Guttorp Johan Lindström Paul Sampson.
Geo479/579: Geostatistics Ch7. Spatial Continuity.
Chapter 11: Linear Regression and Correlation Regression analysis is a statistical tool that utilizes the relation between two or more quantitative variables.
SGPP: Spatial Gaussian Predictive Process Models for Neuroimaging Data Yimei Li Department of Biostatistics St. Jude Children’s Research Hospital Joint.
Controls on Catchment-Scale Patterns of Phosphorous in Soil, Streambed Sediment, and Stream Water Marcel van der Perk, et al… Journal of Environmental.
Stochastic Hydrology Random Field Simulation Professor Ke-Sheng Cheng Department of Bioenvironmental Systems Engineering National Taiwan University.
Geo479/579: Geostatistics Ch12. Ordinary Kriging (2)
Proportional Hazards Model Checking the adequacy of the Cox model: The functional form of a covariate The link function The validity of the proportional.
Using Regional Models to Assess the Relative Effects of Stressors Lester L. Yuan National Center for Environmental Assessment U.S. Environmental Protection.
Exposure Prediction and Measurement Error in Air Pollution and Health Studies Lianne Sheppard Adam A. Szpiro, Sun-Young Kim University of Washington CMAS.
Spatial statistics: Spatial Autocorrelation
Quantifying Scale and Pattern Lecture 7 February 15, 2005
Estimation and Model Selection for Geostatistical Models
Ch9 Random Function Models (II)
Inference for Geostatistical Data: Kriging for Spatial Interpolation
Spatial Prediction of Coho Salmon Counts on Stream Networks
Paul D. Sampson Peter Guttorp
Stochastic Hydrology Random Field Simulation
Correlation & Trend Lines
Presentation transcript:

Strength of Spatial Correlation and Spatial Designs: Effects on Covariance Estimation Kathryn M. Irvine Oregon State University Alix I. Gitelman Sandra E. Thompson

The research described in this presentation has been funded by the U.S. Environmental Protection Agency through the STAR Cooperative Agreement CR Program on Designs and Models for Aquatic Resource Surveys at Oregon State University. It has not been subjected to the Agency's review and therefore does not necessarily reflect the views of the Agency, and no official endorsement should be inferred R

Talk Outline Stream Sulfate Concentration –Geostatistical Model –Preliminary Findings Simulations Results –Parameter Estimation Discussion

Study Objective: Model the spatial heterogeneity of stream sulfate concentration in streams in the Mid- Atlantic U.S.

Why stream sulfate concentration? –Indirectly toxic to fish and aquatic biota Decrease in streamwater pH Increase in metal concentrations (AL) –Observed positive spatial relationship with atmospheric SO 4 -2 deposition (Kaufmann et al. 1991)

The Data EMAP water chemistry data –322 stream locations Watershed variables: –% forest, % agriculture, % urban, % mining –% within ecoregions with high sulfate adsorption soils National Atmospheric Deposition Program

EMAP and NADP locations EMAP NADP

Geostatistical Model Where Y(s) is a vector of observed ln(SO 4 -2 ) concentration at stream locations (s) X(s) is a matrix of watershed explanatory variables   is a vector of unknown regression coefficients   (s) is the spatial error process Where D is matrix of pairwise distances,  is 1/range,   is the partial sill   is the nugget (1)

Effective Range Definition: 1) Distance beyond which the correlation between observations is less than or equal to ) Distance where the semi-variogram reaches 95% of the sill.

Semi-Variogram Nugget Partial Sill Effective Range 197 km 272 km

Interpretations of Spatial Covariance Parameters Patch Characteristics (Rossi et al. 1992; Robertson and Gross 1994; Dalthorp et al. 2000; Schwarz et al and more) –Effective Range ~ Size of Patch –Nugget ~ Tightness of Patches Sample Design Modifications –Effective Range: Independent Samples –Nugget: Measurement Error

Why Are the Estimates Different? Simulation Study Strength of Spatial Correlation? –Nugget:Sill ratio and/or Range Parameter Mardia & Marshall (1984): measurement error increases variability of ML estimates of range Zimmerman & Zimmerman (1991): REML and ML better when spatial signal weak (short range) Lark (2000): ML better compared to MOM when short range and large nugget:sill ratio Thompson (2001): estimation for Matern with 20% and 50% nugget under different spatial designs

Is the spatial correlation too weak? Effective Range Values for Simulations EMAP Estimates Re-Scaled: Range Parameter ~1.5 Nugget-to-Sill Ratio ~0.50

Is it the spatial sample design? - Cluster design optimal for covariance parameter estimation (Pettitt and McBratney 1993; Muller and Zimmerman 1999; Zhu and Stein 2005; Xia et al. 2006; Zimmerman 2006; Zhu and Zhang 2006)

Is it the spatial sample design? Zimmerman (2006) and Thompson (2001)

Simulation Study Spatial Designs: Lattice, Random, Cluster Range Parameter = 1 and 3 Nugget/Sill Ratio: 0.10, 0.33, 0.50, 0.67, 0.90 n=144 and n=361 (In-fill Asymptotics) 100 realizations per combination RandomFields in R Estimation using R code (Ver Hoef 2004)

1.Estimation of Covariance Parameters The Effective Range

Range Parameter = 1 Range Parameter = 3 Results for Estimation of Effective Range Estimation Error = estimate - truth

Range Parameter = 1 Range Parameter = 3 Results for Estimation of Effective Range

Range Parameter = 1 Range Parameter = 3 Results for Estimation of Effective Range

Range Parameter = 1 Range Parameter = 3 Results for Estimation of Effective Range

Summary Covariance Parameter Estimation Effective Range : –ML under-estimate the truth –REML more skewed in 90th percentile (large nugget- to-sill and range parameter) Partial Sill: –ML under-estimate the truth –REML more skewed in 90th percentile Nugget: –estimated well; particularly with cluster design

Discussion –Which estimation method to use? –Consistency Results:     Chen et al. 2000, Zhang and Zimmerman 2005) –Uncertainty estimates for REML and ML REML: Increasing Domain (Cressie and Lahiri 1996) ML: Increasing Domain and Infill Asymptotics (Zhang and Zimmerman 2005)

Acknowledgements Co-Authors Jay Ver Hoef, Alan Herlihy, Andrew Merton, Lisa Madsen

Questions

Results 1. Estimation of Covariance Parameters 2. Estimation of Autocorrelation Function

Results: 2. Estimation of Autocorrelation Function

Estimation of Autocorrelation Function Cluster Design

Summary: Estimation of Autocorrelation Function Overall Patterns: –ML and REML poor performance with stronger spatial correlation (larger effective ranges) –REML large variability –ML under-estimation –‘BEST’ case: Cluster Design with range parameter = 1 and n=361

Wet Atmospheric Sulfate Deposition

Estimated Auto-correlation Function for ln(SO 4 -2 )

Sketch of watershed with overlaid landcover map

2. Estimation of Autocorrelation Function Lattice Design

Estimation of Autocorrelation Function Lattice Design

2. Estimation of Autocorrelation Function Random Design

Estimation of Autocorrelation Function Random Design

2. Estimation of Autocorrelation Function Cluster Design