Space-time Modelling Using Differential Equations Alan E. Gelfand, ISDS, Duke University (with J. Duan and G. Puggioni)

Slides:



Advertisements
Similar presentations
Pattern Recognition and Machine Learning
Advertisements

Pattern Recognition and Machine Learning
Emulation of a Stochastic Forest Simulator Using Kernel Stick-Breaking Processes (Work in Progress) James L. Crooks (SAMSI, Duke University)
Spatial point patterns and Geostatistics an introduction
Gibbs Sampling Methods for Stick-Breaking priors Hemant Ishwaran and Lancelot F. James 2001 Presented by Yuting Qi ECE Dept., Duke Univ. 03/03/06.
HSRP 734: Advanced Statistical Methods July 24, 2008.
Exploiting Sparse Markov and Covariance Structure in Multiresolution Models Presenter: Zhe Chen ECE / CMR Tennessee Technological University October 22,
Cox Model With Intermitten and Error-Prone Covariate Observation Yury Gubman PhD thesis in Statistics Supervisors: Prof. David Zucker, Prof. Orly Manor.
Gaussian process emulation of multiple outputs Tony O’Hagan, MUCM, Sheffield.
Goodness of Fit of a Joint Model for Event Time and Nonignorable Missing Longitudinal Quality of Life Data – A Study by Sneh Gulati* *with Jean-Francois.
1 Software Testing and Quality Assurance Lecture 36 – Software Quality Assurance.
Nonstationary covariance structures II NRCSE. Drawbacks with deformation approach Oversmoothing of large areas Local deformations not local part of global.
Sérgio Pequito Phd Student
Discrete-Time Models Lecture 1. Discrete models or difference equations are used to describe biological phenomena or events for which it is natural to.
Group problem solutions 1.(a) (b). 2. In order to be reversible we need or equivalently Now divide by h and let h go to Assuming (as in Holgate,
Machine Learning CMPT 726 Simon Fraser University
Continuity and covariance Recall that for stationary processes so if C is continuous then Z is mean square continuous. To get results about sample paths.
Nonstationary covariance structures II NRCSE. Drawbacks with deformation approach Oversmoothing of large areas Local deformations not local part of global.
Amos Storkey, School of Informatics. Density Traversal Clustering and Generative Kernels a generative framework for spectral clustering Amos Storkey, Tom.
Computer vision: models, learning and inference
458 Fitting models to data – I (Sum of Squares) Fish 458, Lecture 7.
Statistical Tools for Environmental Problems NRCSE.
Groundwater permeability Easy to solve the forward problem: flow of groundwater given permeability of aquifer Inverse problem: determine permeability from.
Binary Variables (1) Coin flipping: heads=1, tails=0 Bernoulli Distribution.
1 Spatial and Spatio-temporal modeling of the abundance of spawning coho salmon on the Oregon coast R Ruben Smith Don L. Stevens Jr. September.
Overview G. Jogesh Babu. Probability theory Probability is all about flip of a coin Conditional probability & Bayes theorem (Bayesian analysis) Expectation,
Probability theory 2 Tron Anders Moger September 13th 2006.
Model Inference and Averaging
Random Sampling, Point Estimation and Maximum Likelihood.
Indiana GIS Conference, March 7-8, URBAN GROWTH MODELING USING MULTI-TEMPORAL IMAGES AND CELLULAR AUTOMATA – A CASE STUDY OF INDIANAPOLIS SHARAF.
Regional climate prediction comparisons via statistical upscaling and downscaling Peter Guttorp University of Washington Norwegian Computing Center
PATTERN RECOGNITION AND MACHINE LEARNING CHAPTER 3: LINEAR MODELS FOR REGRESSION.
Mixture Models, Monte Carlo, Bayesian Updating and Dynamic Models Mike West Computing Science and Statistics, Vol. 24, pp , 1993.
The Dirichlet Labeling Process for Functional Data Analysis XuanLong Nguyen & Alan E. Gelfand Duke University Machine Learning Group Presented by Lu Ren.
STA 216 Generalized Linear Models Meets: 2:50-4:05 T/TH (Old Chem 025) Instructor: David Dunson 219A Old Chemistry, Teaching.
Bayesian Reasoning: Tempering & Sampling A/Prof Geraint F. Lewis Rm 560:
CS Statistical Machine learning Lecture 24
Introduction to kriging: The Best Linear Unbiased Estimator (BLUE) for space/time mapping.
Analyzing wireless sensor network data under suppression and failure in transmission Alan E. Gelfand Institute of Statistics and Decision Sciences Duke.
A Mass-Balance, Watershed-Scale Analysis of the Chemistry of Adirondack Lakes Discussion - Day 5.
Lecture 2: Statistical learning primer for biologists
The generalization of Bayes for continuous densities is that we have some density f(y|  ) where y and  are vectors of data and parameters with  being.
Markov Chain Monte Carlo for LDA C. Andrieu, N. D. Freitas, and A. Doucet, An Introduction to MCMC for Machine Learning, R. M. Neal, Probabilistic.
by Ryan P. Adams, Iain Murray, and David J.C. MacKay (ICML 2009)
Lab for Remote Sensing Hydrology and Spatial Modeling Dept of Bioenvironmental Systems Engineering National Taiwan University 1/45 GEOSTATISTICS INTRODUCTION.
Generalized Spatial Dirichlet Process Models Jason A. Duan Michele Guindani Alan E. Gelfand March, 2006.
1 Chapter 8: Model Inference and Averaging Presented by Hui Fang.
Stochastic Hydrology Random Field Simulation Professor Ke-Sheng Cheng Department of Bioenvironmental Systems Engineering National Taiwan University.
CS Statistical Machine learning Lecture 25 Yuan (Alan) Qi Purdue CS Nov
Statistical NLP: Lecture 4 Mathematical Foundations I: Probability Theory (Ch2)
Parameter Estimation. Statistics Probability specified inferred Steam engine pump “prediction” “estimation”
Ch 1. Introduction Pattern Recognition and Machine Learning, C. M. Bishop, Updated by J.-H. Eom (2 nd round revision) Summarized by K.-I.
CWR 6536 Stochastic Subsurface Hydrology Optimal Estimation of Hydrologic Parameters.
Overview G. Jogesh Babu. R Programming environment Introduction to R programming language R is an integrated suite of software facilities for data manipulation,
Hierarchical Models. Conceptual: What are we talking about? – What makes a statistical model hierarchical? – How does that fit into population analysis?
Prediction and Missing Data. Summarising Distributions ● Models are often large and complex ● Often only interested in some parameters – e.g. not so interested.
Modeling and Simulation CS 313
Modeling and Simulation CS 313
Model Inference and Averaging
STA 216 Generalized Linear Models
STA 216 Generalized Linear Models
Filtering and State Estimation: Basic Concepts
Generalized Spatial Dirichlet Process Models
Where did we stop? The Bayes decision rule guarantees an optimal classification… … But it requires the knowledge of P(ci|x) (or p(x|ci) and P(ci)) We.
Statistical NLP: Lecture 4
Volume 6, Issue 5, Pages e13 (May 2018)
Chap. 7 Regularization for Deep Learning (7.8~7.12 )
Pattern Recognition and Machine Learning
Dynamic Causal Modelling for M/EEG
Learning From Observed Data
Presentation transcript:

Space-time Modelling Using Differential Equations Alan E. Gelfand, ISDS, Duke University (with J. Duan and G. Puggioni)

The Contribution Space-time data collection is increasing This talk is entirely about modelling for such data in the case of geo-coded locations Modelling the data directly or using random effects to explain the data Using process models to provide process realizations Using classes of parametric functions to provide realizations Using classes of differential equations to provide realizations

Two applications Two examples working with differential equations A realization of a space-time point pattern driven by a random space-time intensity – the application is to urban development measured through housing construction Spatio-temporal data collection - soil moisture measurements in time and space assumed to be a realization of a hydrological model – work in progress

Important Points A differential equation in time at every spatial location, i.e., parameters indexed by location The parameters in the differential equation vary spatially as realizations of a spatial process ALTERNATIVELY, the differential equation is a stochastic differential equation (SDE), e.g., a spatial Ornstein-Uhlenbeck process (Brix and Diggle) OR, the rate parameter in the differential equation is assumed to change over time. It can be modelled as a realization of a spatio-temporal process OR, the rate parameter can be modelled using a SDE, yielding an SDE embedded within the differential equation

Spatio-temporal Point Process Models Overview: Urban Development & Spatio-temporal Point Processes. Differential Equation Models for Cumulative Intensity Model Fitting & Inference Data Examples: –Simulated data –Urban development data for Irving, TX

Urban Development Problem Residential houses in Irving, TX

Our objectives Space-time point pattern of urban development using a spatio-temporal Cox process model. Work with housing development (available at high resolution) surrogate for population growth (not available at high resolution) Use population models, expect housing dynamics to be similar to population dynamics Differential equation models for surfaces. Insight into interpretable mechanisms of growth Bayesian inference and prediction: –Discretizing time and space(replace integral by sum) –Kernel convolution approximation( to handle large sample size)

Spatio-temporal Cox Process In a study region D during a period of [0,T], N T events: Point pattern: is a Poisson process with inhomogeneous intensity Specifying the intensity? are processes for parameters of interest. where

The cumulative intensity Discretize the spatio-temporal Cox process in time: during The cumulative intensity foris We consider models for the cumulative intensity Spatial point pattern:

Comments

Three Growth Models Exponential growth Gompertz growth Logistic growth local growth rate local carrying capacity

Logistic Population Growth average growth rate for region D carrying capacity for region D current population at time t population growth at time t Model for the aggregate intensity.

Proper Scaling Local growth model should scale with the global growth model: cumulate average

Process Models for the Parameters and initial intensity are parameter processes which are modeled on log scale as Hence, given the growth curve is fixed. Also, the μ’s are trend surfaces.

Diffusion Model (SDE) for Growth Rate Let Spatial Ornstein-Uhlenbeck (OU) process model: where W t (s) is spatial Brownian motion and, again It induces a stationary process with separable space-time covariance: Can not add scaled spatial Brownian motion to the logistic diff eqn. Instead, a time-varying growth rate at each location

Discretizing Time Back to the original model, the intensity for the spatial point pattern in a time interval: Difference equation model: a recursionexplicit transition

Discrete-time Model Likelihood Model parameters and latent processes: stochastic integral point i in period j

Discretizing Space Divide region D into M cells. Assume homogeneous intensity in each cell. We obtain (with r(m), k(m) average growth rate and cumulative carrying capacity): with induced transition The joint likelihood (product Poisson):

Parametrizing the spatial processes Growth rate, carry capacity and initial intensity for each cell: M is very large (2500 in our example) !

Kernel Convolution Approximation A dimension reduction approach (2500—>100) Kernel Convolution (Xia & Gelfand, 2006): study region D covering region centroid of block l block l

Kernel Convolution Approximation Approximate area of block l centroid of block l kernel function where Let centroid of cell m

Kernel Convolution Approximation We use Matérn class covariance function:

Bayesian Inference Posterior: Priors:

Bayesian Inference Random-walk Metropolis for posterior exploration: Each parameter is updated sequentially in time in every iteration. are sampled in three blocks.

Recursive Calculation Calculate random effects: Sequentially, use

Simulated Data Analysis time 0year 1 year 2year 3 Initially and successive 5 years

Simulated Data Analysis: Estimation

Posterior: Actual: rK

Simulation: One Step Ahead Prediction PredictedActual

New Houses in Irving, TX: 1952—1957 We use 1951 –1966 to fit our model, leaving 1967 and 1968 out for prediction and validation.

Data Analysis for Irving, TX centroid of block l Divide region D into M cells

Data Analysis for Irving, TX with points at t 0 rK

Prediction: New Houses in1967 & 1968 One Step Ahead 1967 Two Step Ahead 1968

Future work on this project Intensity/type of development – marked point pattern Underlying determinants of development, e.g., zoning, roads, time-varying? - add as covariates in rates, carrying capacities Holes, e.g., lakes, parks, externalities – force zero growth Fit the SDE within the DE A model where the observations drive the intensity, e.g., a self-exciting process

Conclusion Contributions –Integrate Spatio-temporal point process, Bayesian hierarchical models and SDE models –Urban growth research with high spatial resolution –Directly interpretable results Future Work –SDE model for random growth rate –Better approximation for the likelihood –Need efficient Metropolis-Hasting algorithm

Soil moisture loss, i.e., transpiration and drainage as a function of soil moisture

A simulated data set

Results

First Differences

Wilting point and field capacity