Geostatistical structural analysis of TransCom data for development of time-dependent inversion Erwan Gloaguen, Emanuel Gloor, Jorge Sarmiento and TransCom.

Slides:



Advertisements
Similar presentations
Spatial point patterns and Geostatistics an introduction
Advertisements

Spatial point patterns and Geostatistics an introduction
Improving Understanding of Global and Regional Carbon Dioxide Flux Variability through Assimilation of in Situ and Remote Sensing Data in a Geostatistical.
1 アンサンブルカルマンフィルターによ る大気海洋結合モデルへのデータ同化 On-line estimation of observation error covariance for ensemble-based filters Genta Ueno The Institute of Statistical.
Basic geostatistics Austin Troy.
Spatial Interpolation
University of Wisconsin-Milwaukee Geographic Information Science Geography 625 Intermediate Geographic Information Science Instructor: Changshan Wu Department.
Lecture 4 The L 2 Norm and Simple Least Squares. Syllabus Lecture 01Describing Inverse Problems Lecture 02Probability and Measurement Error, Part 1 Lecture.
Lecture 6 Resolution and Generalized Inverses. Syllabus Lecture 01Describing Inverse Problems Lecture 02Probability and Measurement Error, Part 1 Lecture.
Estimating parameters in inversions for regional carbon fluxes Nir Y Krakauer 1*, Tapio Schneider 1, James T Randerson 2 1. California Institute of Technology.
Curve-Fitting Regression
Spatial Interpolation
Simulation Modeling and Analysis Session 12 Comparing Alternative System Designs.
Applied Geostatistics
Development of Empirical Models From Process Data
Evaluating the Impact of the Atmospheric “ Chemical Pump ” on CO 2 Inverse Analyses P. Suntharalingam GEOS-CHEM Meeting, April 4-6, 2005 Acknowledgements.
Retrieval Theory Mar 23, 2008 Vijay Natraj. The Inverse Modeling Problem Optimize values of an ensemble of variables (state vector x ) using observations:
Deterministic Solutions Geostatistical Solutions
Ordinary Kriging Process in ArcGIS
Computer vision: models, learning and inference Chapter 3 Common probability distributions.
Application of Geostatistical Inverse Modeling for Data-driven Atmospheric Trace Gas Flux Estimation Anna M. Michalak UCAR VSP Visiting Scientist NOAA.
Applications in GIS (Kriging Interpolation)
Method of Soil Analysis 1. 5 Geostatistics Introduction 1. 5
Lecture II-2: Probability Review
1 Chapter 10 Correlation and Regression We deal with two variables, x and y. Main goal: Investigate how x and y are related, or correlated; how much they.
Principles of the Global Positioning System Lecture 11 Prof. Thomas Herring Room A;
EXPLORING SPATIAL CORRELATION IN RIVERS by Joshua French.
Spatial Interpolation of monthly precipitation by Kriging method
Spatial Statistics Jonathan Bossenbroek, PhD Dept of Env. Sciences Lake Erie Center University of Toledo.
Geo479/579: Geostatistics Ch12. Ordinary Kriging (1)
Modern Navigation Thomas Herring
Least SquaresELE Adaptive Signal Processing 1 Method of Least Squares.
Method of Least Squares. Least Squares Method of Least Squares:  Deterministic approach The inputs u(1), u(2),..., u(N) are applied to the system The.
Chapter 21 R(x) Algorithm a) Anomaly Detection b) Matched Filter.
Geographic Information Science
Curve-Fitting Regression
Geo479/579: Geostatistics Ch16. Modeling the Sample Variogram.
SUPA Advanced Data Analysis Course, Jan 6th – 7th 2009 Advanced Data Analysis for the Physical Sciences Dr Martin Hendry Dept of Physics and Astronomy.
Spatial Interpolation III
GG 313 Geological Data Analysis Lecture 13 Solution of Simultaneous Equations October 4, 2005.
Colorado Center for Astrodynamics Research The University of Colorado 1 STATISTICAL ORBIT DETERMINATION ASEN 5070 LECTURE 11 9/16,18/09.
Research Vignette: The TransCom3 Time-Dependent Global CO 2 Flux Inversion … and More David F. Baker NCAR 12 July 2007 David F. Baker NCAR 12 July 2007.
Development of an EnKF to estimate CO 2 fluxes from realistic distributions of X CO2 Liang Feng, Paul Palmer
Generalised method of moments approach to testing the CAPM Nimesh Mistry Filipp Levin.
Quality of model and Error Analysis in Variational Data Assimilation François-Xavier LE DIMET Victor SHUTYAEV Université Joseph Fourier+INRIA Projet IDOPT,
Lecture 6: Point Interpolation
Interpolation and evaluation of probable Maximum Precipitation (PMP) patterns using different methods by: tarun gill.
ANOVA, Regression and Multiple Regression March
Principle Component Analysis and its use in MA clustering Lecture 12.
Geology 6600/7600 Signal Analysis 04 Nov 2015 © A.R. Lowry 2015 Last time(s): Discussed Becker et al. (in press):  Wavelength-dependent squared correlation.
Geo479/579: Geostatistics Ch7. Spatial Continuity.
Chapter 2-OPTIMIZATION G.Anuradha. Contents Derivative-based Optimization –Descent Methods –The Method of Steepest Descent –Classical Newton’s Method.
Inverse Modeling of Surface Carbon Fluxes Please read Peters et al (2007) and Explore the CarbonTracker website.
Geostatistics GLY 560: GIS for Earth Scientists. 2/22/2016UB Geology GLY560: GIS Introduction Premise: One cannot obtain error-free estimates of unknowns.
Geo479/579: Geostatistics Ch12. Ordinary Kriging (2)
- 1 - Preliminaries Multivariate normal model (section 3.6, Gelman) –For a multi-parameter vector y, multivariate normal distribution is where  is covariance.
École Doctorale des Sciences de l'Environnement d’Île-de-France Année Universitaire Modélisation Numérique de l’Écoulement Atmosphérique et Assimilation.
Computacion Inteligente Least-Square Methods for System Identification.
CWR 6536 Stochastic Subsurface Hydrology Optimal Estimation of Hydrologic Parameters.
Spatial statistics: Spatial Autocorrelation
Chapter 4 Basic Estimation Techniques
Subsurface mapping, deterministics and volumes
Ch9 Random Function Models (II)
Computer vision: models, learning and inference
Inference for Geostatistical Data: Kriging for Spatial Interpolation
Paul D. Sampson Peter Guttorp
Interpolation & Contour Maps
Principles of the Global Positioning System Lecture 11
Linear regression.
Hartmut Bösch and Sarah Dance
Presentation transcript:

Geostatistical structural analysis of TransCom data for development of time-dependent inversion Erwan Gloaguen, Emanuel Gloor, Jorge Sarmiento and TransCom modelers

Plan  Motivation  Mathematical statement  Methodology  Regularization and warning  TransCom synthetic data  Structural analysis of TransCom data

Motivations and goal  Long time series inversions becomes computer intensive.  Sliding window inversions are commonly used in many sciences (Kalman filter, ARMA filter, etc…) and have been recently used in CO2 inversion (Bruhwiler et al., 2004).  Geostatistical structural analysis for stochastic inversion.

Mathematical statement of the problem We can write the system we are working on, A (f) = c, as a matrix equation: As = c Where, A (describing the forward modelling function) is an N x M matrix, obtained by Transport Simulations. s is the vector of model parameters with M elements, the CO2 fluxes c is the vector of data with N elements, the CO2 measured concentrations

Least-squares and regularization Encountered problems in CO2 flux inversion: - ill-posed - generalized inverse numerically unstable. Regularization allows to compute an inverse of A: ||As – c|| 2 + g ||K s|| 2 where K is any definite positive matrix g is a scalar.

Regularized inversion tells the truth you want to hear GurneyJacobson

We present here an example of the nonuniqueness of underdetermined problems using a simple pair of linear equations. Consider the system described by m1 + 2m2 - m3 + m4 = 6 -m1 + m2 + 2m3 - m4 = 2 or, equivalently: A simple example

This system of equations has Four unknowns (m1, m2, m3, m4 ). Two data (6, 2). It is an underdetermined system. There is no unique solution. Here are four solutions that all will satisfy this system of equations: mA = ( 2.000, 2.000, 2.000, ) mB = ( 0.444, 2.622, 0.134, ) => How to choose which model is "best"? mC = (-2.408, 2.630, 0.109, ) mD = ( 2.002, 2.846, , ) Suite…

Given multiple solutions, how do we choose one that is useful? We need a quantitative way to distinguish between acceptible models. The solution is to find a solution that is "largest" or "smallest". Norms are mathematical rulers to measure "length". We will define m to be the norm of the model. m will be called the model objective function. The procedure for selecting one model will be: 1 Define the model objective function. 2 Choose the shortest; i.e. minimize this function. As examples, one could: Find the solution with smallest magnitude by minimizing (eqn. 1), Or find the solution that is flattest by minimizing, (eqn. 2). Dealing With Nonuniqueness: Norms & Model Objective Functions

The minimum model as specified by the objective function is highlighted in colour. Using a smallest model objective function (eqn. 1) mA = ( 2.000, 2.000, 2.000, ) small = mB = ( 0.444, 2.622, 0.134, ) small = 7.23 mC = (-2.408, 2.630, 0.109, ) small = mD = ( 2.002, 2.846, , ) small = Using a flattest (most featureless) model objective function (eqn. 2) mA = ( 2.000, 2.000, 2.000, ) flat = 0.0 mB = ( 0.444, 2.622, 0.134, ) flat = mC = (-2.408, 2.630, 0.109, ) flat = mD = ( 2.002, 2.846, , ) flat = Impact of the choice of then norm

The choice of O m determines the outcome, and if the "right" model objective function is chosen, a solution close to the "true" fluxes is obtained. Just what exactly is the "right" model objective function is the next obvious question. It will be tackled in the section entitled A Generic Model Objective Function. First, however, we must discuss the important general issue of how close predicted data must match observations. This is referred to as the "data misfit". This implies the importance of exploring the data and model spaces. Structural analysis allows to regularize the solution without any a priori. Conclusions on regularization

Sliding window cokriging as a regularization tool  Cokriging is a mathematical tool that allows to interpolate an unsample variable (here, the fluxes) using a secondary measured variable (here, the concentration).  Fluxes cokriging needs the spatial and temporal covariances to be known.

Slowness covariance modelisation based on measured times cov(c,c) = H cov(f,f) H T + Co Covariances of linearly related data are related with: If E[c] =0, then cov(c,c) = E[c,c T ] Their exists several covariance functions that allow the modelization of cov(f,f).

Cokriging As cov(s,s) has been modelized, the slowness field can be cokriged. The cokriging estimator is  = (Hcov(f,f)H T + Co) -1 * c Sck =  T H cov(s,s)

TransCom Data  Synthetic CO2 fluxes using fossil fuel, Net Ecosystem Productivity and Takahashi ocean’s fluxes.  Synthetic CO2 concentrations from 253 TransCom stations.  Integration of sampled fluxes on the 22 TransCom regions.

TransCom Regions and measurement sites Longitudes Latitudes

Synthetic CO2 fluxes of the 22 TransCom regions (Michalak, 2004)

The synthetic monthly fluxes

What can we say?  Fluxes vary strongly in time.  The « shape » of the fluxes varies in time.  Consecutive fluxes seems to be more correlated than fluxes farther apart in time. => Structural analysis of the fluxes

Structural analysis  Cross-covariance of variables Y and Z: C zy (h) = cov(Z(x),Y(x+h)) where h is a distance separating 2 samples  Cross-variogram of variables Y and Z:  zy (h) = 0.5*cov((Z(x)-Z(x+h)), (Z(x)-Z(x+h)))

Exemple of CO2 flux covariance Nugget + sill

Features of the experimental variogram Features of the covariogram: Sill: maximum semi-variance; represents variability in the absence of spatial dependence. Range: separation between point-pairs at which the sill is reached; distance at which there is no evidence of spatial dependence Nugget: semi-variance as the separation approaches zero; represents variability at a point that can’t be explained by spatial structure.

Structural analysis of the monthly CO2 fluxes. It appears that the covariances of the fluxes vary in time.

Cross-covariances between january fluxes and the other eleven months. After 4 months, the fluxes are uncorrelated.

Covariance of the monthly CO2 concentrations computed using TransCom fluxes. After 4 months, the spatial structure changes dramatically!

Cross-covariance of the January CO2 concentrations and other months computed using TransCom fluxes. After 4 months, the concentrations are uncorrelated.

CO2 flux anisotropy…

CO2 concentration anisotropy…

What about real data?

Cross-covariances between june 199 fluxes and the other eleven months. After 2 months, the fluxes are uncorrelated.

CO2 concentration anisotropy for real data

Conclusions After 4 months synthetic CO2 fluxes and concentrations are uncorrelated Synthetic fluxes show an spatial anisotropy These results can be used to performed time- dependent sliding windows stochastic inversion and/or cosimulation.