Geostatistical structural analysis of TransCom data for development of time-dependent inversion Erwan Gloaguen, Emanuel Gloor, Jorge Sarmiento and TransCom.

Geostatistical structural analysis of TransCom data for development of time-dependent inversion Erwan Gloaguen, Emanuel Gloor, Jorge Sarmiento and TransCom modelers

Plan  Motivation  Mathematical statement  Methodology  Regularization and warning  TransCom synthetic data  Structural analysis of TransCom data

Motivations and goal  Long time series inversions becomes computer intensive.  Sliding window inversions are commonly used in many sciences (Kalman filter, ARMA filter, etc…) and have been recently used in CO2 inversion (Bruhwiler et al., 2004).  Geostatistical structural analysis for stochastic inversion.

Mathematical statement of the problem We can write the system we are working on, A (f) = c, as a matrix equation: As = c Where, A (describing the forward modelling function) is an N x M matrix, obtained by Transport Simulations. s is the vector of model parameters with M elements, the CO2 fluxes c is the vector of data with N elements, the CO2 measured concentrations

Least-squares and regularization Encountered problems in CO2 flux inversion: - ill-posed - generalized inverse numerically unstable. Regularization allows to compute an inverse of A: ||As – c|| 2 + g ||K s|| 2 where K is any definite positive matrix g is a scalar.

Regularized inversion tells the truth you want to hear GurneyJacobson

We present here an example of the nonuniqueness of underdetermined problems using a simple pair of linear equations. Consider the system described by m1 + 2m2 - m3 + m4 = 6 -m1 + m2 + 2m3 - m4 = 2 or, equivalently: A simple example

This system of equations has Four unknowns (m1, m2, m3, m4 ). Two data (6, 2). It is an underdetermined system. There is no unique solution. Here are four solutions that all will satisfy this system of equations: mA = ( 2.000, 2.000, 2.000, 2.000 ) mB = ( 0.444, 2.622, 0.134, 0.446 ) => How to choose which model is "best"? mC = (-2.408, 2.630, 0.109, 3.256 ) mD = ( 2.002, 2.846, -0.537, -2.230 ) Suite…

Given multiple solutions, how do we choose one that is useful? We need a quantitative way to distinguish between acceptible models. The solution is to find a solution that is "largest" or "smallest". Norms are mathematical rulers to measure "length". We will define m to be the norm of the model. m will be called the model objective function. The procedure for selecting one model will be: 1 Define the model objective function. 2 Choose the shortest; i.e. minimize this function. As examples, one could: Find the solution with smallest magnitude by minimizing (eqn. 1), Or find the solution that is flattest by minimizing, (eqn. 2). Dealing With Nonuniqueness: Norms & Model Objective Functions

The minimum model as specified by the objective function is highlighted in colour. Using a smallest model objective function (eqn. 1) mA = ( 2.000, 2.000, 2.000, 2.000 ) small = 16.00 mB = ( 0.444, 2.622, 0.134, 0.446 ) small = 7.23 mC = (-2.408, 2.630, 0.109, 3.256 ) small = 23.33 mD = ( 2.002, 2.846, -0.537, -2.230 ) small = 17.36 Using a flattest (most featureless) model objective function (eqn. 2) mA = ( 2.000, 2.000, 2.000, 2.000 ) flat = 0.0 mB = ( 0.444, 2.622, 0.134, 0.446 ) flat = 11.02 mC = (-2.408, 2.630, 0.109, 3.256 ) flat = 41.61 mD = ( 2.002, 2.846, -0.537, -2.230 ) flat = 15.00 Impact of the choice of then norm

The choice of O m determines the outcome, and if the "right" model objective function is chosen, a solution close to the "true" fluxes is obtained. Just what exactly is the "right" model objective function is the next obvious question. It will be tackled in the section entitled A Generic Model Objective Function. First, however, we must discuss the important general issue of how close predicted data must match observations. This is referred to as the "data misfit". This implies the importance of exploring the data and model spaces. Structural analysis allows to regularize the solution without any a priori. Conclusions on regularization

Sliding window cokriging as a regularization tool  Cokriging is a mathematical tool that allows to interpolate an unsample variable (here, the fluxes) using a secondary measured variable (here, the concentration).  Fluxes cokriging needs the spatial and temporal covariances to be known.

Slowness covariance modelisation based on measured times cov(c,c) = H cov(f,f) H T + Co Covariances of linearly related data are related with: If E[c] =0, then cov(c,c) = E[c,c T ] Their exists several covariance functions that allow the modelization of cov(f,f).

Cokriging As cov(s,s) has been modelized, the slowness field can be cokriged. The cokriging estimator is  = (Hcov(f,f)H T + Co) -1 * c Sck =  T H cov(s,s)

TransCom Data  Synthetic CO2 fluxes using fossil fuel, Net Ecosystem Productivity and Takahashi ocean’s fluxes.  Synthetic CO2 concentrations from 253 TransCom stations.  Integration of sampled fluxes on the 22 TransCom regions.

TransCom Regions and measurement sites Longitudes Latitudes

Synthetic CO2 fluxes of the 22 TransCom regions (Michalak, 2004)

The synthetic monthly fluxes

What can we say?  Fluxes vary strongly in time.  The « shape » of the fluxes varies in time.  Consecutive fluxes seems to be more correlated than fluxes farther apart in time. => Structural analysis of the fluxes

Structural analysis  Cross-covariance of variables Y and Z: C zy (h) = cov(Z(x),Y(x+h)) where h is a distance separating 2 samples  Cross-variogram of variables Y and Z:  zy (h) = 0.5*cov((Z(x)-Z(x+h)), (Z(x)-Z(x+h)))

Exemple of CO2 flux covariance Nugget + sill

Features of the experimental variogram Features of the covariogram: Sill: maximum semi-variance; represents variability in the absence of spatial dependence. Range: separation between point-pairs at which the sill is reached; distance at which there is no evidence of spatial dependence Nugget: semi-variance as the separation approaches zero; represents variability at a point that can’t be explained by spatial structure.

Structural analysis of the monthly CO2 fluxes. It appears that the covariances of the fluxes vary in time.

Cross-covariances between january fluxes and the other eleven months. After 4 months, the fluxes are uncorrelated.

Covariance of the monthly CO2 concentrations computed using TransCom fluxes. After 4 months, the spatial structure changes dramatically!

Cross-covariance of the January CO2 concentrations and other months computed using TransCom fluxes. After 4 months, the concentrations are uncorrelated.

CO2 flux anisotropy…

CO2 concentration anisotropy…

What about real data?

Cross-covariances between june 199 fluxes and the other eleven months. After 2 months, the fluxes are uncorrelated.

CO2 concentration anisotropy for real data

Conclusions After 4 months synthetic CO2 fluxes and concentrations are uncorrelated Synthetic fluxes show an spatial anisotropy These results can be used to performed time- dependent sliding windows stochastic inversion and/or cosimulation.

Geostatistical structural analysis of TransCom data for development of time-dependent inversion Erwan Gloaguen, Emanuel Gloor, Jorge Sarmiento and TransCom.

Similar presentations

Presentation on theme: "Geostatistical structural analysis of TransCom data for development of time-dependent inversion Erwan Gloaguen, Emanuel Gloor, Jorge Sarmiento and TransCom."— Presentation transcript:

Similar presentations

About project

Feedback

Log in

Auth with social network:

Geostatistical structural analysis of TransCom data for development of time-dependent inversion Erwan Gloaguen, Emanuel Gloor, Jorge Sarmiento and TransCom.

Similar presentations

Presentation on theme: "Geostatistical structural analysis of TransCom data for development of time-dependent inversion Erwan Gloaguen, Emanuel Gloor, Jorge Sarmiento and TransCom."— Presentation transcript:

Similar presentations

About project

Feedback