Presentation on theme: "A Fast Diffeomorphic Image Registration Algorithm John Ashburner 2008."— Presentation transcript:
A Fast Diffeomorphic Image Registration Algorithm John Ashburner 2008
Overview Parameterization –Scaling and Squaring Objective Function Optimization Group-wise Registration Comparison with LDDMM
Julian Huxley (1932). PROBLEMS OF RELATIVE GROWTH A constant partition of growth intensity between different regions implies constant differences in their rates of growth. Thus any genes controlling relative size of parts will have to exert their action by influencing the rates of processes, … One essential fact about growth is that it is a process of self- multiplication of living substance – i.e. that the rate of growth of an organism growing equally in all its parts is at any moment proportional to the size of the organism.
Allometry Julian Huxley proposed that simpler relationships among lengths, volumes or areas of anatomical structures could be found by working with the logarithms of the measures. exp(u) = φ(1) = u φ(t) dt, where φ(0) = 1 t=0 1
Parameterization Diffeomorphic Anatomical Registration Through Exponentiated Lie Algebra Deformations parameterized by a single flow field, which is considered to be constant in time. Not really a proper Lie Group. Often referred to as a one parameter subgroup.
Euler Integration Parameterising the deformation φ (0) (x) = x φ (1) (x) = u ( φ (t) (x) ) dt u is a flow field to be estimated Scaling and squaring is used to generate deformations. –c.f. matrix exponentiation t=0 1
Euler integration The differential equation is dφ(x)/dt = u ( φ (t) (x) ) By Euler integration φ (t+h) = φ (t) + hu(φ (t) ) Equivalent to φ (t+h) = (x + hu) o φ (t)
For (e.g) 8 time steps Simple integration φ (1/8) = x + u/8 φ (2/8) = φ (1/8) o φ (1/8) φ (3/8) = φ (1/8) o φ (2/8) φ (4/8) = φ (1/8) o φ (3/8) φ (5/8) = φ (1/8) o φ (4/8) φ (6/8) = φ (1/8) o φ (5/8) φ (7/8) = φ (1/8) o φ (6/8) φ (8/8) = φ (1/8) o φ (7/8) 7 compositions Scaling and squaring φ (1/8) = x + u/8 φ (2/8) = φ (1/8) o φ (1/8) φ (4/8) = φ (2/8) o φ (2/8) φ (8/8) = φ (4/8) o φ (4/8) 3 compositions Similar procedure used for the inverse. Starts with φ (-1/8) = x - u/8
Scaling and squaring example
Deformations at different times
Jacobians Jacobian fields can also be obtained by scaling and squaring. If warps are composed by: ϕ C = ϕ B ϕ A then Jacobian matrices are obtained by the following matrix multiplication: J ϕ C =(J ϕ B ϕ A ) J ϕ A
Jacobian determinants remain positive (almost)
See also… C. Moler and C. van Loan. Nineteen Dubious Ways to Compute the Exponential of a Matrix, Twenty-Five Years Later. SIAM Review 45(1):3-49 (2003). V. Arsigny, O. Commowick, X. Pennec and N. Ayache. A Log- Euclidean Polyaffine Framework for Locally Rigid or Affine Registration. Proc. Of the 3rd International Workshop on Biomedical Image Registration (WBIR'06), 2006, pp LNCS vol Springer-Verlag, Utrecht, NL. V. Arsigny, O. Commowick, X. Pennec and N. Ayache. A Log- Euclidean Framework for Statistics on Diffeomorphisms. Proc. of the 9th International Conference on Medical Image Computing and Computer Assisted Intervention (MICCAI'06), 2006, pp LNCS Springer-Verlag, Berlin, Germany. M. Hernandez, M. N. Bossa, and S. Olmos. Registration of anatomical images using geodesic paths of diffeomorphisms parameterized with stationary vector fields. IEEE workshop on Math. Meth. in Biom. Image Anal. (MMBIA07), 2007.
Overview Parameterization Objective Function Optimization Group-wise Registration Future directions
Original objective function Simultaneously minimize the sum of –Likelihood component Sum of squares difference ½ i k ( t k (x i ) – μ k (φ (1) (x i )) ) 2 φ (1) parameterized by u –Prior component A measure of deformation roughness ½u T Hu
Likelihood Term Images assumed to be partitioned into different tissue classes. –E.g., a 3 class registration simultaneously matches: Grey matter with grey matter White matter wit white matter Background (1 – GM – WM) with background
Prior Term ½u T Hu DARTEL has three different models for H –Membrane energy –Linear elasticity –Bending energy H is very sparse An example H for 2D registration of 6x6 images (linear elasticity)
Membrane Energy Convolution KernelSparse Matrix Representation Penalises first derivatives. Sum of squares of the elements of the Jacobian (matrices) of the flow field.
Bending Energy Sparse Matrix RepresentationConvolution Kernel Penalises second derivatives.
Linear Elasticity Decompose the Jacobian of the flow field into –Symmetric component ½(J+J T ) Encodes non-rigid part. –Anti-symmetric component ½(J-J T ) Encodes rigid-body part. Penalise sum of squares of symmetric part. Trace of Jacobian encodes volume changes. Also penalised.
Regularization models Membrane energy Bending energy Images registered using a small deformation approximation
Overview Parameterization Objective Function Optimization –Gauss-Newton –Multi-grid Group-wise Registration Comparison with LDDMM
Optimization Uses Gauss-Newton –Requires a matrix solution to a very large set of equations at each iteration u (k+1) = u (k) - (H+A) -1 b –b are the first derivatives of objective function –A is a sparse matrix of second derivatives –Computed efficiently, making use of scaling and squaring
Computing Derivatives First derivatives w.r.t. flow field are considered as a vector field when being computed. Second derivatives are treated as a field of positive definite symmetric matrices when computed. Approximations involved, as a result of multiple interpolations used for the computations. –Derivatives resulting from evolution from time 0 to time 1 are simply the sum of the derivatives from time 0 to time 0.5 and those from time 0.5 to time 1. –Derivatives from 0.5 to 1 can be simply computed from those from time 0 to time 0.5. –Similarly derivatives from time 0 to 0.5 are the sum of derivatives from 0 to 0.25 and 0.25 to 0.5. –etc
Relaxation To solve Mx = c Split M into E and F, where E is easy to invert F is more difficult If M is diagonally dominant (membrane energy): x (k+1) = E -1 (c – F x (k) ) Otherwise regularize (bending or linear elastic energy): x (k+1) = x (k) + (E+sI) -1 (c – M x (k) ) –Diagonal dominance is when |m ii | > Σ ij |m ij |
M = H+A = E+F 2 nd derivs of prior term 2 nd derivs of likelihood term Easy to invert Difficult to invert
Relaxation Strategies Jacobis method if not done in place, such that updates are not used immediately. –Easier to implement in MATLAB Gauss-Siedel when done in place. –Faster convergence and uses less memory. –Red-black alternating update scheme is used with membrane energy. –Sweeps in alternating directions are used for updates when regularization is linear-elastic or bending energy. Both methods fit high frequencies quickly, but low frequencies slowly.
Highest resolution Lowest resolution Full Multi-Grid
A Prolongation of low resolution solution to current resolution. Add this to existing solution. Perform a few iterations of relaxation. Restrict residuals down to lower resolution.
B Prolongation of low resolution solution to current resolution. Add this to existing solution at current resolution. Perform a few iterations of relaxation. Prolongation of solution to higher resolution.
C Restrict high resolution residuals to current resolution. Perform a few iterations of relaxation. Restrict residuals down to lower resolution.
E Restrict higher resolution residuals to current resolution. Obtain exact solution by matrix inversion. Prolongation of solution to higher resolution.
See also… W. H. Press, S. A. Teukolsky, W. T. Vetterling and B. P. Flannery. Numerical Recipes in C (Second Edition). Cambridge University Press, Cambridge, UK –Chapter 15, Section 5 explains Gauss- Newton optimization (Levenberg-Marquardt without the regularisation). –Chapter 19, Section 6 explains the basics of multi-grid methods.
Overview Principles Objective Function Optimization Group-wise Registration –Multinomial distributions –Simultaneous registration of GM & WM –Tissue probability map creation Comparison with LDDMM
Average Shaped Template For CA, work in the tangent space of the manifold, using linear approximations. –Average-shaped templates give less bias, as the tangent-space at this point is a closer approximation. For spatial normalisation of fMRI, warping to a more average shaped template is less likely to cause signal to disappear. –If a structure is very small in the template, then it will be very small in the spatially normalised individuals. Smaller deformations are needed to match with an average-shaped template. –Smaller errors.
Average shaped templates Linear Average Average on Riemannian manifold (Not on Riemannian manifold)
Template Generation Initial Average After a few iterations Final template Iteratively generated from 471 subjects. Began with rigidly aligned tissue probability maps. Regularization lighter for later iterations.
Multinomial Model The extended model is multinomial for matching tissue class images. log p(t|μ, ϕ ) = Σ j Σ k t jk log(μ k ( ϕ j )) t – individual GM, WM and background μ – template GM, WM and background ϕ – deformation A general purpose template should not have regions where log( μ ) is –Inf.
Template modelled as softmax of a Gaussian process μ k (x) = exp(a k (x))/(Σ j exp(a j (x))) MAP solution determined for a, by Gauss-Newton optimisation, using multi-grid.
Determining amount of regularisation Matrices too big for Bayesian variance component estimation. Used cross- validation. Smooth an image by different amounts, see how well it predicts other images: Rigidly aligned Nonlinear registered log p(t|μ) = Σ j Σ k t jk log(μ jk )
ML and MAP templates from 6 subjects Nonlinearly Registered Rigidly registered log MAP ML
471 Subject Average
Overview Principles Objective Function Optimization Group-wise Registration Comparison with LDDMM
Slight Problem The DARTEL framework is a parameterisation over the background space through which the deformation evolves. Each part of the flow field is not uniquely associated with one part of the brain. –Not an ideal representation for morphometry. It is not able to represent the full group of diffeomorphic deformations.
Initial momentum Variable velocity framework (as in LDDMM)
Initial momentum Variable velocity framework (as in LDDMM)
Variable Velocity The variable velocity framework of LDDMM actually does associate its parameters with unique points in the brain. The entire trajectory of an evolving LDDMM deformation is (in theory) uniquely specified by the starting conditions. These initial conditions provide a more meaningful measure of shape.
Initial Momentum Maps The assumptions behind allometry fail for overlapping or adjacent structures. Initial momentum maps may be the way to go….
See also… M. F. Beg, M. I. Miller, A. Trouvé and L. Younes.Computing Large Deformation Metric Mappings via Geodesic Flows of Diffeomorphisms. International Journal of Computer Vision 61(2):139–157 (2005). M. Vaillant, M. I. Miller, L. Younes and A. Trouvé. Statistics on diffeomorphisms via tangent space representations. NeuroImage 23:S161–S169 (2004). L. Younes, Jacobi fields in groups of diffeomorphisms and applications. Quart. Appl. Math. 65:113–134 (2007).
Rigid Rotation Consider a 2D rotation y=Rx, where This can be formulated as the solution of a differential equation at time t=1 –x 1 (t) = x 2 (t) –x 2 (t) = - x 1 (t) or –x(t) = Ax(t), where
More on rotations A rigid-body transform can be parameterised by the matrix exponential of an anti-symmetric (skew-symmetric) matrix.