Wavelets and functional MRI Ed Bullmore Mathematics in Brain Imaging IPAM, UCLA July 21, 2004.

Slides:



Advertisements
Similar presentations
Pattern Recognition and Machine Learning
Advertisements

1st level analysis - Design matrix, contrasts & inference
ECE 8443 – Pattern Recognition ECE 8423 – Adaptive Signal Processing Objectives: Periodograms Bartlett Windows Data Windowing Blackman-Tukey Resources:
SPM 2002 C1C2C3 X =  C1 C2 Xb L C1 L C2  C1 C2 Xb L C1  L C2 Y Xb e Space of X C1 C2 Xb Space X C1 C2 C1  C3 P C1C2  Xb Xb Space of X C1 C2 C1 
Pattern Recognition and Machine Learning
Topological Inference Guillaume Flandin Wellcome Trust Centre for Neuroimaging University College London SPM Course London, May 2014 Many thanks to Justin.
Exploiting Sparse Markov and Covariance Structure in Multiresolution Models Presenter: Zhe Chen ECE / CMR Tennessee Technological University October 22,
Classical inference and design efficiency Zurich SPM Course 2014
Multiple testing Justin Chumbley Laboratory for Social and Neural Systems Research Institute for Empirical Research in Economics University of Zurich With.
The General Linear Model (GLM) Methods & models for fMRI data analysis in neuroeconomics November 2010 Klaas Enno Stephan Laboratory for Social & Neural.
07/01/15 MfD 2014 Xin You Tai & Misun Kim
Multiple testing Justin Chumbley Laboratory for Social and Neural Systems Research Institute for Empirical Research in Economics University of Zurich With.
Multiple comparison correction Methods & models for fMRI data analysis 18 March 2009 Klaas Enno Stephan Laboratory for Social and Neural Systems Research.
The General Linear Model (GLM)
The General Linear Model (GLM) SPM Course 2010 University of Zurich, February 2010 Klaas Enno Stephan Laboratory for Social & Neural Systems Research.
Wavelet Estimation of a Local Long Memory Parameter B. Whitcher EURANDOM, The Netherlands M. J. Jensen University of Missouri.
Spatial wavelet analysis Discrete fMRI Testing for active regions Bootstrapping functional connectivity Continuous Lidar.
Comparison of Parametric and Nonparametric Thresholding Methods for Small Group Analyses Thomas Nichols & Satoru Hayasaka Department of Biostatistics U.
Multiple comparison correction Methods & models for fMRI data analysis 29 October 2008 Klaas Enno Stephan Branco Weiss Laboratory (BWL) Institute for Empirical.
Introduction to Wavelets
Laurent Itti: CS599 – Computational Architectures in Biological Vision, USC Lecture 7: Coding and Representation 1 Computational Architectures in.
(1) A probability model respecting those covariance observations: Gaussian Maximum entropy probability distribution for a given covariance observation.
Rician Noise Removal in Diffusion Tensor MRI
SPM short course – May 2003 Linear Models and Contrasts The random field theory Hammering a Linear Model Use for Normalisation T and F tests : (orthogonal.
ENG4BF3 Medical Image Processing
General Linear Model & Classical Inference Guillaume Flandin Wellcome Trust Centre for Neuroimaging University College London SPM M/EEGCourse London, May.
TSTAT_THRESHOLD (~1 secs execution) Calculates P=0.05 (corrected) threshold t for the T statistic using the minimum given by a Bonferroni correction and.
Binary Variables (1) Coin flipping: heads=1, tails=0 Bernoulli Distribution.
Multiple Comparison Correction in SPMs Will Penny SPM short course, Zurich, Feb 2008 Will Penny SPM short course, Zurich, Feb 2008.
Inference by permutation of multi- subject neuroimaging studies John Suckling Brain Mapping Unit Department of Psychiatry University of Cambridge Acknowledgements:
With many thanks for slides & images to: FIL Methods group, Virginia Flanagin and Klaas Enno Stephan Dr. Frederike Petzschner Translational Neuromodeling.
Random Field Theory Will Penny SPM short course, London, May 2005 Will Penny SPM short course, London, May 2005 David Carmichael MfD 2006 David Carmichael.
Basics of fMRI Inference Douglas N. Greve. Overview Inference False Positives and False Negatives Problem of Multiple Comparisons Bonferroni Correction.
FMRI Methods Lecture7 – Review: analyses & statistics.
SPM short course – Oct Linear Models and Contrasts Jean-Baptiste Poline Neurospin, I2BM, CEA Saclay, France.
PATTERN RECOGNITION AND MACHINE LEARNING CHAPTER 3: LINEAR MODELS FOR REGRESSION.
Bayesian Inference and Posterior Probability Maps Guillaume Flandin Wellcome Department of Imaging Neuroscience, University College London, UK SPM Course,
Functional Brain Signal Processing: EEG & fMRI Lesson 14
Spatial Smoothing and Multiple Comparisons Correction for Dummies Alexa Morcom, Matthew Brett Acknowledgements.
Spatial smoothing of autocorrelations to control the degrees of freedom in fMRI analysis Keith Worsley Department of Mathematics and Statistics, McGill.
1 Identifying Robust Activation in fMRI Thomas Nichols, Ph.D. Assistant Professor Department of Biostatistics University of Michigan
SPM short – Mai 2008 Linear Models and Contrasts Stefan Kiebel Wellcome Trust Centre for Neuroimaging.
Multiple comparisons problem and solutions James M. Kilner
Topological Inference Guillaume Flandin Wellcome Trust Centre for Neuroimaging University College London SPM Course London, May 2015 With thanks to Justin.
The General Linear Model Christophe Phillips SPM Short Course London, May 2013.
The General Linear Model Guillaume Flandin Wellcome Trust Centre for Neuroimaging University College London SPM fMRI Course London, October 2012.
Bayesian fMRI analysis with Spatial Basis Function Priors
Biointelligence Laboratory, Seoul National University
The General Linear Model (GLM)
The General Linear Model (GLM)
Topological Inference
Variational Bayesian Inference for fMRI time series
The general linear model and Statistical Parametric Mapping
The General Linear Model
The General Linear Model (GLM)
Topological Inference
Contrasts & Statistical Inference
The General Linear Model
The general linear model and Statistical Parametric Mapping
The General Linear Model
The General Linear Model (GLM)
Contrasts & Statistical Inference
The General Linear Model
Multiple testing Justin Chumbley Laboratory for Social and Neural Systems Research Institute for Empirical Research in Economics University of Zurich.
Multiple testing Justin Chumbley Laboratory for Social and Neural Systems Research Institute for Empirical Research in Economics University of Zurich.
The General Linear Model (GLM)
The General Linear Model
The General Linear Model
Will Penny Wellcome Trust Centre for Neuroimaging,
Contrasts & Statistical Inference
Presentation transcript:

Wavelets and functional MRI Ed Bullmore Mathematics in Brain Imaging IPAM, UCLA July 21, 2004

What’s a wavelet? Wavelet, fractal, brain Wavelets and functional MRI data analysis - resampling or “wavestrapping” - estimating noise and signal parameters - hypothesis testing Overview (See Bullmore et al [2004] NeuroImage, IPAM issue, for more comprehensive literature review)

Wavelets are “little waves”, defined by their scale j = 1,2,3…J and their location k = 1,2,3…K = N/2 j …a family of orthonormal wavelets tesselates the time-frequency plane Wavelets are little waves… ψφ

Wavelets provide an orthonormal basis for multiresolution analysis The scaling function multiplied by the approximation coefficients, plus the wavelets multiplied by the detail coefficients, recovers the signal losslessly

Wavelets are adaptive to non-stationarity “If we are interested in transient phenomena – a word pronounced at a particular time, an apple located in the left corner of an image – the Fourier transform becomes a cumbersome tool.” Mallat (1998)

Wavelets have decorrelating or whitening properties Correlation between any two wavelet coefficients will decay exponentially as a exponential function of distance between them depending on… H = Hurst exponent of time series R = regularity of wavelet (number of vanishing moments)

Wavelets can be used to estimate fractal or spectral parameters The variance of wavelet coefficients within each scale of the DWT is related to the scale j … or H = Hurst exponent - simply related to the spectral exponent , e.g. for fractional Gaussian noise,  = 2H-1… …and the spectral exponent is simply related to the (Hausdorf) fractal dimension, D = T + (3-  )/2

Wavelet variance scaling in fMRI Greater variance at coarser scales of the decomposition Gradient of straight line fitted to log-linear plot = 2H+1

The discrete wavelet transform is quick to compute Mallat’s pyramid algorithm – iterative high and low pass filtering of downsampled coefficients – O(N) complexity, compared to O(N log(N)) for FFT

Fractal, scaling or scale-invariant, self-similar or self-affine structure is very common in nature, including biological systems A wavelet basis is fractal and so a natural choice of basis for analysis of fractal data Brain imaging data often have scaling properties in space and time Therefore, wavelets may be more than just another basis for analysis of fMRI data Wavelet, fractal, brain Continuous wavelet transform of fractal fMRI time series

Biological fractals… complex, patterned, statistically-self similar, scale invariant structures, with non-integer dimensions, generated by simple iterative rules, widespread in real and synthetic natural systems, including the brain.

Fractal cardiology Apparently complex, but simply generated, dendritic anatomy Statistically self-similar (self- affine) behavior in time 1/f-like spectral properties

Brain images often have fractal properties in space (and time) Biological Synthetic

Allometric scaling and self-similar scaling in brain structure Zhang & Sejnowski (2000) PNAS 97, Kiselev et al (2004) NeuroImage 20,

Power law scaling of local field potentials Leopold et al (2003) Cerebral Cortex

Power law scaling of fMRI time series Head movement is often a slow “drift”, which tends to exaggerate low frequencies in fMRI spectra; but, even after movement correction, fMRI time series often demonstrate power law scaling of spectral power - S{f} ~ f  or log(S{f}) ~  log(f)

Resampling or “wavestrapping” * Michael Breakspear Time series modeling and estimation * Voichita Maxim Hypothesis testing * Jalal Fadili and Levent Sendur Wavelets and fMRI data analysis

Time series is autocorrelated or colored Its wavelet coefficients are decorrelated or white … exchangeable under the null hypothesis The resampled series is colored Bullmore et al (2001) Human Brain Mapping 12, Resampling or “wavestrapping”

d1d1 d2d2 d3d3 d4d4 d5d5 d6d6 s6s6 DWT iDWT Time series resampling in the wavelet domain or 1D “wavestrapping” TimeWavelets Observed Resampled NB: Boundary correction issues inform choice of Daubechies wavelets – which have most compact support for any number of vanishing moments

1D wavestrapping preserves fMRI autocorrelation function If the exchangeability assumption was not satisfied, then the wavestrapped interval for the ACF would be unlikely to include the observed ACF… …the wavestrapped data would have a whiter spectrum and a flatter ACF than the observed data. identical resampling at all voxels preserves the spatial correlations in each slice permuting blocks of coefficients, or cyclically rotating coefficients within scale, are alternative resampling strategies experimental power at frequencies corresponding to a scale of the DWT may be preserved under wavestrapping

Fourier- compared to wavelet-based fMRI time series resampling Laird et al (2004) MRM Wavelet resampling is a superior method to Fourier- based alternatives for nonparametric statistical testing of fMRI data

2 or 3D discrete wavelet transform multiresolution spatial filtering of time series statistic maps spatially extended signals are losslessly described by wavelet coefficients at mutually orthogonal scales and orientations yx x xy y increased scale j, fewer wavelet coefficients {w j } 2D (i)DWT spatial map of GLM coefficients {  }

2-dimensional wavestrapping Breakspear et al (2004) Human Brain Mapping, in press 2D wavelet coefficients can be reshuffled randomly, in blocks, or rotated cyclically within each scale (left) Wavestrapping can be restricted to a subset of the image (right)

2D wavestrapping preserves spatial spectrum (autocorrelation structure) Different versions of 2D wavestrapping algorithm preserve observed spatial spectrum (top left) more or less exactly. Random shuffling of 2D coefficients (top right), block permutation (bottom left) and cyclic rotation of coefficients (bottom right) show increasingly close convergence with observed spatial spectrum relative lack of underpinning theory on decorrelating properties of >1D processes

3 or 4D spatiotemporal “wavestrapping” Breakspear et al (2004) Human Brain Mapping, in press The only nonparametric method for inference on task-specific functional connectivity statistics?

GLM with white errors; OLS is BLU estimator y = X  +  {  } iid ~ N(0,  = I  2 ) Estimating noise and signal parameters But fMRI errors are generally colored or endogenously autocorrelated autoregressive pre-whitening estimate unsmoothing kernel pre-coloring apply smoothing kernel Autoregressive pre-whitening is more efficient but may be inadequate in the case of long-memory errors…

Autoregressive pre-whitening may not always work well FMRI data may frequently have significant residual autocorrelation after attempted pre- whitening by AR(1) and AR(3) models A serious side-effect of inadequate pre-whitening is loss of nominal type 1 error control in tests of  /SE(  )

Fractional Gaussian noise as a model for functional MRI errors For fMRI, we are interested in the GLM with fractional Gaussian noise errors y = X  +  {  } iid ~ N(0,    ) Gaussian noise is the increment process of Brownian motion… …fractional Gaussian noise is the increment process of fractional Brownian motion fGn is a zero mean stationary process characterised by two parameters, H and  

Some fractional Gaussian noises H < 0.5, anti-persistent H = 0.5, white H > 0.5, persistent or 1/f

Estimators of fractional Gaussian noise parameters in time and wavelet domains Hurst exponentVariance A maximum likelihood estimator in the wavelet domain (wavelet-ML) gives the best overall performance in terms of bias and efficiency of estimation of H and  2 : Fadili & Bullmore (2001); Maxim et al (2004)

Maximum likelihood estimation of fGn parameters in wavelet domain Likelihood function of G, a fractional Gaussian noise, includes the inverse of its covariance matrix  S is assumed to be diagonalised in the wavelet domain, considerably simplifying its inversion

Wavelet ML estimation of signal and fGn parameters

Resting fMRI data looks like fGn (after head movement correction)

FMRI noise as specimens of fGn High variance, antipersistent noise in lateral ventricles anti-persistence often located in CSF spaces High variance, persistent noise in medial posterior cortex persistence often located symmetrically in cortex

Alzheimer’s disease associated with abnormal cortical fGn parameters fGn parameter maps can be co- registered in standard space and compared between groups using a cluster-level permutation test Preliminary evidence for significantly increased persistence of fMRI noise in lateral and medial temporal cortex, precentral and premotor cortex, dorsal coingulate and insula in patients (N=10) with early AD, compared to healthy elderly volunteers (N=10) Maxim et al (2004)

Hypothesis testing V ~ 20,000 voxels  i  V = E(FP), so if  i  = 0.05, E(FP) = 1,000 Bonferroni  (i) = 0.05/V False discovery rate (FDR)  (i) >= 0.05/V GRF  (i) < 0.05/V for small d.f.

Why wavelets for multiple hypothesis testing for spatial statistic maps? Compaction (few large, many small coefficients) Multiresolution properties obviate difficulties concerning optimal choice of monoresolution Gaussian smoothing kernel (MFT) Various strategies, including wavelet shrinkage, available to reduce the search volume prior to hypothesis testing (enhanced FDR) Known dependencies of neighboring coefficients under the alternative hypothesis (Bayesian bivariate shrinkage) Non-orthogonal wavelets exist and may be advantageous for representing edges and lines (dual tree complex wavelet transform)

Wavelet shrinkage algorithms to control false discovery rate Take 2D or 3D wavelet transform of spatial {  } maps Calculate P-value for each detail coefficient Sort the P-values in ascending order and find the order statistic i such that Calculate the critical threshold Apply hard or soft thresholding to all coefficients and take the inverse wavelet transform to recover the thresholded map in space

Wavelet FDR control is multiresolution Wavelet FDR = 0.01 retains activation in bilateral motor cortex and ipsilateral cerebellum Spatial FDR = 0.01, after monoresolution smoothing with Gaussian kernel, retains activation in contralateral motor cortex with FWHM = 6 mm but loses cerebellar activation with FWHM = 18 mm

Enhanced wavelet FDR control is more powerful than “vanilla” wavelet FDR Enhanced FDR =  means FDR (and FWER) are also <=  (Shen et al, 2002)

Enhanced FDR control in wavelet domain Shen et al (2002); Sendur et al (2004) Generalised degrees of freedom is estimated by Monte Carlo integration to reduce the search volume for hypothesis testing

Bayesian bivariate shrinkage and enhanced FDR control Say w 1 is the child of w 2 … MAP estimator of w 1 is… Test only coefficients which survive shrinkage by this rule

Complex (dual tree) wavelet transform may be advantageous for mapping edges Complex wavelet coefficients are estimated by dual tree algorithm – their magnitude is shift invariant

Conclusions Wavelets are versatile tools, particularly applicable to analysis of scaling, scale-invariant or fractal processes in time (or space, or both) Brain images often have scaling or 1/f properties and wavelets can provide estimators of their fractal parameters, e.g. Hurst exponent Specific wavelet applications to fMRI data analysis include: Resampling (up to 4D) Estimation (robust and informative noise models) Hypothesis testing (multiresolution and enhanced FDR control)

Thanks John Suckling Mick Brammer Jalal Fadili Michael Breakspear Levent Sendur Raymond Salvador Voichita Maxim Brandon Whitcher Human Brain Project, National Institute of Mental Health and National Institute of Biomedical Imaging & Bioengineering Wellcome Trust GlaxoSmithKline plc