Variational filtering in generated coordinates of motion

Slides:

Advertisements

Similar presentations

Pattern Recognition and Machine Learning

Advertisements

Pattern Recognition and Machine Learning

J. Daunizeau Institute of Empirical Research in Economics, Zurich, Switzerland Brain and Spine Institute, Paris, France Bayesian inference.

Bayesian inference Lee Harrison York Neuroimaging Centre 01 / 05 / 2009.

Bayesian Belief Propagation

Mobile Robot Localization and Mapping using the Kalman Filter

Non-Gaussian Data Assimilation using Mixture and Kernel Piyush Tagade Hansjorg Seybold Sai Ravela Earth Systems and Signals Group Earth Atmospheric and.

Bayesian models for fMRI data

ECE 8443 – Pattern Recognition LECTURE 05: MAXIMUM LIKELIHOOD ESTIMATION Objectives: Discrete Features Maximum Likelihood Resources: D.H.S: Chapter 3 (Part.

Biointelligence Laboratory, Seoul National University

The free-energy principle: a rough guide to the brain? Karl Friston

Visual Recognition Tutorial

Pattern Recognition and Machine Learning

Discriminative Training of Kalman Filters P. Abbeel, A. Coates, M

Visual Recognition Tutorial

Arizona State University DMML Kernel Methods – Gaussian Processes Presented by Shankar Bhargav.

G. Cowan Lectures on Statistical Data Analysis Lecture 10 page 1 Statistical Data Analysis: Lecture 10 1Probability, Bayes’ theorem 2Random variables and.

CDB Exploring Science and Society Seminar Thursday 19 November 2009 at 5.30pm Host: Prof Giorgio Gabella The Bayesian brain, surprise and free-energy.

ROBOT MAPPING AND EKF SLAM

Dynamic Causal Modelling THEORY SPM Course FIL, London October 2009 Hanneke den Ouden Donders Centre for Cognitive Neuroimaging Radboud University.

Abstract We start with a statistical formulation of Helmholtz’s ideas about neural energy to furnish a model of perceptual inference and learning that.

The free-energy principle: a rough guide to the brain? K Friston Summarized by Joon Shik Kim (Thu) Computational Models of Intelligence.

Ch 6. Kernel Methods Pattern Recognition and Machine Learning, C. M. Bishop, Summarized by J. S. Kim Biointelligence Laboratory, Seoul National University.

Dynamic Causal Modelling (DCM): Theory Demis Hassabis & Hanneke den Ouden Thanks to Klaas Enno Stephan Functional Imaging Lab Wellcome Dept. of Imaging.

ECE 8443 – Pattern Recognition ECE 8423 – Adaptive Signal Processing Objectives: Deterministic vs. Random Maximum A Posteriori Maximum Likelihood Minimum.

PATTERN RECOGNITION AND MACHINE LEARNING CHAPTER 3: LINEAR MODELS FOR REGRESSION.

Overview Particle filtering is a sequential Monte Carlo methodology in which the relevant probability distributions are iteratively estimated using the.

Mean Field Variational Bayesian Data Assimilation EGU 2012, Vienna Michail Vrettas 1, Dan Cornford 1, Manfred Opper 2 1 NCRG, Computer Science, Aston University,

CS 782 – Machine Learning Lecture 4 Linear Models for Classification  Probabilistic generative models  Probabilistic discriminative models.

Forward-Scan Sonar Tomographic Reconstruction PHD Filter Multiple Target Tracking Bayesian Multiple Target Tracking in Forward Scan Sonar.

Bayesian Inference and Posterior Probability Maps Guillaume Flandin Wellcome Department of Imaging Neuroscience, University College London, UK SPM Course,

HQ U.S. Air Force Academy I n t e g r i t y - S e r v i c e - E x c e l l e n c e Improving the Performance of Out-of-Order Sigma-Point Kalman Filters.

Dynamic Causal Modelling Will Penny Wellcome Department of Imaging Neuroscience, University College London, UK FMRIB, Oxford, May

Abstract We offer a formal treatment of choice behaviour based on the premise that agents minimise the expected free energy of future outcomes. Crucially,

Abstract This presentation questions the need for reinforcement learning and related paradigms from machine-learning, when trying to optimise the behavior.

An Introduction to Kalman Filtering by Arthur Pece

SPM short course Functional integration and connectivity Christian Büchel Karl Friston The Wellcome Department of Cognitive Neurology, UCL London UK http//:

Tracking with dynamics

Ch. 5 Bayesian Treatment of Neuroimaging Data Will Penny and Karl Friston Ch. 5 Bayesian Treatment of Neuroimaging Data Will Penny and Karl Friston 18.

6. Population Codes Presented by Rhee, Je-Keun © 2008, SNU Biointelligence Lab,

Dynamic Causal Model for evoked responses in MEG/EEG Rosalyn Moran.

Bayesian inference Lee Harrison York Neuroimaging Centre 23 / 10 / 2009.

Dynamic Causal Models Will Penny Olivier David, Karl Friston, Lee Harrison, Andrea Mechelli, Klaas Stephan Mathematics in Brain Imaging, IPAM, UCLA, USA,

Bayesian selection of dynamic causal models for fMRI Will Penny Olivier David, Karl Friston, Lee Harrison, Andrea Mechelli, Klaas Stephan The brain as.

Dynamic Causal Models Will Penny Olivier David, Karl Friston, Lee Harrison, Stefan Kiebel, Andrea Mechelli, Klaas Stephan MultiModal Brain Imaging, Copenhagen,

Bayesian Model Comparison Will Penny Wellcome Department of Imaging Neuroscience, University College London, UK UCL, June 7, 2003 Large-Scale Neural Network.

Combining Models Foundations of Algorithms and Machine Learning (CS60020), IIT KGP, 2017: Indrajit Bhattacharya.

Learning Deep Generative Models by Ruslan Salakhutdinov

Ch 12. Continuous Latent Variables ~ 12

Dynamic Causal Modeling of Endogenous Fluctuations

Fundamental principles of particle physics

Particle Filtering for Geometric Active Contours

Effective Connectivity

Dynamic Causal Modelling (DCM): Theory

Computational models for imaging analyses

Bayesian Models in Machine Learning

Filtering and State Estimation: Basic Concepts

Wellcome Trust Centre for Neuroimaging University College London

Dynamic Causal Modelling

The free-energy principle: a rough guide to the brain? K Friston

SPM2: Modelling and Inference

Bayesian Methods in Brain Imaging

CRIS Workshop: Computational Neuroscience and Bayesian Modelling

JFG de Freitas, M Niranjan and AH Gee

Effective Connectivity

Bayesian inference J. Daunizeau

Ch 3. Linear Models for Regression (2/2) Pattern Recognition and Machine Learning, C. M. Bishop, Previously summarized by Yung-Kyun Noh Updated.

Bayesian Model Selection and Averaging

Presentation transcript:

Variational filtering in generated coordinates of motion Approximate Inference in Stochastic Processes and Dynamical Systems Abstract This presentation reviews variational treatments of dynamic models that furnish time-dependent conditional densities on the path or trajectory of a system's states and the time-independent densities of its parameters. These obtain by maximizing a variational action with respect to conditional densities. The action or path-integral of free-energy represents a lower-bound on the model’s log-evidence or marginal likelihood required for model selection and averaging. This approach rests on formulating the optimization in generalized co-ordinates of motion. The resulting scheme can be used for on-line Bayesian inversion of nonlinear dynamic causal models and is shown to outperform existing approaches, such as Kalman and particle filtering. Furthermore, it provides for multiple inference on a models states, parameters and hyperparameters using exactly the same principles. Free-form (Variational filtering) and fixed form (Dynamic Expectation Maximization) variants of the scheme will be demonstrated using simulated (bird-song) and real data (from hemodynamic systems studied in neuroimaging).

Overview Dynamic causal models Generalised coordinates (dynamical priors) Hierarchal forms (structural priors) Variational filtering and action (free-form) Laplace approximation and DEM (fixed-form) Comparative evaluations Hemodynamics Bird songs notation

Hierarchical dynamic causal models

and generalised coordinates Dynamic models and generalised coordinates likelihood prior

Energies and generalised precisions Instantaneous energy General and Gaussian forms Precision matrices in generalised coordinates and time

Hierarchal forms and empirical priors Dynamic priors Structural priors

Overview Dynamic causal models Generalised coordinates (dynamical priors) Hierarchal forms (structural priors) Variational filtering and action (free-form) Laplace approximation and DEM (fixed-form) Comparative evaluations Hemodynamics Bird songs

Variational learning (steady-state) Aim: To optimise a free-energy bound on model evidence Free-energy: Expected energy: Entropy: Lemma 1: The free energy is maximised with respect to when Variational density: Mean-field approximation: Variational energy: Internal energy:

Ensemble learning (steady-state) Let the equations of motion for each particle be Because particles are conserved, their density over parameter space is governed by the free energy Fokker-Plank equation It is trivial to show that the stationary solution for the ensemble density is the variational density by substitution

Variational learning (dynamic) In a dynamic setting, the variational density and energy become functionals of time. By analogy with Lagrangian mechanics, denote the action by the anti-derivative or path-integral of free energy We now seek variational densities that maximise action. It is fairly easy to show that the solutions are functionals of the instantaneous energy Where and are the prior energies 21

Ensemble learning (dynamic) Lemma 2: is the stationary solution, in a moving frame of reference, for an ensemble whose equations of motion and ensemble dynamics are Proof: Substituting the variational density; gives This describes a stationary density under a moving frame of reference, with velocity as seen using the co-ordinate transform

Ensemble dynamics in generalized coordinates 20 40 60 80 100 120 -2 -1 1 2 3 4 5 -2 2 -5 5

Overview Dynamic causal models Generalised coordinates (dynamical priors) Hierarchal forms (structural priors) Variational filtering and action (free-form) Laplace approximation and DEM (fixed-form) Comparative evaluations Hemodynamics Bird songs

The Laplace approximation – why is it useful? Under the Laplace approximation, the variational density assumes a Gaussian form The conditional precision (a function of the mode) obtains by extremising the free-energy

… for dynamic models The conditional precisions obtain by extremising the action Conditional precisions: which leaves the variational modes that optimise variational energy and action Variational energy and actions: Mean-field terms:

Approximating the mode Lemma 3: The path of a particle, whose motion is converges exponentially to the mode Where the trajectory of the conditional mode can be realised with a local linearisation (Ozaki 1992) to give the update

… an augmented gradient ascent In dynamic systems, the trajectory of the conditional mode maximises variational action, which is the solution to the ansatz Here, can be regarded as motion in a frame of reference that moves along the trajectory encoded in generalised coordinates. The stationary solution, in this moving frame of reference, maximises variational action. This can be seen easily by noting This is sufficient for the mode to maximise variational action (by the Fundamental lemma); c.f., Hamilton's principle of stationary action

DEM and belief propagation D-Step inference E-Step learning M-Step uncertainty A dynamic recognition system that minimises prediction error

Overview Dynamic causal models Generalised coordinates (dynamical priors) Hierarchal forms (structural priors) Variational filtering and action (free-form) Laplace approximation and DEM (fixed-form) Comparative evaluations Hemodynamics Bird songs

A linear convolution model Prediction error Generation Inversion

Variational filtering on states and causes 5 10 15 20 25 30 -1 -0.5 0.5 1 1.5 hidden states time cause 5 10 15 20 25 30 -0.4 -0.2 0.2 0.4 0.6 0.8 1 1.2 cause time {bins}

Linear deconvolution with variational filtering (SDEs) Linear deconvolution with Dynamic expectation maximisation (ODE)

The order of generalised motion Precision in generalised coordinates The order of generalised motion 10 20 30 40 -1.5 -1 -0.5 0.5 1 1.5 2 time Accuracy and embedding (n) 1 3 5 7 9 11 13 2 4 6 sum squared error (causal states) 5 10 15 20 25 30 35 -1.5 -1 -0.5 0.5 1 1.5 2 time

DEM and extended Kalman filtering hidden states DEM and extended Kalman filtering 5 10 15 20 25 30 35 -1.5 -1 -0.5 0.5 1 time DEM(0) DEM(4) EKF true With convergence when sum of squared error (hidden states) EKF 0.1 0.2 0.3 0.4 0.5 0.6 0.7 0.8 0.9 DEM(0) DEM(4) 10 20 30 40 -1 -0.5 0.5 1 time hidden states DEM(0) EKF

A nonlinear convolution model level This system has a slow sinusoidal input or cause that excites increases in a single hidden state. The response is a quadratic function of the hidden states (c.f., Arulampalam et al 2002).

DEM and particle filtering Comparative performance Sum of squared error

Triple estimation (DEM) Inference on states Triple estimation (DEM) Learning parameters

Overview Dynamic causal models Generalised coordinates (dynamical priors) Hierarchal forms (structural priors) Variational filtering and action (free-form) Laplace approximation and DEM (fixed-form) Comparative evaluations Hemodynamics Bird songs

An fMRI study of attention Stimuli 250 radially moving dots at 4.7 degrees/s Pre-Scanning 5 x 30s trials with 5 speed changes (reducing to 1%) Task: detect change in radial velocity Scanning (no speed changes) 4 x 100 scan sessions; each comprising 10 scans of 4 different conditions F A F N F A F N S ................. A – dots, motion and attention (detect changes) N – dots and motion S – dots F – fixation PPC V5+ Buchel et al 1999

Output: a mixture of intra- and extravascular signal A hemodynamic model Visual input Motion Attention convolution kernel state equations output equation Output: a mixture of intra- and extravascular signal

Hemodynamic deconvolution Inference on states Hemodynamic deconvolution Learning parameters

… and a closer look at the states

Overview Dynamic causal models Generalised coordinates (dynamical priors) Hierarchal forms (structural priors) Variational filtering and action (free-form) Laplace approximation and DEM (fixed-form) Comparative evaluations Hemodynamics Bird songs

Synthetic song-birds syrinx hierarchy of Lorenz attractors

Song recognition with DEM

… and broken birds

Summary Dynamic causal models Generalised coordinates (dynamical priors) Hierarchal forms (structural priors) Variational filtering and action (free-form) Laplace approximation and DEM (fixed-form) Comparative evaluations Hemodynamics Bird songs