Download presentation

Presentation is loading. Please wait.

Published byKennedy Thorman Modified about 1 year ago

1
2011 COURSE IN NEUROINFORMATICS MARINE BIOLOGICAL LABORATORY WOODS HOLE, MA GENERALIZED LINEAR MODELS Uri Eden BU Department of Mathematics and Statistics August 17, 2011

2
Outline Quick introduction to GLM theory GLM model for inhomogeneous Poisson spiking History dependent GLM model of retinal neurons in culture A GLM model of learning in behavioral experiments

3
3 Simple linear regression How does the height of a son depend of the height of his father? Father’s Height (inches)

4
4 Simple linear regression How does the height of a son depend of the height of his father? Father’s Height (inches) A B C Which is the correct regression line?

5
5 Count data Linear regression methods are not well suited for count data Time (msec)

6
6 Binary data Linear regression methods are not well suited for binary data Trial (msec)

7
Generalized Linear Models Linear regression models of the form: are useful for relating continuous valued observations to a set of covariates. Many types of data cannot be described by a Gaussian additive noise model. Generalized linear models extend a simple class of models to these data types. Count data:Binary data:

8
The Natural Exponential Family A probability model for the data {y 1,…, y n } is in the exponential family if you can write its likelihood in the form: Some common distributions in the exponential family include: Normal, Bernoulli, binomial, Poisson, gamma, beta, exponential, chi-square, lognormal, …

9
Generalized Linear Models Set the link function,, to be a linear function of the covariates, Differentiate the log likelihood with respect to the parameters, set equal to zero, and solve the resulting system of equations of the form

10
The Natural Exponential Family Poisson Data: So the link function is:

11
The Natural Exponential Family Binomial Data: So the link function is:

12
Link Function DistributionEquation logitBinomial logPoisson GLM Models for Spike Data

13
Fitting GLM As with ISI models, use maximum likelihood to obtain GLM parameters. In general it is not possible to obtain a closed form solution for the ML estimator or for its distribution. So, use your favorite numerical optimization technique (such as Newton’s method), or my favorite: MATLAB™

14
Stochastic Models Linear Regression Properties of GLM: Convex likelihood surface Estimators asymptotically have minimum MSE Generalized Linear Models (GLM) Neural Spiking Models

15
Case 1: Inhomogeneous Poisson Model Construct an inhomogeneous Poisson spiking model for repeated trial data as a function of time Time (ms) Trial

16
Case 1: Inhomogeneous Poisson Model Construct an inhomogeneous Poisson spiking model for repeated trial data as a function of time Time (ms) Firing Rate (Hz)

17
Case 1: Inhomogeneous Poisson Model For an inhomogeneous Poisson model for repeated trial data as a function of time –Polynomial model: or:

18
Case 1: Inhomogeneous Poisson Model Inhomogeneous Poisson GLM using 5 th order polynomial in time Time (ms) Firing Rate (Hz)

19
Case 1: Inhomogeneous Poisson Model Inhomogeneous Poisson GLM using 50 th order polynomial in time Time (ms) Firing Rate (Hz)

20
Goodness-of-fit measures Akaike’s Information Criterion: For maximum likelihood estimates it measures the trade-off between maximizing the likelihood (minimizing and the numbers of parameters the model requires. ) Selecting the (parsimonious) model that minimizes the AIC: Helps prevent overfitting Is asymptotically equivalent to complete leave-one- out cross-validation Asymptotically minimizes the KL distance between the selected model and the true unknown model

21
Case 1: Inhomogeneous Poisson Model AIC plot of model order Model order AIC

22
Case 1: Inhomogeneous Poisson Model For an inhomogeneous Poisson model for repeated trial data as a function of time –Spline model: where are spline basis functions

23
Case 1: Inhomogeneous Poisson Model Inhomogeneous Poisson GLM using spline fit in time Time (ms) Firing Rate (Hz)

24
Case 2: An Analysis of the Spiking Activity of Retinal Neurons in Culture (Iygengar and Liu, 1997) Retinal neurons are grown in culture under constant light and environmental conditions. The spontaneous spiking activity of these neurons is recorded. The objective is to develop a statistical model which accurately describes the stochastic structure of this activity.

25

26

27

28

29

30
Discrete Time Spike Train Data dN 1 dN 2 dN 3 dN 4 dN 5 dN 6 dN dN k is the spike indicator function in interval k is the intensity of spiking at time k, which in the limit is given by

31
GLM History Model How do we pick a model order? The ISI distribution models we constructed assume that Now, let the conditional intensity be a function of past spiking activity using GLM

32
Partial Correlogram

33
GLM Coefficients Lag (msec) Coefficient Value

34
AIC Results GLM Order AIC

35
KS Plots Graphical measure of goodness-of-fit, based on time rescaling, comparing an empirical and model cumulative distribution function. If the model is correct, then the rescaled ISIs are independent, identically distributed random variables whose KS plot should produce a 45° line [Ogata, 1988].

36
Uniform Quantiles Model CDF Kolmogorov-Smirnov Plots

37
Uniform Quantiles Model CDF KS Plots for Different Order GLMs

38
ISI Lag Order (rescaled to Gaussian) Correlation Correlation Function for Rescaled ISIs

39
Uniform Quantiles Model CDF GLM Model Classes

40
AIC and KS Statistics PoissonBinomial Order AIC KS ExpGamma Inv. Gauss KS Statistic Parametric Models:

41
ISI Histogram ISI (msec) Probability Density Exponential Gamma Inverse Gaussian Order 50 GLM

42
Inferences and Conclusions Iyengar and Liu showed that a generalized inverse Gaussian model described these data well. The fit of history-dependent GLM model improves appreciably on the fits of the exponential, gamma and inverse Gaussian models, most notably in terms of KS plots. Our analysis shows that the GLM model describes the essential stochastic features in the data. There is a significant history dependence in the retinal neural spiking data extending back 14 msec. There is another effect going back approximately 100 msec. The shorter time-scale phenomena may reflect intrinsic dynamics of the individual neuron whereas the longer time-scale effects may also include network dynamics.

43
Remarks 1.Only 14 parameters are used to fit ~ 30,000 data points! 2.This type of strong history dependent effect is something we have seen in neurons from a number of different brain regions, animal models and experimental protocols. It was all simply described by GLM fitting. Truccolo W, Eden UT, Fellow M, Donoghue JD, Brown EN. A point process framework for relating neural spiking activity to spiking history, neural ensemble and covariate effects. Journal of Neurophysiology, 2005, 93: Kass RE, Ventura V, Brown EN. Statistical issues in the analysis of neuronal data. Journal of Neurophysiology, 2005, 94: 8-25.

44
Relating Learning Dynamically to Neural Activity Single cell recording in monkey hippocampus Trial and error learning of association between picture and response Wirth et al. Science 2003 Smith et al. Neural Computation 2003 Smith et al. Journal of Neuroscience, 2004

45
Behavioral Learning Experiment (Binary Regression) Monkeys were trained to saccade to one of four targets, based on displayed images. Each day the monkey had to learn new associations between images and targets. Single cell recording in monkey hippocampus

46
Behavioral Learning Data Model Question: Can we estimate a learning curve to characterize performance as a function of trial? Can we use it to establish a learning criterion? Logistic Regression where is the trial number is the probability of a correct response on trial k the logistic regression coefficients

47
Behavioral Learning Experiment Data Trial Number Probability of a Correct Response red incorrect blue correct Learning trial Chance performance

48
Parameter Value p-value AIC AIC difference const trial number Generalized Linear Model Summary Inference There is a significant improvement in performance during the experiment. Performance is better than would be expected by chance (0.25) from trial 21 onward. This behavior is consistent with the animal learning the task. The logistic regression function provides an estimate of the animal’s learning curve and a framework for defining the learning trial.

49
Remark Dynamic analyses of these data based on state-space models are more informative and do not require the monotonic assumption of the logistic regression model. Wirth et al. (2003), Smith et al. (2004), Smith et al. (2005)

50
Summary GLM provides a computationally tractable generalization of the Gaussian linear model to non-Gaussian regression models. Estimation is carried out using maximum likelihood. This analysis has all the properties of maximum likelihood. AIC, deviance and parameter standard errors provide measures of goodness-of-fit and an inference framework analogous to regression. Can be applied to other exponential family models. Non-canonical link functions can also be used. GLM is a standard tool in Matlab, Minitab, R, S, SAS, Splus, and SPSS.

51
Monkeys were trained to saccade to one of four targets, based on displayed images. Single cell recording in monkey hippocampus. GLM Peristimulus Time Histogram

52
Time (sec) Trial Peristimulus Time Histogram Spiking Data

53
Model Parameter vector: Basis functions: –Indicator Functions: –Splines:

54
Indicator Function Basis

55
Spline Function Basis

56
Uniform CDF Model CDF Lag ACF of rescaled TimesKS Plot Goodness-of-Fit

57
Adding History Uniform CDF Model CDF KS Plot

58
Adding History

59
Stochastic Models Linear Regression Properties of GLM: Convex likelihood surface Estimators asymptotically have minimum MSE Generalized Linear Models (GLM) Neural Spiking Models

60
GLM Neural Models By selecting an appropriate set of basis functions we can capture arbitrary functional relations. Analysis of relative contributions of components to spiking Truccolo W, Eden UT, Fellows MR, Donoghue JP, Brown EN. (2004) J. Neurophys 93:

61
Monkeys were trained to saccade to one of four targets, based on displayed images. Single cell recording in monkey hippocampus. Dynamic Peristimulus Time Histogram Acknowledgements: Gabriela Czanner

62
Motivation Characterize trial-by-trial changes in firing properties Neither the PSTH nor the spike count per trial is adequate. PSTH (bin size 10 ms)

63
Model Parameter vector: Basis functions: –Indicator Functions: –Splines: Assume stochastic continuity between trials:

64
Results: Simulated Data T = 5 sec, K = 50 trials Data simulated from a model with polynomial basis functions of 4-th order True intensity function Contour plot of the true intensity function

65
Results: Simulated Data True firing intensity function Naïve estimate of the intensity 20*50 Parameters EM estimate with step- functions 20 State Parameters EM estimate with splines 5 State Parameters

66
Back to the Data

67
Results: Real Data Naïve estimate of intensityEM estimate: state-space model with step-functions EM estimate: state-space model with splines

68
Adding History

69

70
Conclusions We can construct and fit (using maximum likelihood) simple generalized linear models that capture the statistical properties of the spike train time series. We used the sample partial correlation function, the distribution of estimators and AIC to suggest the order of the model. AIC and the KS statistic are measures of goodness-of-fit between the model and the data.

71
Empirical Quantiles Model Quantiles KS and Cross-Validation

Similar presentations

© 2016 SlidePlayer.com Inc.

All rights reserved.

Ads by Google