Classifying Event-Related Desynchronization in EEG, ECoG, and MEG Signals Kim Sang-Hyuk.

Slides:



Advertisements
Similar presentations
Support Vector Machine
Advertisements

ECG Signal processing (2)
SVM - Support Vector Machines A new classification method for both linear and nonlinear data It uses a nonlinear mapping to transform the original training.
An Introduction of Support Vector Machine
Data Mining Classification: Alternative Techniques
Data Mining Classification: Alternative Techniques
Pattern Recognition and Machine Learning
Support vector machine
Brain-computer interfaces: classifying imaginary movements and effects of tDCS Iulia Comşa MRes Computational Neuroscience and Cognitive Robotics Supervisors:
Visualization of dynamic power and synchrony changes in high density EEG A. Alba 1, T. Harmony2, J.L. Marroquín 2, E. Arce 1 1 Facultad de Ciencias, UASLP.
Introduction The aim the project is to analyse non real time EEG (Electroencephalogram) signal using different mathematical models in Matlab to predict.
Segmentación de mapas de amplitud y sincronía para el estudio de tareas cognitivas Alfonso Alba 1, José Luis Marroquín 2, Edgar Arce 1 1 Facultad de Ciencias,
Body Sensor Networks to Evaluate Standing Balance: Interpreting Muscular Activities Based on Intertial Sensors Rohith Ramachandran Lakshmish Ramanna Hassan.
1 Learning to Detect Objects in Images via a Sparse, Part-Based Representation S. Agarwal, A. Awan and D. Roth IEEE Transactions on Pattern Analysis and.
A commonly used feature to discriminate between hand and foot movements is the variance of the EEG signal at certain electrodes. To this end, one calculates.
Data mining and statistical learning - lecture 13 Separating hyperplane.
Lecture outline Support vector machines. Support Vector Machines Find a linear hyperplane (decision boundary) that will separate the data.
Lecture 10: Support Vector Machines
Theory Simulations Applications Theory Simulations Applications.
Introduction to Neural Networks Kenji Nakayama Kanazawa University, JAPAN 適応システム理論 ガイダンス.
Jeff Howbert Introduction to Machine Learning Winter Machine Learning Feature Creation and Selection.
Support Vector Machines
ERP DATA ACQUISITION & PREPROCESSING EEG Acquisition: 256 scalp sites; vertex recording reference (Geodesic Sensor Net)..01 Hz to 100 Hz analogue filter;
Machine Learning CUNY Graduate Center Lecture 3: Linear Regression.
Machine Learning Week 4 Lecture 1. Hand In Data Is coming online later today. I keep test set with approx test images That will be your real test.
Linear hyperplanes as classifiers Usman Roshan. Hyperplane separators.
SVM by Sequential Minimal Optimization (SMO)
Introduction to variable selection I Qi Yu. 2 Problems due to poor variable selection: Input dimension is too large; the curse of dimensionality problem.
Ajay Kumar, Member, IEEE, and David Zhang, Senior Member, IEEE.
Support Vector Machines Mei-Chen Yeh 04/20/2010. The Classification Problem Label instances, usually represented by feature vectors, into one of the predefined.
T – Biomedical Signal Processing Chapters
1 Methods for detection of hidden changes in the EEG H. Hinrikus*, M.Bachmann*, J.Kalda**, M.Säkki**, J.Lass*, R.Tomson* *Biomedical Engineering Center.
Multimodal Information Analysis for Emotion Recognition
Functional Brain Signal Processing: EEG & fMRI Lesson 4
CS Statistical Machine learning Lecture 18 Yuan (Alan) Qi Purdue CS Oct
Authors: Sriram Ganapathy, Samuel Thomas, and Hynek Hermansky Temporal envelope compensation for robust phoneme recognition using modulation spectrum.
National University of Tainan 國立台南大學資訊教育研究所 人工智慧實驗室 1 A new approach in the BCI research based on fractal dimension as feature and Adaboost as classifier.
Mining Binary Constraints in Feature Models: A Classification-based Approach Yi Li.
CROSS-VALIDATION AND MODEL SELECTION Many Slides are from: Dr. Thomas Jensen -Expedia.com and Prof. Olga Veksler - CS Learning and Computer Vision.
Distributed Representative Reading Group. Research Highlights 1Support vector machines can robustly decode semantic information from EEG and MEG 2Multivariate.
Using Feed Forward NN for EEG Signal Classification Amin Fazel April 2006 Department of Computer Science and Electrical Engineering University of Missouri.
Singer similarity / identification Francois Thibault MUMT 614B McGill University.
컴퓨터 과학부 김명재.  Introduction  Data Preprocessing  Model Selection  Experiments.
Linear hyperplanes as classifiers Usman Roshan. Hyperplane separators.
Classification (slides adapted from Rob Schapire) Eran Segal Weizmann Institute.
1  The Problem: Consider a two class task with ω 1, ω 2   LINEAR CLASSIFIERS.
1  Problem: Consider a two class task with ω 1, ω 2   LINEAR CLASSIFIERS.
Autoregressive (AR) Spectral Estimation
Acknowledgement Work supported by NINDS (grant NS39845), NIMH (grants MH42900 and 19116) and the Human Frontier Science Program Methods Fullhead.
Feature Selction for SVMs J. Weston et al., NIPS 2000 오장민 (2000/01/04) Second reference : Mark A. Holl, Correlation-based Feature Selection for Machine.
Lecture notes for Stat 231: Pattern Recognition and Machine Learning 1. Stat 231. A.L. Yuille. Fall 2004 Linear Separation and Margins. Non-Separable and.
Support Vector Machines Reading: Ben-Hur and Weston, “A User’s Guide to Support Vector Machines” (linked from class web page)
SVMs in a Nutshell.
Support Vector Machine: An Introduction. (C) by Yu Hen Hu 2 Linear Hyper-plane Classifier For x in the side of o : w T x + b  0; d = +1; For.
Next, this study employed SVM to classify the emotion label for each EEG segment. The basic idea is to project input data onto a higher dimensional feature.
SUPERVISED AND UNSUPERVISED LEARNING Presentation by Ege Saygıner CENG 784.
Support Vector Machines Reading: Textbook, Chapter 5 Ben-Hur and Weston, A User’s Guide to Support Vector Machines (linked from class web page)
Support Vector Machines
LINEAR CLASSIFIERS The Problem: Consider a two class task with ω1, ω2.
[Ran Manor and Amir B.Geva] Yehu Sapir Outlines Review
Machine Learning Feature Creation and Selection
Support Vector Machines Introduction to Data Mining, 2nd Edition by
Perceptual Echoes at 10 Hz in the Human Brain
Dynamic Causal Modelling for M/EEG
Roman F. Loonis, Scott L. Brincat, Evan G. Antzoulatos, Earl K. Miller 
Support Vector Machines
Machine Learning for Visual Scene Classification with EEG Data
Analysis of Dynamic Brain Imaging Data
Presented by Chen-Wei Liu
Valerio Mante, Vincent Bonin, Matteo Carandini  Neuron 
Presentation transcript:

Classifying Event-Related Desynchronization in EEG, ECoG, and MEG Signals Kim Sang-Hyuk

Bioimaging Introduction Experimental setup and procedure Preanalysis Data processing Generalization error estimation Contents

Bioimaging Introduction Several different technologies exist for measuring brain activity – They have their own advantages and limitations – Spatial and temporal resolution – Cost, portability and risk to the user Comparative studies are required in order to guide Motor-imagery BCI experiments based on Electroencephalography (EEG), electrocorticography (ECoG) and magnetoencephalography (MEG) A simple binary synchronous (trial-based) paradigm Present quantitative results focusing on – The effect of the number of trial – The effect of spatial filtering

Bioimaging Introduction EEG – Electrical signals are measured by passive electrodes – Very high temporal resolution – Low cost, risk, and portability – Limitation of spatial resolution ECoG – Electrical signals obtained from an array of electrodes beneath the skull – High SNR – A better response at higher frequencies – Invasive MEG – Measuring the tiny magnetic field fluctuations induced by the electrical activity of cerebral neurons – Expensive and nonportable

Bioimaging Experimental Setup and Procedure EEG – 8 untrained right handed male subjects – 39 silver chloride electrodes – Sampling frequency: 256Hz – The subjects were seat in an armchair at 1-m distance in front of a computer screen : used for data acquisition : reference Positions of electrodes

Bioimaging Experimental Setup and Procedure Each trial started with a blank screen A small fixation cross displayed in the center of the screen from second 2 to 9 At 2s, a short warning tone (beep) At 3s, the fixation cross was overlaid with an arrow at the center of the monitor for 1.5s – The direction of arrow point either to the left or to the right In order to avoid event related signals in later processing stages, only data from seconds 4 to 9 of each trial was considered

Bioimaging Preanalysis In order to identify and exclude subjects that did not show significant μ- activity at all Restricted to only the 17 EEG channels that were located over or close to the motor cortex – Calculate of the μ-band using the Welch method (short time Fourier transform) for each subject This feature extraction resulted in one parameter per trial and channel The eight data sets consisting of the Welch-features were classified with linear support vector machines including individual model selection for each subject Generalization errors were estimated by 10-fold cross validation (CV) For three subjects the preanalysis showed very poor error rates close to chance level, their data sets were excluded from further analysis

Bioimaging Preanalysis  Short Time Fourier Transform (STFT) A Fourier-related transformation used to examine the frequency and phase content of local sections of a signal over time Discrete-time STFT – W[n] is the window function – Window is sliding along time axis Examples of window overlap

Bioimaging Preanalysis  Short Time Fourier Transform (STFT) Examples of STFT

Bioimaging  Short Time Fourier Transform (STFT) 5 segment for a trial, overlapping 50% Averaging the spectra of 5 A vector of log amplitudes at different frequencies for each sensor Preanalysis 5 A trial 5 A vector Averaging

Bioimaging  Autoregressive (AR) Model AR(p) model is defined as – Where are the parameters of the model – P is order The output is modeled as a linear combination of P past values of the output For the remaining five subjects, the recorded 5s windows of each trial resulted in a time series of 1280 sample points per channel AR model of order 3 is fitted to the time series of all 39 channels using forward backward linear prediction – The three resulting coefficients per channel and trial formed the new representation of the data – The extraction of the features did not explicitly incorporate prior knowlede – They are not directly linked to the μ-rhythm Data Preprocessing

Bioimaging Support Vector Machine  Linear Support Vector Machine Choose a decision boundary between classes such that margin is maximized – Margin: the distance in feature space between the boundary and the nearest data points (support vectors) Linearly separable case

Bioimaging Support Vector Machine  Linear Support Vector Machine The function of hyperplane – : weight vector normal to hyperplane – : threshold The distance of a point from a hyperplane

Bioimaging Support Vector Machine  Linear Support Vector Machine Scale so that the value of, at the support vectors, is equal to 1 for S 1 and equal to -1 for S 2 – Margin: – Compute the parameters, of the hyperplane so that to: – Minimize – Subject to to where corresponding class indicator (+1 for, -1 for ) S2S2 S1S1

Bioimaging Support Vector Machine  Linear Support Vector Machine The Karush-Kuhn-Tucker (KKT) conditions – is the vector of the Lagrange multipliers – is the Lagrangian function defined as Finally results are

Bioimaging Support Vector Machine  Soft Margin Support Vector Machine In the case where the classes are not separable, soft margin support vector machine is available The training feature vectors categorized into three cases – Vectors that fall outside the band and are correctly classified – Vectors falling inside the band and which are correctly classified – Vectors that are misclassified

Bioimaging Support Vector Machine  Soft Margin Support Vector Machine All three cases can be treated under a single type of constraints – The first category of data: – The second: – The third: The goal is to make the margin as large as possible but at the same time to keep the number of points with as small as possible Cost function – Where is the vector of the parameters

Bioimaging Support Vector Machine  Soft Margin Support Vector Machine The parameter C is a positive constant that controls the relative influence of the two competing terms Optimization of the cost function is difficult due to a discontinuous function – A closely related cost function – Minimize – Subject to Depending on C, the optimal margin will widen and more points will become support vectors – Finding a good value for C is part of the model selection procedure

Bioimaging Generalization Error Estimation  K-Fold Cross Validation A statistical method for validating a predictive model Whole data is separated into k subsets (folds) of equal size Each fold is also divided into k subsets and k subsets are categorized into train set and test set – K-1 subsets are used for training of classifier – 1 set is used for validation Model training and evaluation is repeated k times with each of the k subsets An example of 5-fold cross-validation

Bioimaging Contents of Next Lecture Feature Selection Method – Fisher criterion – Zero-norm optimization – Recursive feature elimination (RFE) Results in EEG Procedure and results in ECoG Procedure and results in MEG Overview of results in EEG, ECoG and MEG