G. Suarez, J. Soares, S. Lopez, I. Obeid and J. Picone

Slides:



Advertisements
Similar presentations
M. Tabrizi: Seizure Detection May, A COMPARATIVE ANALYSIS OF NONLINEAR FEATURES FOR AN HMM-BASED SEIZURE DETECTION SYSTEM Masih Tabrizi and Joseph.
Advertisements

Introduction The aim the project is to analyse non real time EEG (Electroencephalogram) signal using different mathematical models in Matlab to predict.
Take the Survey! Big data needs? How could membership benefit you? Automatic Interpretation of EEGs Statistics Acknowledgements DARPA/MTO (D13AP00065)
Corpus Development EEG signal files and reports had to be manually paired, de-identified and annotated: Corpus Development EEG signal files and reports.
Abstract EEGs, which record electrical activity on the scalp using an array of electrodes, are routinely used in clinical settings to.
Automatic Labeling of EEGs Using Deep Learning M. Golmohammadi, A. Harati, S. Lopez I. Obeid and J. Picone Neural Engineering Data Consortium College of.
JM - 1 Introduction to Bioinformatics: Lecture VIII Classification and Supervised Learning Jarek Meller Jarek Meller Division.
Data Processing Machine Learning Algorithm The data is processed by machine algorithms based on hidden Markov models and deep learning. They are then utilized.
Abstract The emergence of big data and deep learning is enabling the ability to automatically learn how to interpret EEGs from a big data archive. The.
Acknowledgements This research was also supported by the Brazil Scientific Mobility Program (BSMP) and the Institute of International Education (IIE).
Views The architecture was specifically changed to accommodate multiple views. The used of the QStackedWidget makes it easy to switch between the different.
THE TUH EEG CORPUS: The Largest Open Source Clinical EEG Corpus Iyad Obeid and Joseph Picone Neural Engineering Data Consortium Temple University Philadelphia,
THE TUH EEG CORPUS: A Big Data Resource for Automated EEG Interpretation A. Harati, S. López, I. Obeid and J. Picone Neural Engineering Data Consortium.
Why Big Data is Crucial Overall progress in the field is not commensurate with the scope of investment. The existence of massive corpora has proven to.
Exploration of Instantaneous Amplitude and Frequency Features for Epileptic Seizure Prediction Ning Wang and Michael R. Lyu Dept. of Computer Science and.
Data Acquisition An EEG measurement represents a difference between the voltages at two electrodes. The signal is usually displayed using a montage which.
©2009 Mladen Kezunovic. Improving Relay Performance By Off-line and On-line Evaluation Mladen Kezunovic Jinfeng Ren, Chengzong Pang Texas A&M University,
The Goal Use computers to aid physicians in diagnosis of neurological diseases, particularly epilepsy Detect pathological events in real time Currently,
TUH EEG Corpus Data Analysis 38,437 files from the Corpus were analyzed. 3,738 of these EEGs do not contain the proper channel assignments specified in.
A NOVEL METHOD FOR COLOR FACE RECOGNITION USING KNN CLASSIFIER
Classification (slides adapted from Rob Schapire) Eran Segal Weizmann Institute.
Big Mechanism for Processing EEG Clinical Information on Big Data Aim 1: Automatically Recognize and Time-Align Events in EEG Signals Aim 2: Automatically.
Automatic Discovery and Processing of EEG Cohorts from Clinical Records Mission: Enable comparative research by automatically uncovering clinical knowledge.
Automated Interpretation of EEGs: Integrating Temporal and Spectral Modeling Christian Ward, Dr. Iyad Obeid and Dr. Joseph Picone Neural Engineering Data.
Abstract Automatic detection of sleep state is important to enhance the quick diagnostic of sleep conditions. The analysis of EEGs is a difficult time-consuming.
Demonstration A Python-based user interface: Waveform and spectrogram views are supported. User-configurable montages and filtering. Scrolling by time.
Market: Customer Survey: 57 clinicians from academic medical centers and community hospitals, and 44 industry professionals. Primary Customer Need: 70%
Data Analysis Generation of the corpus statistics was accomplished through the analysis of information contained in the EDF headers. Figure 4 shows some.
Identifying “Best Bet” Web Search Results by Mining Past User Behavior Author: Eugene Agichtein, Zijian Zheng (Microsoft Research) Source: KDD2006 Reporter:
WHAT IS DATA MINING?  The process of automatically extracting useful information from large amounts of data.  Uses traditional data analysis techniques.
WHAT IS DATA MINING?  The process of automatically extracting useful information from large amounts of data.  Uses traditional data analysis techniques.
Views The architecture was specifically changed to accommodate multiple views. The used of the QStackedWidget makes it easy to switch between the different.
Abstract Automatic detection of sleep state is an important queue in accurate detection of sleep conditions. The analysis of EEGs is a difficult time-consuming.
Improved EEG Event Classification Using Differential Energy A.Harati, M. Golmohammadi, S. Lopez, I. Obeid and J. Picone Neural Engineering Data Consortium.
Research Methodology Proposal Prepared by: Norhasmizawati Ibrahim (813750)
The Neural Engineering Data Consortium Mission: To focus the research community on a progression of research questions and to generate massive data sets.
Constructing Multiple Views The architecture of the window was altered to accommodate switching between waveform, spectrogram, energy, and all combinations.
Constructing Multiple Views The architecture of the window was altered to accommodate switching between waveform, spectrogram, energy, and all combinations.
Results from Mean and Variance Calculations The overall mean of the data for all features was for the REF class and for the LE class. The.
Descriptive Statistics The means for all but the C 3 features exhibit a significant difference between both classes. On the other hand, the variances for.
Combining Models Foundations of Algorithms and Machine Learning (CS60020), IIT KGP, 2017: Indrajit Bhattacharya.
Scalable EEG interpretation using Deep Learning and Schema Descriptors
Automated Identification of Abnormal Adult EEG
Sentiment analysis algorithms and applications: A survey
Applying Neural Networks
CLASSIFICATION OF SLEEP EVENTS IN EEG’S USING HIDDEN MARKOV MODELS
Enhanced Visualizations for Improved Real-Time EEG Monitoring
THE TUH EEG SEIZURE CORPUS
Automatic Sleep Stage Classification using a Neural Network Algorithm
Epileptic Seizure Prediction
The Elements of Statistical Learning
Schizophrenia Classification Using
Enhanced Visualizations for Improved Real-Time EEG Monitoring
N. Capp, E. Krome, I. Obeid and J. Picone
Optimizing Channel Selection for Seizure Detection
EEG Recognition Using The Kaldi Speech Recognition Toolkit
Big Data Resources for EEGs: Enabling Deep Learning Research
AN ANALYSIS OF TWO COMMON REFERENCE POINTS FOR EEGS
E. von Weltin, T. Ahsan, V. Shah, D. Jamshed, M. Golmohammadi, I
Improved EEG Event Classification Using Differential Energy
Vinit Shah, Joseph Picone and Iyad Obeid
Automatic Interpretation of EEGs for Clinical Decision Support
MACHINE LEARNING TECHNIQUES IN IMAGE PROCESSING
feature extraction methods for EEG EVENT DETECTION
MACHINE LEARNING TECHNIQUES IN IMAGE PROCESSING
EEG Event Classification Using Deep Learning
A Dissertation Proposal by: Vinit Shah
Deep Residual Learning for Automatic Seizure Detection
EEG Event Classification Using Deep Learning
Machine Learning with Clinical Data
Presentation transcript:

A MACHINE LEARNING APPROACH TO AUTOMATED IDENTIFICATION OF ABNORMAL EEGS G. Suarez, J. Soares, S. Lopez, I. Obeid and J. Picone The Neural Engineering Data Consortium, Temple University College of Engineering Temple University www.isip.piconepress.com Machine Learning Though machine learning has been to classify EEG signals, prior results have yet to be sufficiently successful. Yuan et al. (2012) used a wavelet transform and the extreme learning machine algorithm to classify interictal and ictal EEGs, achieving an accuracy of 99.25% on the Bonn EEG data. Cao et al. (2008) used the support vector machines to identify residual functional abnormalities in athletes suffering from concussions, producing 77.1% accuracy on a corpus of 61 athletes at risk. Karlik & Hayta (2014) used artificial neural networks to identify epileptic seizures with 99.75% accuracy on a corpus of 200 normal/abnormal EEGs. Hussain et al. (2015) used a combination of algorithms including k-nearest neighbors to achieve a 98.22% accuracy on the detection of alcoholism using EEGs. Abstract The TUH EEG Corpus is the largest and most comprehensive publicly-released corpus representing 11 years of clinical data collected at Temple Hospital. It includes over 15,000 patients, 28,000+ sessions, 50,000+ EEGs and deidentified clinical information. About 75% of the EEGs contained in the TUH EEG Corpus have been identified as abnormal, allowing for a large pool of data that can be used to analyze the underlying factors predicting abnormalities. The goal of this work is to augment our system that automatically interprets EEGs, known as AutoEEGTM, with abnormal EEG classification. A simple machine learning approach based on random forests and principal components analysis resulted in a 40.3% classification error rate. Random guessing based on priors resulted in a 49.8% error rate. Corpus Statistics The first step in this process was collecting data from the TUH EEG Corpus: Results A random forest algorithm was used to train a two- way classifier, and provided results that were more accurate than guessing based on priors. A PCA dimension of 20 was used in this experiment. The initial vector dimension was 27x600, representing the first 60 secs of the signal. This was done to simulate the amount of data used by neurologists in making this decision. Year # Normal # Abnormal Total 2003 17 18 35 2004 23 25 48 2005 4 7 11 2006 31 49 2007 15 33 2008 14 20 34 2009 39 2010 13 28 2011 16 24 40 2012 26 46 2013 Total: 202 200 402 Figure 8. A comparison of random forests to guessing based on priors. Figure 4. An Analysis of the Data By Year of Collection Introduction Electroencephalography (EEG) is increasingly being used for preventive diagnostic procedures. Abnormalities in EEG signals can be caused by a wide range of reasons; however, much can be identified by observing the posterior dominant rhythm and vulnerable temporal lobes. Events such as spikes, sharp waves and epileptic patterns usually indicate abnormal behavior. Context plays an important role in determining whether these events are associated with an abnormality: Methodology Manually selected 139 normal and 119 abnormal .edf files and used features derived from one channel of the signal. Channel T5-O1 from a TCP montage was selected because it contains information about the posterior dominant rhythm. A standard 9-element feature vector (8 cepstral features plus a special differential energy term) were augmented with first and second derivatives, giving a 27‑dimension feature vector. A concatenated feature vector was built from the first 60 secs of the signal, giving a supervector of dimension 27 x 600. Principal components analysis was used to reduce the dimension to approximately 20 features. A “leave-one-out” cross-validation approach was used to assess performance. A random forest approach was used to classify the data (based on TreeBagger in MATLAB): Figure 5. An EEG of an epileptic patient. The intensity of the spike and waves are an indication of epilepsy. Input: EEG raw data Figure 9. Error Rate as a Function of PCA Dimension Summary Abnormal behavior in EEGs can generally be detected by the posterior dominant rhythm, making the channel T5-O1 the most effective channel to examine for abnormal behavior. The combination of random forests and principal components analysis is a promising approach. A relatively small number of dimensions can be used. Future work will include optimizing the parameters of this system, and incorporating more sophisticated temporal modeling (e.g., hidden Markov models), and developing better criteria for building a supervector representation of the data. Acknowledgements Many thanks to the Honors Program in Temple University for funding my research through the Merit Scholars Summer Stipend program. Figure 6. An EEG of a patient diagnosed with a concussion. Frequent paroxysmal sharply contoured theta discharges are usually an indication of a concussion. Figure 1. A Typical Abnormal EEG EEG Signal Nonstandard/Abnormal Benign Variant Healthy True Abnormality Unhealthy Standard/Normal Figure 7. An EEG of a patient suffering from a form of measles, subacute sclerosing panencephalitis (a progressive neurological complication of measles). Periodic synchronous discharges are observed. Figure 2. A Decision Tree to Classify Abnormal EEGs Figure 3. The TreeBagger Approach to Random Forests