Data Analysis Generation of the corpus statistics was accomplished through the analysis of information contained in the EDF headers. Figure 4 shows some.

Slides:



Advertisements
Similar presentations
M. Tabrizi: Seizure Detection May, A COMPARATIVE ANALYSIS OF NONLINEAR FEATURES FOR AN HMM-BASED SEIZURE DETECTION SYSTEM Masih Tabrizi and Joseph.
Advertisements

Abstract Electrical activity in the cortex can be recorded by surface electrodes. Electro Encephalography (EEG) machine records potential difference between.
Get Involved Download and use the data Take the survey Join the user group Neural Engineering Data Consortium Iyad Obeid PhD, Joseph Picone PhDTemple University,
Using ICD Codes and Birth Records to Prevent Mismatches of Multiple Births in Linked Hospital Readmission Data Alison Fraser 1, MSPH, Zhiwei Liu 2, MS,
Take the Survey! Big data needs? How could membership benefit you? Automatic Interpretation of EEGs Statistics Acknowledgements DARPA/MTO (D13AP00065)
She Figures 2009 European Commission, Research DG Francesca Crippa Unit L.4 - Scientific culture and gender issues
Corpus Development EEG signal files and reports had to be manually paired, de-identified and annotated: Corpus Development EEG signal files and reports.
SHHS Data Dissemination to the Research/Academic Community: Needs and Opportunities Kenneth A. Loparo Nord Professor of Engineering EECS Department Case.
THE TUH EEG CORPUS: A Big Data Resource for Automated EEG Interpretation A. Harati, S. López, I. Obeid and J. Picone Neural Engineering Data Consortium.
 Communicating with Consumers About Health Care Value : A Controlled Experiment Judith Hibbard and Jessica Greene, University of Oregon Shoshanna Sofaer.
Computational Analysis of USA Swimming Data Junfu Xu School of Computer Engineering and Science, Shanghai University.
Abstract EEGs, which record electrical activity on the scalp using an array of electrodes, are routinely used in clinical settings to.
Jared A. Rowland, M.S., Michael M. Knepp, M.S., Sheri L. Towe, M.S., Chris S. Immel, M.S., Ryoichi J.P. Noguchi, M.S., Chad L. Stephens, M.S. & David W.
Automatic Labeling of EEGs Using Deep Learning M. Golmohammadi, A. Harati, S. Lopez I. Obeid and J. Picone Neural Engineering Data Consortium College of.
NEMSIS Compliance National Database Implementation National EMS Reports.
Big Data in Biomedical Engineering Iyad Obeid, PhD November 7, 2014 Temple University Neural Engineering Data Consortium
Data Processing Machine Learning Algorithm The data is processed by machine algorithms based on hidden Markov models and deep learning. They are then utilized.
Abstract The emergence of big data and deep learning is enabling the ability to automatically learn how to interpret EEGs from a big data archive. The.
Is The Prevalence of Staphylococcus aureus Increasing Among York College Students? By: Angel Newson Department of Biological Sciences, York College of.
Acknowledgements This research was also supported by the Brazil Scientific Mobility Program (BSMP) and the Institute of International Education (IIE).
Elaine Weyuker August  To determine which files of a large software system with multiple releases are likely to contain the largest numbers of.
The Neural Engineering Data Consortium: ‘Déjà Vu All Over Again’ Iyad Obeid, Associate Professor Joseph Picone, Professor Department of Electrical and.
Views The architecture was specifically changed to accommodate multiple views. The used of the QStackedWidget makes it easy to switch between the different.
THE TUH EEG CORPUS: The Largest Open Source Clinical EEG Corpus Iyad Obeid and Joseph Picone Neural Engineering Data Consortium Temple University Philadelphia,
THE TUH EEG CORPUS: A Big Data Resource for Automated EEG Interpretation A. Harati, S. López, I. Obeid and J. Picone Neural Engineering Data Consortium.
Why Big Data is Crucial Overall progress in the field is not commensurate with the scope of investment. The existence of massive corpora has proven to.
Statistical Outcomes IT Key Ideas, Dynamic Database Systems, 2002 Chapter 6.
DIAGNOSTIC SERVICES What They Do * Perform tests or evaluations that aid in the detection, diagnosis, and treatment of disease, injury or other physical.
Data Acquisition An EEG measurement represents a difference between the voltages at two electrodes. The signal is usually displayed using a montage which.
Updated Wingate Anaerobic Power Test Norms Mike Borgert-Poepping, Brett Gressick & Don Bredle, PhD Department of Kinesiology, University of Wisconsin-Eau.
The Goal Use computers to aid physicians in diagnosis of neurological diseases, particularly epilepsy Detect pathological events in real time Currently,
Results (continued) Results Abstract Methods The motor imagery group was read a detailed script and in summary asked to do the following during rest intervals:
Business and Management Research
TUH EEG Corpus Data Analysis 38,437 files from the Corpus were analyzed. 3,738 of these EEGs do not contain the proper channel assignments specified in.
The Neural Engineering Data Consortium: ‘Déjà Vu All Over Again’ 1 Iyad Obeid, Associate Professor Joseph Picone, Professor Department of Electrical and.
Big Mechanism for Processing EEG Clinical Information on Big Data Aim 1: Automatically Recognize and Time-Align Events in EEG Signals Aim 2: Automatically.
Bringing Big Data to Neural Interfaces Iyad Obeid PhD, Joseph Picone PhD Temple University, Philadelphia, Pennsylvania Existing Research & Funding Model.
Instructors’ General Perceptions on Students’ Self-Awareness Frances Feng-Mei Choi HUNGKUANG UNIVERSITY DEPARTMENT OF ENGLISH.
Automatic Discovery and Processing of EEG Cohorts from Clinical Records Mission: Enable comparative research by automatically uncovering clinical knowledge.
Automated Interpretation of EEGs: Integrating Temporal and Spectral Modeling Christian Ward, Dr. Iyad Obeid and Dr. Joseph Picone Neural Engineering Data.
Abstract Automatic detection of sleep state is important to enhance the quick diagnostic of sleep conditions. The analysis of EEGs is a difficult time-consuming.
Demonstration A Python-based user interface: Waveform and spectrogram views are supported. User-configurable montages and filtering. Scrolling by time.
Generating and Using a Qualified Medical Knowledge Graph for Patient Cohort Retrieval from Big Clinical Electroencephalography (EEG) Data Sanda Harabagiu,
Feature Extraction Find best Alignment between primitives and data Found Alignment? TUH EEG Corpus Supervised Learning Process Reestimate Parameters Recall.
Market: Customer Survey: 57 clinicians from academic medical centers and community hospitals, and 44 industry professionals. Primary Customer Need: 70%
Views The architecture was specifically changed to accommodate multiple views. The used of the QStackedWidget makes it easy to switch between the different.
Abstract Automatic detection of sleep state is an important queue in accurate detection of sleep conditions. The analysis of EEGs is a difficult time-consuming.
Spirometry tests were carried out by a Respiratory Clinical Nurse Specialist (Respiratory CNS) Participants were referred to see their General Practitioner.
The Neural Engineering Data Consortium Mission: To focus the research community on a progression of research questions and to generate massive data sets.
Pharmacy in Public Health: Describing Populations Course, date, etc. info.
Constructing Multiple Views The architecture of the window was altered to accommodate switching between waveform, spectrogram, energy, and all combinations.
Constructing Multiple Views The architecture of the window was altered to accommodate switching between waveform, spectrogram, energy, and all combinations.
Results from Mean and Variance Calculations The overall mean of the data for all features was for the REF class and for the LE class. The.
Descriptive Statistics The means for all but the C 3 features exhibit a significant difference between both classes. On the other hand, the variances for.
Scalable EEG interpretation using Deep Learning and Schema Descriptors
CLASSIFICATION OF SLEEP EVENTS IN EEG’S USING HIDDEN MARKOV MODELS
G. Suarez, J. Soares, S. Lopez, I. Obeid and J. Picone
Enhanced Visualizations for Improved Real-Time EEG Monitoring
THE TUH EEG SEIZURE CORPUS
Enhanced Visualizations for Improved Real-Time EEG Monitoring
N. Capp, E. Krome, I. Obeid and J. Picone
Fig. 2: Internal Components
Optimizing Channel Selection for Seizure Detection
EEG Recognition Using The Kaldi Speech Recognition Toolkit
Big Data Resources for EEGs: Enabling Deep Learning Research
AN ANALYSIS OF TWO COMMON REFERENCE POINTS FOR EEGS
E. von Weltin, T. Ahsan, V. Shah, D. Jamshed, M. Golmohammadi, I
Automatic Interpretation of EEGs for Clinical Decision Support
feature extraction methods for EEG EVENT DETECTION
Introduction to Databases
Presentation transcript:

Data Analysis Generation of the corpus statistics was accomplished through the analysis of information contained in the EDF headers. Figure 4 shows some information contained in the fields of the EDF file header. Figure 4. Examples of information contained in an EDF file header A Python script was developed to read the necessary information from each EDF file in the database and later analyze the data. Data Analysis Generation of the corpus statistics was accomplished through the analysis of information contained in the EDF headers. Figure 4 shows some information contained in the fields of the EDF file header. Figure 4. Examples of information contained in an EDF file header A Python script was developed to read the necessary information from each EDF file in the database and later analyze the data. Corpus Development EEG signal files and reports had to be manually paired, de-identified and annotated. Figure 2 shows the process followed to prepare a single session for release. Figure 2. Diagram of the workflow for preparing data. Each session consists of a set of EDF files and a corresponding EEG report. Figure 3. Directory Tree of the Database Corpus Development EEG signal files and reports had to be manually paired, de-identified and annotated. Figure 2 shows the process followed to prepare a single session for release. Figure 2. Diagram of the workflow for preparing data. Each session consists of a set of EDF files and a corresponding EEG report. Figure 3. Directory Tree of the Database Summary The EEG TUH Data Corpus is the largest publicly available clinical EEG database – at least two orders of magnitude larger than other corpora. The database is organized by sets of 100 patients for convenience. Each patient directory contains all sessions for that patient. Each session contains an EDF file and a redacted report. The data was obtained and analyzed through the development of a Python script that extracted the fields of interest from each EDF file in the database. The increasing demand for EEGs is reflected in the EEG year distribution. The most common demographic is a female, between 40 to 69 years old with a tendency to get between 1 and 4 EEGs in the course of 10 years. The 2002–2013 data is now publicly available. Data collected in 2014 and 2015 will be released in May See for more details. Acknowledgements Portions of this work were sponsored by the Defense Advanced Research Projects Agency (DARPA) MTO under the auspices of Dr. Doug Weber through the Contract No. D13AP00065, Temple University’s College of Engineering and Office of the Senior Vice-Provost for Research. Summary The EEG TUH Data Corpus is the largest publicly available clinical EEG database – at least two orders of magnitude larger than other corpora. The database is organized by sets of 100 patients for convenience. Each patient directory contains all sessions for that patient. Each session contains an EDF file and a redacted report. The data was obtained and analyzed through the development of a Python script that extracted the fields of interest from each EDF file in the database. The increasing demand for EEGs is reflected in the EEG year distribution. The most common demographic is a female, between 40 to 69 years old with a tendency to get between 1 and 4 EEGs in the course of 10 years. The 2002–2013 data is now publicly available. Data collected in 2014 and 2015 will be released in May See for more details. Acknowledgements Portions of this work were sponsored by the Defense Advanced Research Projects Agency (DARPA) MTO under the auspices of Dr. Doug Weber through the Contract No. D13AP00065, Temple University’s College of Engineering and Office of the Senior Vice-Provost for Research. THE TEMPLE UNIVERSITY HOSPITAL EEG CORPUS (TUH EEG) Silvia Lopez, Iyad Obeid and Joseph Picone The Neural Engineering Data Consortium, Temple University Abstract The TUH EEG Corpus is the largest and most comprehensive publicly-released corpus representing 12 years of clinical data collected at Temple Hospital. It includes over 17,000 patients, 25,000+ sessions, 50,000+ EEGs and deidentified clinical information. This study presents a comprehensive analysis of this database, including statistics about the number of records per patient, age and gender. The average number of sessions per patient is 1.7. A significant number of patients have 3 or more EEG sessions that span several years, making it ideal for studying long-term variations. The EEG reports contain brief medical histories, supporting investigations into the correlations with EEG signal events. A small amount of hand-labeled data is available for machine learning research. Abstract The TUH EEG Corpus is the largest and most comprehensive publicly-released corpus representing 12 years of clinical data collected at Temple Hospital. It includes over 17,000 patients, 25,000+ sessions, 50,000+ EEGs and deidentified clinical information. This study presents a comprehensive analysis of this database, including statistics about the number of records per patient, age and gender. The average number of sessions per patient is 1.7. A significant number of patients have 3 or more EEG sessions that span several years, making it ideal for studying long-term variations. The EEG reports contain brief medical histories, supporting investigations into the correlations with EEG signal events. A small amount of hand-labeled data is available for machine learning research. Introduction Electroencephalography (EEG), reading of electrical activity along the scalp, is increasingly being used for preventive diagnostic procedures for conditions such as epilepsy, sleep disorders and others. Previously released data resources in this area are mostly represented by experimentally acquired recordings. Figure 1 shows the comparison between the TUH EEG Data Corpus and other resources. The TUH EEG Data Corpus is the largest publicly available database of Clinical EEG data, comprising more than 25,000 records. Figure 1. Comparison between the TUH EEG Data Corpus and other Resources Introduction Electroencephalography (EEG), reading of electrical activity along the scalp, is increasingly being used for preventive diagnostic procedures for conditions such as epilepsy, sleep disorders and others. Previously released data resources in this area are mostly represented by experimentally acquired recordings. Figure 1 shows the comparison between the TUH EEG Data Corpus and other resources. The TUH EEG Data Corpus is the largest publicly available database of Clinical EEG data, comprising more than 25,000 records. Figure 1. Comparison between the TUH EEG Data Corpus and other Resources Analysis of Statistics 70% of the patients have only 1 EEG recording,27% have between 2 and 5 sessions, and 3% (over 400 patients) have more than 5 sessions. The increasing amount of records throughout the years represents an increasing demand for EEGs in the medical diagnostics field. The data collected in represents 35% of the recordings, while the data collected between 2000 and 2005 represents only 8% of the sessions. The female population represents 52% of the total. The number of female and male patients is relatively even, with a slight female majority. 60% of the population is between 40 and 69 years old, 26% is between 0 and 39 years old and 15% of the population is 70 or older. Approximately 75% of the sessions are classified as abnormal EEGs. Analysis of Statistics 70% of the patients have only 1 EEG recording,27% have between 2 and 5 sessions, and 3% (over 400 patients) have more than 5 sessions. The increasing amount of records throughout the years represents an increasing demand for EEGs in the medical diagnostics field. The data collected in represents 35% of the recordings, while the data collected between 2000 and 2005 represents only 8% of the sessions. The female population represents 52% of the total. The number of female and male patients is relatively even, with a slight female majority. 60% of the population is between 40 and 69 years old, 26% is between 0 and 39 years old and 15% of the population is 70 or older. Approximately 75% of the sessions are classified as abnormal EEGs. Corpus Statistics The following tables present a series of statistics about the corpus. Corpus Statistics The following tables present a series of statistics about the corpus. FieldDescriptionExample 1Version Number0 2Patient IDTUH GenderM 4Date of Birth57 8Firstname_LastnameTUH Startdate01-MAY Study Number/ Tech. IDTUH /TAS X 14Start Date Start Time Number of Bytes in Header Type of SignalEDF+C 19Number of Data Records207 20Dur. of a Data Record (Secs)1 21No. of Signals in a Record24 27Signal[1] PrefilteringHP:1.000 Hz LP:70.0 Hz N: Signal[1] No. Samples/Rec.250