Automatic Analysis of Ion Mobility Spectrometry – Mass Spectrometry (IMS-MS) Data Hyejin Yoon School of Informatics Indiana University Bloomington December.

Slides:



Advertisements
Similar presentations
Protein Quantitation II: Multiple Reaction Monitoring
Advertisements

Lipids Analytical Tool (LipidAT): automated analysis of lipidomic mass spectrometry data Jun Ma Advisor: Dr. Haixu Tang Co-Advisor: Dr. David Wild Co-Advisor.
A Multi-PCA Approach to Glycan Biomarker Discovery using Mass Spectrometry Profile Data Anoop Mayampurath, Chuan-Yih Yu Info-690 (Glycoinformatics) Final.
In-depth Analysis of Protein Amino Acid Sequence and PTMs with High-resolution Mass Spectrometry Lian Yang 2 ; Baozhen Shan 1 ; Bin Ma 2 1 Bioinformatics.
Mass Spectrometry The substance being analyzed (solid or liquid) is injected into the mass spectrometer and vaporized at elevated temperature and reduced.
Lecture 8. GC/MS.
Peptide Identification by Tandem Mass Spectrometry Behshad Behzadi April 2005.
Atomic Mass Spectrometry
1 Information-Theoretic Mass Spectral Library Search Arvind Visvanathan CSCE 990 Seminar in Multi-Dimensional Chromatography Systems, Informatics, and.
PROTEIN IDENTIFICATION BY MASS SPECTROMETRY. OBJECTIVES To become familiar with matrix assisted laser desorption ionization-time of flight mass spectrometry.
Proteomics: A Challenge for Technology and Information Science CBCB Seminar, November 21, 2005 Tim Griffin Dept. Biochemistry, Molecular Biology and Biophysics.
Molecular Mass Spectrometry
LC-MS Lecture 7.
ProReP - Protein Results Parser v3.0©
Computational Methods for Biomarker Discovery in Proteomics and Glycomics Vijetha Vemulapalli School of Informatics Indiana University Capstone Advisor:
Proteomics Informatics – Protein identification II: search engines and protein sequence databases (Week 5)
Proteomics Informatics Workshop Part I: Protein Identification
Previous Lecture: Regression and Correlation
Proteomics Informatics (BMSC-GA 4437) Course Director David Fenyö Contact information
My contact details and information about submitting samples for MS
A Neural Network Predictor for Peptide Fragmentation in Mass Spectrometry Arunima Ram Advisor : Dr. Predrag Radivojac Co-Advisor : Dr. Haixu Tang Co-Advisor.
Analysis of tandem mass spectra - II Prof. William Stafford Noble GENOME 541 Intro to Computational Molecular Biology.
Proteomics Josh Leung Biology 1220 April 13 th, 2010.
Proteomics Informatics (BMSC-GA 4437) Course Director David Fenyö Contact information
Introduction to high-throughput analysis of proteins and metabolites by Mass Spectrometry The basic principle Brief introduction of techniques Computational.
Proteomics Informatics Workshop Part III: Protein Quantitation
Fa 05CSE182 CSE182-L9 Mass Spectrometry Quantitation and other applications.
Proteomics Informatics – Overview of Mass spectrometry (Week 2)
Tryptic digestion Proteomics Workflow for Gel-based and LC-coupled Mass Spectrometry Protein or peptide pre-fractionation is a prerequisite for the reduction.
Comparison of chicken light and dark meat using LC MALDI-TOF mass spectrometry as a model system for biomarker discovery WP 651 Jie Du; Stephen J. Hattan.
Production of polypeptides, Da, and middle-down analysis by LC-MSMS Catherine Fenselau 1, Joseph Cannon 1, Nathan Edwards 2, Karen Lohnes 1,
Chapter 9 Mass Spectrometry (MS) -Microbial Functional Genomics 조광평 CBBL.
Molecular mass spectrometry Chapter 20 The study of “molecular ions” M + e -  M e -
Introduction to Protein Chemistry October 2013 Gustavo de Souza IMM, OUS.
Acknowledgements This work is supported by NSF award DBI , and National Center for Glycomics and Glycoproteomics, funded by NIH/NCRR grant 5P41RR
Common parameters At the beginning one need to set up the parameters.
1 Chemical Analysis by Mass Spectrometry. 2 All chemical substances are combinations of atoms. Atoms of different elements have different masses (H =
Analysis of Complex Proteomic Datasets Using Scaffold Free Scaffold Viewer can be downloaded at:
Laxman Yetukuri T : Modeling of Proteomics Data
MS Calibration for Protein Profiles We need calibration for –Accurate mass value Mass error: (Measured Mass – Theoretical Mass) X 10 6 ppm Theoretical.
Temple University MASS SPECTROMETRY FURTHER INVESTIGATIONS Ilyana Mushaeva and Amber Moscato Department of Electrical and Computer Engineering Temple University.
Temple University MASS SPECTROMETRY INTRODUCTION Ilyana Mushaeva and Amber Moscato Department of Electrical and Computer Engineering Temple University.
INF380 - Proteomics-51 INF380 – Proteomics Chapter 5 – Fundamentals of Mass Spectrometry Mass spectrometry (MS) is used for measuring the mass-to-charge.
Clustering Features in High-Throughput Proteomic Data Richard Pelikan (or what’s left of him) BIOINF 2054 April
PEAKS: De Novo Sequencing using Tandem Mass Spectrometry Bin Ma Dept. of Computer Science University of Western Ontario.
Proteomics What is it? How is it done? Are there different kinds? Why would you want to do it (what can it tell you)?
Eat Raw & Fresh: Introducing isotopic Mass-to-charge Ratio and Envelope Fingerprinting (iMEF) and ProteinGoggle for Protein Database Search Zhixin(Michael)
EBI is an Outstation of the European Molecular Biology Laboratory. In silico analysis of accurate proteomics, complemented by selective isolation of peptides.
Separates charged atoms or molecules according to their mass-to-charge ratio Mass Spectrometry Frequently.
Isotope Labeled Internal Standards in Skyline
Proteomics Informatics (BMSC-GA 4437) Instructor David Fenyö Contact information
MC 13.3 Spectroscopy, Pt III 1 Introduction to Mass Spectrometry (cont) Principles of Electron-Impact Mass Spectrometry:  A mass spectrometer produces.
Introduction to high-throughput analysis of proteins and metabolites by Mass Spectrometry The basic principle Brief introduction of techniques Computational.
Proteomics Informatics (BMSC-GA 4437) Course Directors David Fenyö Kelly Ruggles Beatrix Ueberheide Contact information
Constructing high resolution consensus spectra for a peptide library
Chapter 29 Mass Spectrometry. 29 A Principles of mass spectrometry In the mass spectrometer, analyte molecules are converted to ions by applying energy.
Protein quantitation I: Overview (Week 5). Fractionation Digestion LC-MS Lysis MS Sample i Protein j Peptide k Proteomic Bioinformatics – Quantitation.
김지형. Introduction precursor peptides are dynamically selected for fragmentation with exclusion to prevent repetitive acquisition of MS/MS spectra.
RANIA MOHAMED EL-SHARKAWY Lecturer of clinical chemistry Medical Research Institute, Alexandria University MEDICAL RESEARCH INSTITUTE– ALEXANDRIA UNIVERSITY.
Volume 4, Issue 6, Pages e4 (June 2017)
Bioinformatics Solutions Inc.
Proteomics Informatics David Fenyő
Volume 4, Issue 6, Pages e4 (June 2017)
Proteomics Informatics –
NoDupe algorithm to detect and group similar mass spectra.
Schematic of MS1 filtering.
Feature extraction and alignment for LC/MS data
Mass Spectrometry THE MAIN USE OF MS IN ORG CHEM IS:
Proteomics Informatics David Fenyő
Presentation transcript:

Automatic Analysis of Ion Mobility Spectrometry – Mass Spectrometry (IMS-MS) Data Hyejin Yoon School of Informatics Indiana University Bloomington December 5, 2008 Advisor: Dr. Haixu Tang

Outline 1. Introduction 2. Motivation 4. IMS-MS Analyzer 5. Results 3. Workflow of IMS-MS Data Analysis 6. Future Work 7. References 8. Acknowledgements

Mass Spectrometry (MS) Generic mass spectrometry (MS)- based proteomics experiment [Ruedi Aebersold et al.]  Measures molecular mass (mass-to-charge ratio) of a sample  Mass spectrum  Tandem MS (MS/MS)

Application of MS  Molecule identification/quantitation accurate molecular weight confirm the molecular formula substitution of a amino acid or post-translational modification  Structural and sequence information from MS/MS

Liquid Chromatography – Mass Spectrometry  MS Combined with Liquid Chromatography (LC) LC-MS, LC-MS/MS  Advantages Provides a steady stream of different samples More precise Higher confident  Limitation Molecule at low abundance levels Low depth of coverage for complex samples Slow: Liquid phase A schematic diagram of LC-MS [

 Ion mobility spectrometry (IMS) Fast: Gas phase Ion Mobility Spectrometry – Mass Spectrometry (IMS-MS) E Buffer Gas DETECTOR Gate High-throughput proteomics platform based on ion- mobility time-of-flight mass spectrometry [Belov et. al. ASMS]

 IMS-MS Distinguish different ions having identical mass-to-charge ratios Separates out conformers Increases depth of coverage, confidence Used to measure cross-section Reduces noise Fast separation: Gas phase Advantages of IMS-MS A schematic diagram of IMS-MS [Hoaglund CS, et al. 1998]

 IMS-MS “Frame” 3-dimensional data: drift time, m/z, intensity 2D Color map Rarely done so far, Few analysis SW LC-IMS-MS LC coupled to MS-MS 4-dimensional data frame, drift time, m/z, intensity Multiple frames Advantage  Multiple measurements per LC peak  Increasing peak capacity  Increase depth of coverage  Reproducible, increase confidence MS vs. IMS-MS  MS Mass Spectrum 2-dimensional data: m/z, intensity Many tools to analyze LC-MS

Motivation for Automatic IMS-MS Analysis  Challenging data analysis, due to multi-dimensional nature of data  Need for an automatic data analysis tool for the studies using IMS-MS/LC-IMS-MS instruments  Visualize IMS-MS, LC-IMS-MS data m/z, drift time space Mass, drift time space  Feature/Peak detection Deisotope isotopic distributions to get monoisotopic mass & charge state Identify IMS-MS peaks using two dimensions (mass/ drift time)  User-friendly

Workflow of IMS-MS Analysis IMS-MS / LC-IMS-MS System IMS-MS / LC-IMS-MS System Biological sample mixture Biological sample mixture Visualization & Feature-finding Algorithm Visualization & Feature-finding Algorithm Peak-picking Algorithm Peak-picking Algorithm Visualization & Deisotoping Algorithm Visualization & Deisotoping Algorithm IMS-MS Analyzer Feature List IMS-MS Data IMS-MS Data IMS-MS Peak List IMS-MS Peak List Monoisotope (peak) List Monoisotope (peak) List LC-IMS-MS Data LC-IMS-MS Data Monoisotope (peak) Lists Monoisotope (peak) Lists Feature Lists Feature Lists IMS-MS Peak Lists IMS-MS Peak Lists

IMS-MS Analyzer: 2D Color Map and Deisotoping Visualization & Feature-finding Algorithm Visualization & Feature-finding Algorithm Peak-picking Algorithm Peak-picking Algorithm Visualization & Deisotoping Algorithm Visualization & Deisotoping Algorithm IMS-MS Analyzer Feature List IMS-MS Data IMS-MS Data Peak List Monoisotope (peak) List Monoisotope (peak) List LC-IMS-MS Data LC-IMS-MS Data Monoisotope (peak) Lists Monoisotope (peak) Lists Feature Lists Feature Lists Peak Lists Peak Lists

2D Color Map and Zoom :::: :::: Input (drift scan, TOF bin, intensity) calibration coefficients drift time, m/z, color code Plot drift time vs. m/z vs. intensity

2D Color Map and Zoom

Single drift scan view

Single Drift Scan Processing  Peak-picking on spectra Remove spectral noise  Deisotoping Algorithm THRASH [Horn et al. 2000] algorithm Detect accurate monoisotopic mass and charge state

THRASH on a frame  THRASH entire frame THRASH scan by scan a peak list in the form of monoisotopic masses observed across continuous drift-times. Results saved as a csv file

IMS-MS Analyzer: THRASH 2D map and Feature Finding Visualization & Feature-finding Algorithm Visualization & Feature-finding Algorithm Peak-picking Algorithm Peak-picking Algorithm Visualization & Deisotoping Algorithm Visualization & Deisotoping Algorithm IMS-MS Analyzer Feature List IMS-MS Data IMS-MS Data Peak List Monoisotope (peak) List Monoisotope (peak) List LC-IMS-MS Data LC-IMS-MS Data Monoisotope (peak) Lists Monoisotope (peak) Lists Feature Lists Feature Lists Peak Lists Peak Lists

THRASH 2D map 2D map of drift time vs. m/z THRASH frame 2D map of drift-time vs. monoisotopic mass

Feature Finding   Feature: a drift profile for a specific mass value   Preliminary step to Identify IMS-MS peaks   Sliding Window approach Cluster monoisotopic ions located across continuous drift-times Report representative monoisotopic mass, drift-time value, maximum intensity, total intensity, charge and range of drift-time that correspond to a particular feature   Feature profile view   Manually visualizing Gaussian fitting to the feature

Feature Finding

IMS-MS Analyzer: Peak-Picking Visualization & Feature-finding Algorithm Visualization & Feature-finding Algorithm Peak-picking Algorithm Peak-picking Algorithm Visualization & Deisotoping Algorithm Visualization & Deisotoping Algorithm IMS-MS Analyzer Feature List IMS-MS Data IMS-MS Data IMS-MS Peak List IMS-MS Peak List Monoisotope (peak) List Monoisotope (peak) List LC-IMS-MS Data LC-IMS-MS Data Monoisotope (peak) Lists Monoisotope (peak) Lists Feature Lists Feature Lists Peak Lists Peak Lists

Peak-Picking  Overlapping peaks: isomeric molecules or conformational change in a molecules  Apply Gaussian mixture models  Use Expectation-Maximization (EM) algorithm  Goodness-of-fit to find the best fitting Gaussian mixture  Choose Gaussian means to represent IMS-MS peaks

Peak-picking Examples

Gaussian Mixture Models (GMMs)  There are k components of Gaussian  i ’ th component: w i  Mean of component w i : μ i  Each component generates data from a Gaussian function with mean μ i and variance σ i 2  Each datapoint is generated according to probability of component i: P(w i ) N(μ i, σ i 2 )  We need to find μ 1, μ 2, …, μ k which give maximum likelihood

EM Algorithm  Alternate between Expectation (E) step and Maximization (M) step  E step computes an expectation of the likelihood by including the unobserved variables as if they were observed  M step computes the maximum likelihood estimates of the parameters by maximizing the expected likelihood found on the E step  Begin next round of the E step using the parameters found on the M step and repeat the process

 On the t’th iteration let our estimates be  E step  M step EM for GMMs

 How well the model fits a set of observed data  Discrepancy between observed values and the values expected under the model  Based on goodness-of-fit we determine the best fitting Gaussian mixture within user specified max components Goodness-of-Fit

Peak-picking

Peak-picking Results

IMS-MS Analyzer: LC-IMS-MS Processing Visualization & Feature-finding Algorithm Visualization & Feature-finding Algorithm Peak-picking Algorithm Peak-picking Algorithm Visualization & Deisotoping Algorithm Visualization & Deisotoping Algorithm IMS-MS Analyzer Feature List IMS-MS Data IMS-MS Data Peak List Monoisotope (peak) List Monoisotope (peak) List LC-IMS-MS Data LC-IMS-MS Data Monoisotope (peak) Lists Monoisotope (peak) Lists Feature Lists Feature Lists IMS-MS Peak Lists IMS-MS Peak Lists

Analyzing LC-IMS-MS data  Data set of multiple frames  4D data  Binary search algorithm to find the target frame  Processing all frames automatically : :

2D Map of LC-IMS-MS

THRASH/peak-picking of LC-IMS-MS

Results IMS-MS sample (Cellobiose) LC-IMS-MS sample (Human Plasma) # of Deisotoped ions 5370~266 per frame # of IMS-MS peaks 350~18 per frame

Future Work Biological sample LC-IMS-MS Systems LC-IMS-MS Systems LC-IMS-MS dataset LC-IMS-MS dataset IMS-MS/MS dataset IMS-MS/MS dataset Precursor Feature/Peak List Precursor Feature/Peak List Fragment Peak List Fragment Peak List MS/MS Spectra + Precursor information MS/MS Spectra + Precursor information Downstream Computational Analysis - Protein identification - Protein quantitation - Biological pathway reconstruction Precursor Peak List Precursor Peak List Drift Profile Aligner De- isotoping Peak Picking Feature Detector Feature Detector Fragment Feature/Peak List Fragment Feature/Peak List

References  Aebersold R, Mann M, Mass spectrometry-based proteomics, Nature Mar 13;422(6928):  Guerrera IC, Kleiner O. Application of mass spectrometry in proteomics, Biosci Rep Feb-Apr;25(1-2):  Clemmer DE, Jarrold MF, Ion mobility measurements and their applications to clusters and biomolecules, J Mass Spectrom. 1997;32:  Hoaglund CS, Valentine SJ, Sporleder CR, Reilly JP, Clemmer DE, Three-dimensional ion mobility/TOFMS analysis of electrosprayed biomolecules, Anal Chem Jun 1;70(11):  Baker ES, Clowers BH, Li F, Tang K, Tolmachev AV, Prior DC, Belov ME, Smith RD, Ion Mobility Spectrometry–Mass Spectrometry Performance Using Electrodynamic Ion Funnels and Elevated Drift Gas Pressures, J Am Soc Mass Spectrom Jul;18(7):  Horn DM, Zubarev RA, McLafferty FW, Automated reduction and interpretation of high resolution electrospray mass spectra of large molecules, J Am Soc Mass Spectrom Apr;11(4):   ml 

Acknowledgements  Prof. Haixu Tang, School of Informatics  Lab-mates Anoop Mayampurath, Mina Rho, Jun Ma, Yong Li, Paul Yu, Chao Ji, Indrani Sarkar  Chemistry Department Stephen Valentine Manny Plasenci Ruwan Thushara Kurulugama Prof. David E. Clemmer  Faculty and staff, School of Informatics