Introduction to high-throughput analysis of proteins and metabolites by Mass Spectrometry The basic principle Brief introduction of techniques Computational.

Slides:



Advertisements
Similar presentations
Genomes and Proteomes genome: complete set of genetic information in organism gene sequence contains recipe for making proteins (genotype) proteome: complete.
Advertisements

Protein Quantitation II: Multiple Reaction Monitoring
Mass Spectrometry Kyle Chau and Andrew Gioe. Computation of Molecular Mass -Mass Spectrum is a plot of intensity as a function of mass- charge ratio,
Andrew Rouff and Ben Berger
How to identify peptides October 2013 Gustavo de Souza IMM, OUS.
Mass Spectrometry in the Biosciences: Introduction to Mass Spectrometry and Its Uses in a Company Like Decode. Sigurður V. Smárason, Ph.D. New Technologies.
Proteomics The proteome is larger than the genome due to alternative splicing and protein modification. As we have said before we need to know All protein-protein.
PROTEIN IDENTIFICATION BY MASS SPECTROMETRY. OBJECTIVES To become familiar with matrix assisted laser desorption ionization-time of flight mass spectrometry.
Proteomics: A Challenge for Technology and Information Science CBCB Seminar, November 21, 2005 Tim Griffin Dept. Biochemistry, Molecular Biology and Biophysics.
Molecular Mass Spectrometry
LC-MS Lecture 7.
PROTEOMICS LECTURE. Genomics DNA (Gene) Functional Genomics TranscriptomicsRNA Proteomics PROTEIN Metabolomics METABOLITE Transcription Translation Enzymatic.
Announcements: Proposal resubmissions are due 4/23. It is recommended that students set up a meeting to discuss modifications for the final step of the.
Previous Lecture: Regression and Correlation
HOW MASS SPECTROMETRY CAN IMPROVE YOUR RESEARCH
Proteomics Informatics – Overview of Mass spectrometry (Week 2) Ion Source Mass Analyzer Detector mass/charge intensity.
My contact details and information about submitting samples for MS
Goals in Proteomics 1.Identify and quantify proteins in complex mixtures/complexes 2.Identify global protein-protein interactions 3.Define protein localizations.
Proteomics Josh Leung Biology 1220 April 13 th, 2010.
Proteomics Informatics (BMSC-GA 4437) Course Director David Fenyö Contact information
Proteomics Informatics Workshop Part III: Protein Quantitation
Fa 05CSE182 CSE182-L9 Mass Spectrometry Quantitation and other applications.
Gas Chromatography And Mass Spectrometry
Proteomics Informatics – Overview of Mass spectrometry (Week 2)
Proteome.
Tryptic digestion Proteomics Workflow for Gel-based and LC-coupled Mass Spectrometry Protein or peptide pre-fractionation is a prerequisite for the reduction.
Comparison of chicken light and dark meat using LC MALDI-TOF mass spectrometry as a model system for biomarker discovery WP 651 Jie Du; Stephen J. Hattan.
Chapter 9 Mass Spectrometry (MS) -Microbial Functional Genomics 조광평 CBBL.
es/by-sa/2.0/. Large Scale Approaches to the Study of Protein Levels and Activity Prof:Rui Alves
An introduction and possible applications Ariane Kahnt
INF380 - Proteomics-61 INF380 – Proteomics Chapter 6 – Mass Spectrometry – MALDI TOF The MALDI-TOF instruments are the simplest MS instruments suitable.
1 Chemical Analysis by Mass Spectrometry. 2 All chemical substances are combinations of atoms. Atoms of different elements have different masses (H =
Analysis of Complex Proteomic Datasets Using Scaffold Free Scaffold Viewer can be downloaded at:
MS Calibration for Protein Profiles We need calibration for –Accurate mass value Mass error: (Measured Mass – Theoretical Mass) X 10 6 ppm Theoretical.
Temple University MASS SPECTROMETRY FURTHER INVESTIGATIONS Ilyana Mushaeva and Amber Moscato Department of Electrical and Computer Engineering Temple University.
Temple University MASS SPECTROMETRY INTRODUCTION Ilyana Mushaeva and Amber Moscato Department of Electrical and Computer Engineering Temple University.
INF380 - Proteomics-51 INF380 – Proteomics Chapter 5 – Fundamentals of Mass Spectrometry Mass spectrometry (MS) is used for measuring the mass-to-charge.
Lecture 9. Functional Genomics at the Protein Level: Proteomics.
Genome of the week - Enterococcus faecalis E. faecalis - urinary tract infections, bacteremia, endocarditis. Organism sequenced is vancomycin resistant.
Genomics II: The Proteome Using high-throughput methods to identify proteins and to understand their function.
Software Project MassAnalyst Roeland Luitwieler Marnix Kammer April 24, 2006.
Proteomics What is it? How is it done? Are there different kinds? Why would you want to do it (what can it tell you)?
Pulsed Field Gel Electrophoresis In normal electrophoresis - electrophoretic mobility is independent of molecular weight for large DNA (> 50 kbp) elongate.
CSE182 CSE182-L11 Protein sequencing and Mass Spectrometry.
Overview of Mass Spectrometry
A New Strategy of Protein Identification in Proteomics Xinmin Yin CS Dept. Ball State Univ.
Separates charged atoms or molecules according to their mass-to-charge ratio Mass Spectrometry Frequently.
Proteomics Informatics (BMSC-GA 4437) Instructor David Fenyö Contact information
Salamanca, March 16th 2010 Participants: Laboratori de Proteomica-HUVH Servicio de Proteómica-CNB-CSIC Participants: Laboratori de Proteomica-HUVH Servicio.
Introduction to high-throughput analysis of proteins and metabolites by Mass Spectrometry The basic principle Brief introduction of techniques Computational.
Chem. 133 – 4/26 Lecture. Announcements Return graded quiz and additional problem Lab – Lab report deadlines (2:4 – Thursday) Today’s Lecture – Mass Spectrometry.
Chapter 29 Mass Spectrometry. 29 A Principles of mass spectrometry In the mass spectrometer, analyte molecules are converted to ions by applying energy.
Protein quantitation I: Overview (Week 5). Fractionation Digestion LC-MS Lysis MS Sample i Protein j Peptide k Proteomic Bioinformatics – Quantitation.
RANIA MOHAMED EL-SHARKAWY Lecturer of clinical chemistry Medical Research Institute, Alexandria University MEDICAL RESEARCH INSTITUTE– ALEXANDRIA UNIVERSITY.
Date of download: 6/24/2016 Copyright © The American College of Cardiology. All rights reserved. From: Proteomic Strategies in the Search of New Biomarkers.
Yonsei Proteome Research Center Peptide Mass Finger-Printing Part II. MALDI-TOF 2013 생화학 실험 (1) 6 주차 자료 임종선 조교 내선 6625.
Mass Spectrometry makes it possible to measure protein/peptide masses (actually mass/charge ratio) with great accuracy Major uses Protein and peptide identification.
Proteomics Informatics – Overview of Mass spectrometry (Week 2)
The Syllabus. The Syllabus Safety First !!! Students will not be allowed into the lab without proper attire. Proper attire is designed for your protection.
2 Dimensional Gel Electrophoresis
S. Emonet, H.N. Shah, A. Cherkaoui, J. Schrenzel 
Tianwei Yu Department of Biostatistics and Bioinformatics
Instrumental Chemistry
Proteomics Informatics David Fenyő
S. Emonet, H.N. Shah, A. Cherkaoui, J. Schrenzel 
Feature extraction and alignment for LC/MS data
Mass Spectrometry THE MAIN USE OF MS IN ORG CHEM IS:
Shotgun Proteomics in Neuroscience
Proteomics Informatics David Fenyő
Presentation transcript:

Introduction to high-throughput analysis of proteins and metabolites by Mass Spectrometry The basic principle Brief introduction of techniques Computational issues

Background High-throughput profiling of biological samples (Picture edited from Metabolites Red line: central dogma Blue line: interaction DNA: genotype, copy number, epigenetics... RNA: expression levels, alternative splicing, microRNA … Protein: concentration, modification, interaction … Metabolite: concentration, modification, interaction …

Why Mass Spectrometry The question: In the biological system, there are tens of thousands (species) of proteins and metabolites. How to identify and quantify them from a sample? Which protein is this? Does it change significantly between control/disease samples?

Background In a complex network, even if we know the entire structure, the network behavior is hard to predict. Direct profiling gives us snapshots of the status of the system. (Picture from KEGG PATHWAY)

Proteins/metabolites could be separated according to their properties: mass/size hydrophilicity/hydrophobicity binding to specific ligands charge … … Using Chromatography Electrophoresis … … Why Mass Spectrometry

Problems with these separation techniques: Reproducibility Identification / Quantification Inability to separate tens of thousands of species Mass Spectrometry: Highly accurate, highly reproducible measurements Theoretical values easy to obtain  identification Can study protein modifications (small ligands attached) Measurements based on mass/charge ratio (m/z)

Mass Spectrometry --- getting ion from solution to gas phase Matrix assisted laser desorption ionization (MALDI) Electrospray ionization (ESI) Picture provided by Prof. Junmin Peng (Emory)

Mass Spectrometry --- finding m/z Time-of-flight: Putting a charged particle in an electric field, the time of flight is k: a constant related to instrument characteristics

Quadrupole: Radio-frequency voltage applied to opposing pair of poles. Only ions with a specific m/z can pass to the detector at each frequency. Mass Spectrometry --- finding m/z

Fourier transform MS. Ions detected not by hitting a detector, but by passing by a detecting plate. Ions detected simultaneously. Very high resolution. m/z detected based on the frequency of the ion in the cyclotron. Mass Spectrometry --- finding m/z

Why is simple MS not enough A biological sample consists of tens of thousands of species of molecules. The resolution is not enough for clear separation. Biological interactions between the molecules may interfere with ionization. The solution: Multi-dimensional separation: combining MS with protein breakage by enzymatic digestion and collision decomposition electrophoresis chromatography

Tandem Mass Spec (MS/MS) for protein identification Picture provided by Prof. Junmin Peng (Emory)

2D gel MS/MS Control samples Treatment samples Differential spots In-gel digestion MS/MS protein identification

Int J Biol Sci 2007; 3: D gel differential protein finding  in-gel digestion  MS/MS protein identification

LC/MS Liquid chromotography retention time Mass-to-charge ratio (m/z) Take “slices” in retention time, send to MS

LC/MS-MS Picture provided by Prof. JunminPeng (Emory)

LC/MS-MS Here is an example of LC/MS spectrum. The second MS serves the purpose of protein identification. Matching the sequence found by the second MS falls into the realm of sequence comparison and database search. Peak quantification is done by the first MS. (a) Original spectrum; (b) square root-transformed spectrum to show smaller peaks; (c) A portion of the spectrum showing details.

Between proteomics and metabolomics Proteomics uses LC/MS-MS. The second MS is for protein identification. Metabolomics uses LC/MS. Sometimes a second MS is used, but data interpretation for metabolite identification is much harder. What concerns statisticians: (1) The shared LC/MS part: In metabolomics: quantification, identification In proteomics: quantification (2) The second MS: Protein identification: sequence modeling/comparison Protein quantification: merging values from different peptides from the same protein.

Some computational issues in LC/MS-MS Modeling peaks. Noise reduction & peak detection Multiple peaks from one molecule caused by (1) isotopes (2) multiple charge states Retention time correction. Peak alignment. Peak quantification, especially with overlapping peaks caused by m/z sharing (mostly in metabolomics) From peptides to proteins.

General workflow for LC/MS

Modeling peaks In high-resolution LC/MS data, every peak is a thin slice --- there is no need to model the MS dimension. Modeling the LC dimension is important for quantification. Models have been developed for traditional LC data, which can be applied here. Most empirical peak shape models were derived from Gaussian model. Changes were made to account for asymmetry in the peak shape.

Modeling peaks Asymmetric peak. “asymmetry factor”: b/a at 0.1h Data Analysis and signal processing in chromatography. A. Felinger

Modeling peaks The bi-Gaussian model: The area under peak is: Data Analysis and signal processing in chromatography. A. Felinger

Modeling peaks Generalized exponential function Data Analysis and signal processing in chromatography. A. Felinger

Modeling peaks Data Analysis and signal processing in chromatography. A. Felinger Log-normal function.

Noise reduction Reviewed by Katajamaa&Oresic (2007) J Chr. A 1158:318

Noise reduction Signal-to-noise (S/N) ratio Where to make the cut? Should it be a straight line or a smoother?

Anal Chem Feb 1;78(3): Using filters to detect peak from noise in conjunction with hard cutoff. Noise reduction & peak detection

Matched filter. Calculate the convolution of the signal (x) with the reverse of the standardized peak shape model (f). Try to minimize with regard to the peak height alpha and the peak location tau. Take differential: Data Analysis and signal processing in chromatography. A. Felinger

Noise reduction & peak detection With data from Gaussian model: The above equations become: The goal is to find where The corresponding is the peak intensity. Data Analysis and signal processing in chromatography. A. Felinger

Noise reduction & peak detection Data Analysis and signal processing in chromatography. A. Felinger

Anal Chem Feb 1;78(3): Retention time correction With every run, the LC dimension data has some fluctuation. Identify “reliable” peaks in both samples, use non-linear curve fitting to adjust the retention time.

Multiple peaks from one molecule Caused by multiple charge states (z = 1, 2, 3,……), and different number of carbon isotopes present in the molecule. Example: m=1000 (all C12) charges 2 charges single charge

Peak alignment Reviewed by Katajamaa&Oresic (2007) J Chr. A 1158:318

Dynamic programming. BMC Bioinformatics 2007, 8:419 Peak alignment

Anal Chem Feb 1;78(3): Peak alignment First align m/z dimension by binning. Use kernel density estimation to find “meta-peaks”.

Dealing with overlapping peaks (1) Matched filter. (2) Some traditional methods. Data Analysis and signal processing in chromatography. A. Felinger

Dealing with overlapping peaks (3) Statistical modeling using the EM algorithm Bi-Gaussian mixture Gaussian mixture

Anal Chem Feb 1;78(3): An example of the overall strategy in LC/MS metabolomics

In a complex biological sample (cell, tissue, serum, … ), there are several thousand proteins – tens of thousands of peptides after digestion; signal from less-abundant species may be suppressed. Solution: Must reduce complexity to identify and quantify proteins. Incorporate biochemical separation techniques: LC-MS/MS LC/LC-MS/MS …… 2D gel-MS/MS 2D gel/LC-MS/MS Affinity column separation – LC-MS/MS Separate proteins in multiple dimensions. Sacrifice speed. Analyze a subset of proteins. Sacrifice coverage. Beyond LC/MS-MS

Nature. 452:571. Fig. 1 Beyond LC/MS-MS Right: LC/LC/LC- MS/MS