Multiple flavors of mass analyzers Single MS (peptide fingerprinting): Identifies m/z of peptide only Peptide id’d by comparison to database, of predicted.

Slides:



Advertisements
Similar presentations
Genomes and Proteomes genome: complete set of genetic information in organism gene sequence contains recipe for making proteins (genotype) proteome: complete.
Advertisements

Protein Quantitation II: Multiple Reaction Monitoring
In-depth Analysis of Protein Amino Acid Sequence and PTMs with High-resolution Mass Spectrometry Lian Yang 2 ; Baozhen Shan 1 ; Bin Ma 2 1 Bioinformatics.
MN-B-C 2 Analysis of High Dimensional (-omics) Data Kay Hofmann – Protein Evolution Group Week 5: Proteomics.
How to identify peptides October 2013 Gustavo de Souza IMM, OUS.
De Novo Sequencing v.s. Database Search Bin Ma School of Computer Science University of Waterloo Ontario, Canada.
Data Processing Algorithms for Analysis of High Resolution MSMS Spectra of Peptides with Complex Patterns of Posttranslational Modifications Shenheng Guan.
Proteomics The proteome is larger than the genome due to alternative splicing and protein modification. As we have said before we need to know All protein-protein.
Sangtae Kim Ph.D. candidate University of California, San Diego
PROTEOMICS LECTURE. Genomics DNA (Gene) Functional Genomics TranscriptomicsRNA Proteomics PROTEIN Metabolomics METABOLITE Transcription Translation Enzymatic.
Lawrence Hunter, Ph.D. Director, Computational Bioscience Program University of Colorado School of Medicine
Proteomics Informatics – Protein identification II: search engines and protein sequence databases (Week 5)
Proteomics Informatics Workshop Part I: Protein Identification
Previous Lecture: Regression and Correlation
My contact details and information about submitting samples for MS
Goals in Proteomics 1.Identify and quantify proteins in complex mixtures/complexes 2.Identify global protein-protein interactions 3.Define protein localizations.
Facts and Fallacies about de Novo Sequencing & Database Search.
Analysis of tandem mass spectra - II Prof. William Stafford Noble GENOME 541 Intro to Computational Molecular Biology.
Absolute quantification of proteins and phosphoproteins from cell lysates by tandem MS Gygi et al (2003) PNAS 100(12), presented by Jessica.
Spectral Counting. 2 Definition The total number of identified peptide sequences (peptide spectrum matches) for the protein, including those redundantly.
Proteomics Informatics Workshop Part III: Protein Quantitation
Fa 05CSE182 CSE182-L9 Mass Spectrometry Quantitation and other applications.
Proteome.
Tryptic digestion Proteomics Workflow for Gel-based and LC-coupled Mass Spectrometry Protein or peptide pre-fractionation is a prerequisite for the reduction.
Karl Clauser Proteomics and Biomarker Discovery Taming Errors for Peptides with Post-Translational Modifications Bioinformatics for MS Interest Group ASMS.
The dynamic nature of the proteome
© 2010 SRI International - Company Confidential and Proprietary Information Quantitative Proteomics: Approaches and Current Capabilities Pathway Tools.
INF380 - Proteomics-91 INF380 – Proteomics Chapter 9 – Identification and characterization by MS/MS The MS/MS identification problem can be formulated.
Common parameters At the beginning one need to set up the parameters.
1 Chemical Analysis by Mass Spectrometry. 2 All chemical substances are combinations of atoms. Atoms of different elements have different masses (H =
Analysis of Complex Proteomic Datasets Using Scaffold Free Scaffold Viewer can be downloaded at:
A Phospho-Peptide Spectrum Library for Improved Targeted Assays Barbara Frewen 1, Scott Peterman 1, John Sinclair 2, Claus Jorgensen 2, Amol Prakash 1,
Laxman Yetukuri T : Modeling of Proteomics Data
Lecture 9. Functional Genomics at the Protein Level: Proteomics.
Genome of the week - Enterococcus faecalis E. faecalis - urinary tract infections, bacteremia, endocarditis. Organism sequenced is vancomycin resistant.
Proteomics What is it? How is it done? Are there different kinds? Why would you want to do it (what can it tell you)?
CSE182 CSE182-L11 Protein sequencing and Mass Spectrometry.
Isotope Labeled Internal Standards in Skyline
Proteomics Informatics (BMSC-GA 4437) Instructor David Fenyö Contact information
Salamanca, March 16th 2010 Participants: Laboratori de Proteomica-HUVH Servicio de Proteómica-CNB-CSIC Participants: Laboratori de Proteomica-HUVH Servicio.
Oct 2011 SDMBT1 Lecture 11 Some quantitation methods with LC-MS a.ICAT b.iTRAQ c.Proteolytic 18 O labelling d.SILAC e.AQUA f.Label Free quantitation.
Click to add Text Sample Preparation for Mass Spectrometry Sermin Tetik, PhD Marmara University July 2015, New Orleans.
ISA Kim Hye mi. Introduction Input Spectrum data (Protein database) Peptide assignment Peptide validation manual validation PeptideProphet.
Constructing high resolution consensus spectra for a peptide library
DIA Method Design, Data Acquisition, and Assessment
Protein quantitation I: Overview (Week 5). Fractionation Digestion LC-MS Lysis MS Sample i Protein j Peptide k Proteomic Bioinformatics – Quantitation.
Using Scaffold OHRI Proteomics Core Facility. This presentation is intended for Core Facility internal training purposes only.
Hanyang Univ. Introduction to Data Analyses for Mass Spectrometry-based Proteomics 1.
Quantitation using Pseudo-Isobaric Tags (QuPIT) and Quantitation using Pseudo-isobaric Amino acids in Cell culture (QuPAC) Parimal Samir Andrew J. Link.
Ho-Tak Lau, Hyong Won Suh, Martin Golkowski, and Shao-En Ong
Novel Proteomics Techniques
Goals in Proteomics Identify and quantify proteins in complex mixtures/complexes Identify global protein-protein interactions Define protein localizations.
Post translational modification n- acetylation Peptide Mass Fingerprinting (PMF) is an analytical technique for identifying unknown protein. Proteins to.
Mass Spectrometry makes it possible to measure protein/peptide masses (actually mass/charge ratio) with great accuracy Major uses Protein and peptide identification.
The Syllabus. The Syllabus Safety First !!! Students will not be allowed into the lab without proper attire. Proper attire is designed for your protection.
2 Dimensional Gel Electrophoresis
Protein/Peptide Quantification
Mass spectrometry-based proteomics
Proteomics Informatics David Fenyő
Quantifying Ubiquitin Signaling
A perspective on proteomics in cell biology
Proteomics Informatics –
NoDupe algorithm to detect and group similar mass spectra.
Is Proteomics the New Genomics?
Shotgun Proteomics in Neuroscience
Methods for the Elucidation of Protein-Small Molecule Interactions
Proteomics Informatics David Fenyő
Identification of Post Translational Modifications
Kuen-Pin Wu Institute of Information Science Academia Sinica
Presentation transcript:

Multiple flavors of mass analyzers Single MS (peptide fingerprinting): Identifies m/z of peptide only Peptide id’d by comparison to database, of predicted m/z of trypsinized proteins Tandem MS/MS (peptide sequencing): Pulls each peptide from the first MS Breaks up peptide bond Identifies each fragment based on m/z Collision cell 1 Now multiple types of collision cells: CID: collision induced dissociation ETD: electron transfer dissociation HCD: high-energy collision dissociation

Mass SpecMS Spectrum Ion sourceMass analyzerDetector Intro to Mass Spec (MS) Separate and identify peptide fragments by their Mass and Charge (m/z ratio) Basic principles: 1. Ionize (i.e. charge) peptide fragments 2. Separate ions by mass/charge (m/z) ratio 3. Detect ions of different m/z ratio 4. Compare to database of predicted m/z fragments for each genome 2

Mann Nat Reviews MBC. 5:699:711 3 How does each spectrum translate to amino acid sequence?

1.De novo sequencing: very difficult and not widely used (but being developed) for large-scale datasets 2.Matching observed spectra to a database of theoretical spectra 3.Matching observed spectra to a spectral database of previously seen spectra How does each spectrum translate to amino acid sequence? 4

Nesvizhskii (2010) J. Proteomics, 73: spectral matching is supposedly more accurate but … -limited to the number of peptides whose spectra have been observed before With either approach, observed spectra are processed to: group redundant spectra, remove bad spectra, recognized co-fragmentation, improve z estimates Many good spectra will not match a known sequence due to: absence of a target in DB, PTM modifies spectrum, constrained DB search, incorrect m or z estimate. 5

Result: peptide-to-spectral match (PSM) A major problem in proteomics is bad PSM calls … therefore statistical measures are critical Methods of estimating significance of PSMs: p- (or E-) value: compare score S of best PSM against distribution of all S for all spectra to all theoretical peptides FDR correction methods: 1.B&H FDR 2.Estimate the null distribution of RANDOM PSMs: - match all spectra to real (‘target’) DB and to fake (‘decoy) DB - often decoy DB is the same peptides in the library but reverse sequence one measure of FDR: 2*(# decoy hits) / (# decoy hits + # target hits) 3. Use #2 above to calculate posterior probabilities for EACH PSM 6

- mixture model approach: take the distribution of ALL scores S - this is a mixture of ‘correct’ PSMs and ‘incorrect’ PSMs - but we don’t know which are correct or incorrect - scores from decoy comparison are included, which can provide some idea of the distribution of ‘incorrect’ scores -EM or Bayesian approaches can then estimate the proportion of correct vs. incorrect PSM … based on each PSM score, a posterior probability is calculated FDR can be done at the level of PSM identification … but often done at the level of Protein identification 7

Error in PSM identification can amplify FDR in Protein identification Often focus on proteins identified by at least 2 different PSMs (or proteins with single PSMs of very high posterior probability) Nesvizhskii (2010) J. Proteomics, 73: Some methods combine PSM FDR to get a protein FDR 8

Some practical guidelines for analyzing proteomics results 1.Know that abundant proteins are much easier to identify 2.# of peptides per protein is an important consideration - proteins ID’d with >1 peptide are more reliable - proteins ID’d with 1 peptide observed repeatedly are more reliable - note than longer proteins are more likely to have false PSMs 3.Think carefully about the p-value/FDR and know how it was calculated 4.Know that proteomics is no where near saturating … many proteins will be missed 9

Quantitative proteomics 1.Spectral counting 2.Isotope labeling (SILAC) 3.Isobaric tagging (iTRAQ & TMT) 4.SRM Either absolute measurements or relatively comparisons 10

Spectral counting counting the number of peptides and counts for each protein Challenges: - different peptides are more (or less) likely to be assayed - analysis of complex mixtures often not saturating – may miss some peptides in some runs newer high-mass accuracy machines alleviate these challenges - quantitation comes in comparing separate mass-spec runs … therefore normalization is critical and can be confounded by error - requires careful statistics to account for differences in: quality of run, likelihood of observing each peptide, likelihood of observing each protein (eg. based on length, solubility, etc) Advantages / Challenges + label-free quantitation; cells can be grown in any medium - requires careful statistics to quantify - subject to run-to-run variation / error 11

SILAC (Stable Isotope Labeling with Amino acids in Cell culture) Cells are grown separately in heavy ( 13 C) or light ( 12 C) amino acids (often K or R), lysates are mixed, then analyzed in the same mass-spec run Mass shift of one neutron allows deconvolution, and quantification, of peaks in the same run. Advantages / Challenges: + not affected by run-to-run variation - need special media to incorporate heavy aa’s, - can only compare (and quantify) 2 samples directly - incomplete label incorporation can confound MS/MS identification 12

Isobaric Tagging iTRAQ or Tandem Mass Tags, TMTs LTQ Velos Orbitrap Each peptide mix covalently tagged with one of 4, 6, or 8 chemical tags of identical mass Samples are then pooled and analyzed in the same MS run Collision before MS 2 breaks tags – Tags can be distinguished in the small-mass range and quantified to give relative abundance across up to 8 samples. Advantages / Challenges: + can analyze up to 8 samples, same run - still need to deal with normalization 13

Selective Reaction Monitoring (SRM) Targeted proteomics to quantify specific peptides with great accuracy -Specialized instrument capable of very sensitively measuring the transition of precursor peptide and one peptide fragment -Typically dope in heavy-labeled synthetic peptides of precisely known abundance to quantify Advantages: - best precision measurements Disadvantages: - need to identify ‘proteotypic’ peptides for doping controls - expensive to make many heavy peptides of precise abundance - limited number of proteins that can be analyzed 14

Phospho-proteomics and Post-translational modifications (PTMs) 15 phosphorylated (P’d) peptides are enriched, typically through chromatography - P’d peptides do not ionize as well as unP’d peptides - enrichment of P’d peptides ensures ionization and aids in mapping IMAC: immobilized metal ion affinity chromatography - phospho groups bind charged metals - contamination by negatively-charged peptides Titanium dioxide (TiO 2 ) column: - binds phospho groups (mono-P’d better than multi-P’d) SIMAC: Sequential Elution from IMAC: - IMAC followed by TiO 2 column Goal: identify which residues are phosphorylated (Ser, Thr, Tyr), mapped based on known m/z of phospho group