Identification of Post Translational Modifications

Slides:



Advertisements
Similar presentations
Genomes and Proteomes genome: complete set of genetic information in organism gene sequence contains recipe for making proteins (genotype) proteome: complete.
Advertisements

Protein Quantitation II: Multiple Reaction Monitoring
From Genome to Proteome Juang RH (2004) BCbasics Systems Biology, Integrated Biology.
Post-Translational Modifications: CrossTalk Robert Chalkley Chem 204.
Proteomics Informatics – Protein characterization I: post-translational modifications (Week 10)
MN-B-C 2 Analysis of High Dimensional (-omics) Data Kay Hofmann – Protein Evolution Group Week 5: Proteomics.
Proteomics Informatics – Protein identification III: de novo sequencing (Week 6)
How to identify peptides October 2013 Gustavo de Souza IMM, OUS.
Proteomics Informatics – Protein identification II: search engines and protein sequence databases (Week 5)
Proteomics Informatics Workshop Part I: Protein Identification
Previous Lecture: Regression and Correlation
De Novo Sequencing of MS Spectra
Proteomics Informatics (BMSC-GA 4437) Course Director David Fenyö Contact information
Proteomics Informatics (BMSC-GA 4437) Course Director David Fenyö Contact information
Proteomics Informatics Workshop Part II: Protein Characterization David Fenyö February 18, 2011 Top-down/bottom-up proteomics Post-translational modifications.
Tryptic digestion Proteomics Workflow for Gel-based and LC-coupled Mass Spectrometry Protein or peptide pre-fractionation is a prerequisite for the reduction.
GSAT501 - proteomics Name, home-town Students – previous lab experience –Lab you hope to end up in? Teachers – what is your current project.
Lecture 9. Functional Genomics at the Protein Level: Proteomics.
Genome of the week - Enterococcus faecalis E. faecalis - urinary tract infections, bacteremia, endocarditis. Organism sequenced is vancomycin resistant.
Multiple flavors of mass analyzers Single MS (peptide fingerprinting): Identifies m/z of peptide only Peptide id’d by comparison to database, of predicted.
Isobaric tags for quantitative analysis Joshua J. Coon U. Wisconsin-Madison.
Proteomics Informatics (BMSC-GA 4437) Instructor David Fenyö Contact information
Salamanca, March 16th 2010 Participants: Laboratori de Proteomica-HUVH Servicio de Proteómica-CNB-CSIC Participants: Laboratori de Proteomica-HUVH Servicio.
Click to add Text Sample Preparation for Mass Spectrometry Sermin Tetik, PhD Marmara University July 2015, New Orleans.
Proteomics Informatics (BMSC-GA 4437) Course Directors David Fenyö Kelly Ruggles Beatrix Ueberheide Contact information
Constructing high resolution consensus spectra for a peptide library
What is proteomics? Richard Mbasu and Ben Richards.
Ho-Tak Lau, Hyong Won Suh, Martin Golkowski, and Shao-En Ong
Date of download: 6/24/2016 Copyright © The American College of Cardiology. All rights reserved. From: Proteomic Strategies in the Search of New Biomarkers.
Goals in Proteomics Identify and quantify proteins in complex mixtures/complexes Identify global protein-protein interactions Define protein localizations.
The Syllabus. The Syllabus Safety First !!! Students will not be allowed into the lab without proper attire. Proper attire is designed for your protection.
Protein/Peptide Quantification
Mass spectrometry-based proteomics
Thomas BOTZANOWSKI & Blandine CHAZARIN
MCB test 2 Review M. Alex Miranda 11/5/16.
Bioinformatics Solutions Inc.
Quantitative proteomics reveals daf‐16‐mediated reduction in protein metabolism in long‐lived daf‐2(e1370) mutants. Quantitative proteomics reveals daf‐16‐mediated.
General Overview of the module and the methods
Identification of DJ-1/PARK-7 as a determinant of stroma-dependent and TNF-α–induced apoptosis in MDS using mass spectrometry and phosphopeptide analysis.
Proteomics Informatics David Fenyő
Volume 7, Issue 8, Pages (August 2000)
Quantifying Ubiquitin Signaling
Proteomic Approaches to Cancer Biomarkers
Interpretation of Mass Spectra I
Analytical Characteristics of Cleavable Isotope-Coded Affinity Tag-LC-Tandem Mass Spectrometry for Quantitative Proteomic Studies  Cecily P. Vaughn, David.
A perspective on proteomics in cell biology
Volume 31, Issue 3, Pages (August 2008)
Proteomics Informatics –
Volume 24, Issue 11, Pages e5 (November 2017)
A Photoreactive Small-Molecule Probe for 2-Oxoglutarate Oxygenases
Volume 20, Issue 12, Pages (December 2013)
Volume 7, Issue 8, Pages (August 2000)
Relative quantitation of phosphopeptides from conditioned media from subtype specific breast cancer cell lines. Relative quantitation of phosphopeptides.
A, Base peak chromatogram of apomyoglobin digest generated by 0
SUMO-2 Orchestrates Chromatin Modifiers in Response to DNA Damage
Fus3-Regulated Tec1 Degradation through SCFCdc4 Determines MAPK Signaling Specificity during Mating in Yeast  Song Chou, Lan Huang, Haoping Liu  Cell 
2D-LC-MS/MS analysis of tryptic digest of HEK293-SUMO3 cells (2 μg inj
Overview of the analytical workflow used in this study and a representative MS/MS spectrum.a, Overview of the analytical workflow used in this study. Overview.
Mass Spectrometry THE MAIN USE OF MS IN ORG CHEM IS:
Shotgun Proteomics in Neuroscience
SUMO-2 Orchestrates Chromatin Modifiers in Response to DNA Damage
The Coming Age of Complete, Accurate, and Ubiquitous Proteomes
Volume 22, Issue 7, Pages (July 2015)
Proteomics Informatics David Fenyő
Interpretation of Mass Spectra
Kuen-Pin Wu Institute of Information Science Academia Sinica
Sample preparation Protein and peptide separation techniques Karel Bezstarosti (Proteomics Center, Erasmus MC)
MS3 for peptide identification and mapping phosphorylation sites
A Primer on Concepts and Applications of Proteomics in Neuroscience
Phosphorylated tyrosine residues in FGFR3, FGFR3(K650E), FGFR3-TACC3, and FGFR3(K650E)-TACC3 identified by mass spectrometry analysis. Phosphorylated tyrosine.
Presentation transcript:

Identification of Post Translational Modifications Beatrix Ueberheide March 25th 2019

Homework from last lecture

Assigned Paper

Ubiquitination - MS

Jackson Paper: USP4 ubiquitination Catalytic dead USP4 (Cys311A) no longer interacts with CtIP and MRN Could it be that ubiquitination of USP4 is blocking the interaction with CtIP and MRN and that inactive USP4 can no longer de-ubiquitinates itself and hence prevents interaction? Co-expressed USP4 with HA tagged ubiquitin: GFP-USP4 WT; GFP-USP4 CD; GFP Performed affinity purification using the HA tag and Western Blot using the GFP handle Performed GFP affinity purification followed by LC-MS Found a sh%^$ load of ubiquitin

Table S4

Sites of ubiquitination on USP4 CD WT STLVCPECAK 1164.5388 [M+1H]+1

Ubiquitinated spectrum

Which cysteine is ubiquitinylated?

Let’s check public databases: www.thegpm.org

From The GPM

From The GPM

Could it be this?

Easy Problem to have avoided

Figure Legend

Figure Legend

Figure Legend

Figure Legend

Characterizing PTMs

Detecting PTMs In theory quite simple, only the mass of the amino acid changes In practice difficult for several reasons Peptide is too short or too long to be detected Not enough sequence coverage to assign the site of modification Modification is present only in a small percentage Peptide is never selected for MS/MS Several amino acids in the peptide could be modified Modification might interfere with fragmentation Not enough sequence coverage to identify the peptide

Biological Mass Spectrometry Proteolytic digestion Protein(s) Peptides Base Peak Chromatogram MS 500 1000 1500 m/z Time (min) Mass Spectrometer 200 600 1000 m/z MS/MS Database Search Manual Interpretation

Searching Proteomics Data GSFLYEYSRRHPEYAVSVLLRLAKEYEATLEECCAKDDPHACYSTVFDKLKHLVDEPQNLIKQNCDQFEKGEYGFQNALIVRYTRKVPQVSTPTLVEVSRSLGKVGTRCCTKPESERMPCTEDYLSLILNRLCVLHEKTPVSEKVTKCCTESLVNRRPCFSALTP Protein Digestion LFTFHADICTLPDTEK 1850.8993 RPCFSALTPDETYVPK 1823.8906 MPCTEDYLSLILNR 1667.8131 VPQVSTPTLVEVSR 1511.8427 DDPHACYSTVFDK 1497.6314 Peptide Mass Measurement 500 1000 1500 m/z 1850.8993 1850.8906 1850.8805 1850.8914 1850.8868 924.9537 MS Peptide Fragmentation 200 600 1000 m/z MS/MS

Proteomics is more than Mass Spec Enrichment Fractionation Sample Protein Extraction Protein Digestion Data Analysis LC-MS Peptide Clean up Enrichment Fractionation

Inherent difficulty of proteome characterization Proteolytic digestion Protein(s) Peptides Size of peptides depends entirely on the protein sequence (K, R, D, E, Y, F,….) Peptides too short or too long are not detectable If those peptides carry PTMs, they go undetected as well

Inherent difficulty of proteome characterization Proteolytic digestion Protein(s) Peptides Protein A Protein B Protein C Protein D Protein E One protein group is reported!

Inherent difficulty of proteome characterization Proteolytic digestion Protein(s) Peptides Protein A Cancer Protein A Healthy

The Central Dogma Proteomics Metabolomics seconds to minutes hours (seconds) Genomics Transcriptomics Proteomics Metabolomics Adapted from Patti et al (2011) Nature Reviews| Molecular Cell Biology

Proteomics

Proteomics Protein Production/Changes Protein PTM modification/Changes Protein-Protein Interaction/Changes Protein Structural Changes

Dynamic Range 200ng HeLa lysate: ~2500 proteins >8000 proteins 10 20 30 40 50 60 70 80 90 100 110 120 Time (min) Relative Abundance 444.74 477.31 408.73 492.79 652.36 472.77 425.72 599.76 416.25 590.81 566.77 567.78 420.79 533.32 644.73 757.40 895.95 446.91 655.85 584.80 680.37 533.27 726.05 911.40 455.32 899.46 200ng HeLa lysate: ~2500 proteins >8000 proteins 10 20 30 40 50 60 70 80 90 100 110 120 130 140 150 160 Time (min) Relative Abundance 575.31 395.24 476.23 671.82 682.70 480.79 492.75 467.26 518.21 547.32 663.58 829.38 489.96 371.10 518.03 637.98 749.80 669.59 756.43 992.46 623.58 593.83 840.41 738.06 871.38 200ng Human Plasma: ~250 proteins

Success Rate of a Proteomics Experiment Distribution of Protein Amounts Number of Proteins Fractionation Enrichment Proteins Detected Log (Protein Amount) DEFINITION: The success rate of a proteomics experiment is defined as the number of proteins detected divided by the total number of proteins in the proteome. J. Eriksson, D. Fenyö, "Improving the success rate of proteome analysis by modeling protein-abundance distributions and experimental designs". Nature Biotechnology 25 (2007) 651-655.

Fractionation and Enrichment One sample becomes 10 or more Fractions Protein identification increases from ~2000 to ~8000 ( 2 hours versus 60 hours instrument time)

Fractionation and Enrichment increases the MS instrument time

Multiplexing for Quantitation SILAC TANDEM MASS TAGS

SILAC

Multiplex Analysis (TMT)

iTRAQ

Multiplex Analysis (TMT)

TMT (iTRAQ)

MS2 of a peptide Peptide: AVDTWSWGER Protein: Galectin 3 binding protein May stimulate host defense against viruses and tumor cells

Zoom in

Quantitative multiplexed proteomic analysis of TNKS DKO cells 7,254 proteins quantified: 608 showed significant change in abundance 287 increased in the DKO 74 of the 287 contained a TNKS-binding site 23 of the 74 tankyrase targets are shown in the heatmap Highlighted proteins were selected for validation Bhardwaj A., Yang, Y., Ueberheide, B., Smith, S., Whole proteome analysis of human tankyrase knockout cells reveals targets of tankyrase-mediated degradation, Nature Communications, 8:2214 (2017)

Characterizing PTMs

Modified Peptide is not abundant enough to be selected for MS/MS enriching for the modification Phospho specific enrichment (IMAC, TiO2) Antibody based enrichment (ubiquitin, phosphotyrosine)

Fractionation and Enrichment Total Cell Lysate 1824 Protein Groups 12389 Peptides 26 phosphorylated peptides 100 % S & T 20 25 30 35 40 45 50 55 60 65 70 75 80 85 90 95 100 105 110 115 120 125 130 135 Time (min) 5 10 15 Relative Abundance Total cell lysate 1 HILIC Fraction 917 Protein Groups 1208 Peptides 95 phosphorylated peptides 100 % S & T 0 % Y 20 21 22 23 24 25 26 27 28 29 30 31 32 33 34 35 36 37 38 39 40 41 42 43 44 45 46 47 48 49 50 51 52 53 54 55 56 57 58 59 Time (min) 5 10 15 60 65 70 75 80 85 90 95 100 Relative Abundance

Fractionation and Enrichment 20 21 22 23 24 25 26 27 28 29 30 31 32 33 34 35 36 37 38 39 40 41 42 43 44 45 46 47 48 49 50 51 52 53 54 55 56 57 58 59 Time (min) 5 10 15 60 65 70 75 80 85 90 95 100 Relative Abundance Total cell lysate 1 HILIC Fraction 917 Protein Groups 1208 Peptides 95 phosphorylated peptides 100 % S & T 0 % Y Total Cell Lysate 1 HILIC Fraction TiO2 Enrichment 347 Protein Groups 419 Peptides 387 phosphorylated peptides 99.2 % S & T 0.8 % Y 20 22 24 26 28 30 32 34 36 38 40 42 44 46 48 50 52 54 56 58 60 62 64 Time (min) 5 10 15 25 35 45 55 65 70 75 80 85 90 95 100 Relative Abundance

Enrichment for pY Pre-Enrichment Post-Enrichment NL: 2.44E9 5 10 15 20 25 30 35 40 45 50 55 60 65 70 75 80 85 90 95 100 105 110 115 120 125 130 135 140 145 Time (min) Relative Abundance 477.3055 472.7690 416.2502 488.7277 442.5899 445.9025 652.3596 872.4121 590.8138 569.2763 796.9095 516.8010 489.6061 644.7284 566.7671 714.3454 452.2126 526.9341 499.7473 895.9489 600.2489 880.4102 584.8013 911.4047 933.7854 406.3527 470.9005 498.8081 989.7180 537.8790 939.4348 450.3788 434.3840 419.1702 900.5027 995.0111 445.1200 633.2947 840.0427 406.3518 681.2819 714.3449 476.3057 453.3427 432.2793 520.3320 575.7687 493.5727 608.3846 563.5903 475.8850 644.7281 516.8008 564.3422 445.1194 445.1195 567.9413 796.9098 489.3137 487.6485 511.3267 516.9537 450.3782 599.3796 445.1192 445.1191 445.1189 NL: 2.44E9 NL: 2.50E7 Pre-Enrichment Post-Enrichment

Enrichment for pY Pre-Enrichment Post-Enrichment 5 10 15 20 25 30 35 40 45 50 55 60 65 70 75 80 85 90 95 100 105 110 115 120 125 130 135 140 145 Time (min) Relative Abundance 477.3055 472.7690 416.2502 488.7277 442.5899 445.9025 652.3596 872.4121 590.8138 569.2763 796.9095 516.8010 489.6061 644.7284 566.7671 714.3454 452.2126 526.9341 499.7473 895.9489 600.2489 880.4102 584.8013 911.4047 933.7854 406.3527 470.9005 498.8081 989.7180 537.8790 939.4348 450.3788 434.3840 419.1702 900.5027 995.0111 445.1200 633.2947 840.0427 406.3518 681.2819 714.3449 476.3057 453.3427 432.2793 520.3320 575.7687 493.5727 608.3846 563.5903 475.8850 644.7281 516.8008 564.3422 445.1194 445.1195 567.9413 796.9098 489.3137 487.6485 511.3267 516.9537 450.3782 599.3796 445.1192 445.1191 445.1189 NL: 2.44E9 NL: 2.50E7 Pre-Enrichment   Total number of Proteins Total number of Peptides Total Phospho Peptides Tyrosine Phospho Peptides Pre-Enrichment 2364 18826 221 5 Post-Enrichment 988 2919 583 574 Post-Enrichment

Enrichment Strategies for the Detection of Phosphorylated Peptides Unphosphorylated single phosphorylation multiple phosphorylation Hydrophilic Interaction Chromatography (HILIC) Phosphopeptides elute later than their unphosphorylated counterparts Stationary phase is hydrophilic Mobile phase is hydrophobic Peptide elute when it is more soluble in the solvent, the higher the charge the less soluble in the organic phase Works for PO4 but other modifications have other ways to enrich

Enrichment Strategies for the Detection of Phosphorylated Peptides Time (min) neutral peptides basic peptides SCX Strong Cation Exchange Chromatography Stationary phase is negatively charged Mobile phase is a buffer that is increasing the pH (if peptide becomes neutral it elutes) Neutral peptides elute earlier: XXpSxxxxxR/K Positive peptides elute late: XXXXHXXXXR/K

Several Strategies are often combined

Multiplexed Mouse Study 10mg of protein for each of the 27 samples

Multiplexed Mouse Study 49 x 3 hr LC-MS analysis (>147 hours of instrument time)

Is the protein or the phosphorylation changed?

What if the enrichment worked, but you still don’t ‘see’ much?

MS/MS is not informative m/z

MS/MS is not informative Peptide with two possible modification sites

MS/MS is not informative Peptide with two possible modification sites MS/MS spectrum m/z

MS/MS is not informative Peptide with two possible modification sites Matching MS/MS spectrum m/z

MS/MS is not informative Peptide with two possible modification sites Matching MS/MS spectrum m/z

MS/MS is not informative Peptide with two possible modification sites Matching MS/MS spectrum m/z

Change the way the peptide get’s dissociated in the mass spectrometer What if enough of the modified peptide is present, but still no ID is possible? Change the way the peptide get’s dissociated in the mass spectrometer

Tandem MS - Dissociation Techniques CAD: Collision Activated Dissociation (b, y ions) increase of internal energy through collisions

Tandem MS - Dissociation Techniques CAD: Collision Activated Dissociation (b, y ions) increase of internal energy through collisions ETD: Electron Transfer Dissociation (c, z ions) bombardment of peptides with electrons (radical driven fragmentation)

Tandem MS - Dissociation Techniques CAD: Collision Activated Dissociation (b, y ions) selective fragmentation ETD: Electron Transfer Dissociation (c, z ions) more random fragmentation

Detecting PTMs In theory quite simple, only the mass of the amino acid changes In practice difficult for several reasons Peptide is too short or too long to be detected Not enough sequence coverage to assign the site of modification Modification is present only in a small percentage Peptide is never selected for MS/MS Several amino acids in the peptide could be modified Modification might interfere with fragmentation Not enough sequence coverage to identify the peptide

Searching for PTMs

CAD versus ETD x x Low charge Modified from Joshua Coon, Analytical Chemistry, 81, 3208-3215 (2009)

Doubly charged peptides

CAD versus ETD x Low charge Labile modifications Modified from Joshua Coon, Analytical Chemistry, 81, 3208-3215 (2009)

Phosphopeptides

O-Sulfonated Peptides

CAD versus ETD x x Low charge Labile modifications Intact proteins Modified from Joshua Coon, Analytical Chemistry, 81, 3208-3215 (2009)

Intact Histone Protein CAD ETD

Intact Histone Protein CAD M M P Ac Ac Ac P M P Ac M Ac M M M P P M M H3 1-ARTKQTARKSTGGKAPRKQLATKAARKSAPATGGVKKPHRYRPTVALRE-50 ETD

Large highly charged peptides 1:1000 +6 415.7 m/z 11 12 13 14 Time (min) 415 m/z