MN-B-C 2 Analysis of High Dimensional (-omics) Data Kay Hofmann – Protein Evolution Group Week 5: Proteomics.

Slides:



Advertisements
Similar presentations
Genomes and Proteomes genome: complete set of genetic information in organism gene sequence contains recipe for making proteins (genotype) proteome: complete.
Advertisements

Protein Quantitation II: Multiple Reaction Monitoring
The Proteomics Core at Wayne State University
Proteomics Examination Yvonne (Bonnie) Eyler Technology Center 1600 Art Unit 1646 (703)
Quantification of low-abundance proteins in complexes and in total cell lysates by mass spectrometry Bastienne Jaccard and Manfredo Quadroni Université.
Yoona Kim University of California, San Diego UCSD Mass Spectrometry Journal Club 12/03/10.
Proteomics The proteome is larger than the genome due to alternative splicing and protein modification. As we have said before we need to know All protein-protein.
PROTEOMICS LECTURE. Genomics DNA (Gene) Functional Genomics TranscriptomicsRNA Proteomics PROTEIN Metabolomics METABOLITE Transcription Translation Enzymatic.
Lawrence Hunter, Ph.D. Director, Computational Bioscience Program University of Colorado School of Medicine
Proteomics Informatics – Protein identification II: search engines and protein sequence databases (Week 5)
Proteomics Informatics Workshop Part I: Protein Identification
Previous Lecture: Regression and Correlation
Proteomics Informatics (BMSC-GA 4437) Course Director David Fenyö Contact information
Goals in Proteomics 1.Identify and quantify proteins in complex mixtures/complexes 2.Identify global protein-protein interactions 3.Define protein localizations.
Proteomics Informatics (BMSC-GA 4437) Course Director David Fenyö Contact information
Absolute quantification of proteins and phosphoproteins from cell lysates by tandem MS Gygi et al (2003) PNAS 100(12), presented by Jessica.
Proteomics Informatics Workshop Part III: Protein Quantitation
Fa 05CSE182 CSE182-L9 Mass Spectrometry Quantitation and other applications.
Proteomics Informatics Workshop Part II: Protein Characterization David Fenyö February 18, 2011 Top-down/bottom-up proteomics Post-translational modifications.
Proteome.
Tryptic digestion Proteomics Workflow for Gel-based and LC-coupled Mass Spectrometry Protein or peptide pre-fractionation is a prerequisite for the reduction.
A highly abbreviated introduction to proteomics
Phosphoproteomics and motif mining Martin Miller Ph.d. student CBS DTU
The dynamic nature of the proteome
PROTEIN STRUCTURE NAME: ANUSHA. INTRODUCTION Frederick Sanger was awarded his first Nobel Prize for determining the amino acid sequence of insulin, the.
© 2010 SRI International - Company Confidential and Proprietary Information Quantitative Proteomics: Approaches and Current Capabilities Pathway Tools.
PROTEIN QUANTIFICATION AND PTM JUN SIN HSS.I. PROJECT 1.
INF380 - Proteomics-91 INF380 – Proteomics Chapter 9 – Identification and characterization by MS/MS The MS/MS identification problem can be formulated.
Common parameters At the beginning one need to set up the parameters.
1 Chemical Analysis by Mass Spectrometry. 2 All chemical substances are combinations of atoms. Atoms of different elements have different masses (H =
Analysis of Complex Proteomic Datasets Using Scaffold Free Scaffold Viewer can be downloaded at:
INF380 - Proteomics-101 INF380 – Proteomics Chapter 10 – Spectral Comparison Spectral comparison means that an experimental spectrum is compared to theoretical.
Proteomics The science of proteomics Applications of proteomics Proteomic methods a. protein purification b. protein sequencing c. mass spectrometry.
Quantification of Membrane and Membrane- Bound Proteins in Normal and Malignant Breast Cancer Cells Isolated from the Same Patient with Primary Breast.
Genome of the week - Enterococcus faecalis E. faecalis - urinary tract infections, bacteremia, endocarditis. Organism sequenced is vancomycin resistant.
Software Project MassAnalyst Roeland Luitwieler Marnix Kammer April 24, 2006.
CSE182 CSE182-L11 Protein sequencing and Mass Spectrometry.
Multiple flavors of mass analyzers Single MS (peptide fingerprinting): Identifies m/z of peptide only Peptide id’d by comparison to database, of predicted.
Overview of Mass Spectrometry
Proteome and Gene Expression Analysis Chapter 15 & 16.
Proteomics Informatics (BMSC-GA 4437) Instructor David Fenyö Contact information
Salamanca, March 16th 2010 Participants: Laboratori de Proteomica-HUVH Servicio de Proteómica-CNB-CSIC Participants: Laboratori de Proteomica-HUVH Servicio.
Oct 2011 SDMBT1 Lecture 11 Some quantitation methods with LC-MS a.ICAT b.iTRAQ c.Proteolytic 18 O labelling d.SILAC e.AQUA f.Label Free quantitation.
Proteomics Informatics (BMSC-GA 4437) Course Directors David Fenyö Kelly Ruggles Beatrix Ueberheide Contact information
Protein quantitation I: Overview (Week 5). Fractionation Digestion LC-MS Lysis MS Sample i Protein j Peptide k Proteomic Bioinformatics – Quantitation.
Using Scaffold OHRI Proteomics Core Facility. This presentation is intended for Core Facility internal training purposes only.
Quantitation using Pseudo-Isobaric Tags (QuPIT) and Quantitation using Pseudo-isobaric Amino acids in Cell culture (QuPAC) Parimal Samir Andrew J. Link.
Ho-Tak Lau, Hyong Won Suh, Martin Golkowski, and Shao-En Ong
Novel Proteomics Techniques
Date of download: 6/24/2016 Copyright © The American College of Cardiology. All rights reserved. From: Proteomic Strategies in the Search of New Biomarkers.
Goals in Proteomics Identify and quantify proteins in complex mixtures/complexes Identify global protein-protein interactions Define protein localizations.
2 Dimensional Gel Electrophoresis
Protein/Peptide Quantification
MCB test 2 Review M. Alex Miranda 11/5/16.
Bioinformatics Solutions Inc.
General Overview of the module and the methods
Proteomics Informatics David Fenyő
Analytical Characteristics of Cleavable Isotope-Coded Affinity Tag-LC-Tandem Mass Spectrometry for Quantitative Proteomic Studies  Cecily P. Vaughn, David.
A perspective on proteomics in cell biology
Proteomics Informatics –
The potential for proteomic definition of stem cell populations
The potential for proteomic definition of stem cell populations
NoDupe algorithm to detect and group similar mass spectra.
Is Proteomics the New Genomics?
The principle of the immuno-SILAC method.
Shotgun Proteomics in Neuroscience
Methods for the Elucidation of Protein-Small Molecule Interactions
Proteomics Informatics David Fenyő
Kuen-Pin Wu Institute of Information Science Academia Sinica
Presentation transcript:

MN-B-C 2 Analysis of High Dimensional (-omics) Data Kay Hofmann – Protein Evolution Group Week 5: Proteomics 1

Detection Proteomics  Which proteins are there? Which are abundant?  Which part of a protein is there?  Are there changes in protein presence or abundance? Modification Proteomics  Which proteins have posttranslational modifications?  What fraction of a protein pool is modified?  Are there changes in protein modification level? Interaction Proteomics  Which bind to each other or form complexes?  What fraction of a protein pool is bound/complexed?  Are there changes in protein interaction patterns?

Two-dimensional gel electrophoresis  Detektion (and possibly quantification) of entire proteins.  limited scope, lack of reproducibility  old school Mass spectrometry  Possibly coupled with liquid chromatography (LC)  Detection (and possibly quantification) of peptides  Requires sophisticated instrumentation Specific protein tags  Candidate proteins have to be known and be modified artificially  only suitable for special applications Antibodies  Candidate proteins have to be known  Specific antibodies have to be available  only suitable for small- and medium-scale studies

1st Dimension Isoelectric focussing Separation by isoelectric point 2nd Dimension SDS PAGE Separation by size Modification

Differences in Gel properties can be (partially) compensated Protein Recognition Can to (some degrees) be done directly on gel by a combination of IEP/MW-values and recognition of spot patterns. Usually, MS analysis of eluted spots required.

How many proteins can be detected ? In a 4 h run on the newest generation instrument: peptides proteins Sample requirement 4 µg peptide sample Sample preparation time 6h plus digest time

H – N – C – C – N – C – C – N – C – C – N – C – C – OH R1R1 R2R2 R3R3 R4R4 OOO HHHHHHHH O a 1 b 1 c 1 a 2 b 2 c 2 a 3 b 3 c 3 x 3 y 3 z 3 x 2 y 2 z 2 x 1 y 1 z 1 H+H+ a,b,c ions: charge retained in N-terminal fragment x,y,z ions: charge retained in C-terminal fragment The type of generated ion depends on the MS method

In principle, the fragmentation pattern of a peptide could result in a complete series of (e.g.) b-ions that allows the determination of the peptide sequence. Real data typically allows only identification of 3mer or 4mer peptides. Short tags can identify a peptide if combined with additional data (size) GWSV K/R This method can tolerate some degree of modification or variability, even if this is unknown/unexpected. Identification of suitable tags is difficult, often done manually

The entire fragment spectrum of a peptide is compared to a database of expected spectra for every possible peptide. Possible peptides are taken from a proteome-wide sequence database, taking the cleaving enyzme into account. Even if some ions are missing or too much, the correcpt peptide can be identified by a good correlation of expected and observed pattern. Problems with polymorphisms, modifications, other unexpected things.

Nowadays rarely done Can introduce bias Protein separation is often not very reproducible Reduces sample/spectrum complexity Spectra contain data of fewer proteins. But: more spectra have to be measured Allows quantification at protein level Protein amounts (e.g. LC peaks) more quantitative than peptide amounts or even MS peaks. Allows the detection of minor components Minor proteins are not overwhelmed by peptids from major components

missed cleavage site matches this protein, but also another one (with better score)

b* = b without NH3 b0 = b without H2O b++= b with two charges

Mass Spectrometry is not a quantitative method Different ions have different physicochemical properties, 'flyability', stability etc. Quantification before MS In some settings, it is possible to quantify the proteins or peptides before MS analysis, e.g. by gels, LC. Semi-quantitative 'label-free' approaches While the peak intensity does not correlate with protein abundance, the peptide count can be used for quantification (iBAQ, spectral counting, PAI) Quantitative labeling approaches Allow quantitative comparison of protein abundance under various conditions (iCAT, iTRAQ, SILAC)

Peptide frequency is correlated with protein abundance only semi-quantitative requires comparison over multiple runs, conditions must be stable & reproducible normalization for protein size required (big proteins generate more peptides) Observed vs. Non-observed Frequently used line of argumentation. Statistics difficult; effects are easily overestimated Difference 6 vs 0 observations typically not significant. (Audic &Claverie statistics) Protein abundance index (PAI) The number of observed peptides divided by the number of observable peptides per protein. Related to the logarithm of protein abundance

Stable isotope labeling by amino acids in cell culture A frequently used type of metabolic labeling One cell culture is fed with Lys/Arg containing light C 12 atoms One cell culture is fed with Lys/Arg containing heavy C 13 atoms Proteins from the two cultures are mixed and analysed in a single experiment. The proteins and resulting peptides behave identically During MS analysis, all labeled ions appear as a duplet with a defined size difference. The intensity ratio of these peaks is a good proxy for the ratio of protein abundance in the two cultures

Two commercially available variants: TMT: Tandem Mass Tag iTRAQ: Isobaric Tag for relative and absolute quantitation Labeling is not done metabolically but at the protein or peptide level. Isobaric properties ensure that peptide differences are only observed after fragmentation

Demonstration von Human Protein Atlas ( ArrayExpress Expression atlas (