Finding the unexpected in SWATH™ Data Sets – Implications for Protein Quantification Ron Bonner; Stephen Tate; Adam Lau AB SCIEX, 71 Four Valley Drive,

Slides:



Advertisements
Similar presentations
Protein Quantitation II: Multiple Reaction Monitoring
Advertisements

Protein Quantitation II: Multiple Reaction Monitoring
Using Skyline to Monitor Long- Term Performance Metrics of High-Resolution Mass Spectrometers J. Will Thompson and M. Arthur Moseley Duke Proteomics Core.
The Proteomics Core at Wayne State University
In-depth Analysis of Protein Amino Acid Sequence and PTMs with High-resolution Mass Spectrometry Lian Yang 2 ; Baozhen Shan 1 ; Bin Ma 2 1 Bioinformatics.
Monitoring of temporal changes in phosphorylation states of proteins using mass spectrometry and a chemical labeling strategy RESULTS INTRODUCTION It is.
1336 SW Bertha Blvd, Portland OR 97219
MN-B-C 2 Analysis of High Dimensional (-omics) Data Kay Hofmann – Protein Evolution Group Week 5: Proteomics.
THE APPLICATION OF A SOFTWARE TOOL FOR THE TWO-DIMENSIONAL REPRESENTATION OF LC-MS DATA TO THE ANALYSIS OF POST-TRANSLATIONAL MODIFICATIONS AND THE COMPARISON.
Smart Templates for Chemical Identification in GCxGC-MS QingPing Tao 1, Stephen E. Reichenbach 2, Mingtian Ni 3, Arvind Visvanathan 2, Michael Kok 2, Luke.
Proteomics Informatics – Protein identification II: search engines and protein sequence databases (Week 5)
Previous Lecture: Regression and Correlation
FIGURE 5. Plot of peptide charge state ratios. Quality Control Concept Figure 6 shows a concept for the implementation of quality control as system suitability.
Scaffold Download free viewer:
Proteomics Informatics (BMSC-GA 4437) Course Director David Fenyö Contact information
My contact details and information about submitting samples for MS
Proteomics Informatics (BMSC-GA 4437) Course Director David Fenyö Contact information
Fa 05CSE182 CSE182-L9 Mass Spectrometry Quantitation and other applications.
Tryptic digestion Proteomics Workflow for Gel-based and LC-coupled Mass Spectrometry Protein or peptide pre-fractionation is a prerequisite for the reduction.
Comparison of chicken light and dark meat using LC MALDI-TOF mass spectrometry as a model system for biomarker discovery WP 651 Jie Du; Stephen J. Hattan.
Introduction Recent research has proposed rapid and robust identification of intact microorganisms using matrix assisted laser desorption/ ionization time-of-flight.
The dynamic nature of the proteome
PROTEIN STRUCTURE NAME: ANUSHA. INTRODUCTION Frederick Sanger was awarded his first Nobel Prize for determining the amino acid sequence of insulin, the.
Introduction The GPM project (The Global Proteome Machine Organization) Salvador Martínez de Bartolomé Bioinformatics support –
PROTEIN QUANTIFICATION AND PTM JUN SIN HSS.I. PROJECT 1.
INF380 - Proteomics-91 INF380 – Proteomics Chapter 9 – Identification and characterization by MS/MS The MS/MS identification problem can be formulated.
Common parameters At the beginning one need to set up the parameters.
Analysis of Complex Proteomic Datasets Using Scaffold Free Scaffold Viewer can be downloaded at:
A Comprehensive Comparison of the de novo Sequencing Accuracies of PEAKS, BioAnalyst and PLGS Bin Ma 1 ; Amanda Doherty-Kirby 1 ; Aaron Booy 2 ; Bob Olafson.
A Phospho-Peptide Spectrum Library for Improved Targeted Assays Barbara Frewen 1, Scott Peterman 1, John Sinclair 2, Claus Jorgensen 2, Amol Prakash 1,
Laxman Yetukuri T : Modeling of Proteomics Data
INF380 - Proteomics-101 INF380 – Proteomics Chapter 10 – Spectral Comparison Spectral comparison means that an experimental spectrum is compared to theoretical.
A new "Molecular Scanner" design for interfacing gel electrophoresis with MALDI-TOF ThP Stephen J. Hattan; Kenneth C. Parker; Marvin L. Vestal SimulTof.
* CORRESPONDING AUTHOR Glucagon Bioanalysis by LC-MS: “Unprecedented Level of Sensitivity (10pg/mL) for a Novel Formulation” Jean-Nicholas Mess 1, Louis-Philippe.
INF380 - Proteomics-71 INF380 – Proteomics Chap 7 –Protein Identification and Characterization by MS Protein identification in our context means that we.
June 9th, 2013 Matthew J. Rardin June 9th, 2013 Matthew J. Rardin MS1 and MS2 crosstalk in label free quantitation of mass spectrometry data independent.
EBI is an Outstation of the European Molecular Biology Laboratory. In silico analysis of accurate proteomics, complemented by selective isolation of peptides.
Isotope Labeled Internal Standards in Skyline
Salamanca, March 16th 2010 Participants: Laboratori de Proteomica-HUVH Servicio de Proteómica-CNB-CSIC Participants: Laboratori de Proteomica-HUVH Servicio.
Deducing protein composition from complex protein preparations by MALDI without peptide separation.. TP #419 Kenneth C. Parker SimulTof Corporation, Sudbury,
ISA Kim Hye mi. Introduction Input Spectrum data (Protein database) Peptide assignment Peptide validation manual validation PeptideProphet.
Proteomics Informatics (BMSC-GA 4437) Course Directors David Fenyö Kelly Ruggles Beatrix Ueberheide Contact information
Workflows to set up acquisition methods for scheduled sMRM-HR on the TripleTOF 5600 Start from a data dependent acquisition (DDA) Perform data base search.
Data independent acquisition methods for metabolomics Stephen Tate, Ron Bonner AB SCIEX, 71 Four Valley Drive, Concord, ON, L4K 4V8 Canada A high resolution.
Target Analyses in Parallel Reaction Monitoring Mode (PRM)
Custom peptide synthesis services In the quantitative proteomics research, several MS-based methodologies for relative quantification have been introduced.
Custom peptide synthesis services In the quantitative proteomics research, several MS-based methodologies for relative quantification have been introduced.
Large Scale DIA With Skyline
Jarrett Egertson, Ph.D. MacCoss Lab
Supplemental figures. Observed extracted ion chromatograms (XICs) of each of the ten peptides from five standard proteins using dual ion funnel and standard.
LC-MS/MS Identification of Impurities Present in Synthetic Peptide Drugs Dr Anna Meljon*, Dr Alan Thompson, Dr Osama Chahrour, and Dr John Malone Almac.
MassMatrix Search Results Explained
View  text zoom  large Set properties text size to 14 point
Agenda Welcome from the Skyline team!
UniProtKB - Q15165 (PON2_HUMAN) is quantified by two signature peptide
Bioinformatics Solutions Inc.
Presentation Title NEMC 2018 Dale Walker, Bruce Quimby Agilent
Proteomics Informatics David Fenyő
Volume 138, Issue 4, Pages (August 2009)
Volume 20, Issue 12, Pages (December 2013)
Top-down protein identification.
Skyline MS1 filtering graphical user interface.
2D-LC-MS/MS analysis of tryptic digest of HEK293-SUMO3 cells (2 μg inj
Shotgun Proteomics in Neuroscience
Tryptic phosphopeptides of AdIGFBP-5, [γ-32P]ATP-labeled in vitro by phosphorylation with CK2, were separated by HPLC and detected and sequenced by mass.
Sim and PIC scoring results for standard peptides and the test shotgun proteomics dataset. Sim and PIC scoring results for standard peptides and the test.
Proteomics Informatics David Fenyő
Interpretation of Mass Spectra
Operation manual of AI SIDA
Presentation transcript:

Finding the unexpected in SWATH™ Data Sets – Implications for Protein Quantification Ron Bonner; Stephen Tate; Adam Lau AB SCIEX, 71 Four Valley Drive, Concord, ON, L4K 4V8 Canada RESULTS ABSTRACT Data independent acquisition (DIA) methods such as SWATH-MS™ acquisition allow the analysis of many compounds in complex mixtures and have demonstrated a very high degree of quantitation fidelity compared to MRM [1]. For quantitation the major advantages of SWATH acquisition over MRM are the number of analytes and the significant reduction in method development time, but the unbiased acquisition approach also provides the ability to perform additional quantitative and qualitative experiments. In fact, the “digital sample record” can provide a holistic view of the data, and sample, that is not possible by other methods. Here we describe a prototype qualitative browser and illustrate some of the insights that can be obtained. INTRODUCTION To date the main use of SWATH-MS™ acquisition is for reproducibly quantifying known compounds in complex mixtures. Chromatograms of known fragments are extracted from the Swath (25 amu window) expected to contain the precursor ion and the resulting peak groups (peaks appearing in all or several fragment traces) are scored and filtered in comparison to a decoy model. Scoring is based on the similarity of the fragment ion peaks (shape, retention time), their mass accuracy and similar parameters. Qualitative analysis can extract spectra in an untargeted manner or, as described here, can use a semi-targeted approach with the same scoring approach using either predicted precursor and fragments or by searching for the same fragment pattern in different SWATHs (different precursor mass) and at different retention times. Useful insights and capabilities include: Detecting process induced artifacts (miscleavages, deamidation, Met oxidation, etc.) caused by subtle sample preparation differences between laboratories and between researchers in a single lab that can adversely affect quantitation. Confirming protein identification and providing additional peptide matches that may have been missed. Fragment chromatograms are valuable because of the stochastic nature of shotgun ID and the likelihood of mis-identifications at low levels or from mixed peptides when only a single spectrum is available. Detection of unexpected fragments within the instrument that generate a perceived increase in sample complexity and redundant MS/MS spectra. Such in source fragments may also be fragmented but all have the same elution time. Detection of trypsin-induced artifacts, such as truncation, that resemble in source fragmentation but where the products have different retention times Distributing a peptide across several forms will adversely affect its quantitation hence detection is important particularly as the quantitation of compounds in complex samples becomes increasingly important. Similarly, targeting the correct peptides, and hence validation, is critical. The use of MRM is impossible given the large number of possibilities and the time required, but SWATH provides holistic data that can be used to confirm identifications and search for process artifacts. MATERIALS AND METHODS A series of samples were analyzed using a standard SWATH-MS™ acquisition method with 25 amu windows using a Nano LC introduction system and a AB Sciex 5600 TripleTOF® system. A library of peptide spectra was generated from IDA (shotgun) data using the ProteinPilot TM V4.5 software and the SWATH TM acquisition data were processed by extracting fragment chromatograms of confidently identified species in a research version of the PeakView® software with the SWATH Application which performed automated peak detection, transitions/peptides selection, and area extraction. The SWATH-MS data was also used in a prototype qualitative browser that automates the extraction and scoring of target fragments, from shotgun runs or manually specified, in the expected or all Swaths, and at the expected retention time (where known) or across the entire LC run. CONCLUSIONS 1.Extraction of ions for specific peptides from SWATH™-MS acquisition data allows the identity and reproducibility of modified species to be determined 2.Extraction of low confidence identifications, or undetected peptides, improves protein identification confidence and sequence coverage for quantitation 3.SWATH allows holistic review of the sample and identifies issues which could cause problems for quantification, such as the distribution of a peptide across many unexpected forms, and provides a consistent method for monitoring them REFERENCES 1.Gillet LC et al, Mol Cell Proteomics 2012, O Picotti P et al, Mol Cell Proteomics 2007, 6, 1589 TRADEMARKS/LICENSING For Research Use Only. Not for use in diagnostic procedures. The trademarks mentioned herein are the property of AB Sciex Pte. Ltd. or their respective owners. AB SCIEX ™ is being used under license. © 2013 AB SCIEX. Identification of Ladder Sequences Trypsin is expected to produce peptides with a consistent terminal group. While the notion of hindered and missed cleavage is well known, the idea that cleavage can occur at off target sites is poorly understood, poorly recognised and much debated. These peptides have previously been reported in samples at a higher concentration [2] but, as shown here, we also observe them in less concentrated samples. The browser can find “ladder” sequences (truncated forms) by searching for known peptide fragments across all SWATH™ acquisition experiments. If peak groups are found at different retention times they must correspond to distinct forms. Note, however, that this simple approach requires that either the C- or N-terminus is intact in the truncated form. Extending Sequence Coverage The example above shows the investigation of all predicted peptides from one protein in order to assist the identification by extending the sequence coverage. The two peptides shown were not identified by the IDA analysis but are clearly identified in the SWATH™ analysis. The unbiased approach provides spectra for all peptides, even those that would not have been targeted in a shotgun experiment because of mass, charge state, etc. The example below shows another example of this and compares the evidence for a confidently identified peptide to the unidentified peptide of comparable quality. In extreme cases, many ladder sequences are identified in the most abundant proteins. This complicates the identification of peptides suitable for quantification as the reproducibility of these different mis-cleaved forms is unknown. This figure shows all peptide forms detected for a single protein and reveals that only two peptides (green) are represented by a single tryptic peptide; all others have multiple forms which in general will invalidate their use for both normalization and also for quantification. (See also poster TP144). Although it is possible to identify these in shotgun experiments SWATH™acquisition allows determination of their reproducibility and hence their suitability for quantification. 1 The target protein can be transferred from the quantitative browser, entered manually or retrieved from a repository and digested. 2 One or all peptides can be processed by looking for selected fragments in the expected Swath(s) or all. A window around the expected retention time (if known from a shotgun experiment) or the whole run can be processed. 3 After processing the display shows the score (yellow -> red) in each Swath (if processed) and indicates the Swaths expected to contain the precursor (marked 1+, 2+…). 4 Selecting a peptide/Swath generates (lower, left to right): The overlaid fragment chromatograms for the highest scoring Swath. A summary of the scores for all Swaths and retention times; the cross-hairs indicate the one displayed. Here it is clear that several SWATHs have reasonable scores at the same retention time. The MS spectrum for the selected Swath, here showing the 2+ molecular ion of the selected peptide. A background subtracted MS/MS spectrum marked with the sequence ions that match those predicted (top right pane). The browser provides a highly interactive display of information concerning the selected peptides, the scores for the selected fragment ions in all Swaths and their retention time behaviour. The panes are linked so that changing the selected peptide updates all of the others. This example, from the SWATH™ acquisition analysis of an E. coli sample confirms the detection of four truncated versions of the same peptide. Since they are at different retention times they are distinct compounds and not the result of in source fragmentation. Augmenting Protein Identification Results Shotgun protein identification is based on a single snapshot MSMS spectrum of the parent ions isolated by Q1 region which introduces a number of issues. It can be difficult to confirm low level identifications since there is no way to verify that the fragment ions are from the same peptide and not from a mixed spectrum. Using SWATH™ acquisition it is possible to check the presence of the compound by showing that the product ions all elute with a similar profile and thereby increase confidence in the identification. These examples show confirmation of two peptides that were identified but with low confidence values. Extracting the expected fragment chromatograms reveals that they generate peak groups with reasonable scores at the expected retention times.