Targeted Quantification of O-Linked Glycosylation Site for Glycan Distribution Analysis Scott M. Peterman 1, Amol Prakash 1, Bryan Krastins 1, Mary Lopez.

Slides:



Advertisements
Similar presentations
Protein Quantitation II: Multiple Reaction Monitoring
Advertisements

Improvements in Mass Spectrometry for Life Science Research – Does Agilent Have the Answer? Ashley Sage PhD.
Proteomics Informatics – Protein characterization I: post-translational modifications (Week 10)
Conclusion The workflow presented provides a strategy to incorporate unbiased glycopeptide identification to generate an initial list of targets for data.
In-depth Analysis of Protein Amino Acid Sequence and PTMs with High-resolution Mass Spectrometry Lian Yang 2 ; Baozhen Shan 1 ; Bin Ma 2 1 Bioinformatics.
MN-B-C 2 Analysis of High Dimensional (-omics) Data Kay Hofmann – Protein Evolution Group Week 5: Proteomics.
N-Glycopeptide Identification from CID Tandem Mass Spectra using Glycan Databases and False Discovery Rate Estimation Kevin B. Chandler, Petr Pompach,
Knowledge Enabled Information and Services Science What can SW do for HCLS today? Panel at HCSL Workshop, WWW2007 Amit Sheth Kno.e.sis Center Wright State.
Data Processing Algorithms for Analysis of High Resolution MSMS Spectra of Peptides with Complex Patterns of Posttranslational Modifications Shenheng Guan.
Sangtae Kim Ph.D. candidate University of California, San Diego
Proteomics Informatics – Protein identification II: search engines and protein sequence databases (Week 5)
Proteomics Informatics Workshop Part I: Protein Identification
Previous Lecture: Regression and Correlation
FIGURE 5. Plot of peptide charge state ratios. Quality Control Concept Figure 6 shows a concept for the implementation of quality control as system suitability.
Scaffold Download free viewer:
Improving Throughput for Highly Multiplexed Targeted Quantification Methods Using Novel API-Remote Instrument Control and State-Model Data Acquisition.
My contact details and information about submitting samples for MS
Overview Purpose: Accurately estimate peptide retention based on spectrum library data utilizing commonly observed peptides in place of synthetic standards.
Proteomics Informatics (BMSC-GA 4437) Course Director David Fenyö Contact information
Proteomics Informatics Workshop Part III: Protein Quantitation
Fa 05CSE182 CSE182-L9 Mass Spectrometry Quantitation and other applications.
Conclusion  Comprehensive workflow identified approximately 70% more high confident peptide as compare to general search strategy.  The comprehensive.
Tryptic digestion Proteomics Workflow for Gel-based and LC-coupled Mass Spectrometry Protein or peptide pre-fractionation is a prerequisite for the reduction.
Production of polypeptides, Da, and middle-down analysis by LC-MSMS Catherine Fenselau 1, Joseph Cannon 1, Nathan Edwards 2, Karen Lohnes 1,
The dynamic nature of the proteome
PROTEIN STRUCTURE NAME: ANUSHA. INTRODUCTION Frederick Sanger was awarded his first Nobel Prize for determining the amino acid sequence of insulin, the.
INF380 - Proteomics-91 INF380 – Proteomics Chapter 9 – Identification and characterization by MS/MS The MS/MS identification problem can be formulated.
Acknowledgements This work is supported by NSF award DBI , and National Center for Glycomics and Glycoproteomics, funded by NIH/NCRR grant 5P41RR
Common parameters At the beginning one need to set up the parameters.
Improving Peptide Searching Workflow to Maximize Protein Identifications Shadab Ahmad 1, Amol Prakash 1, David Sarracino 1, Bryan Krastins 1, MingMing.
Integrated Targeted Quantitative Method for Insulin and its Therapeutic Analogs Eric Niederkofler 1, Dobrin Nedelkov 1, Urban Kiernan 1, David Phillips.
A Phospho-Peptide Spectrum Library for Improved Targeted Assays Barbara Frewen 1, Scott Peterman 1, John Sinclair 2, Claus Jorgensen 2, Amol Prakash 1,
Laxman Yetukuri T : Modeling of Proteomics Data
INF380 - Proteomics-101 INF380 – Proteomics Chapter 10 – Spectral Comparison Spectral comparison means that an experimental spectrum is compared to theoretical.
A new "Molecular Scanner" design for interfacing gel electrophoresis with MALDI-TOF ThP Stephen J. Hattan; Kenneth C. Parker; Marvin L. Vestal SimulTof.
Knowledge Enabled Information and Services Science Glycomics project overview.
C. Other Enzymes PCA1 PCA2 glycolytic HSPB2 CK Other Enzymes PCA1 PCA2 Other Enzymes PC1 glycolytic HSPB2 CK glycolytic HSPB2 CK Quantitation of Changes.
Finding a Needle in a Haystack: Using High Resolution Mass Spectrometry in Targeted and Non Targeted Searching for Food Contaminants Erik Verschuuren.
Software Project MassAnalyst Roeland Luitwieler Marnix Kammer April 24, 2006.
June 9th, 2013 Matthew J. Rardin June 9th, 2013 Matthew J. Rardin MS1 and MS2 crosstalk in label free quantitation of mass spectrometry data independent.
Improving the Detection of Hydrophilic Peptides for Increased Protein Sequence Coverage and Enhanced Proteomic Analyses Brian S. Hampton 1 and Amos H.
Multiple flavors of mass analyzers Single MS (peptide fingerprinting): Identifies m/z of peptide only Peptide id’d by comparison to database, of predicted.
EBI is an Outstation of the European Molecular Biology Laboratory. In silico analysis of accurate proteomics, complemented by selective isolation of peptides.
© Copyright 2009 by the American Association for Clinical Chemistry Plasma Renin Activity by Liquid Chromatography– Tandem Mass Spectrometry (LC-MS/MS):
Isotope Labeled Internal Standards in Skyline
Salamanca, March 16th 2010 Participants: Laboratori de Proteomica-HUVH Servicio de Proteómica-CNB-CSIC Participants: Laboratori de Proteomica-HUVH Servicio.
1 The world leader in serving science DIA: the Why, How, and When…Really…
Deducing protein composition from complex protein preparations by MALDI without peptide separation.. TP #419 Kenneth C. Parker SimulTof Corporation, Sudbury,
Protein quantitation I: Overview (Week 5). Fractionation Digestion LC-MS Lysis MS Sample i Protein j Peptide k Proteomic Bioinformatics – Quantitation.
김지형. Introduction precursor peptides are dynamically selected for fragmentation with exclusion to prevent repetitive acquisition of MS/MS spectra.
Quantitation using Pseudo-Isobaric Tags (QuPIT) and Quantitation using Pseudo-isobaric Amino acids in Cell culture (QuPAC) Parimal Samir Andrew J. Link.
Data independent acquisition methods for metabolomics Stephen Tate, Ron Bonner AB SCIEX, 71 Four Valley Drive, Concord, ON, L4K 4V8 Canada A high resolution.
Finding the unexpected in SWATH™ Data Sets – Implications for Protein Quantification Ron Bonner; Stephen Tate; Adam Lau AB SCIEX, 71 Four Valley Drive,
Forensic Toxicology Use Only Analysis of ETG, ETS using the Thermo Scientific Exactive Mass Spectrometer Kent Johnson Fortes lab, Portland Oregon.
Target Analyses in Parallel Reaction Monitoring Mode (PRM)
Database Search Algorithm for Identification of Intact Cross-Links in Proteins and Peptides Using Tandem Mass Sepctrometry 신성호.
Table 1. Quality Parameters Being Considered for Evaluation
DIA: the Why, How, and When…Really…
MassMatrix Search Results Explained
Accelerating Research in Life Sciences
Pinpointing phosphorylation sites using Selected Reaction Monitoring and Skyline Christina Ludwig group of Ruedi Aebersold, ETH Zürich.
Bioinformatics Solutions Inc.
Proteomics Informatics David Fenyő
Now, More Than Ever, Proteomics Needs Better Chromatography
A perspective on proteomics in cell biology
Proteomics Informatics –
Shotgun Proteomics in Neuroscience
What Determines the Specificity and Outcomes of Ubiquitin Signaling?
Accelerating Research in Life Sciences
Kuen-Pin Wu Institute of Information Science Academia Sinica
Presentation transcript:

Targeted Quantification of O-Linked Glycosylation Site for Glycan Distribution Analysis Scott M. Peterman 1, Amol Prakash 1, Bryan Krastins 1, Mary Lopez 1, Julian Saba 2, Jennifer Cushing 3, Audra Ann Harget 3, and Matthew Renfrow 3 1 Thermo Fisher Scientific BRIMS, Cambridge, MA, 2 Thermo Fisher Scientific, San Jose, CA, University of Alabama Birmingham, Birmingham, AL Conclusion The targeted protein workflow presented facilitated detection, verification, and quantification of Apo CIII across biological samples. Targeting at the protein level provides significant advantages of evaluating unmodified peptides to provide landmarks for modified peptide confirmation.  Incorporation of MSIA enrichment increased sensitivity ca. 100-fold compared to whole serum digest analysis.  Unbiased HR/AM MS and data dependent/dynamic exclusion acquisition facilitates post-acquisition data processing workflow.  The Pinpoint Screening Tool generating a list of highly confident set of modified and unmodified targeted peptides from 100,000’s of sequences.  Pinpoint data processing incorporates multiple scoring levels significantly increasing confidence in the final relative quantification results. References 1.Maeda, H., Hashimoto, R. K., Ogura, T., Hiraga, S., Uzawa, H. J. of Lipid Research, 1987, 28, pp Zhao, P., Viner, R., Teo, C. F., Boons, G., Horn, D., Wells, L. J. Prot. Research, 2011, 10, Nedelkov, D., Kiernan, U. A., Niederkofler, E. E., Tubbs, K. A., Nelson, R. W. PNAS, 2005, 102(31), Acknowledgements We would like to thank Dr. MingMing Ning from Massachusetts General Hospital for providing the samples used in the experiment. Overview Purpose: Develop an automated workflow for identification and quantification of O- linked glycopeptides and corresponding glycoforms per targeted protein. Methods: Combine IP-MS using MSIA extraction with unbiased HR/ AM LC-MS and MS/MS data acquisition. Perform a novel iterative searching strategy based on retention time correlation of precursor and product ion accurate mass values creating a targeted list to perform qualitative and quantitative analysis across samples. Results: Incorporation of MSIA IP enrichment strategy resulted in ca fold increase in measured area under the curve (AUC) values per Apo CIII peptide compared to the measured AUC values extracted from whole serum digest. The increase in sensitivity facilitated the detection, characterization and quantification of key O-linked glycosylation region of Apo CIII and enabled direct comparison to the unmodified form of the peptides. 1 Introduction The detection of glycoproteins, characterization of various glycoforms and identification of the modified region of a protein can be challenging due to the complexity of the background, dilution of the precursor ion signal, and lack of common PTM mass shift. Our approach to increase the throughput for identification and quantification for O- linked glycopeptides and corresponding glycoforms is to perform targeted data extraction from protein/peptide sequences. Multilevel scoring attributes are implemented to automate data reduction resulting in a refined list of highly confident peptides. Incorporation of unbiased data acquisition of high resolution/accurate mass (HR/AM) MS and tandem MS dramatically increases the scoring routine. 2 The final report provides a score and integrated peak area across biological samples for modified and unmodified peptides attributed to the targeted protein(s). Methods Sample Preparation To perform initial testing, serum samples were collected from normal and stroke patients. Each sample was divided into two equal aliquots. One set of samples were reduced, alkylated, and digested with trypsin and used without further preparation. The second set of samples were aspirated into a MSIA D.A.R.T. tip (Thermo Fisher Scientific, Tempe, AZ) 3 covalently loaded with an anti-Apo CIII antibody. Following MSIA extraction, each sample was washed then reduced, alkylated, and digested using a similar protocol as that for the serum samples with the ratio of analyte:enzyme held constant. Prior to analysis, a constant amount of the PRTC kit (Thermo Fisher Scientific, Rockford, IL) was spiked as an internal standard. Liquid Chromatography LC separation was performed using a Hypersil Gold 100 x mm column with 3 µm particle size (Thermo Fisher Scientific) and a binary solvent system comprised of A) 0.1% formic acid and B) 0.1% formic aid in MeCN. A linear gradient of 5 – 45% B was performed over 40 minutes prior to column washing and re-equilibration. Mass Spectrometry All experiments were performed on a Q Exactive mass spectrometer (Thermo Scientific, Bremen, Germany) operated in data dependent/dynamic exclusion mode using a Top 10 acquisition scheme. Full scan MS spectra were acquired using a resolution setting of 70k and all HCD product ion spectra were acquired using 15k. All data was acquired using internal lock mass. Data Analysis Initial unbiased data searching was performed using Proteome Discoverer 1.4 to identify all proteins and corresponding peptides. The initial search strategy was performed with typical variable modifications of phosphorylation, oxidation, and Cys alkylation. The Apo CIII protein sequence and initial set of identified peptides from step 1 was transferred to prototype version of Pinpoint 1.4 to search for O-linked glycopeptides. The Screening Tool incorporates base peptide sequence information with individual glycan information to calculate the composite chemical formula facilitating HR/AM MS data extraction resulting in a list of putative glycopeptides and corresponding glycoforms. The list was used to perform higher level data extraction, verification/scoring, and relative quantitation using HR/AM MS data. Relative quantitation was compared between the glycopeptides and unmodified peptides originating from Apo CIII. Results In addition to MS data, product ion data was evaluated to provide additional confidence to the assigned glycopeptide sequence. Figure 6 shows an example of automated product ion determination used to distinguish each base peptide and glycan composition. Acquiring the product ion spectra in the orbitrap facilitated product ion charge state and accurate m/z value determination which significantly increased product ion assignment based on the peptide sequence and proposed glycan composition identified from the Screening Tool. The key fragments used to confirm the base peptide sequence was m/z 2137 for the peptide and m/z 2381 fragment for the peptides assigned as the base peptide sequence. The mass errors calculated for each fragment ion was less than 5 ppm. Due to the degree of sequence overlap attributed to the missed cleavage site, additional product ions were used to further confirm the sequence, specifically the b-type ions. Similar data analysis was completed for each of The incorporation of the PRTC kit into the mix helps to map the measured RT values to the expected times. Figure 3 shows the overlap of the PRTC peptides to those of the Apo CIII peptides. All of the tryptic peptides align with the PRTC peptides except peptides with missed cleavage sites. The goal is to incorporate additional scoring metrics to each peptide. In addition, the establishment of a known RT for unmodified This information is not intended to encourage use of these products in any manners that might infringe the intellectual property rights of others. FIGURE 1. Representation of the comparative sample preparation workflow to evaluate O-linked glycopeptide detection and quantification. A common stock of serum sample from two different patient types was divided into two equal aliquots. The only difference was the introduction of an automated IP extraction using MSIA D.A.R.T. tips loaded with anti-Apo CIII antibody. The histogram for an Apo CIII tryptic peptide is presented comparing the measured AUC values with and without MSIA extraction prior to digestion and LC-MS analysis. FIGURE 3. Plot of measured RT values as a function of HF values for the PRTC peptides and the targeted Apo CIII peptides from Table 1. FIGURE 4. MS-level data analysis of the O-linked glycopeptide DKFSEFWDLDPEVRPTSAVAA[GalNAc1Gal1NeuAc1]. Figure 5a shows the overlaid XIC trace for the 6 isotopes of the +3 charge state. Figure 5b shows comparative isotopic distribution analysis across each sample – MSIA extracted as well as serum digests and the corresponding histogram comparing the integrated peak areas across each sample. FIGURE 7. Comparative distribution of unmodified and O-linked glycopeptides from Apo CIII. Integrated peak areas from HR/AM MS data was used for the comparison of glycoform distribution as well as the relative amounts across each sample. PositionTargeted PeptideHF%CV RT (min)AUC SerumAUC MSIA AUC Ratio Dot- Product ControlDiseaseControlDisease 1-17 SEAEDASLLSFM[Oxid]Q[Dea mid]GYMKHATK E+03N/A1.6E+079.2E SEAEDASLLSFMQGYMK E+061.0E+053.1E SEAEDASLLSFM[Oxid]QGYM K E+05N/A1.2E+091.3E SEAEDASLLSFM[Oxid]QGYM[ Oxid]K E+052.7E+054.5E+085.3E HATKTAK E+031.3E+043.5E+062.9E TAKDALSSVQESQVAQQAR N/A 1.4E+071.8E DALSSVQESQVAQQAR E+072.7E+071.1E GWVTDGFSSLK E+077.0E+066.9E+096.7E GWVTDGFSSLKDYWSTVK E+041.9E+052.0E DYWSTVK E+071.1E+076.1E+095.6E DKFSEFWDLDPEVRPTSAVAA E+058.0E+041.7E+082.5E FSEFWDLDPEVRPTSAVAA N/A 6.5E+079.1E Serum Samples - Control and Disease MSIA Extraction Reduction, Alkylation, and Digestion Spike PRTC Kit HR/AM LC-MS and MS/MS Analysis Whole Serum 41 GWVTDGFSSLK 51 ~100x MonoA+1A+2A+3A+4 A+5 MSIA Control MSIA Disease Serum Disease Serum Control Theoretical FIGURE 5. Comparative plots of the dot-product correlation coefficients as a function of measured retention times for the identified glycoforms for DKFSEFWDLDPEVRPTSAVAA and FSEFWDLDPEVRPTSAVAA. FIGURE 2. Screen capture showing the Pinpoint Screening Tool used to identify putative modified and unmodified peptide sequences from a RAW file. The Screening Tool utilizes in silico sequence generation to create a list of unmodified peptides. The list of unmodified peptides are also subjected to all possible glycan additions (N- and/or O-linked modifications) to create a master list of peptides and m/z values used to extract data. The resulting table lists the peptide sequences and modifications as well as all corresponding LC and MS information such as retention times, precursor charge states, mass errors, and protein sequence location. The results from the screening tool are directly exported to the Pinpoint Main Workbook integrated analysis. Table 1. List of peptides attributed to the tryptic digestion of Apo CIII with and without standard modifications. The hydrophobicity factors (HF) were calculated using Krokhin’s SSRCalc algorithm and dot product correlation coefficient was calculated based on precursor isotopic distribution overlap of experimental to theoretical values. AUC values were determined based on the MS peak profile. The information generated in the screening provides significant data reduction to eliminate peptide candidates in the unmodified as well as modified forms. Even for a small protein such as Apo CIII which has only 79 residues (omitting the first 20 N-terminal residues identified as the signaling chain) the use of common in-silico processing parameters (common PTMs and 1 missed cleavage) generates over 8,000 possible sequences. The Screening Tool reduces this list down to 16 peptides in less than one minute. Enabling the possibility of O-linked glycopeptides increases the number of possible sequences to over 100,000 and provides a list of 92 putative peptides. The list is further reduced based on the presence of additional retention time and MS information to a final list that is further evaluated, scored, and quantified in an automated routine. Table 1 lists the identified set of peptides unmodified or modified with non O-linked glycans. Each peptide has an MS/MS spectrum acquired under the elution peak profile. The AUC ratios for the list of peptides are close to 1:1 except for those peptides with missed cleavage sites. Three of the peptides covering residues 22-40, 59-79, and show greater response for the “disease” sample compared to the “normal”. Each of the missed cleavage peptides also show greater variance than the tryptic peptides peptides provides landmarks for the O-linked glycosylated forms as they are expected to elute under similar times. In addition to the peptides listed in Table 1, O-linked glycopeptides were also identified for the peptides covering residues 22-40, 59-79, and An example of the MS-level data extraction for the peptide modified with GalNAc 1 Gal 1 NeuAc 1 is shown in Figure 4. The isotopic distribution profiles for the MSIA extracted samples match the theoretical distribution with correlation coefficients >0.98. The isotopic overlap for the serum digest samples shows little response. Figure 5c shows the relative AUC values for the Control vs. Disease samples. The AUC ratio for the O-linked glycopeptide 1.3 matches that for the unmodified peptide. Three additional O-linked glycoforms were identified for the a. 5b. 5c. peptide and 5 O-linked glycoforms were identified for the fully tryptic peptide Figure 5 shows the results of the initial screening based on HR/AM MS data and retention time correlation. The group of O-linked glycopeptides and corresponding glycoforms should elute in proximity (<20%) of the unmodified peptide when using a C18-based column. The isotopic distribution analysis was also calculated for each glycoform. The goodness of fit for the relative AUC values per isotope are directly dependent on the mass accuracy of the predicted chemical composition of the peptide and glycan as well as the overall measured intensities. FIGURE 6. Comparative product ion spectra for the O-linked glycopeptides FSEFWDLDPEVRPTSAVAA and DKFSEFWDLDPEVRPTSAVAA with the same glycan modification. Both HR/AM HCD spectra were acquired under the precursor elution profiles. each glycoform. The results for the O-glycoform distribution are presented in Figure 7. The unmodified forms of the peptide (with and without missed cleavage site) showed a lower relative response compared to the modified forms. Incorporation HR/AM MS HCD data acquisition facilitates peptide sequence determination and glycan composition, and for those O-linked glycopeptides modified at only one residue, site determination. For glycoforms modified at multiple sites, specific site determination generally requires ETD product ion data collection. The workflow presented here can automatically create a secondary experimental method for targeted ETD data acquisition.