N-Glycopeptide Identification from CID Tandem Mass Spectra using Glycan Databases and False Discovery Rate Estimation Kevin B. Chandler, Petr Pompach,

Slides:



Advertisements
Similar presentations
Tandem MS (MS/MS) on the Q-ToF2
Advertisements

Proteomics Informatics – Protein characterization I: post-translational modifications (Week 10)
Analysis of human haptoglobin, digest with trypsin and Glu-C – six putative N-motif peptides. Glycopeptide separation by hydrophilic interaction liquid.
Proteomics and Glycoproteomics (Bio-)Informatics of Protein Isoforms Nathan Edwards Department of Biochemistry and Molecular & Cellular Biology Georgetown.
UC Mass Spectrometry Facility & Protein Characterization for Proteomics Core Proteomics Capabilities: Examples of Protein ID and Analysis of Modified Proteins.
In-depth Analysis of Protein Amino Acid Sequence and PTMs with High-resolution Mass Spectrometry Lian Yang 2 ; Baozhen Shan 1 ; Bin Ma 2 1 Bioinformatics.
Proteomics Informatics – Protein identification III: de novo sequencing (Week 6)
PepArML: A model-free, result-combining peptide identification arbiter via machine learning Xue Wu, Chau-Wen Tseng, Nathan Edwards University of Maryland,
De Novo Sequencing v.s. Database Search Bin Ma School of Computer Science University of Waterloo Ontario, Canada.
Bin Ma, CTO Bioinformatics Solutions Inc. June 5, 2011.
Peptide Identification by Tandem Mass Spectrometry Behshad Behzadi April 2005.
Proteomics: A Challenge for Technology and Information Science CBCB Seminar, November 21, 2005 Tim Griffin Dept. Biochemistry, Molecular Biology and Biophysics.
20-30% of a trypsinised proteome are constituted of peptides with Mw≥3000 (TReP) Identification of large peptides by shotgun MS is not efficient Isolation.
ProReP - Protein Results Parser v3.0©
Basics of 2-DE and MALDI-ToF MS
Proteomics Informatics Workshop Part I: Protein Identification
Previous Lecture: Regression and Correlation
De Novo Sequencing of MS Spectra
Proteomics Informatics (BMSC-GA 4437) Course Director David Fenyö Contact information
My contact details and information about submitting samples for MS
Facts and Fallacies about de Novo Sequencing & Database Search.
Proteomics Informatics (BMSC-GA 4437) Course Director David Fenyö Contact information
Tryptic digestion Proteomics Workflow for Gel-based and LC-coupled Mass Spectrometry Protein or peptide pre-fractionation is a prerequisite for the reduction.
Mueller LN, Brusniak MY, Mani DR, Aebersold R
Karl Clauser Proteomics and Biomarker Discovery Taming Errors for Peptides with Post-Translational Modifications Bioinformatics for MS Interest Group ASMS.
Production of polypeptides, Da, and middle-down analysis by LC-MSMS Catherine Fenselau 1, Joseph Cannon 1, Nathan Edwards 2, Karen Lohnes 1,
Towards the Management of Information Quality in Proteomics David Stead University of Aberdeen.
Analysis of human haptoglobin, after digest with trypsin and Glu-C – six putative N-linked motif peptides. Glycopeptide separation by hydrophilic interaction.
Dinosaur Proteomics. 2 Claims Proteins can be extracted from fossilized bones Extracted proteins can be analyzed by LC-MS/MS MS/MS can be matched to.
Acknowledgements This work is supported by NSF award DBI , and National Center for Glycomics and Glycoproteomics, funded by NIH/NCRR grant 5P41RR
Common parameters At the beginning one need to set up the parameters.
Analysis of Complex Proteomic Datasets Using Scaffold Free Scaffold Viewer can be downloaded at:
A Comprehensive Comparison of the de novo Sequencing Accuracies of PEAKS, BioAnalyst and PLGS Bin Ma 1 ; Amanda Doherty-Kirby 1 ; Aaron Booy 2 ; Bob Olafson.
Laxman Yetukuri T : Modeling of Proteomics Data
Protein bioinformatics and systems biology Nathan Edwards Department of Biochemistry and Molecular & Cellular Biology Georgetown University Medical Center.
Knowledge Enabled Information and Services Science Glycomics project overview.
Poster produced by Faculty & Curriculum Support (FACS), Georgetown University Medical Center Application of meta-search, grid-computing, and machine-learning.
Software Project MassAnalyst Roeland Luitwieler Marnix Kammer April 24, 2006.
Glycoprotein Microheterogeneity via N-Glycopeptide Identification Kevin Brown Chandler, Petr Pompach, Radoslav Goldman, Nathan Edwards Georgetown University.
Peptide Identification via Tandem Mass Spectrometry Sorin Istrail.
Overview of Mass Spectrometry
EBI is an Outstation of the European Molecular Biology Laboratory. In silico analysis of accurate proteomics, complemented by selective isolation of peptides.
By Jay Krishnan. Introduction Information gathered from Proteomic techniques + neuroscientific research = Information on protein composition and function.
Proteomics Informatics (BMSC-GA 4437) Instructor David Fenyö Contact information
Poster produced by Faculty & Curriculum Support (FACS), Georgetown University Medical Center Application of meta-search, grid-computing, and machine-learning.
The observed and theoretical peptide sequence information Cal.MassObserved. Mass ±da±ppmStart Sequence EndSequenceIon Score C.I%modification FLPVNEK.
Novel Peptide Identification using ESTs and Genomic Sequence Nathan Edwards Center for Bioinformatics and Computational Biology University of Maryland,
Salamanca, March 16th 2010 Participants: Laboratori de Proteomica-HUVH Servicio de Proteómica-CNB-CSIC Participants: Laboratori de Proteomica-HUVH Servicio.
Application of meta-search, grid-computing, and machine-learning can significantly improve the sensitivity of peptide identification. The PepArML meta-search.
Deducing protein composition from complex protein preparations by MALDI without peptide separation.. TP #419 Kenneth C. Parker SimulTof Corporation, Sudbury,
Top-down characterization of proteins in bacteria with unsequenced genomes Colin Wynne Catherine Fenselau University of Maryland, College Park Nathan Edwards.
ISA Kim Hye mi. Introduction Input Spectrum data (Protein database) Peptide assignment Peptide validation manual validation PeptideProphet.
Proteomics Informatics (BMSC-GA 4437) Course Directors David Fenyö Kelly Ruggles Beatrix Ueberheide Contact information
Constructing high resolution consensus spectra for a peptide library
김지형. Introduction precursor peptides are dynamically selected for fragmentation with exclusion to prevent repetitive acquisition of MS/MS spectra.
10/30/2013BCHB Edwards Project/Review BCHB Lecture 17.
MassMatrix Search Results Explained
Bioinformatics Solutions Inc.
Proteomics Informatics David Fenyő
Interpretation of Mass Spectra I
The Haystack Is Full of Needles: Technology Rescues Sugars!
Proteomics Informatics –
Protein Identification Using Tandem Mass Spectrometry
Bioinformatics for Proteomics
Shotgun Proteomics in Neuroscience
Accelerating Research in Life Sciences
Proteomics Informatics David Fenyő
Interpretation of Mass Spectra
Kuen-Pin Wu Institute of Information Science Academia Sinica
Presentation transcript:

N-Glycopeptide Identification from CID Tandem Mass Spectra using Glycan Databases and False Discovery Rate Estimation Kevin B. Chandler, Petr Pompach, Radoslav Goldman, Nathan J. Edwards Georgetown University, Department of Biochemistry and Molecular & Cellular Biology, Washington, DC Over half of all proteins are glycosylated (this rate is higher for secreted, cell-surface and extracellular matrix proteins) Glycosylation mediates cell-cell & cell-matrix interactions N-linked glycosylation is enzyme-directed, occurs on Asn residues within the motif NXS/T (X ≠ Pro) Tandem mass spectrometry (MS/MS) is used to study protein glycosylation; however, there are few software tools to aid in processing and interpretation of large glycopeptide MS/MS datasets, and manual interpretation of datasets is time consuming Introduction and Background GlycoPeptideSearch Software – Glycopeptide Discovery Workflow 1. Ranzinger, Herget, von der Lieth, Frank. Nucleic Acids Res. 39(Database issue):D (2011). 2. Fujimura, Shinohara, Tossot, Pang, Kurogochi, Saito, Arai, Sadilek, Murayama, Dell, Nishimura, Hakomori. Int. J. Cancer 122:39–49 (2008). 3. Goldberg, Sutton-Smith, Paulson, Dell. Proteomics 5: (2005). 4. Pompach, Chandler, Lan, Edwards, Goldman. J.Proteome Res. 11 (3); (2012). References Acknowledgements Kevin B Chandler is supported by a Graduate Research Fellowship from the National Science Foundation. Nathan J Edwards is supported, in part, by NIH/NCI/CPTI grant CA Rado slav Goldman is supported by NCI’s R01 CA and CA GlycoPeptideSearch Scheme MS/MS Spectra w/ glycan oxonium ion (204, 366) peaks w/ 2+ “peptide” peaks GlycomeDB Distinct Glycopeptides x LC-MS/MS Proteolytic digest of Haptoglobin with trypsin and GluC Hydrophilic interaction liquid chromatography (HILIC) of glycopeptides Eleven glycopeptide fractions analyzed by nanoC18 RP LC-MS/MS using a Q-STAR Elite mass-spectrometer. IDA: Four most abundant ions with 20 sec exclusion. 15,780 MS and 3,288 MS/MS spectra (msconvert) Test Dataset (Haptoglobin Glycopeptides) Automated in silico digestion (Trypsin, GluC) of user-submitted protein sequence & N-glycosylation site ID Fixed (carbamidomethylation) & variable modifications (Methionine oxidation) considered Spectra filtered for glycan oxonium ions & peptide + N-linked core fragments & mass-lookup in GlycomeDB (glycan database) Isotope Cluster Scoring performed on precursor ion Decoy (non-motif containing) peptides submitted to search to enable estimation of the False Discovery Rate Open format (XML) spectra input and Excel output. Methods: GlycoPeptideSearch Software We hypothesize that N-glycopeptide MS/MS data interpretation can be automated by adopting an algorithm that uses (1) oxonium marker ions and intact peptide peak filters, (2) the sequence(s) of the protein(s) of interest and (3) mass-matching to publicly available glycan databases to match glycan-peptide pairs to glycopeptide MS/MS spectra. The aim of this research is to develop a novel software tool for rapid interpretation of glycopeptide MS/MS datasets to facilitate the study of glycoprotein microheterogeneity Hypothesis and Aims Controlling Error Using Spectra Filters and Hit Filters FDR Determination Decoy Peptides (without motif) GPS Target and Decoy Spectra Hits FDR Target Peptides (with NXS/T) 52 glycan-peptide pairs matched 263 spectra (3.9% FDR). 52% (136) of filtered spectra matched a single glycopeptide pair ( 1 peptide. 27 distinct non-isobaric glycans at 4 sites were discovered, consistent with published reports 2. Conclusion: Using characteristics of glycopeptide spectra including oxonium ions and intact peptide peaks, it is possible to automate glycopeptide CID MS/MS data interpretation with low false discovery. Results and Conclusion Summary of Haptoglobin Glycopeptides Sample MS/MS Spectrum: Fraction 17, Scan 1407