Biomarker discovery by automatic annotation of N-glycan species in MALDI-TOF-TOF spectra Chuan-Yih, Yu 2010-4-8 Capstone Advisor: Prof. Haixu Tang.

Slides:



Advertisements
Similar presentations
Genomes and Proteomes genome: complete set of genetic information in organism gene sequence contains recipe for making proteins (genotype) proteome: complete.
Advertisements

Protein Quantitation II: Multiple Reaction Monitoring
The Proteomics Core at Wayne State University
Mass Spectrometry Kyle Chau and Andrew Gioe. Computation of Molecular Mass -Mass Spectrum is a plot of intensity as a function of mass- charge ratio,
A Multi-PCA Approach to Glycan Biomarker Discovery using Mass Spectrometry Profile Data Anoop Mayampurath, Chuan-Yih Yu Info-690 (Glycoinformatics) Final.
MS-Viewer – A Web Based Spectral Viewer For Database Search Results Peter R. Baker 1, Alma L. Burlingame 1 and Robert J. Chalkley 1 1 Mass Spectrometry.
Evaluation of Peptides in Wisconsin Beer Mckenna L. Missfeldt, Dr. Jennifer Grant, University of Wisconsin-Stout Abstract Matrix-assisted laser desorption/ionization.
MALDI-TOF Mass Spectrometry and Introduction to Proteomics Dr. Steve Hartson Oklahoma State University Dept. Biochemistry and Molecular Biology Recombinant.
N-Glycopeptide Identification from CID Tandem Mass Spectra using Glycan Databases and False Discovery Rate Estimation Kevin B. Chandler, Petr Pompach,
Automatic annotation of N-glycan species in MALDI-TOF-TOF spectra for rapid profiling and comparing Chuan-Yih, Yu Capstone Advisor: Prof. Haixu.
Biomarker discovery by automatic annotation of N-glycan species in MALDI-TOF-TOF spectra Chuan-Yih, Yu Capstone Advisor: Prof. Haixu Tang Indiana.
PROTEIN IDENTIFICATION BY MASS SPECTROMETRY. OBJECTIVES To become familiar with matrix assisted laser desorption ionization-time of flight mass spectrometry.
Modeling Functional Genomics Datasets CVM Lesson 1 13 June 2007Bindu Nanduri.
ProReP - Protein Results Parser v3.0©
Basics of 2-DE and MALDI-ToF MS
Announcements: Proposal resubmissions are due 4/23. It is recommended that students set up a meeting to discuss modifications for the final step of the.
Previous Lecture: Regression and Correlation
Mass Spectrometry. What are mass spectrometers? They are analytical tools used to measure the molecular weight of a sample. Accuracy – 0.01 % of the total.
My contact details and information about submitting samples for MS
Proteomics Josh Leung Biology 1220 April 13 th, 2010.
Proteomics Informatics (BMSC-GA 4437) Course Director David Fenyö Contact information
EUROCarbDB CCRC – Database for high quality mass spectrometry data Khalifeh Al Jadda 1, Haseeb Yousef 1, Kitae Myong 1, Srikalyan Swayampakula 1, David.
Chapter 9 Mass Spectrometry (MS) -Microbial Functional Genomics 조광평 CBBL.
The dynamic nature of the proteome
PROTEIN STRUCTURE NAME: ANUSHA. INTRODUCTION Frederick Sanger was awarded his first Nobel Prize for determining the amino acid sequence of insulin, the.
Human Proteome Project? Màster en bioquímica, biologia molecular i biomedicina Mòdul 4: Genòmica i Proteòmica Núria Colomé Calls.
es/by-sa/2.0/. Large Scale Approaches to the Study of Protein Levels and Activity Prof:Rui Alves
Automatic annotation of N-glycans in MALDI-TOF spectra for rapid glycan profiling and comparison Chuan-Yih, Yu Capstone Presentation Advisor:
Mass Spectrometry I Basic Data Processing. Mass spectrometry A mass spectrometer measures molecular masses. The mass unit is called dalton, which is 1/12.
UPDATE! In-Class Wed Oct 6 Latil de Ros, Derek Buns, John.
Acknowledgements This work is supported by NSF award DBI , and National Center for Glycomics and Glycoproteomics, funded by NIH/NCRR grant 5P41RR
Analysis of Complex Proteomic Datasets Using Scaffold Free Scaffold Viewer can be downloaded at:
PeptideProphet Explained Brian C. Searle Proteome Software Inc SW Bertha Blvd, Portland OR (503) An explanation.
LC-MS Based Detection and Quantification of N-glycans in Human Serum Samples Tsung-Heng Tsai¹, Minkun Wang¹, Cristina Di Poto¹, Yi Zhao¹, Yunli Hu², Shiyue.
Quantification of Membrane and Membrane- Bound Proteins in Normal and Malignant Breast Cancer Cells Isolated from the Same Patient with Primary Breast.
Peak Detection with Chemical Noise Removal Using Short-Time FFT for a Kind of MALDI Data Xiaobo Zhou HCNR-CBI, Harvard Medical School and Brigham & Women’s.
High throughput Protein Measurement Techniques Harin Kanani.
Clustering of MS/MS spectra for glycan biomarker discovery Anoop Mayampurath, Chuan-Yih Yu.
Genomics II: The Proteome Using high-throughput methods to identify proteins and to understand their function.
Software Project MassAnalyst Roeland Luitwieler Marnix Kammer April 24, 2006.
PEAKS: De Novo Sequencing using Tandem Mass Spectrometry Bin Ma Dept. of Computer Science University of Western Ontario.
SVM-based techniques for biomarker discovery in proteomic pattern data Elena Marchiori Department of Computer Science Vrije Universiteit Amsterdam.
Glycan database. Database of molecules Two models (of vocabularies) – Proteins / Nucleic Acids Residues (+ modifications) Genbank / Swissprot – Compounds.
Overview of Mass Spectrometry
Data Management Support for Life Sciences or What can we do for the Life Sciences? Mourad Ouzzani
AOCS Analytical Division Serving Lipid Chemists Worldwide Mass Spectrometry of Lipids Symposium 97 th Annual Meeting of the American Oil Chemists’ Society.
Proteomics Informatics (BMSC-GA 4437) Instructor David Fenyö Contact information
Novel Peptide Identification using ESTs and Genomic Sequence Nathan Edwards Center for Bioinformatics and Computational Biology University of Maryland,
Proteomics Technology: the Next Generation Akos Vertes Department of Chemistry Institute for Proteomics Technology and Applications.
Proteomics Informatics (BMSC-GA 4437) Course Directors David Fenyö Kelly Ruggles Beatrix Ueberheide Contact information
Minimize Database-Dependence in Proteome Informatics Apr. 28, 2009 Kyung-Hoon Kwon Korea Basic Science Institute.
What is proteomics? Richard Mbasu and Ben Richards.
A density gradient is formed in a centrifuge tube, and a mixture of proteins in solution is placed on top of the gradient. To identify the estradiol receptor,
RANIA MOHAMED EL-SHARKAWY Lecturer of clinical chemistry Medical Research Institute, Alexandria University MEDICAL RESEARCH INSTITUTE– ALEXANDRIA UNIVERSITY.
Yonsei Proteome Research Center Peptide Mass Finger-Printing Part II. MALDI-TOF 2013 생화학 실험 (1) 6 주차 자료 임종선 조교 내선 6625.
Carbonyl-Reactive Tandem Mass Tags for the Proteome-Wide Quantification of N-Linked Glycans Hannes Hahne, Patrick Neubert, Karsten Kuhn, Chris Etienne,
Post translational modification n- acetylation Peptide Mass Fingerprinting (PMF) is an analytical technique for identifying unknown protein. Proteins to.
Mass Spectrometry makes it possible to measure protein/peptide masses (actually mass/charge ratio) with great accuracy Major uses Protein and peptide identification.
Mass spectrometry data enhancement software
The Syllabus. The Syllabus Safety First !!! Students will not be allowed into the lab without proper attire. Proper attire is designed for your protection.
Mass spectrometry-based proteomics
Day 2: Session 8: Questions and follow-up…. James C. Fleet, PhD
Instrumental Chemistry
Schematic of the principles of mass spectrometry (MS).
V. Protein Chips 1. What is Protein Chips 2. How to Make Protein Chips
Proteomics Informatics David Fenyő
Pierre P. Massion, MD, Richard M. Caprioli, PhD 
Proteomics Informatics David Fenyő
General schematic for MS analysis of ionized microbiological isolates and clinical material. General schematic for MS analysis of ionized microbiological.
N-Linked glycosylation of total protein from representative HCC tissue and adjacent liver tissue. N-Linked glycosylation of total protein from representative.
Presentation transcript:

Biomarker discovery by automatic annotation of N-glycan species in MALDI-TOF-TOF spectra Chuan-Yih, Yu Capstone Advisor: Prof. Haixu Tang

Introduction Post-Translation Modification (PTM) –Nitrosylation –Phosphorylation –Glycosolation 50% of all eukaryotic proteins are glycosylated 1 1.Apweiler, R., H. Hermjakob, and N. Sharon, On the frequency of protein glycosylation, as deduced from analysis of the SWISS-PROT database. Biochim Biophys Acta, (1): p. 4-8

Glycoprotein Protein glycosylation –N-linked glycosylation Core structure – 2 GlcNac + 3 Man Asn-X-Ser or Asn-X-Thr, X can be any but Pro Glycosylation before folding –O-linked glycosylation Core structures Serine or Threonine Glycosylation after folding

Monosaccharides Building blocks Diverse linage Three types N-linked glycan –High mannose –Complex –Hybrid 412 combinations ->7,000 structures 1 Graphs: Varki, A., Essentials of glycobiology. 2nd ed. 2009, Cold Spring Harbor, N.Y.: Cold Spring Harbor Laboratory Press. xxix, 784 p NameMolecular formula/ Structure Mannose (Man)C 6 H 12 O 6 Galactose (Gal)C 6 H 12 O 6 Fucose (Fuc)C 6 H 12 O 5 GlcNacC 8 H 15 NO 6 NeuNACC 11 H 19 NO 9 NeuNGCC 11 H 19 NO 10 1.Krambeck, F.J. and M.J. Betenbaugh, A mathematical model of N-linked glycosylation. Biotechnol Bioeng, (6): p

Mass Spectrometry Wright scale of molecular Ion Source –Electrospray ionization (ESI) –Matrix-assisted laser desorption/ionization (MALDI) Mass Analyzer –Time of flight (TOF) –Quadrupole –Fourier transform mass spectrometry (FTMS) Detector –Charge induced or the current produced

MALDI-TOF-TOF Graph:MALDI-TOF Mass Analysis. (2008, 11 16). Retrieved May 2, 2009, from The Protein Facility of the Iowa State University Office of Biotechnology

Problem Isotope pattern overlap –Permethylated, Add Sodium 2 GlcNac + 9 Man = 2, GlcNac + 3 Man = 2, High-throughput glycans screening –Find significant differences between groups of sample Graphs: Isotope Pattern Calculator v

Major Features Glycans profile correlation –Report scores for non-overlap and overlap profile –Glycans examination Glycan profiling comparison –Report significant glycan between groups –Glycans biomarker discovery

Glycans Profile Correlation For each glycan combination –412 different glycan combinations –Generate a theoretical isotope pattern –Calculate the correlation for following cases Glycans Glycans + Glycans, linear combination applied Glycans + Unknown, linear combination applied Mercury algorithm 1 1.Rockwood, A., S. Van Orden, and R. Smith, Rapid Calculation of Isotope Distributions. Analytical Chemistry, : p

Three Cases Experiment spectrum Glycans αβ α β Unknown Score

Glycan Profiling Comparison Multiple spectra comparison Biomarker discovery –Given spectrum with several conditions –Find distinct glycans between samples Graph: Ressom, H.W., et al., Analysis of MALDI-TOF mass spectrometry data for discovery of peptide and glycan biomarkers of hepatocellular carcinoma. J Proteome Res, (2): p HCC: Hepatocellular Carcinoma ( Cancer of liver) CLD: Chronic liver disease

Concept Health spectra (H 1, H 2, H 3 …H k ) Disease spectra (D 1, D 2, D 3 …D k ) Remove the least significant component. Repeat until all the score above threshold. 1.Hastie, T., et al., 'Gene shaving' as a method for identifying distinct sets of genes with similar expression patterns. Genome Biol, (2): p. RESEARCH % identical with a cutoff at 0.5

Multi N-Glycan Software Requirement –.net framework 2.0 using C# –C++ runtime –R –Thermo Scientific Xcalibur Input –Spectrum Plain text (Peak list) mzXML 1 RAW ( instrument raw file) –Glycans list CSV file 1.Pedrioli, P., et al., A Common Open Representation of Mass Spectrometry Data and its Application in a Proteomics Research Environment. Nature Biotechnology, (11): p

Software Interface

Html result export Biomarker discovery setting

Result Filtered out Can’t find the glycan structure in CFG database

Result

Future Works Test on more clinical samples Verify the correlation between glycan modification with disease Perform these tasks on O-linked glycan

References Apweiler, R., H. Hermjakob, and N. Sharon, On the frequency of protein glycosylation, as deduced from analysis of the SWISS-PROT database. Biochim Biophys Acta, (1): p Hastie, T., et al., ‘Gene shaving’ as a method for identifying distinct sets of genes with similar expression patterns. Genome Biol, (2): p. RESEARCH0003. Krambeck, F.J. and M.J. Betenbaugh, A mathematical model of N-linked glycosylation. Biotechnol Bioeng, (6): p Pedrioli, P., et al., A Common Open Representation of Mass Spectrometry Data and its Application in a Proteomics Research Environment. Nature Biotechnology, (11): p Ressom, H.W., et al., Analysis of MALDI-TOF mass spectrometry data for discovery of peptide and glycan biomarkers of hepatocellular carcinoma. J Proteome Res, (2): p Rockwood, A., S. Van Orden, and R. Smith, Rapid Calculation of Isotope Distributions. Analytical Chemistry, : p Tang, Z., et al., Identification of N-glycan serum markers associated with hepatocellular carcinoma from mass spectrometry data. J Proteome Res, (1): p Varki, A., Essentials of glycobiology. 2nd ed. 2009, Cold Spring Harbor, N.Y.: Cold Spring Harbor Laboratory Press. xxix, 784 p.

Acknowledge Advisor: Prof. Haixu Tang Co-worker: Anoop Mayampurath Collaborator: Yehia Mechref, Department of Chemistry This work will present in 26 th May, 58 th ASMS Conference Salt Lake City, Utah and submit to the Bioinformatics Application Notes.

Thank You