Computational Methods for Biomarker Discovery in Proteomics and Glycomics Vijetha Vemulapalli School of Informatics Indiana University Capstone Advisor:

Slides:



Advertisements
Similar presentations
Genomes and Proteomes genome: complete set of genetic information in organism gene sequence contains recipe for making proteins (genotype) proteome: complete.
Advertisements

Protein Quantitation II: Multiple Reaction Monitoring
Improvements in Mass Spectrometry for Life Science Research – Does Agilent Have the Answer? Ashley Sage PhD.
Mass Spectrometry Kyle Chau and Andrew Gioe. Computation of Molecular Mass -Mass Spectrum is a plot of intensity as a function of mass- charge ratio,
Lipids Analytical Tool (LipidAT): automated analysis of lipidomic mass spectrometry data Jun Ma Advisor: Dr. Haixu Tang Co-Advisor: Dr. David Wild Co-Advisor.
A Multi-PCA Approach to Glycan Biomarker Discovery using Mass Spectrometry Profile Data Anoop Mayampurath, Chuan-Yih Yu Info-690 (Glycoinformatics) Final.
HPLC Coupled with Quadrupole Mass Spectrometry and Forensic Analysis of Cocaine.
Bioinformatic Treatment of Human Metabolome Profile for Diagnostics Dr. Petr Lokhov & Dr. Alexander Archakov Institute of Biomedical Chemistry, RAMS.
Lecture 8. GC/MS.
In carbon-13 NMR, what do the number of peaks represent?
Metabolomics DNA RNA Protein Biochemicals (Metabolites) Genomics – 25,000 Genes Transcriptomics – 100,000 Transcripts Metabolomics – 2,800 Compounds Proteomics.
Microarrays: Tools for Proteomics
ProReP - Protein Results Parser v3.0©
Proteomics Informatics – Protein identification II: search engines and protein sequence databases (Week 5)
Announcements: Proposal resubmissions are due 4/23. It is recommended that students set up a meeting to discuss modifications for the final step of the.
Previous Lecture: Regression and Correlation
Proteomics Informatics (BMSC-GA 4437) Course Director David Fenyö Contact information
My contact details and information about submitting samples for MS
A Neural Network Predictor for Peptide Fragmentation in Mass Spectrometry Arunima Ram Advisor : Dr. Predrag Radivojac Co-Advisor : Dr. Haixu Tang Co-Advisor.
Proteomics Josh Leung Biology 1220 April 13 th, 2010.
Proteomics Informatics (BMSC-GA 4437) Course Director David Fenyö Contact information
Proteomics Informatics Workshop Part III: Protein Quantitation
Fa 05CSE182 CSE182-L9 Mass Spectrometry Quantitation and other applications.
Proteomics Informatics Workshop Part II: Protein Characterization David Fenyö February 18, 2011 Top-down/bottom-up proteomics Post-translational modifications.
Comparison of chicken light and dark meat using LC MALDI-TOF mass spectrometry as a model system for biomarker discovery WP 651 Jie Du; Stephen J. Hattan.
Chemical Ideas 7.6 Chromatography. The general principle. Use – to separate and identify components of mixtures. Several different types - paper, thin.
Current Proteome Profiling Methods and Applications Yetrib Hathout, Ph.D. Center for Genetic Medicine Children’s National Medical Center & The George Washington.
2007 GeneSpring MS GeneSpring for Metabolite BioMarker Analysis using Mass Spectrometry data Agilent Q-TOF VIP Visit Jan 16-17, 2007 Santa Clara, CA Thon.
© 2010 SRI International - Company Confidential and Proprietary Information Quantitative Proteomics: Approaches and Current Capabilities Pathway Tools.
BIOMARKERS Diagnostics and Prognostics. OMICS Molecular Diagnostics: Promises and Possibilities, p. 12 and 26.
Acknowledgements This work is supported by NSF award DBI , and National Center for Glycomics and Glycoproteomics, funded by NIH/NCRR grant 5P41RR
Laxman Yetukuri T : Modeling of Proteomics Data
Automatic Analysis of Ion Mobility Spectrometry – Mass Spectrometry (IMS-MS) Data Hyejin Yoon School of Informatics Indiana University Bloomington December.
Quantification of Membrane and Membrane- Bound Proteins in Normal and Malignant Breast Cancer Cells Isolated from the Same Patient with Primary Breast.
CS 461b/661b: Bioinformatics Tools and Applications Software Algorithm Mathematical Models Biology Experiments and Data.
Temple University MASS SPECTROMETRY INTRODUCTION Ilyana Mushaeva and Amber Moscato Department of Electrical and Computer Engineering Temple University.
High throughput Protein Measurement Techniques Harin Kanani.
Software Project MassAnalyst Roeland Luitwieler Marnix Kammer April 24, 2006.
Proteomics What is it? How is it done? Are there different kinds? Why would you want to do it (what can it tell you)?
Methods of Chemical Analysis. Selecting an Analytical Technique  Organic vs. Inorganic materials  Organic:  Inorganic:  Quantitative vs. Qualitative.
Central dogma: the story of life RNA DNA Protein.
June 9th, 2013 Matthew J. Rardin June 9th, 2013 Matthew J. Rardin MS1 and MS2 crosstalk in label free quantitation of mass spectrometry data independent.
Innovative Paths to Better Medicines Design Considerations in Molecular Biomarker Discovery Studies Doris Damian and Robert McBurney June 6, 2007.
PRINCIPLE, APPLICATIONS & ADVANCES IN
1 I. Introduction 1.Definition: Protein Characterization/Proteomics i.Classical Proteomics ii.Functional Proteomics 2.Mass spectrometery I.Advantages in.
Proteomics Informatics (BMSC-GA 4437) Instructor David Fenyö Contact information
PROTEIN INTERACTION NETWORK – INFERENCE TOOL DIVYA RAO CANDIDATE FOR MASTER OF SCIENCE IN BIOINFORMATICS ADVISOR: Dr. FILIPPO MENCZER CAPSTONE PROJECT.
Proteomics Informatics (BMSC-GA 4437) Course Directors David Fenyö Kelly Ruggles Beatrix Ueberheide Contact information
What is proteomics? Richard Mbasu and Ben Richards.
Objective  To develop methods for analysis of compounds in organic aerosol particles Why is this important?  Environmental impact  Alternative fuels.
Quantitation using Pseudo-Isobaric Tags (QuPIT) and Quantitation using Pseudo-isobaric Amino acids in Cell culture (QuPAC) Parimal Samir Andrew J. Link.
Novel Proteomics Techniques
Target Analyses in Parallel Reaction Monitoring Mode (PRM)
Chemical Ideas 7.6 Chromatography.
Bioinformatics Solutions Inc.
General Overview of the module and the methods
V. Protein Chips 1. What is Protein Chips 2. How to Make Protein Chips
Proteomics Informatics David Fenyő
Now, More Than Ever, Proteomics Needs Better Chromatography
A perspective on proteomics in cell biology
Nat. Rev. Nephrol. doi: /nrneph
NoDupe algorithm to detect and group similar mass spectra.
Diagnostics and Prognostics
Pierre P. Massion, MD, Richard M. Caprioli, PhD 
Is Proteomics the New Genomics?
Shotgun Proteomics in Neuroscience
Proteomics Informatics David Fenyő
Kuen-Pin Wu Institute of Information Science Academia Sinica
Mass spectrometry (MS) is an analytical technique that can be used to determine the mass, elemental composition or chemical structure of molecules. Mass.
Presentation transcript:

Computational Methods for Biomarker Discovery in Proteomics and Glycomics Vijetha Vemulapalli School of Informatics Indiana University Capstone Advisor: Dr. Haixu Tang

What are Biomarkers?  Substances present in increased or decreased amounts in body fluids or tissues that indicate exposure, disease or susceptibility to disease. Problem Definition Background LC-MS Method Results CE Method Results Acknowledgements References

Some Uses of Biomarkers Problem Definition Background LC-MS Method Results CE Method Results Acknowledgements References  Biomarkers are increasingly being used for the following purposes: Prognosis / Diagnosis of disease Monitoring response to medication  With high sensitivity and throughput, proteomics and glycomics is capable of identifying many potential biomarkers simultaneously.

More on Biomarkers Problem Definition Background LC-MS Method Results CE Method Results Acknowledgements References  A lot of times biomarkers have not been identified clearly. But based on the signature pattern of glycans and proteins, samples can be classified as healthy and diseased.

What is Proteomics?  Proteomics: Proteomics is the study of proteins and proteomes using high- throughput technology.  Proteome: All the proteins in a cell or bodily fluid at a given point of time under certain conditions.  Proteins: A chain of amino acids including hormones, enzymes and antibodies. Problem Definition Background LC-MS Method Results CE Method Results Acknowledgements References

What is Glycomics?  Glycoproteins: Proteins with attached polysaccharides.  Glycans: Polysaccharide chain attached to a protein  Glycome: The entire set of glycans that are present in a cell or a bodily fluid at a certain point of time under certain conditions.  Glycomics: Study of structure and function of oligosaccharides in a cell or organism. Problem Definition Background LC-MS Method Results CE Method Results Acknowledgements References

High Throughput Technologies to Identify Biomarkers Genome Scale Scanning Genome level Micro - arrays Transcriptome level Proteomics Proteome level Glycomics Glycome level Problem Definition Background LC-MS Method Results CE Method Results Acknowledgements References

Transcriptome Why the Focus on Proteomics and Glycomics?  Information content Genome Transcriptome Proteome Glycome Static Dynamic Problem Definition Background LC-MS Method Results CE Method Results Acknowledgements References

Biomarker Discovery using Proteomics

Liquid Chromatography / Mass Spectrometry (LC/MS)  Why LC/MS for analysis of proteomes? LC spreads complexity of the sample over time. MS identifies ions based on their mass/charge value.  Software exists currently to identify proteins in a sample using data from a LC-MS experiment. Problem Definition Background LC-MS Method Results CE Method Results Acknowledgements References Liquid Chromatography Mass Spectrometry Data Protein sample

Liquid Chromatography (LC)  Liquid Chromatography is a technique that separates ions or molecules dissolved in a solvent based on size of the ion/molecule, adsorption, ion-exchange or other similar characteristics. Problem Definition Background LC-MS Method Results CE Method Results Acknowledgements References

What is Mass Spectrometer?  Mass Spectrometry (MS) is an instrument that identifies ions based on their mass-to-charge ratio. Source: & Problem Definition Background LC-MS Method Results CE Method Results Acknowledgements References

Visualization of LC/MS Data : 2D Map Problem Definition Background LC-MS Method Results CE Method Results Acknowledgements References

How Do We Find Biomarkers From LC-MS Data? Problem Definition Background LC-MS Method Results CE Method Results Acknowledgements References MS View Quantities of peptides identified from the sample Liquid Chromatography Mass Spectrometry Data Protein sample Identification software Identified Proteins and Peptides

How Do We Find Biomarkers From LC-MS Data? Continued… Problem Definition Background LC-MS Method Results CE Method Results Acknowledgements References Sample 1 Sample 2 Sample 3 Sample N MSView Quantification 1 Quantification 2 Quantification 3 Quantification N Analyze to find Biomarkers

MSView Problem Definition Background LC-MS Method Results CE Method Results Acknowledgements References MSView Visualization Relative Quantification Components Purpose Visual comparison /Analysis Further analysis for Biomarker Discovery

Extracted Ion Chromatogram (XIC)  Chromatogram created by plotting the intensity of the signal observed at a chosen m/z value in a series of mass spectra recorded as a function of retention time. Source: Problem Definition Background LC-MS Method Results CE Method Results Acknowledgements References

Visualization: XIC Problem Definition Background LC-MS Method Results CE Method Results Acknowledgements References

Relative Quantification using Peptide Identification Results Problem Definition Background LC-MS Method Results CE Method Results Acknowledgements References Data from LC-MS experiment Identification of peptides Extracted Ion Chromatogram of peptide Peak selection Area calculation MS View

Actual data: Quantification: Peak Selection Algorithm Problem Definition Background LC-MS Method Results CE Method Results Acknowledgements References After Smoothing: Minima Maxima Selecting local maxima and minima Selecting peaks: Minima Maxima

Quantification: Sample Results Problem Definition Background LC-MS Method Results CE Method Results Acknowledgements References

Biomarker Discovery using Glycomics

How does Capillary Electrophoresis (CE) work? Problem Definition Background LC-MS Method Results CE Method Results Acknowledgements References 84&sz=25&hl=en&start=3&um=1&tbnid=_JDf4X3dJn170M:&tbnh=108&tbnw=139&prev=/images%3Fq%3Dcapillary%2Belectrophoresis%26svnum%3D10%26um%3D1%26hl%3Den

What does the data look like? Problem Definition Background LC-MS Method Results CE Method Results Acknowledgements References Samples from different CE experiments:

Biomarker Discovery using Glycomics – Overview Problem Definition Background LC-MS Method Results CE Method Results Acknowledgements References Data from different samples Analysis of quantification for identifying Biomarkers Quantification of mapped peaks Mapping areas corresponding the same glycan from different samples CE Analyze

Direct Comparison: Dynamic Time Warping (DTW)  DTW algorithm aligns two time series having similar curves but are skewed differently over time. Time Source: Problem Definition Background LC-MS Method Results CE Method Results Acknowledgements References

Direct Comparison: DTW continued…  Sakoe-Chuba Band is used to reduce time & space complexity.  Parameters used in DTW: - Band width- Peak extention penalty - Difference in peak intensities. - Difference in peak direction Problem Definition Background LC-MS Method Results CE Method Results Acknowledgements References Stan Aslvador and Philip Chan. FastDTW:Toward Accurate Dynamic Time Warping in Linear Time and Space, KDD Workshop on Mining Temporal and Sequential Data, 2004

Method: Dynamic Time Warping Consensus Sample Align to consensus sample Align next sample to consensus sample Problem Definition Background LC-MS Method Results CE Method Results Acknowledgements References

Method continued… Corresponding peaks Aligned sample Unaligned sample Corresponding peaks Problem Definition Background LC-MS Method Results CE Method Results Acknowledgements References Calculate Area Peak 1

Results Corresponding peaks Problem Definition Background LC-MS Method Results CE Method Results Acknowledgements References

Summary Problem Definition Background LC-MS Method Results CE Method Results Acknowledgements References  Proteomics - MSView  Glycomics - CE Analyze LC-MS data Identified Peptides Quantification results for Biomarker Discovery CE Data Quantification results for Biomarker Discovery

Acknowledgements Dr. Haixu Tang- My advisor Dr. Randy J.ArnoldDr. Yehia Mechref Dr. Milos NovotnyDr. David E.Clemmer Dr. Sun Kim Dr. Jeong-Hyeon Choi Dr. Stephen J. Valentine Yin Wu Manolo D.Plasencia School of Informatics Funding: NIH/NCRR MetaCyt Indiana University Problem Definition Background LC-MS Method Results CE Method Results Acknowledgements References

[ 1] Higgs, R.E., Knierman, M.D., Gelfanova, V., Butle,r J.P. and Hale, J.E. (2005) Comprehensive label-free method for the relative quantification of proteins from biological samples. J. Proteome Res., 4, [2] Linsen, L., Locherbach, J., Berth, M., Becher, D. and Bernhardy, J. (2006) Visual Analysis of Gel-Free Proteome Data. IEEE Transactions on Visualization and Computer Graphics,12, [3] Prakash, A., Mallick, P., Whiteaker, J., Zhang, H., Paulovich, A., Flory, M., Lee, H., Aebersold, R., and Schwikowski, B. (2006) Signal maps for mass spectrometry-based comparative proteomics. Mol. Cell. Proteomics 5, 423 –432 [4] Leptos, K. C., Sarracino, D. A., Jaffe, J. D., Krastins, B., and Church, G. M. (2006) MapQuant: open-source software for large-scale protein quantification. Proteomics 6, 1770 –1782 [5] Aebersold, R., and Mann, M. (2003) Mass spectrometry-based proteomics. Nature 422, 198 –207 Problem Definition Background LC-MS Method Results CE Method Results Acknowledgements References