Mass Spectrometry. What are mass spectrometers? They are analytical tools used to measure the molecular weight of a sample. Accuracy – 0.01 % of the total.

Slides:



Advertisements
Similar presentations
Genomes and Proteomes genome: complete set of genetic information in organism gene sequence contains recipe for making proteins (genotype) proteome: complete.
Advertisements

Protein Sequencing and Identification by Mass Spectrometry.
The Proteomics Core at Wayne State University
Micromass Quattro Ultima triple quadrupole mass spectrometric detector HPLC system (LC) Electrospray ionisation source (-ve & +ve ion) Photodiode array.
In-depth Analysis of Protein Amino Acid Sequence and PTMs with High-resolution Mass Spectrometry Lian Yang 2 ; Baozhen Shan 1 ; Bin Ma 2 1 Bioinformatics.
Peptide Mass Fingerprinting
De Novo Sequencing v.s. Database Search Bin Ma School of Computer Science University of Waterloo Ontario, Canada.
By Jonathan Haltiwanger and Teresa Leslie.  Analytical technique to determine the elemental composition of a sample  Vaporization  ionization  electric.
CSE182 CSE182-L12 Mass Spectrometry Peptide identification.
Protein Sequencing and Identification by Mass Spectrometry.
Fa 05CSE182 CSE182-L7 Protein sequencing and Mass Spectrometry.
Peptide Identification by Tandem Mass Spectrometry Behshad Behzadi April 2005.
PROTEIN IDENTIFICATION BY MASS SPECTROMETRY. OBJECTIVES To become familiar with matrix assisted laser desorption ionization-time of flight mass spectrometry.
Basics of 2-DE and MALDI-ToF MS
Proteomics Informatics – Protein identification II: search engines and protein sequence databases (Week 5)
Announcements: Proposal resubmissions are due 4/23. It is recommended that students set up a meeting to discuss modifications for the final step of the.
Proteomics Informatics Workshop Part I: Protein Identification
Previous Lecture: Regression and Correlation
Proteomics Informatics (BMSC-GA 4437) Course Director David Fenyö Contact information
My contact details and information about submitting samples for MS
A Neural Network Predictor for Peptide Fragmentation in Mass Spectrometry Arunima Ram Advisor : Dr. Predrag Radivojac Co-Advisor : Dr. Haixu Tang Co-Advisor.
1 Mass Spectrometry-based Proteomics Xuehua Shen (Adapted from slides with textbook)
Proteomics Informatics (BMSC-GA 4437) Course Director David Fenyö Contact information
Protein sequencing and Mass Spectrometry. Sample Preparation Enzymatic Digestion (Trypsin) + Fractionation.
Tryptic digestion Proteomics Workflow for Gel-based and LC-coupled Mass Spectrometry Protein or peptide pre-fractionation is a prerequisite for the reduction.
Mass Spectrometry Mass spectrometry (MS) is not true “spectroscopy” because it does not involve the absorption of electromagnetic radiation to form an.
The dynamic nature of the proteome
PROTEIN STRUCTURE NAME: ANUSHA. INTRODUCTION Frederick Sanger was awarded his first Nobel Prize for determining the amino acid sequence of insulin, the.
Pharmaceutical analysis Bioavailability studies Drug metabolism studies, pharmacokinetics Characterization of potential drugs Drug degradation product.
INF380 - Proteomics-91 INF380 – Proteomics Chapter 9 – Identification and characterization by MS/MS The MS/MS identification problem can be formulated.
Neural Networks for Protein Structure Prediction Brown, JMB 1999 CS 466 Saurabh Sinha.
Acknowledgements This work is supported by NSF award DBI , and National Center for Glycomics and Glycoproteomics, funded by NIH/NCRR grant 5P41RR
Common parameters At the beginning one need to set up the parameters.
1 Chemical Analysis by Mass Spectrometry. 2 All chemical substances are combinations of atoms. Atoms of different elements have different masses (H =
Laxman Yetukuri T : Modeling of Proteomics Data
In-Gel Digestion Why In-Gel Digest?
Genomics II: The Proteome Using high-throughput methods to identify proteins and to understand their function.
Software Project MassAnalyst Roeland Luitwieler Marnix Kammer April 24, 2006.
PEAKS: De Novo Sequencing using Tandem Mass Spectrometry Bin Ma Dept. of Computer Science University of Western Ontario.
Proteomics What is it? How is it done? Are there different kinds? Why would you want to do it (what can it tell you)?
CSE182 CSE182-L11 Protein sequencing and Mass Spectrometry.
Data Mining: Knowledge Discovery in Databases Peter van der Putten ALP Group, LIACS Pre-University College Bio Informatics January
Overview of Mass Spectrometry
EBI is an Outstation of the European Molecular Biology Laboratory. In silico analysis of accurate proteomics, complemented by selective isolation of peptides.
Error tolerant search Large number of spectra remain without significant score. Reasonable number of fragment ion peaks might have not match. – Underestimated.
Proteomics Informatics (BMSC-GA 4437) Instructor David Fenyö Contact information
Tag-based Blind Identification of PTMs with Point Process Model 1 Chunmei Liu, 2 Bo Yan, 1 Yinglei Song, 2 Ying Xu, 1 Liming Cai 1 Dept. of Computer Science.
De Novo Peptide Sequencing via Probabilistic Network Modeling PepNovo.
Proteomics Informatics (BMSC-GA 4437) Course Directors David Fenyö Kelly Ruggles Beatrix Ueberheide Contact information
2014 생화학 실험 (1) 6주차 실험조교 : 류 지 연 Yonsei Proteome Research Center 산학협동관 421호
Constructing high resolution consensus spectra for a peptide library
RANIA MOHAMED EL-SHARKAWY Lecturer of clinical chemistry Medical Research Institute, Alexandria University MEDICAL RESEARCH INSTITUTE– ALEXANDRIA UNIVERSITY.
Yonsei Proteome Research Center Peptide Mass Finger-Printing Part II. MALDI-TOF 2013 생화학 실험 (1) 6 주차 자료 임종선 조교 내선 6625.
An introduction to Mass spectrometry. What is mass spectrometry? Analytical tool measuring molecular weight (MW) of sample Only picomolar concentrations.
Peptide de novo sequencing Peptide de novo sequencing is the analytical process that derives a peptide’s amino acid sequence from its tandem mass spectrum.
Post translational modification n- acetylation Peptide Mass Fingerprinting (PMF) is an analytical technique for identifying unknown protein. Proteins to.
A Database of Peak Annotations of Empirically Derived Mass Spectra
The Covalent Structure of Proteins
The Syllabus. The Syllabus Safety First !!! Students will not be allowed into the lab without proper attire. Proper attire is designed for your protection.
Mass spectrometry-based proteomics
Bioinformatics Solutions Inc.
Proteomics Informatics David Fenyő
Interpretation of Mass Spectra I
Artificial Neural Networks Thomas Nordahl Petersen & Morten Nielsen
Mass Spectrometry THE MAIN USE OF MS IN ORG CHEM IS:
Shotgun Proteomics in Neuroscience
Using Bayesian Network in the Construction of a Bi-level Multi-classifier. A Case Study Using Intensive Care Unit Patients Data B. Sierra, N. Serrano,
Proteomics Informatics David Fenyő
Interpretation of Mass Spectra
Presentation transcript:

Mass Spectrometry

What are mass spectrometers? They are analytical tools used to measure the molecular weight of a sample. Accuracy – 0.01 % of the total molecular weight of a large sample (biomolecule) which is enough to identify substitutions, post translational modifications.

Use to BioChemists Accurate molecular weight measurements Determine sample purity Verify amino acid substitutions, post translational modifications. Amino acids sequencing Oligonucleotide structure Protein structure determination (protein folding, macromolecular structure determination)

Use in the Industry Biotechnology (Analysis of proteins, peptides, oligonucleotides) Pharmaceuticals (Drug discovery, pharmacokinetics, drug metabolism) Clinical (neonatal screening, haemoglobin analysis) Geological (Oil composition) Environmental (Water quality, food contamination)

Mass Spectrometer has… 3 parts: Ionization source Analyzer Detector

Matrix-Assisted Laser Desorption/Ionization (MALDI) From lectures by Vineet Bafna,UCSD

Protein to peptide to fragment ion Proteins are digested by using a protease like Trypsin. Trypsin breaks the protein backbone at L and R which are basic residues and form positive ions. The mass spectrometer further breaks these peptides into fragment ions.

Peptide Cleavage

Glycan Cleavage

Mass Spectrum

Experimental SpectrumTheoretical Spectrum

Peptide sequencing problem Goal: Find a peptide whose theoretical spectrum matches the given experimental spectrum the best.

How can it be done? Database search De Novo search

Database search Given a experimental spectrum and the parent mass of the experimental peptide, find candidate peptides with the same parent mass in the database that match the experimental peptide the best

De novo Search Build a spectrum graph from the masses to create the nodes Use mass differences to create the edges Find the best path

De novo SearchSequence

Database vs De Novo Search Database search is very successful in identification of already known proteins. De novo helps in identification of proteins not in the database. Database search is not as fast as De novo. De novo needs good quality spectra and without any modifications to work with. De novo is not very accurate. Database: SEQUEST (Yates et al) De Novo: PepNovo (Frank and Pevzner)

Our Project Database search tool for Peptide Identification from Mass Spectrometry data using a Machine Learning approach

What makes it different from the traditional search tools? * Traditional search tools use ad-hoc rules and or unified probabilistic models * Our tool is based on the Machine learning approach

What is Machine Learning? * An area of artificial intelligence concerned with the development of techniques which allow computers to learn from data. *The researcher feeds a set of training examples to a computer program that aims to learn the connection between features of the examples and a specified target concept.

Examples of Machine learning techniques *Linear Regression *Decision Tree learning *Artificial Neural Networks *Bayesian Learning *Analytical Learning *Reinforcement Learning etc.....

Our Choice.... Artificial Neural Networks Reason? * Peptide fragmentation is a non-linear problem governed by complex rules.

A brief overview of Neural Networks

Brain Cells

From Human Neurons to Artificial Neurons....

Feed Forward Networks

Work flow of the project

Data Preparation Protein Samples were isolated from rat brains Samples were digested with trypsin and passed through LCQ Deca Xp ion-trap mass spectrometer and spectra of peptide ions were recorded. All spectra were searched against protein sequences for Rattus in Swiss- Prot database using Mascot

Precursor peptides were divided into double and triple charged sets. Peak intensities were estimated for the following ion types precursor-H2O b, b-H2O, b-NH3, b-H2O-NH3 y, y-H2O, y-NH3, y-H2O-NH3

Network Training Features for each ion were extracted. Target is the peak intensity for each ion. Seperate ensembles of two layer feed- forward networks were constructed for each ion type and trained on the data.

Prediction of Spectrum The predicted fragment spectrum was constructed combining the outputs of individual predictors for each ion type. A blackbox was constructed from the trained Neural Network models The blackbox when presented with a peptide as input will output the predicted spectrum

Thank You Questions?