Presentation is loading. Please wait.

Presentation is loading. Please wait.

Example of regression by RBF-ANN

Similar presentations


Presentation on theme: "Example of regression by RBF-ANN"— Presentation transcript:

1 Example of regression by RBF-ANN
Prediction of charge on peptides after electron-spray ionization in mass spectrometry What are the best attributes to predict charge?

2 Review of molecular biology
DNA sequence determines protein sequence

3 Amino acids with different side chains have different names
Glycine gly G alanine ala A valine val V leucine leu L isoleucine ile I methionine met M porline pro P phenylalanine phe F tryptophan trp W serine ser S cysteine cys C threonine thr T glutamine gln Q asparagine asn N histidine his H tyrosine tyr Y glutamic acid glu E aspartic acid asp D lysine lys K arginine arg R What are amino acids? N-terminus C-terminus Side chain Amino acids with different side chains have different names

4 chemical properties of amino acids

5 More properties of amino acids
code mass pi pK1 pK2 charge Hydrophobic? Polar? A 6.01 2.35 9.87 T F R 10.76 1.82 8.99 + N 5.41 2.14 8.72 D 2.85 1.99 9.9 - C 5.05 1.92 10.7 E 3.15 2.1 9.47 Q 5.65 2.17 9.13 G 6.06 9.78 H 7.6 1.8 9.33 I 6.05 2.32 9.76 L 2.33 9.74 K 9.6 2.16 9.06 M 5.74 2.13 9.28 5.49 2.2 9.31 P 6.3 1.95 10.64 S 5.68 2..19 9.21 5.6 2.09 9.1 W 5.89 2.46 9.41 Y 5.64 V 6.0 2.39 More properties of amino acids

6 Amino Acids Polymerize to Form Proteins (polypeptides)
formation of peptide bond -N-C-C-N-C-C-N- H R

7 Proteases: enzymes that cut proteins at the peptide bond
-N-C-C-N-C-C-N- H R Most proteases have cleavage specificity. Trypsin cleaves mainly at arginines (R) and lysines (K) Digestion of a protein with trypsin produces peptides of various length Analysis of digestion mixture yields information about proteins in sample

8 Liquid chromatography coupled to mass spectrometry
LC column Electro-spray ionization Mass spectrometer Digested protein mixture peptides are retained for differing times on the LC column Peptides may have multiple charges. Charges in dataset are averages from several runs

9 First 4 of ~ 23,000 data pairs are
Sequence Charge AAAAAAPDDVAAQLVVADLDLVGGHVEDAFAR 2.8 AAAAADLANR 2 AAAAAQASASAAAK AAAAAVAQGGPIEDAER Can peptide sequence be an input? What inputs can we calculate from the input sequence?

10 Some suggestions for inputs from properties of amino acids
Length of peptide Mass of peptide First amino acid Last amino acid Factions of amino acids of each type Fractions of hydrophobic, polar, and charged residues Net formal charge Average isoelectric point Average disassociation constant

11 RBF-ANN is under Classify and called
RBFNetwork

12

13

14


Download ppt "Example of regression by RBF-ANN"

Similar presentations


Ads by Google