Presentation is loading. Please wait.

Presentation is loading. Please wait.

Identify proteins. Proteomic workflow Trypsin A typical sample We add a solution of 50 mM NH 4 HCO 3 (pH 7.8) containing trypsin (0.01-5 µg/µl). Volume.

Similar presentations


Presentation on theme: "Identify proteins. Proteomic workflow Trypsin A typical sample We add a solution of 50 mM NH 4 HCO 3 (pH 7.8) containing trypsin (0.01-5 µg/µl). Volume."— Presentation transcript:

1 Identify proteins

2 Proteomic workflow

3 Trypsin A typical sample We add a solution of 50 mM NH 4 HCO 3 (pH 7.8) containing trypsin (0.01-5 µg/µl). Volume depends on the dimension of the sample Incubation o/n at 37°C Surnatants are 0.22 µm filtered Depending on the volume, samples are concentrated / vacuum dried Peptide mixtures are then analysed by mass spectrometry

4 Trypsin

5 525.3144 HIQK 615.3283 LHSMK 689.3828 VNELSK 748.3698 TTMPLW 831.3843 EDVPSER 910.4741 EGIHAQQK 1267.7045 YLGYLEQLLR 1384.7300 FFVAPFPEVFGK 1580.8279 VPQLEIVPNSAEER 1694.0761 LLILTCLVAVALARPK 1759.9450 HQGLPQEVLNENLLR 1767.7589 DIGSESTEDQAMEDIK 2316.1369 EPMIGVNQELAYFYPELFR 2321.0813 QMEAESISSSEEIVPNSVEQK This set of masses consitutes a fingerprint of the protein. An MS analysis can allow identification of this protein. MH + Peptide sequence

6 Identification by search in protein sequences databases

7 http://www.matrixscience.com/search_form_select.html

8

9

10 525.14 831.34 1384.68 2321.08 MALDI-TOF spectrum of the trypsin digestion of the pictorial sample containing milk Signals corresponding to peptides of bovine α-S1 casein

11 Database search with MS data only MH + data

12 Database search results Several unassigned signals All the signals in the spectrum are inserted in the search box

13 Effect of multiple Peptide Masses on Protein Identification by Mass Fingerprinting Search m/z Mass Tolerance Da N° of Hits 1529.730.1204 1529.730.1 7 1252.700.1 1529.730.1 1252.700.1 1 1833.880.1 Moreover…… The number of peptides detected that belong to the same protein strongly influences the identification

14 Effect of Mass Accuracy and Mass Tolerance on Peptide Mass Fingerprinting Search Results Search m/z Mass Tolerance Da N° of Hits 15291478 1529.70.1164 1529.730.01 25 1529.7340.001 4 1529.73480.0001 2 Moreover……

15 Identification by MS data, (generally MALDI-TOF), suffers from: The complex mixtures of proteins of organic materials The uneven relative quantities of the different components

16 MS analysis What can give us more informations? The sequence of two or more peptides

17 Despite the quite similar name Casein α–S1 and Casein α–S2 are quite different proteins

18 Peptide sequencing by MS

19 MS/MS of Peptide Mixtures LC MS MS/MS

20 CHIP LC-MSMS Q-Tof

21 Single peptides are selected for fragmentation MS/MS Fragmentation spectra Tandem mass spectra Peptide mixture MS of the single peptides + + + What is MS/MS?

22 Interpretation of an MSMS spectrum to derive structural information is analogous to solving a puzzle Use the fragment ion masses as specific pieces of the puzzle to put the intact molecule back together. + + +

23 -HN--CH--CO--NH--CH--CO--NH- RiRi CH-R’ cici z n-i R” d i+1 v n-i w n-i low energy high energy Cleavages Observed in MS/MS of Peptides aiai x n-i bibi y n-i

24 Simple Fragmentation rules Ions of the “b” serieIons of the “y” serie

25 Fragmentation rules

26 Precursor ion Doubly charged m/z = 634.36 MH+ = 1267.72 C-term Arg

27 525.3144 HIQK 615.3283 LHSMK 689.3828 VNELSK 748.3698 TTMPLW (C-terminus of the protein) 831.3843 EDVPSER 910.4741 EGIHAQQK 1267.7045 YLGYLEQLLR 1384.7300 FFVAPFPEVFGK 1580.8279 VPQLEIVPNSAEER 1694.0761 LLILTCLVAVALARPK 1759.9450 HQGLPQEVLNENLLR 1767.7589 DIGSESTEDQAMEDIK 2316.1369 EPMIGVNQELAYFYPELFR 2321.0813 QMEAESISSSEEIVPNSVEQK Since the proteolytic enzyme is trypsin, all the peptides end either with Arg (R) or Lys (K). Y 1 ion will always be either 147 (K), or 175 (R) MH + Peptide sequence

28 m/z = 634.36 MH+ = 1267.72 C-term Arg 288 – 175 = 113 = Leu (L) RL 401 – 288 = 113 = Leu (L) L 175 288 401 529 – 401 = 128 = GLn (Q) Q 529 E 658 129=E 113=L 163=Y 113=L 57=G L 771 Y 934 G 991 L 1104 MH + - y 9 = 163 = Tyr (Y) Y Precursor ion

29 m/z = 634.36 MH+ = 1267.72 RLL 175 228 401 Q 529 E 658 129=E 113=L 57=G L 771 Y 934 G 991 L 1104 Y Precursor ion 277 334 163=Y 497 610 739 128=Q 867

30 m/z = 634.36 MH+ = 1267.72 RLLQELYGLY y1y1 y8y8 y7y7 y6y6 y5y5 y4y4 y3y3 y2y2 y9y9 b2b2 b3b3 b4b4 b6b6 b5b5 b7b7

31 MSMS Peptide Fragmentation signal b 1 y 1 b 2 y 2 b 3 y 3 b 4 y 4 b 5 y 5 Ala-Gly-His-Leu-….Phe-Glu-Cys-Tyr

32 Should we manually interpeter each fragmentation spectrum?

33 Peptide Sequencing This set of numbers are identificative of the fragmentation spectrum of this peptide

34 Signals in the fragmentation spectra can be predicted b seriey serie ---b1b1 Y y 10 --- 277.1547b2b2 L y9y9 1104.6412 334.1761b3b3 G y8y8 991.5571 497.2395b4b4 Y y7y7 934.5356 610.3235b5b5 L y6y6 771.4723 739.3661b6b6 E y5y5 658.3883 867.4247b7b7 Q y4y4 529.3457 980.5088b8b8 L y3y3 401.2871 1093.5928b9b9 L y2y2 288.2030 ---b 10 R y1y1 175.1190 b seriey serie ---b1b1 Y y 10 --- 277.1547b2b2 L y9y9 1177.6364 334.1761b3b3 G y8y8 1064.5524 497.2395b4b4 Y y7y7 1007.5309 683.3188b5b5 W y6y6 844.4676 812.3614b6b6 E y5y5 658.3883 940.4199b7b7 Q y4y4 529.3457 1053.5040b8b8 L y3y3 401.2871 1166.5881b9b9 L y2y2 288.2030 ---b 10 R y1y1 175.1190 A single change in the sequence changes the profile of expected signals

35 m/z 1410.6 Database IIGHFYDDWCPLK SPAFDSIMAETLK AFDSLPDDIHEK GGILAQSPFLIIK Real spectrum cross- correlated with theoretical spectrum 25050075010001250 20 40 60 80 100 x8 185.3 255.7 360.9 403.0 519.1 662.3 805.5 1007.4 1155.5 1226.8 1324.8 892.6 m/z Database searches compares the experimental MSMS spectra with the virtual spectra of the peptides generated by the in silico digestion of the proteins in the database

36 100 fmol BSA injected on column. BPC of m/z 300-2200, and typical MS/MS spectrum (right inset). 304050 Time [min ] MS trace MS/MS trace MS m/z 600200 1000 y2 b3 y3 y4 y5 b7 y7 b8 y6 b9 y9 y10 b11 b12 MS/MS NanoLC MS/MS

37 Database search

38

39 Database search with raw data from LC-MSMS MH + data Fragmentation spectra are automatically uploaded

40

41

42

43 Why Trypsin is preferred? Because MS sees only ions….. Upon fragmentation, the presence of a positively charged residue (Lys or Arg), will ensure the presence of a charge on fragments on the C- terminal side, generating the y serie The amino group at the N-terminus should ensure the presence of a charge on fragments on the N- terminal side, generating the b serie

44

45 For this kind of sample we do not use any fixed modification

46 Variable Modifications take into account modifications induced by sample treatment and/or sample deterioration

47 These informations depend on the instrument

48 A typical output of the results

49

50

51 Proteins are ranked as a function of decreasing score Proteins are grouped into families using a novel hierarchical clustering algorithm. If the family contains multiple members, the accessions, scores and descriptions are aligned with a dendrogram, which illustrates the degree of similarity between members. A short description is given of the hit, with the organism of provenance

52 -The Report Builder tab allows you to build a customised table of protein hits, which is particularly useful if you need a minimal list of proteins for a publication. Number of peptide matchesNumber of significant peptide matches (above the significance threshold) Number of indipendent sequences Number of significant indipendent sequences (above the significance threshold)

53 A hit Experimental m/z valueExperimental m/z transformed to a relative molecular mass molecular mass calculated from the matched peptide sequence Difference (error) between the experimental and calculated masses Ions score - If there are duplicate matches to the same peptide, then the lower scoring matches are shown in brackets. Expectation value for the peptide match. (The number of times we would expect to obtain an equal or higher score, purely by chance. The lower this value, the more significant the result). A letter U if the peptide sequence is unique to the protein family sequence Sequence of the peptide in 1-letter code. If the peptide sequence is modified, each affected residue is underlined.

54 Clicking the number These are other possible interpretations of the same fragmentation spectrum Highlighting the number (it is the spectrum number in the query)

55 The matched fragment ions are shown in tabular format below the spectrum. The ion series are those specified by the INSTRUMENT search parameter. If you choose to label the matches used for scoring, bold italic red means the series contributed to the score. Bold red means that the number of matches in the ion series is greater than would be expected by chance, indicating that the ion series is present. Non-bold red means that the number of matches in the ion series is no greater than would be expected by chance, so that the matches themselves may be by chance.

56 Advantages: - Univoque identifications - Multiple identifications - Few peptides are sufficient - Proteins in mixtures can be distinguished - Reduced relevance of protein contamination - Deductive results: no hypothesys is requested - Organisms can be recognized and differentiated


Download ppt "Identify proteins. Proteomic workflow Trypsin A typical sample We add a solution of 50 mM NH 4 HCO 3 (pH 7.8) containing trypsin (0.01-5 µg/µl). Volume."

Similar presentations


Ads by Google