Presentation is loading. Please wait.

Presentation is loading. Please wait.

Corrections. SEQUENCE 4 >seq4 MSTNNYQTLSQNKADRMGPGGSRRPRNSQHATASTPSASSCKEQQKDVEH EFDIIAYKTTFWRTFFFYALSFGTCGIFRLFLHWFPKRLIQFRGKRCSVE NADLVLVVDNHNRYDICNVYYRNKSGTDHTVVANTDGNLAELDELRWFKY.

Similar presentations


Presentation on theme: "Corrections. SEQUENCE 4 >seq4 MSTNNYQTLSQNKADRMGPGGSRRPRNSQHATASTPSASSCKEQQKDVEH EFDIIAYKTTFWRTFFFYALSFGTCGIFRLFLHWFPKRLIQFRGKRCSVE NADLVLVVDNHNRYDICNVYYRNKSGTDHTVVANTDGNLAELDELRWFKY."— Presentation transcript:

1 Corrections

2

3 SEQUENCE 4 >seq4 MSTNNYQTLSQNKADRMGPGGSRRPRNSQHATASTPSASSCKEQQKDVEH EFDIIAYKTTFWRTFFFYALSFGTCGIFRLFLHWFPKRLIQFRGKRCSVE NADLVLVVDNHNRYDICNVYYRNKSGTDHTVVANTDGNLAELDELRWFKY RKLQYTWIDGEWSTPSRAYSHVTPENLASSAPTTGLKADDVALRRTYFGP NVMPVKLSPFYELVYKEVLSPFYIFQAISVTVWYIDDYVWYAALIIVMSL YSVIMTLRQTRSQQRRLQSMVVEHDEVQVIRENGRVLTLDSSEIVPGDVL VIPPQGCMMYCDAVLLNGTCIVNESMLTGESIPITKSAISDDGHEKIFSI DKHGKNIIFNGTKVLQTKYYKGQNVKALVIRTAYSTTKGQLIRAIMYPKP ADFKFFRELMKFIGVLAIVAFFGFMYTSFILFYRGSSIGKIIIRALDLVT IVVPPALPAVMGIGIFYAQRRLRQKSIYCISPTTINTCGAIDVVCFDKTG TLTEDGLDFYALRVVNDAKIGDNIVQIAANDSCQNVVRAIATCHTLSKIN NELHGDPLDVIMFEQTGYSLEEDDSESHESIESIQPILIRPPKDSSLPDC QIVKQFTFSSGLQRQSVIVTEEDSMKAYCKGSPEMIMSLCRPETVPENFH DIVEEYSQHGYRLIAVAEKELVVGSEVQKTPRQSIECDLTLIGLVALENR LKPVTTEVIQKLNEANIRSVMVTGDNLLTALSVARECGIIVPNKSAYLIE HENGVVDRRGRTVLTIREKEDHHTERQPKIVDLTKMTNKDCQFAISGSTF SVVTHEYPDLLDQLVLVCNVFARMAPEQKQLLVEHLQDVGQTVAMCGDGA NDCAALKAAHAGISLSEAEASIAAPFTSKVADIRCVITLISEGRAALVTS YSAFLCMAGYSLTQFISILLLYWIATSYSQMQFLFIDIAIVTNLAFLSSK TRAHKELASTPPPTSILSTASMVSLFGQLAIGGMAQVAVFCLITMQSWFI PFMPTHHDNDEDRKSLQGTAIFYVSLFHYIVLYFVFAAGPPYRASIASNK AFLISMIGVTVTCIAIVVFYVTPIQYFLGCLQMPQEFRFIILAVATVTAV ISIIYDRCVDWISERLREKIRQRRKGA

4 Compute pI/Mw tool !!! If you choose the wrong format for the sequence… With the correct format:

5 ProtParam

6

7 SAPS

8 SAPS (1)

9 SAPS (2)

10

11 doi: 10.1093/bioinformatics/bti797

12 http://en.wikipedia.org/wiki/Coiled_coil

13

14 The coiled-coil domains are annotated according to 3D structure data (experimental data)

15

16 Coiled-coil prediction Coils

17 http://www.ch.embnet.org/software/COILS_form.html Coils prediction

18

19 Coiled-coil prediction PairCoil (not always working…)

20 Paircoil prediction

21 Coiled-coil prediction PairCoil2

22 Parcoil2 results

23

24 Coiled-coil prediction Sliding window (Protscale)

25 Sliding window amino acid scale- example:

26 Bad results---- Bad results….

27

28 Sliding windows and amino acid scales Transmembrane domain: alpha-helix of 20 amino acids (hydrophobic) -> amino acid scales: hydrophobicity and alpha helix -> sliding window size: 20 amino acids

29 Protscale Amino acid scale: Kyte and Doolittle (hydrophobicity) Sliding window size: 21 amino acids

30 Protscale Amino acid scale: Chou&Fasman (alpha helix) Sliding window size: 21

31 Sliding windows and amino acid scales Transmembrane domain: alpha-helix of 20 amino acids (hydrophobic) -> amino acid scales: hydrophobicity and alpha helix -> sliding window size: 20 amino acids

32 Method based HMM or NN

33 HMMTOP

34 Protein: seq4 Length: 1127 N-terminus: IN Number of transmembrane helices: 8 Transmembrane helices: 65-82 409-432 445-468 916-940 970-993 1020-1039 1052- 1072 1089-1106

35 TMHMM (1)

36 TMHMM (2)

37 TMpred (1)

38 PSORT II (1)

39 - Look for the presence of a signal peptide.

40 No signal peptide Signal peptides are often predicted as ‘transmembrane’ domains (or vice versa) as they amino acids with similar biochemical properties (hydrophic and alpha helix).

41 Transmembrane: resume HMMTOP (8 TM) PSORT II (10 TM) Tmpred (10 TM) TMHMM (11 TM) 1 1130 1 1 1 in out Big loop

42 ? missed TM

43 The protein is known to contain 12 TM: one TM is missing at the N-terminus The possible ways to find the correct protein topology is to do a multiple alignment with other family members, or to do some 3D experiment (which are difficult with proteins containing transmembrane domains) Kristian Axelsen: personnal communication SEQ4 = Q9N323Q9N323

44

45 The Aquaglyceroporin contains ½ transmembrane regions which can not be predicted by programs, because the region is too short (less than 20 amino acids). There is no way to predict such transmembrane regions, except by doing 3D experiments. 3D experiments is the only way to confirm and ‘predict’ correctly transmembrane domains. Similarity analysis could then help to predict such regions in other protein of the same family. P0AER0

46

47 M3 and M7 are ‘demi’ transmembrane: not predictable

48 http://www.uniprot.org/uniprot/P0AER0#section_features

49 Look for the transmembrane regions of P31243 (try the different transmembrane prediction programs): your conclusions ?

50

51 No transmembrane domains are found by any program because this protein, a porin, is anchored in the membrane by a specific 3D structure called beta barrel which does not have any alpha helix….

52 ‘beta barrel’ Mainly composed of beta-sheets in a 16-stranded beta-barrel formation and forms a pore in the membrane 1.7 - 2.5 nm in diameter. Note that the orientation of the strands is such that side chains alternately point into the interior and exterior of the pore; the former are strongly polar residues while the latter are very hydrophobic.

53 Beta barrel Porin from Rhodobacter

54

55 Alignment of the 2 isoforms The gene has two in-frame initiation codons and two different proteins are made by alternative initiation (of translation)

56 According to this publication (PubMed: 11274159), there is a 'Dual targeting of spinach protoporphyrinogen oxidase II to mitochondria and chloroplasts by alternative use of two in- frame initiation codons'.

57 Immunoblot analysis of Protox II in spinach leaf. Watanabe N et al. J. Biol. Chem. 2001;276:20474-20481 ©2001 by American Society for Biochemistry and Molecular Biology chloromitoTotal leaf

58

59

60

61

62

63 Q94IG7 – Long isoform wolfPSORT: chloroplast TargetP: chloroplast CH score: 0.826 MI score: 0.026 ER score: 0.101 Other location: 0.060 SignalP-NN: not secreted score (D): 0.285 SignalP-HMM: not secreted SP probability: 6.2% SA probability: 0.2% ChloroP: chloroplast prediction score: 0.549 MITOPROT: mitochondria !!! exported to mitochondria with a probability of 0.71 !!!! Q94IG7 – Short isoform wolfPSORT: mitochondrial TargetP: mitochondrial CH score: 0.123 MI score: 0.504 ER score: 0.048 Other location: 0.400 SignalP-NN: not secreted score (D): 0.195 SignalP-HMM: not secreted SP probability: 3.1% SA probability: 5% ChloroP: not in chloroplast prediction score: 0.473 MITOPROT: other location exported to mitochondria with a probability of 0.33 !!!!!!

64

65

66

67 Cystein (61 modifications) and serine (46 modifications) are the amino acids with the highest number of known associated PTM. Beware: Resid considers the selenocystein as a PTM…this is not the case !

68

69

70 Phosphorylation

71 P03372

72 UniProt data: Experimentally proved P03372

73 http://www.phosphosite.org/proteinAction.do?id=968&showAllSites=true

74 The phosphorylation sites are localized on the ‘surface’ of the protein (homodimer) (where the amino acid are accessible to the kinases !)

75 O-glycosylation

76 P02724

77 Myristoylation

78 P51876

79

80 NMT

81 Myristoylator

82

83 Protein: secreted protein (P02751, fibronectin) Can be predicted: -Subcellular location (PSORT, TargetP) -Domains (InterPro) -Signal -Sulfation -N-glycosylation -O-glycosylation -Phosphorylation (Not predictable…) (predictable…)

84 THE END


Download ppt "Corrections. SEQUENCE 4 >seq4 MSTNNYQTLSQNKADRMGPGGSRRPRNSQHATASTPSASSCKEQQKDVEH EFDIIAYKTTFWRTFFFYALSFGTCGIFRLFLHWFPKRLIQFRGKRCSVE NADLVLVVDNHNRYDICNVYYRNKSGTDHTVVANTDGNLAELDELRWFKY."

Similar presentations


Ads by Google