Presentation is loading. Please wait.

Presentation is loading. Please wait.

1 Functional prediction in proteins (purifying and positive selection)

Similar presentations


Presentation on theme: "1 Functional prediction in proteins (purifying and positive selection)"— Presentation transcript:

1 1 Functional prediction in proteins (purifying and positive selection)

2 2 1. Introduction: evolution & sequence analysis

3 3 Darwin – the theory of natural selection  Adaptive evolution: Favorable traits will become more frequent in the population

4 4 Adaptive evolution  When natural selection favors a single allele and therefore allele frequency continuously shifts in one direction

5 5 Kimura – the theory of neutral evolution  Neutral evolution: Most molecular changes have no effect on the phenotype (neutral) Selection operates to preserve a trait (no change)

6 6 Purifying Selection  Stabilizes a trait in a population: Small babies  more illness Large babies  more difficult birth…  Baby weight is stabilized round 3-4 Kg

7 7 Purifying selection (conservation) - the molecular level  Histone 3

8 8 Synonymous vs. non-synonymous substitutions Purifying selection: excess of synonymous substitutions relative to non-synonymous substitutions Synonymous substitution: GUU  GUC Non-synonymous substitution: GUU  GCU

9 9 Synonymous vs. non-synonymous substitutions  Histone 3 Non-syn. Syn.

10 10 Conservation as a means of predicting function Infer the rate of evolution at each site

11 11 Conservation as a means of predicting function Low rate of evolution  constraints on the site to prevent disruption of function/structure: active sites, protein-protein interactions, protein core etc. 1234567 HumanDMAAHAM ChimpDEAAGGC CowDQAAWAP FishDLAACAL S. cerevisiae DDGAFAA S. pombe DDGALGE

12 12 Which site is more conserved? 1234567 HumanDMAAHAM ChimpDEAAGGC CowDQAAWAP FishDLAACAL S. cerevisiae DDGAFAA S. pombe DDGALGE

13 13 Use phylogenetic information 1234567 HumanDMAAHAM ChimpDEAAGGC CowDQAAWAP FishDLAACAL S. cerevisiae DDGAFAA S. pombe DDGALGE A G A A A G A A A A G G

14 14 ConSurf/ConSeq web servers: Prediction of conserved residues by estimating evolutionary rates at each site

15 15 Working process Input a protein with a known 3D structure (PDB ID or file provided by the user) Find homologous protein sequences (psi-blast) Perform multiple sequence alignment (removing doubles)Construct an evolutionary tree Project the results on the 3D structureCalculate the conservation score for each site

16 16 ConSurf example: potassium channel  An integral membrane protein with sequence similarity to all known K+ channels, particularly in the pore region.  PDB ID: 1bl8 chain A

17 17 ConSurf results

18 18 http://conseq.bioinfo.tau.ac.il/  ConSeq performs the same analysis as ConSurf but presents the results on the sequence.  Predicts buried/exposed relation exposed & conserved  functionally important sites exposed & conserved  functionally important sites buried & conserved  structurally important sites buried & conserved  structurally important sites

19 19 2. Positive selection & drug resistance

20 20 Darwin – the theory of natural selection  Adaptive evolution: Favorable traits will become more frequent in the population

21 21 Adaptive evolution at the molecular level

22 22 Adaptive evolution at the molecular level Look for changes which confer an advantage

23 23 Naïve detection  Observe a multiple sequence alignment: variable regions = adaptive evolution??

24 24 Naïve detection  The problem – how do we know which sites are not under any selection pressure (“non-important” sites) and which are under adaptive evolution?

25 25 Solution – we look at the DNA synonymous non- synonymous

26 26 Solution – we look at the DNA Purifying selection Syn > Non-syn Adaptive evolution = Positive selection Non-syn > Syn Neutral selection Syn = Non-syn

27 27 Also known as… Ka/Ks (or dn/ds, or ω) ratio  Purifying selection: Ka < Ks (Ka/Ks <1)  Neutral selection: Ka = Ks (Ka/Ks = 1)  Positive selection: Ka > Ks (Ka/Ks >1) Non- synonymous substitution rate Synonymous substitution rate

28 28 Examples for positive selection  Proteins involved in the immune system  Proteins involved in host-pathogen interaction (‘arms-race’)  Proteins following gene duplication  Proteins involved in reproduction systems

29 29 Accumulation of substitutions (syn. or non-syn.) depends on the evolutionary time that elapsed since the divergence of the analyzed species. When distant species are analyzed saturation of syn. substitutions is often encountered Synonymous vs. non-synonymous substitutions

30 30 Selecton – a server for the detection of purifying and positive selection http://selecton.bioinfo.tau.ac.il Stern et al., Nucleic Acids Res 35, W506 (2007).

31 31 Detecting drug resistance using Selecton

32 32 HIV: molecular evolution paradigm Rapidly evolving virus: 1.High mutation rate (low fidelity of reverse transcriptase) 2.High replication rate

33 33 Drug resistance No drug Drug Adaptive evolution (positive selection)

34 34 HIV Protease Protease is an essential enzyme for viral replication Drugs against Protease are always part of the “cocktail”

35 35 Ritonavir Inhibitor  Ritonavir (RTV) is a specific protease inhibitor (drug) C 37 H 48 N 6 O 5 S 2

36 36 Used Selecton to analyse HIV-1 protease gene sequences from patients that were treated with RTV only

37 37

38 38 Example: HIV Protease  Primary mutations  Secondary mutations   novel predictions (experimental validation)

39 39 Rate shifts and HIV sub-types

40 40 Rate shifts V Chimp V Rhesus A Squirrel K Rat M Mouse V Human

41 41 Rate shifts V V A K M V Low evolutionary rate High evolutionary rate

42 42 Rate shifts Specificity determinants:  Different phylogenetic groups V Chimp V Rhesus A Squirrel A Rat A Mouse V Human Gain of function?

43 43 Rate shifts Specificity determinants:  Following gene duplication V S. paradoxus V S. mikatae A S. cervisiae A S. paradoxus A S. mikatae V S. cervisiae Tropomyosin 1 Tropomyosin 2

44 44 Rate shifts in HIV subtypes

45 45 HIV subtypes

46 46 Which sites are responsible for the differences between the subtypes?  Detection of rate-shifts in all 9 subtypes

47 47 Significant rate shift in all HIV genes proportion # rate-shift sites 0.184Env 0.0420Gag 0.1721Nef 0.0333Pol 0.2529Rev 0.1513Tat 0.0713Vif 0.054Vpr 0.3529Vpu

48 48 Gag Position12  Wild-type (E)  Site which contributes to Protease Inhibitor (Amprenavir) drug resistance (K) E E E K Q R K K

49 49 C C A G F D J KEQNR

50 50 Summary  Sequence analysis can provide valuable information about protein function  The basic signal: conservation: http://consurf.tau.ac.il  Positive “Darwinian” selection: http://selecton.bioinfo.tau.ac.il http://selecton.bioinfo.tau.ac.il  Rate-shifts (specificity determinants)


Download ppt "1 Functional prediction in proteins (purifying and positive selection)"

Similar presentations


Ads by Google