Presentation is loading. Please wait.

Presentation is loading. Please wait.

Evolution & Design Principles in Biology: a consequence of evolution and natural selection Rui Alves University of Lleida Course Website:http://web.udl.es/usuaris/pg193845/Bioinformatics_2009/

Similar presentations


Presentation on theme: "Evolution & Design Principles in Biology: a consequence of evolution and natural selection Rui Alves University of Lleida Course Website:http://web.udl.es/usuaris/pg193845/Bioinformatics_2009/"— Presentation transcript:

1 Evolution & Design Principles in Biology: a consequence of evolution and natural selection Rui Alves University of Lleida ralves@cmb.udl.es Course Website:http://web.udl.es/usuaris/pg193845/Bioinformatics_2009/

2 Part I: Molecular Evolution

3 Theory of Evolution Evolution is the theory that allows us to understand how organisms came to be how they are In probabilistic terms, it is likely that all living beings today have originated from a single type of cells These cells divided and occupied ecological niches, where they adapted to the new environments through natural selection

4 How did the first cell create different cells? Neutral Mutation (e.g. by error in genome replication)

5 How did the first cell create different cells? Neutral Mutation (e.g. by error in genome replication)

6 How did the first cell create different cells? Neutral Mutation (e.g. by error in genome replication)

7 How did the first cell create different cells? Deleterious Mutation (e.g. by error in genome replication)

8 How did the first cell create different cells? Deleterious Mutation (e.g. by error in genome replication)

9 How did the first cell create different cells? Deleterious Mutation (e.g. by error in genome replication)

10 How did the first cell create different cells? Advantageous Mutation (e.g. by error in genome replication)

11 How did the first cell create different cells? Advantageous Mutation (e.g. by error in genome replication)

12 And then there was sex…

13 Why Sex??? Asexual reproduction is quicker, easier  more offspring/individual. Sex may limit harmful mutations – Asexual: all offspring get all mutations – Sexual: Random distribution of mutations. Those with the most harmful ones tend not to reproduce. Generate beneficial gene combinations – Adaptation to changing environment – Adaptation to all aspects of constant environment – Can separate beneficial mutations from harmful ones – Sample a larger space of gene combinations

14 New Niche/ New conditions in old niche What drives cells to adapt?

15 New (better adapted) mutation What drives cells to adapt?

16 How do New Genes and Proteins appear? Genes (Proteins) are build by combining domains New proteins may appear either by intradomain mutation of by combining existing domains of other proteins Cell Division Cell Division … …

17 The Coalescent This model of cellular evolution has implications for molecular evolution Coalescent Theory: a retrospective model of population genetics that traces all alleles of a gene in a sample from a population to a single ancestral copy shared by all members of the population, known as the most recent common ancestoralleles genemost recent common ancestor

18 Why is the coalescent the de facto standard today? Alternatives? Current sequences have evolved from the same original sequence (Coalescent) Current sequences have converged to a similar sequence from multiple origins of life

19 Back of the envelop support for ? ACDEFGHIKLMNPQRSTVWY 20 A EDYAHIKLMNPQRGTVWY 20 AAiAAk AAiAAk AAi Convergence Divergence Which is more likely? Back of the envelop support for divergence

20 About the mutational process Point mutations: Transitions (A↔G, C↔T) are more frequent than transversions (all other substitutions) In mammals, the CpG dinucleotide is frequently mutated to TG or CA (possibly related to the fact that most CpG dinucleotides are methylated at the C-residues) Microsatellites frequently increase or decrease in size (possibly due to polymerase slippage during replication) Gene and genome duplications (complete or partial), may lead to: pseudogenes: function-less copies of genes which rapidly accumulate (mostly deleterious) mutations, useful for estimating mutation rates! new genes after functional diversification Chromosomal rearrangements (inversions and translocation), may lead to meiotic incompatibilities, speciation Estimated mutation rates: Human nuclear DNA: 3-5×10 -9 per year Human mitochondrial DNA: 3-5×10 -8 per year RNA and retroviruses: ~10 -2 per year

21 Consequences of the coalescent model?

22 So what if we accept the coalescent model? A1TSRISEIRR A2TSRISEIRR A3TSRISEIRR A4TSRISEIRR A5TSRISEIRR A6TSRISEIRR A7PSRISEIRR A8PKRISEVRR A9PKRISEVRR A10PQRISAIQR A11PQRISAIQR A12PQRISTIQR A13PQRISTIQR A14ASHLHNLQR A15TKHLQELQRE A16TKHLQELQRE A17TKHLQELQRE A18SKHLHELQRD A19PKNLHELQKD A20SKRLHEVQSE A1-6TSRISEIRR A7PSRISEIRR A8-9PKRISEVRR A10-11PQRISAIQR A12-13PQRISTIQR A14ASHLHNLQR A15-17TKHLQELQR A18SKHLHELQR A19PKNLHELQK A20SKRLHEVQS

23 So what if we accept the coalescent model? A1-6TSRI SEI RR A7PSRI SEI RR A8-9PKRI SEVRR A10-11PQRI SAI QR A12-13PQRI STI QR A14ASHLHNLQR A15-17TKHLQELQR A18SKHLHELQR A19PKNLHELQK A20SKRLHEVQS A1-6 A7 A10-11 A12-A13 A’1-7 A’10-13

24 So what if we accept the coalescent model? A’1-7(p-t) SRI S E I RR A8-9 P KRI S E VRR A’10-13 P QRI S(a-t)I QR A14 A SHLH N LQR A15-17 T KHLQ E LQR A18 S KHLH E LQR A19 P KNLH E LQK A20 S KRLH E VQS 4 3324 5 323 The study of sequence alignments can gives information about the evolution of the different organisms!!!!

25 Phylogenetic tree reconstruction, overview Computational challenge : There is an enormous number of different topologies even for a relatively small number of sequences: 3 sequences: 1 4 sequences: 3 5 sequences: 15 10 sequences: 2,027,025 20 sequences: 221,643,095,476,699,771,875 Consequence: Most tree construction algorithm are heuristic methods not guaranteed to find the optimal topology. Input data for two major classes of algorithms: 1.Input data distance matrix, examples UPGMA, neighbor-joining 2. Input data multiple alignment: parsimony, maximum likelihood Distance matrix methods use distances computed from pairwise or multiple alignments as input.

26 Building phylogenetic trees of proteins Genome 1 Genome 2 Genome 3 Genome … Protein AProtein BProtein CProtein D Protein AProtein BProtein C Protein D Protein A Protein BProtein C Protein D …

27 Distance based phylogenetic trees ACTDEEGGGGSRGHI… A-TEEDGGAASRGHI… ACFDDEGGGGSRGHL… … A1 A2 A3 … A1 A2 A3 A1 5 substitutions 3 substitutions A2 A3 8 substitutions A2 A3 A1 3 5

28 Maximum likelihood phylogenetic trees ACTDEEGGGGSRGHI… A-TEEDGGAASRGHI… ACFDDEGGGGSRGHL… … Alignment Probability of aa substitution A - E D … A 1 0.01 0.2 0.09 … - 0.01 1 0.0001 0.0001 … E 0.2 0.0001 1 0.5 D 0.09 0.0001 0.5 1 …

29 Maximum likelihood phylogenetic trees ACTDEEGGGGSRGHI… A-TEEDGGAASRGHI… ACFDDEGGGGSRGHL… … Alignment A1 A2 A3 A1 5 substitutions 3 substitutions A2 A3 8 substitutions p(1,2) p(1,3) p(2,3) p(2,3)>p(1,2)>p(1,3) A1 A3 A2 A3 A1

30 Statistical evaluation of trees: bootstrapping 1 2 5 4 3 7 6 8 Motivation: Some branching patterns in a tree may be uncertain for statistical reasons (short sequences, small number of mutational events) Goal of bootstrapping: To assess the statistical robustness for each edge of the tree. Note that each edge divides the leave nodes into two subsets. For instance, edge 7–8 divides the leaves into subsets {1,2,3} and {4,5}.However, is this short edge statistically robust ? Method: Try to generate tree from subsets of input data as follows: Randomly modify input MSA by eliminating some columns and replacing them by existing ones, This results in duplication of columns. Compute tree for each modified input MSA. For each edge of the tree derived from the real MSA, determine the fraction of trees derived from modified MSAs which contain an edge that divides the leaves into the same subsets. This fraction is called the bootstrap value. Edges with low bootstrap values (e.g. <0.9) are considered unreliable.

31 Statistical evaluation of trees: bootstrapping

32 Other Trees Use genomes Use Enzymomes Use whatever group of molecules are important for a given function

33 Part II: Design principles

34 Outline What are design principles  How to study design principles Examples

35 What are design principles? Recurrent qualitative or quantitative rules that are observed in similar types of systems as a solution to a given functional problem Exist at different levels Nuclear Targeting Sequences Operon Gene 1 Gene 2Gene 3

36 How can design principles emerge in molecular biology? Inteligent design? Not a scientific hypothesis; out of the table Evolution? Makes sense, but how could such regularities emerge?

37 Climbing down mount improbable Overtime, edged stones would accumulate on the slope. Smooth, round, stones accumulate at the bottom. Design Principles: - Smooth, roundish rocks roll down the mountain. - Edged, flat, rocks don’t.

38 Design principles in molecular biology Similarly, if a topology or set of parameters has appeared through mutation and it can be shown to create a molecular network that functionally outperforms all other possible alternatives in a given set of conditions, one can talk about a design principle for the system under those conditions. [sensu engineering]

39 Index of talk How to identify design principles Design principles in: – Gene expression – Metabolic networks – Signal transduction – Development Design principles, what are they good for? Summary

40 First step, define the alternatives Gene Regulator + Gene Regulator _ X0X1X2X3

41 First step, define the alternatives X0X1X2X3 X3 t How strong should the feedback be?

42 Then, create models for each alternative Gene Regulator + Gene Regulator _

43 Finally: Compare the dynamic behavior of the models for the two or more alternatives with respect to physiologically relevant criteria.

44 Then, create models for each alternative X0X1X2X3

45 Index of talk How to identify design principles Design principles in: – Gene expression – Metabolic networks – Signal transduction – Development Design principles, what are they good for? Summary

46 The demand theory for gene expression Are there situations where positive regulation of gene expression outperforms negative regulation of gene expression and vice versa? Gene Regulator + Gene Regulator _

47 Regulating gene expression has principles Positive regulator: – More effective when gene product in demand for large fraction of life cycle. – Less noise sensitive if signal is low. Negative regulator: – More effective when gene product in demand for small fraction of life cycle. – Less noise sensitive if signal is high. Gene Regulator + Gene Regulator _ Genetics 149:1665; PNAS 103:3999; PNAS 104:7151;Nature 405: 590

48 Index of talk How to identify design principles Design principles in: – Gene expression – Metabolic networks – Signal transduction – Development Design principles, what are they good for? Summary

49 Negative overall feedback is a design principle in metabolic biosynthesis X0X1X2X3 Negative overall feedback: – More effective in coupling production to demand. – More robust to fluctuations. Bioinformatics 16:786; Biophysical J. 79:2290

50 Index of talk How to identify design principles Design principles in: – Gene expression – Metabolic networks – Signal transduction – Development Design principles, what are they good for? Summary

51 Bifunctional sensors can be a design principle in signal transduction Bifunctional sensor: – Performs best against cross talk Independent deactivator: – Better integrator of signals Mol. Microbiol. 48:25; Mol. Microbiol. 68: 1196 Signal Sensor Efector Deactivator Effect

52 Index of talk How to identify design principles Design principles in: – Gene expression – Metabolic networks – Signal transduction – Development Design principles, what are they good for? Summary

53 Gene Regulator _ Design principles in development Gene Regulator + High demand, low signal Signal + High demand, high signal Low demand, high signal Low demand, low signal Signal _ Genetics 149:1665; PNAS 103:3999; PNAS 104:7151;Nature 405: 590

54 Index of talk How to identify design principles Design principles in: – Gene expression – Metabolic networks – Signal transduction – Development Design principles, what are they good for? Summary

55 Biological design principles are good to understand why biology works as it does Biological design principles may connect molecular determinants to functional effectiveness. Heat shock Expression of important genes time Growth rate time BMC Bioinformatics 7:184

56 Underlying assumption Evolution of molecular networks can be treated as modules. Work in the group of Uri Alon suggests that – networks evolving to meet simultaneous goals evolve in a modular fashion – Networks evolving to meet a single goal evolve globally Modularity seems like a reasonable first assumption PNAS 102:13773; PLOS Comp Biol 4:e1000206;BMC Evol biol 7: 169

57 The good news about function Sometimes, you get stuff for free!!! For example: – networks that are responsive to signals, just because they are responsive may have inbuilt buffering of noise. – Functions that are associated with marginally stable proteins are favored because due to the large dimensions of sequence space most randomly selected sequences have a structure that is marginally stable. PNAS 100:14463; PNAS 103:6435; Proteins 46:105

58 How can biological design principles be applied? Design of molecular circuits with specific behaviors!! Stable Systems Unstable systems Oscilations Bistable systems Cell 113: 597; PLoS Comput Biol. 5:e1000319; PNAS 106: 6435

59 Index of talk How to identify design principles Design principles in: – Gene expression – Metabolic networks – Signal transduction – Development Design principles, what are they good for? Summary

60 Design principles can be found in molecular networks. Such principles can sometimes be connected to selection for function effectiveness. Even in the absence of such a connection, if they are valid they can be used to build biological circuits with specific behaviors.


Download ppt "Evolution & Design Principles in Biology: a consequence of evolution and natural selection Rui Alves University of Lleida Course Website:http://web.udl.es/usuaris/pg193845/Bioinformatics_2009/"

Similar presentations


Ads by Google