MOLECULAR EVOLUTION Molecular evolution examines DNA and proteins, addressing two types of questions: How do DNA and proteins evolve? How are genes and.

Slides:



Advertisements
Similar presentations
Evolution of genomes.
Advertisements

Genetica per Scienze Naturali a.a prof S. Presciuttini Homologous genes Genes with similar functions can be found in a diverse range of living things.
Classification of Living Things. 2 Taxonomy: Distinguishing Species Distinguishing species on the basis of structure can be difficult  Members of the.
Duplication, rearrangement, and mutation of DNA contribute to genome evolution Chapter 21, Section 5.
14 Molecular Evolution and Population Genetics
Review of cladistic technique Shared derived (apomorphic) traits are useful in understanding evolutionary relationships Shared primitive (plesiomorphic)
Molecular Evolution with an emphasis on substitution rates Gavin JD Smith State Key Laboratory of Emerging Infectious Diseases & Department of Microbiology.
Molecular Clocks, Base Substitutions, & Phylogenetic Distances.
Chapter 17 From Gene to Protein. Gene Expression The process by which DNA directs the synthesis of proteins 2 stages: transcription and translation Detailed.
BNFO 602/691 Biological Sequence Analysis Mark Reimers, VIPBG
How is the amino acid sequence determined?
TGCAAACTCAAACTCTTTTGTTGTTCTTACTGTATCATTGCCCAGAATAT TCTGCCTGTCTTTAGAGGCTAATACATTGATTAGTGAATTCCAATGGGCA GAATCGTGATGCATTAAAGAGATGCTAATATTTTCACTGCTCCTCAATTT.
Chapter 3 Substitution Patterns Presented by: Adrian Padilla.
From Gene To Protein Chapter 17. The Connection Between Genes and Proteins Proteins - link between genotype (what DNA says) and phenotype (physical expression)
Molecular Clock. Rate of evolution of DNA is constant over time and across lineages Resolve history of species –Timing of events –Relationship of species.
Chapter 17 From Gene to Protein.
Lecture 25 - Phylogeny Based on Chapter 23 - Molecular Evolution Copyright © 2010 Pearson Education Inc.
5. Point mutations can affect protein structure and function
Mutations.
Bioinformatics 2011 Molecular Evolution Revised 29/12/06.
Copyright © 2007 Pearson Education Inc., publishing as Pearson Benjamin Cummings Bio11 – Announcements TODAY – Genetics (review) and quiz (CP #4) Structure.
The Biology and Genetic Base of Cancer. 2 (Mutation)
Evolutionary Biology Concepts Molecular Evolution Phylogenetic Inference BIO520 BioinformaticsJim Lund Reading: Ch7.
1 Genome Evolution Chapter Introduction Genomes contain the raw material for evolution; Comparing whole genomes enhances – Our ability to understand.
Pattern Matching Rhys Price Jones Anne R. Haake. What is pattern matching? Pattern matching is the procedure of scanning a nucleic acid or protein sequence.
Chapter 17 From Gene to Protein. Gene Expression DNA leads to specific traits by synthesizing proteins Gene expression – the process by which DNA directs.
Models of Molecular Evolution III Level 3 Molecular Evolution and Bioinformatics Jim Provan Page and Holmes: Sections 7.5 – 7.8.
Spliceosome attachs to hnRNA and begins to snip out non-coding introns mRNA strand composed of exons is free to leave the nucleus.
KEY CONCEPT Molecular clocks provide clues to evolutionary history.
Mutations are changes in the genetic material of a cell or virus
Chapter 9 From DNA to Protein.
Ch. 17 From Gene to Protein. Genes specify proteins via transcription and translation DNA controls metabolism by directing cells to make specific enzymes.
Chapter 5 The Content of the Genome 5.1 Introduction genome – The complete set of sequences in the genetic material of an organism. –It includes the.
Fig b6 Template strand RNA primer Okazaki fragment Overall direction of replication.
GENE EXPRESSION What is a gene? Mendel –Unit of inheritance conferring a phenotype Modern definition –Unit of DNA directing the synthesis of a polypeptide.
Chapter 10 Phylogenetic Basics. Similarities and divergence between biological sequences are often represented by phylogenetic trees Phylogenetics is.
Copyright © 2008 Pearson Education, Inc., publishing as Pearson Benjamin Cummings PowerPoint ® Lecture Presentations for Biology Eighth Edition Neil Campbell.
Evolution at the Molecular Level. Outline Evolution of genomes Evolution of genomes Review of various types and effects of mutations Review of various.
NEW TOPIC: MOLECULAR EVOLUTION.
Lesson Four Structure of a Gene. Gene Structure What is a gene? Gene: a unit of DNA on a chromosome that codes for a protein(s) –Exons –Introns –Promoter.
Evolution at the Molecular Level. Outline Evolution of genomes Evolution of genomes Review of various types and effects of mutations Review of various.
DNA  RNA  protein Copyright © 2008 Pearson Education Inc., publishing as Pearson Benjamin Cummings.
The Central Dogma of Molecular Biology DNA  RNA  Protein  Trait.
RNA processing and Translation. Eukaryotic cells modify RNA after transcription (RNA processing) During RNA processing, both ends of the primary transcript.
Copyright © 2005 Pearson Education, Inc. publishing as Benjamin Cummings Overview: The Flow of Genetic Information The information content of DNA is in.
PROTEIN SYNTHESIS. CENTRAL DOGMA OF MOLECULAR BIOLOGY: DNA is used as the blueprint to direct the production of certain proteins.
Genetic Code and Interrupted Gene Chapter 4. Genetic Code and Interrupted Gene Aala A. Abulfaraj.
KEY CONCEPT Molecular clocks provide clues to evolutionary history.
Lesson Four Structure of a Gene.
Causes of Variation in Substitution Rates
The neutral theory of molecular evolution
Pipelines for Computational Analysis (Bioinformatics)
Chapter 25 Using the Genetic Code
In-Text Art, Ch. 16, p. 316 (1).
Transfer of RNA molecules serve as interpreters during translation
Genomes and Their Evolution
Distances.
KEY CONCEPT Molecular clocks provide clues to evolutionary history.
What are the Patterns Of Nucleotide Substitution Within Coding and
KEY CONCEPT Molecular clocks provide clues to evolutionary history.
KEY CONCEPT Molecular clocks provide clues to evolutionary history.
Chapter 17 From Gene to Protein.
Chapter 17 From Gene to Protein.
KEY CONCEPT Molecular clocks provide clues to evolutionary history.
Phylogenics & Molecular Clocks
CHAPTER 17 FROM GENE TO PROTEIN.
Chapter 6 Clusters and Repeats.
Translation and Mutation
Section 20.4 Mutations and Genetic Variation
Presentation transcript:

MOLECULAR EVOLUTION Molecular evolution examines DNA and proteins, addressing two types of questions: How do DNA and proteins evolve? How are genes and organisms evolutionarily related?

Applications Reveal dynamics of evolutionary processes. Indicate chronology of change. Identify phylogenetic relationships.

Alignment of two sequences Number of aligned positions = 23

Sequence Alignments Matching nucleotides are interpreted as unchanged since a common ancestor. Substitutions, insertions, and deletions can be identified. Gaps inserted to maximize the similarity between aligned sequences indicate occurrence of insertions and deletions (indels).

Optimal alignment Many alignments are possible between sequences, and algorithms typically maximize the matching number of amino acids or nucleotides, invoking the smallest possible number of indel events.

Substitutions When DNA sequences diverge, they begin to collect mutations. The number of substitutions (P) found in an alignment is widely used in molecular evolution analysis.

An exemplary alignment Number of aligned positions = 23 Number of different positions (P) = 8

Number of substitutions If the alignment shows few substitutions, a simple count is used. If many substitutions occurred, it is likely that a simple count will underestimate the substitution events, due to the probability of multiple changes at the same site.

Jukes and Cantor Model They assumed that each nucleotide is equally likely to change into any other nucleotide, and created a mathematical model to describe multiple base substitutions. What other models could be developed?

Jukes and Cantor model K= -(3/4)*ln (1-(4/3)*P) P= observed number of substitutions over the total number of sites. K=distance between sequence x and sequence y expressed as the number of changes per site corrected for multiple substitutions at the same site natural log (ln) corrects for the underestimation of substitutions). ¾ and 4/3 are terms reflect that there are four types of nucleotides and three ways in which a second nucleotide may be substituted with.

P = 8/23 = K = -(3/4)*ln(1-4/3*P) = Observed distance P = increases when Jukes Cantor Model is used to correct for the multiple substitutions. Calculation of distance (K) between sequences

Correction for multiple substitutions If two sequences are 95% identical, then P = 0.05; and K = = If two sequences are only 50% identical, then P = 0.5; and K = – 0.5 = 0.324

Rates of nucleotide substitutions Substitutions accumulate independently and simultaneously in different sequences. Substitution rate, R, can be calculated by dividing the distance (K) between two homologous sequences by 2T, where T is the divergence time. R = K/(2T).

Example The following sequences represent an optimum alignment of the first 50 nucleotides of human and sheep preproinsulin genes, which last shared a common ancestor 80 million years ago: Human: ATGGCCTGT GGATGCGCCT CCTGCCCCTG CTGGCGCTGC TGGCCCTCTG Sheep: ATGGCCTGT GGACACGCCT GGTGCCCCTG CTGGCCCTGC TGGCACTCTG

Example Human: ATGGCCTGT GGATGCGCCT CCTGCCCCTG CTGGCGCTGC TGGCCCTCTG Sheep: ATGGCCTGT GGACACGCCT GGTGCCCCTG CTGGCCCTGC TGGCACTCTG P = 6/50 = 0.12 (observed) K = -(3/4)ln(1-(4/3)(0.12)) = Estimated number of substitutions = 50 x = 6.56 R = K/(2T) = /(2 x 80 x 10 6 ) = x /year

Degenerate Code Codons are degenerate. Of 20 amino acids, 18 are encoded by more than one codon. Met (AUG) and Trp (UGG) are the exceptions; all other correspond to a set of two or more codons. Codon sets often show a pattern in their sequences; variation at the third position is most common.

Degenerate Code The code has start and stop signals. AUG, the start signal for protein synthesis. Stop codons have no corresponding tRNA (UAG, amber; UAA, ochre; UGA, opal). Wobble occurs in the anticodon. The 3 rd base in the codon is able to base-pair less specifically, because it is less constrained three dimensionally.

Patterns and Modes of Substitutions Patterns of variation within homologous genes show that some amino acid substitutions are found more frequently than others.

Patterns and Modes of Substitutions Substitutions often involve amino acids with similar chemical characteristics, supporting two evolutionary principles: Mutations are rare events Most dramatic changes are removed by natural selection.

Patterns and Modes of Substitutions Chemically similar amino acids tend to have similar codons, and so may result from a single mutation. Natural selection acting on this variation produces proteins optimized for role and environment. More substantial alterations of protein structure are likely to be deleterious and removed from gene pool.

Synonymous and non- synonymous sites Synonymous changes, which do not alter the amino acids in the protein, are found five times more often than non-synonymous changes.

Synonymous and non- synonymous sites –Both types of change are equally likely to occur, but non-synonymous changes are usually detrimental to fitness, and are eliminated by natural selection. Mutations are changes in nucleotide sequences due to errors in replication or repair. Substitutions are mutations that have passed through the filter of selection.

Mutations vs. substitutions Would the mutation rate would be greater or less than the observed substitution rate, for example x is for preproinsulin gene. YES Why?

Variation in evolutionary rates within genes Studies show that different regions of genes evolve at different rates. Distinctions are seen between and within coding and non-coding regions. Examples of non-coding regions include introns, leaders, non-transcribed flanking regions, pseudogenes.

Relative rates of evolutionary change in mammals SequenceR (x ) Functional genes 5’ flanking region2.36 CDS, synonymous4.65 CDS, nonsynonymous0.88 Intron3.70 3’ flanking region4.46 Pseudogenes4.85

Flanking regions and introns Changes in 3’ sequences have no known effect on the amino acid sequence; so most substitutions are tolerated. Rate of substitutions are high in introns but not as high as in synonymous of CDS. 5’ untranslated regions have low rates: they contain regulatory regions for transcription.

Pseudogenes Highest rate of evolution is that of nonfunctional pseudogenes, which no longer code for proteins. What advantage pseudogenes provide for evolution of multiple gene families?

Coding sequences with high rates of nonsynonymous substitution Major histocompatibility complex (MHC) in mammals If there is evolutionary pressure for diversity, substitutions become advantageous. MHC is involved in immune function where diversity favors fewer individuals vulnerable to an infection by any single virus. Viruses utilize error-prone replication coupled with diversifying selection. Both viruses and MHC complex rapidly evolves due to natural selection for diversification.

Ribosomal RNAs Sequences of rRNA regions that interact and provide for ribosomal function by pairing will be subject to mutation at the same rates as sequences that do not pair. However, mutations that disrupt pairing will be selected against, since such mutations will alter ribosomal function and become detrimental to fitness.

Mitochondrial DNA (mtDNA) Mammalian mitochondrial genome contains a circular, double-stranded mtDNA about bp long (1/10000 of the nuclear genome, encoding 2 rRNAs, 22 tRNAs, and 13 proteins). The average synonymous substitution rate in mammalian mitochondria is 5.7 x 10- 8/site/year, 10 times higher than the synonymous substitutions in nuclear genes.

Mitochondrial DNA (mtDNA) The higher rates of mutation in mtDNA are likely to be due to: The higher error rate during mtDNA replication and repair. mtDNA polymerases have no proofreading ability. Higher concentrations of mutagens such as free radicals resulting from metabolic processes. Less selective pressure because there are many of them within the cell; changes are less detrimental.

Maternal transmission Clonal inheritance from mother, when the mother’s egg contributes to the zygote. So no meiosis occurs, all offspring will have the same mtDNA from the same mother. Study matriarchal lineages can be traced allowing examination of family structure. Example: geographic variation in mtDNA sequences of pocket gophers in south eastern USA.

Peter J. Russell, iGenetics: Copyright © Pearson Education, Inc., publishing as Benjamin Cummings. Letters are different mtDNA types grouped according to similarity, and are superimposed on a geographic map of the collection sites. The tick marks across connecting lines are the number of mutations. Lineage relationships among mtDNA types in pocket gophers

mtDNA or nuclear DNA Suppose you are studying human migrational patterns? Would you use mtDNA or nuclear genes to estimate how long ago humans moved from a particular place to another?

mtDNA or nuclear DNA? Since the time scale is on the order of tens of thousands years, and mtDNA accumulate more mutations than nuclear DNA, mtDNA will provide more information about the differences between human populations geographically separated. What about if you want to study the phylogenetic relationships of mammalian species that diverged 80 million years ago? (HINT: multiple substitutions)

Molecular Clock Suggests that rates of molecular evolution for loci with similar functional constraints are uniform during the time period after divergence from a common ancestor (Fossil record).

The molecular clock for alpha- globin

Rates of amino acid replacement in proteins

Peter J. Russell, iGenetics: Copyright © Pearson Education, Inc., publishing as Benjamin Cummings. Fig The molecular clock runs at different rates in different proteins

Molecular Phylogeny Organisms are similar at the molecular level are expected to be more closely related than dissimilar organisms. Phylogenetic relationships among living things are inferred from molecular similarity.