1 Population Genetics Basics. 2 Terminology review Allele Locus Diploid SNP.

Slides:



Advertisements
Similar presentations
How do we know if a population is evolving?
Advertisements

PHYLOGENETIC TREES Bulent Moller CSE March 2004.
Alleles = A, a Genotypes = AA, Aa, aa
A method of quantifying stability and change in a population.
Chapter 2: Hardy-Weinberg Gene frequency Genotype frequency Gene counting method Square root method Hardy-Weinberg low Sex-linked inheritance Linkage and.
THE EVOLUTION OF POPULATIONS
Population Genetics. Mendelain populations and the gene pool Inheritance and maintenance of alleles and genes within a population of randomly breeding.
Sickle Cell Anemia.
14 Molecular Evolution and Population Genetics
Variation in Natural Populations. Overview of Evolutionary Change Natural Selection: variation among individuals in heritable traits lead to variation.
WABI 2005 Algorithms for Imperfect Phylogeny Haplotyping (IPPH) with a Single Homoplasy or Recombnation Event Yun S. Song, Yufeng Wu and Dan Gusfield University.
March 2006Vineet Bafna CSE280b: Population Genetics Vineet Bafna/Pavel Pevzner
Haplotyping via Perfect Phylogeny Conceptual Framework and Efficient (almost linear-time) Solutions Dan Gusfield U.C. Davis RECOMB 02, April 2002.
March 2006Vineet Bafna CSE280b: Population Genetics Vineet Bafna/Pavel Pevzner
CSE182-L18 Population Genetics. Perfect Phylogeny Assume an evolutionary model in which no recombination takes place, only mutation. The evolutionary.
CSE 291: Advanced Topics in Computational Biology Vineet Bafna/Pavel Pevzner
CSE182-L17 Clustering Population Genetics: Basics.
March 2006Vineet Bafna CSE280b: Population Genetics Vineet Bafna/Pavel Pevzner
PROCESS OF EVOLUTION I (Genetic Context). Since the Time of Darwin  Darwin did not explain how variation originates or passed on  The genetic principles.
Brachydactyly and evolutionary change
Population Genetics.
Hardy Weinberg: Population Genetics
Chapter 23 Population Genetics © John Wiley & Sons, Inc.
Population Genetics 101 CSE280Vineet Bafna. Personalized genomics April’08Bafna.
What evolutionary forces alter
Population Genetics Learning Objectives
Evolution of Populations
Maintaining Genetic Variation (Population Equilibrium) Populations have TWO competing factors: Remaining stable (not evolving) vs Changing (evolving)
Copyright © 2005 Pearson Education, Inc. publishing as Benjamin Cummings D.4 High Level Only D.4 The Hardy-Weinberg Principle – D.4.1 Explain how the Hardy-Weinberg.
How do we know if a population is evolving?
Chapter 7 Population Genetics. Introduction Genes act on individuals and flow through families. The forces that determine gene frequencies act at the.
Evolution Chapters Evolution is both Factual and the basis of broader theory What does this mean? What are some factual examples of evolution?
E QUILIBRIA IN POPULATIONS CSE280Vineet Bafna Population data Recall that we often study a population in the form of a SNP matrix – Rows.
CSE280Vineet Bafna CSE280a: Algorithmic topics in bioinformatics Vineet Bafna.
Trees & Topologies Chapter 3, Part 1. Terminology Equivalence Classes – specific separation of a set of genes into disjoint sets covering the whole set.
E QUILIBRIA IN POPULATIONS CSE280Vineet Bafna Population data Recall that we often study a population in the form of a SNP matrix – Rows.
National Taiwan University Department of Computer Science and Information Engineering Pattern Identification in a Haplotype Block * Kun-Mao Chao Department.
Allele Frequencies: Staying Constant Chapter 14. What is Allele Frequency? How frequent any allele is in a given population: –Within one race –Within.
Lecture 24: Quantitative Traits IV Date: 11/14/02  Sources of genetic variation additive dominance epistatic.
Lecture 21: Quantitative Traits I Date: 11/05/02  Review: covariance, regression, etc  Introduction to quantitative genetics.
1 Population Genetics Basics. 2 Terminology review Allele Locus Diploid SNP.
Association mapping for mendelian, and complex disorders January 16Bafna, BfB.
Population Genetics & Evolution. Population Genetics The study of evolution from a genetic point of view.
CSE280Vineet Bafna In a ‘stable’ population, the distribution of alleles obeys certain laws – Not really, and the deviations are interesting HW Equilibrium.
Fixed Parameters: Population Structure, Mutation, Selection, Recombination,... Reproductive Structure Genealogies of non-sequenced data Genealogies of.
The Hardy-Weinberg theorem describes the gene pool of a nonevolving population. This theorem states that the frequencies of alleles and genotypes in a.
Evolution of Populations. Individual organisms do not evolve. This is a misconception. While natural selection acts on individuals, evolution is only.
Evolution of Populations
8 and 11 April, 2005 Chapter 17 Population Genetics Genes in natural populations.
Please feel free to chat amongst yourselves until we begin at the top of the hour.
1,3, ,
Population Genetics Measuring Evolutionary Change Over Time.
Equilibria in populations
Measuring Evolutionary Change Over Time
Evolution and Populations –Essential Questions p
Population Genetics.
Hardy-Weinberg Theorem
Evolution as Genetic Change
Modes of Natural Selection
PLANT BIOTECHNOLOGY & GENETIC ENGINEERING (3 CREDIT HOURS)
Evolution and Classification
Linkage, Recombination, and Eukaryotic Gene Mapping
Basic concepts on population genetics
Hardy -- Weinberg.
Population Genetics & Hardy - Weinberg
Process of Evolution.
The Evolution of Populations
Microevolution: Population Genetics and Hardy-Weinberg Equilibrium
Outline Cancer Progression Models
11.1 Genetic Variation within Popln
Presentation transcript:

1 Population Genetics Basics

2 Terminology review Allele Locus Diploid SNP

3 Single Nucleotide Polymorphisms Infinite Sites Assumption: Each site mutates at most once

4 What causes variation in a population? Mutations (may lead to SNPs) Recombinations Other genetic events (gene conversion) Structural Polymorphisms

5 Recombination

6 Gene Conversion Gene Conversion versus crossover – Hard to distinguish in a population

7 Structural polymorphisms Large scale structural changes (deletions/insertions/inversions) may occur in a population.

8 Topic 1: Basic Principles In a ‘stable’ population, the distribution of alleles obeys certain laws – Not really, and the deviations are interesting HW Equilibrium – (due to mixing in a population) Linkage (dis)-equilibrium – Due to recombination

9 Hardy Weinberg equilibrium Consider a locus with 2 alleles, A, a p (respectively, q) is the frequency of A (resp. a) in the population 3 Genotypes: AA, Aa, aa Q: What is the frequency of each genotype If various assumptions are satisfied, (such as random mating, no natural selection), Then P AA =p 2 P Aa =2pq P aa =q 2

10 Hardy Weinberg: why? Assumptions: – Diploid – Sexual reproduction – Random mating – Bi-allelic sites – Large population size, … Why? Each individual randomly picks his two chromosomes. Therefore, Prob. (Aa) = pq+qp = 2pq, and so on.

11 Hardy Weinberg: Generalizations Multiple alleles with frequencies – By HW, Multiple loci?

12 Hardy Weinberg: Implications The allele frequency does not change from generation to generation. Why? It is observed that 1 in 10,000 caucasians have the disease phenylketonuria. The disease mutation(s) are all recessive. What fraction of the population carries the mutation? Males are 100 times more likely to have the “red’ type of color blindness than females. Why? Conclusion: While the HW assumptions are rarely satisfied, the principle is still important as a baseline assumption, and significant deviations are interesting.

13 Recombination

14 What if there were no recombinations? Life would be simpler Each individual sequence would have a single parent (even for higher ploidy) The relationship is expressed as a tree.

15 The Infinite Sites Assumption The different sites are linked. A 1 in position 8 implies 0 in position 5, and vice versa. Some phenotypes could be linked to the polymorphisms Some of the linkage is “destroyed” by recombination

16 Infinite sites assumption and Perfect Phylogeny Each site is mutated at most once in the history. All descendants must carry the mutated value, and all others must carry the ancestral value i 1 in position i 0 in position i

17 Perfect Phylogeny Assume an evolutionary model in which no recombination takes place, only mutation. The evolutionary history is explained by a tree in which every mutation is on an edge of the tree. All the species in one sub-tree contain a 0, and all species in the other contain a 1. Such a tree is called a perfect phylogeny.

18 The 4-gamete condition A column i partitions the set of species into two sets i 0, and i 1 A column is homogeneous w.r.t a set of species, if it has the same value for all species. Otherwise, it is heterogenous. EX: i is heterogenous w.r.t {A,D,E} i A 0 B 0 C 0 D 1 E 1 F 1 i0i0 i1i1

19 4 Gamete Condition – There exists a perfect phylogeny if and only if for all pair of columns (i,j), either j is not heterogenous w.r.t i 0, or i 1. – Equivalent to – There exists a perfect phylogeny if and only if for all pairs of columns (i,j), the following 4 rows do not exist (0,0), (0,1), (1,0), (1,1)

20 4-gamete condition: proof Depending on which edge the mutation j occurs, either i 0, or i 1 should be homogenous. (only if) Every perfect phylogeny satisfies the 4- gamete condition (if) If the 4-gamete condition is satisfied, does a prefect phylogeny exist? i0i0 i1i1 i

21 An algorithm for constructing a perfect phylogeny We will consider the case where 0 is the ancestral state, and 1 is the mutated state. This will be fixed later. In any tree, each node (except the root) has a single parent. – It is sufficient to construct a parent for every node. In each step, we add a column and refine some of the nodes containing multiple children. Stop if all columns have been considered.

22 Inclusion Property For any pair of columns i,j – i < j if and only if i 1  j 1 Note that if i<j then the edge containing i is an ancestor of the edge containing i i j

23 Example A B C D E r A BCDE Initially, there is a single clade r, and each node has r as its parent

24 Sort columns Sort columns according to the inclusion property (note that the columns are already sorted here). This can be achieved by considering the columns as binary representations of numbers (most significant bit in row 1) and sorting in decreasing order A B C D E

25 Add first column In adding column i – Check each edge and decide which side you belong. – Finally add a node if you can resolve a clade r A B C D E A B C D E u

26 Adding other columns Add other columns on edges using the ordering property r E B C D A A B C D E

27 Unrooted case Switch the values in each column, so that 0 is the majority element. Apply the algorithm for the rooted case

28 Handling recombination A tree is not sufficient as a sequence may have 2 parents Recombination leads to loss of correlation between columns

29 Linkage (Dis)-equilibrium (LD) Consider sites A &B Case 1: No recombination – Pr[A,B=0,1] = 0.25 Linkage disequilibrium Case 2:Extensive recombination – Pr[A,B=(0,1)=0.125 Linkage equilibrium AB AB

30 Handling recombination A tree is not sufficient as a sequence may have 2 parents Recombination leads to loss of correlation between columns

31 Recombination, and populations Think of a population of N individual chromosomes. The population remains stable from generation to generation. Without recombination, each individual has exactly one parent chromosome from the previous generation. With recombinations, each individual is derived from one or two parents. We will formalize this notion later in the context of coalescent theory.