Are we still evolving? Mapping sites of selection in the human genome Simon Myers.

Slides:



Advertisements
Similar presentations
The Five Factors of Evolution
Advertisements

After 13 years of scientist work predominatly in USA & UK the DNA sequence of the human genome was completed in 2003 Any ideas how they did it? What would.
Note that the genetic map is different for men and women Recombination frequency is higher in meiosis in women.
Genome-wide Association Study Focus on association between SNPs and traits Tendency – Larger and larger sample size – Use of more narrowly defined phenotypes(blood.
Chapter 19 Evolutionary Genetics 18 and 20 April, 2004
Gene Expression Levels Are a Target of Recent Natural Selection in the Human Genome Mol. Biol. Evol. 26(3):649– Journal Club
Discovery of a rare arboreal forest-dwelling flying reptile (Pterosauria, Pterodactyloidea) from China Wang et al. PNAS Feb. 11, 2008.
Plant of the day! Pebble plants, Lithops, dwarf xerophytes Aizoaceae
Signatures of Selection
Outline to SNP bioinformatics lecture
The role of variation in finding functional genetic elements Andy Clark – Cornell Dave Begun – UC Davis.
14 Molecular Evolution and Population Genetics
Genetica per Scienze Naturali a.a prof S. Presciuttini Human and chimpanzee genomes The human and chimpanzee genomes—with their 5-million-year history.
Biology and Bioinformatics Gabor T. Marth Department of Biology, Boston College BI820 – Seminar in Quantitative and Computational Problems.
October 2, 2002Daryl Thomas. October 2, 2002Daryl Thomas Molecular Evolution of FOXP2 Human Language Abilities Highlighted by Comparative Genomics CMPE.
Human Evolution: Searching for Selection Andrew Shah Algorithms in Biology 374 Spring 2008.
Molecular Evolution with an emphasis on substitution rates Gavin JD Smith State Key Laboratory of Emerging Infectious Diseases & Department of Microbiology.
Something related to genetics? Dr. Lars Eijssen. Bioinformatics to understand studies in genomics – São Paulo – June Image:
Population genetics, comparative genomics, and natural selection Simon Myers.
TGCAAACTCAAACTCTTTTGTTGTTCTTACTGTATCATTGCCCAGAATAT TCTGCCTGTCTTTAGAGGCTAATACATTGATTAGTGAATTCCAATGGGCA GAATCGTGATGCATTAAAGAGATGCTAATATTTTCACTGCTCCTCAATTT.
Computational Molecular Biology Biochem 218 – BioMedical Informatics Simple Nucleotide.
Chapter 3 Substitution Patterns Presented by: Adrian Padilla.
Epigenome 1. 2 Background: GWAS Genome-Wide Association Studies 3.
- any detectable change in DNA sequence eg. errors in DNA replication/repair - inherited ones of interest in evolutionary studies Deleterious - will be.
 Archaeology – “the scientific study of material remains (as fossil relics, artifacts, and monuments) of past human life and activities”  Studies.
Doug Brutlag 2011 Genomics & Medicine Doug Brutlag Professor Emeritus of Biochemistry &
Biology 101 DNA: elegant simplicity A molecule consisting of two strands that wrap around each other to form a “twisted ladder” shape, with the.
TGCAAACTCAAACTCTTTTGTTGTTCTTACTGTATCATTGCCCAGAATAT TCTGCCTGTCTTTAGAGGCTAATACATTGATTAGTGAATTCCAATGGGCA GAATCGTGATGCATTAAAGAGATGCTAATATTTTCACTGCTCCTCAATTT.
Chapter 24: Molecular and Genomic Evolution CHAPTER 24 Molecular and Genomic Evolution.
SNPing Lactose By: Mandy Butler, Ying-Tsu Loh and Cheryl Ann Peterson.
Host genetic diversity Genome-wide approaches. Affected sib analysis Take full sibs, preferably of the same sex should share many environmental variables.
Large-scale recombination rate patterns are conserved among human populations David Serre McGill University and Genome Quebec Innovation Center UQAM January.
Models of Molecular Evolution III Level 3 Molecular Evolution and Bioinformatics Jim Provan Page and Holmes: Sections 7.5 – 7.8.
Eukaryotic Genomes  The Organization and Control of Eukaryotic Genomes.
Julia N. Chapman, Alia Kamal, Archith Ramkumar, Owen L. Astrachan Duke University, Genome Revolution Focus, Department of Computer Science Sources
AP Biology Evolution of Populations AP Biology Populations evolve  Natural selection acts on individuals  differential survival  “survival.
Discovery of a rare arboreal forest-dwelling flying reptile (Pterosauria, Pterodactyloidea) from China Wang et al. PNAS Feb. 11, 2008.
Lecture 6. Functional Genomics: DNA microarrays and re-sequencing individual genomes by hybridization.
MEME homework: probability of finding GAGTCA at a given position in the yeast genome, based on a background model of A = 0.3, T = 0.3, G = 0.2, C = 0.2.
Detection of positive selection in humane genome.
Selectionist view: allele substitution and polymorphism
Evolution of Populations
February 20, 2002 UD, Newark, DE SNPs, Haplotypes, Alleles.
The International Consortium. The International HapMap Project.
The Interactions of Selection With Genetic Drift Can Be Complicated Because the Changes in p Induced By Drift are Random and Ever-Changing Three Important.
NEW TOPIC: MOLECULAR EVOLUTION.
Molecular evolution Part I: The evolution of macromolecules.
Ayesha M.Khan Spring Phylogenetic Basics 2 One central field in biology is to infer the relation between species. Do they possess a common ancestor?
Genomics of Adaptation
A genetic polymorphism in the Drosophila insulin receptor suggests adaptation to climate variation across continents Annalise Paaby a, Mark Blacket b,
Can genes help explain our evolution? - What type of changes (regulatory or structural mutations?) - How many genes are involved?
Evolutionary Genome Biology Gabor T. Marth, D.Sc. Department of Biology, Boston College
Signals of natural selection in the HapMap project data The International HapMap Consortium Gil McVean Department of Statistics, Oxford University.
Katherine S. Pollard Gladstone Institutes, Institute for Human Genetics and Division of Biostatistics - UCSF What makes us human?
11.1 Genetic Variation Within Population KEY CONCEPT A population shares a common gene pool.
EVALUATING EVOLUTIONARY EXPLANATIONS THE SCHOOL NEWSPAPER HAS DECIDED TO INCLUDE A SPECIAL SECTION ON EVOLUTION AND MEDICINE. WE NEED TO HELP THE EDITOR.
The Haplotype Blocks Problems Wu Ling-Yun
Enhancers and 3D genomics Noam Bar RESEARCH METHODS IN COMPUTATIONAL BIOLOGY.
Inferences on human demographic history using computational Population Genetic models Gabor T. Marth Department of Biology Boston College Chestnut Hill,
The evolution of lactose tolerance
Complex disease and long-range regulation: Interpreting the GWAS using a Dual Colour Transgenesis Strategy in Zebrafish.
Signatures of Selection
Detection of the footprint of natural selection in the genome
Type 2 Diabetes With type 2 diabetes, your body either resists the effects of insulin — a hormone that regulates the movement of sugar into your cells.
Detection of the footprint of natural selection in the genome
Identifying Recent Adaptations in Large-Scale Genomic Data
Genomic Signatures of Selective Pressures and Introgression from Archaic Hominins at Human Innate Immunity Genes  Matthieu Deschamps, Guillaume Laval,
GWAS-eQTL signal colocalisation methods
Reminder The AP Exam registration is open in Naviance. The Exam is on Monday, May 13. I’ll let you know when the next test/homework will be.
A population shares a common gene pool.
Presentation transcript:

Are we still evolving? Mapping sites of selection in the human genome Simon Myers

Targets of selection are important Humans Other species What makes us human? (FOXP2, gene loss) Resistance to pesticides Understand how we adapt to our environment Diet (Lactase, amylase) Mating success Physical environment (SLC24A5, EDAR…) Disease (LARGE, Duffy,…) ?? Pathogen evolution What parts of our genome are functional? (Genes, regulatory regions, siRNAs,….)

Adaptive evolution Time Advantagous mutations arise by chance Once arisen, carriers have more offspring “Positive selection” On average, higher rate of change towards advantageous mutations

Looking for positive selection Direct approach is very difficult –Need to observe trait for long time –Need very strong selection In many cases, need a more indirect approach –Compare genomes among closely related species –Look for “accelerated evolution” –Current day patterns of diversity –Look for “signature of selection”

FOXP2 Gene coding for a transcription factor Mutations in this gene cause speech impairment and other problems (Lai et al., Nature 2001) –Mutation in FOXP2 co-segregates with a disorder in a family in which half of the members have severe speech, linguistic and grammatical difficulties –Translocation in same gene in unrelated individual with similar disorder Are changes in this gene associated with human language development?

Yellow: human lineage mutations (since chimpanzee-human split) Blue: mutations on all other lineages Very conserved gene (top 5% of 1,880 genes) Only 3 non-repeat amino acid changes in 130 million years between human and mouse 2 occurred on human lineage in last 5-6 million years FOXP2 (Enard et al., Nature 2002)

156 synonymous changes, 0 on human lineage 4 non-synonymous changes 2 on human lineage (p= by Fishers exact test) FOXP2 (Enard et al., Nature 2002)

Gene loss CMAH: Loss of enzymes that transform sialic acid –Sugar on cell surface that mediates a variety of recognition events involving pathogenic microbes and toxins Myosin heavy chain –Reduces masticatory muscles? –Associated with gracilization KRTHAP1: –Hair keratin Wang et al (2006)

Is this the answer? Comparative genomics has disadvantages –Need repeated mutations to give power –Tells little about the timescale –Recent research suggests Neanderthals may share FOXP2 mutations with humans (Krause et al., Current Biology 2007) How do we find out if, and where, we’re currently evolving?

Looking for positive selection Direct approach is difficult –Need to observe trait for long time In many cases, need a more indirect approach –Compare genomes among closely related species –Look for “accelerated evolution” –Current day patterns of diversity –Look for “signature of selection”

Variation data and selection Revolution in population genetics Genome-wide datasets –HapMap project –Many unrelated individuals (60 CEU, 60 YRI, 45 JPT and 45 CHB) –Typed at ~4,000,000 loci that vary within population Allow systematic searches for selection –Comparison of interesting regions to genome –Identification of novel candidates for selection

Neutral alleles III III Neutral allele arises Neutral variation Recombination scrambles variation over time e.g. HapMap

The signature of positive selection III III Advantageous allele arises Neutral variation Spreads (sweeps) rapidly through population Recombination has much less time to scramble variation on selected background

The signature of positive selection Neutral mutation at 50% Selected mutation at 50% SelSim (Spencer and Coop, Bioinformatics 2004)

EHH Several authors have developed tests based on similar idea –Sabeti et al. (Nature 2002) –Focus on potentially selected mutation –Measure proportion of haplotypes identical, as a function of distance on either side –Compare selected/nonselected types –Look for signal of “extended haplotype homozygosity” (EHH)

Simulation results (Voight et al.,PloS Biology 2006)

Lactase gene –70% of all humans are lactose intolerant –In Europe, 95% lactose tolerance

Lactase gene DNA variant C/T kb upstream of Lactase gene Predicts lactose persistance (Enattah et al., Nature Genetics 2002) Mutation enhances promoter activity, so probably causal (Olds et al. Hum. Mol. Genet. 2003) Other mutations exist in some groups

EHH around Lactase From Bersaglieri et al. (AJHG, 2004)

EHH around Lactase 5’: p=.012 3’: p<0.0004

From the HapMap paper (Nature, 2005) Human evolution in action Infection by Lassa virus Malaria resistance

A complimentary approach SNPs that are at highly different frequencies across populations are excellent candidates for selection –EDAR (hair follicle development, HapMap paper, Sabeti et al. Nature 2007) –SLC24A5, SLC45A2 (HapMap paper, Lamason et al. Science 2005) –Explored in practical Non-synonymous SNP in FY gene

Conclusions Population genetics provides diverse information about molecular evolution Combining population genetics with knowledge of genomic sequence –New insights into adaptive evolution –Evolution is ongoing, and influenced by local environment –Limited power means we will probably never find all sites of selection Avalanche of variation data being gathered –Will bring many more insights –Presents major challenges in utilising vast and highly informative datasets, whilst keeping analyses computationally tractable

Purifying selection Much of the work of selection is removing disadvantageous alleles Regions performing some useful function (e.g. genes!) evolve more slowly Once again, comparative genomics can help! –Look for regions that are conserved between distantly related species Maladaptive mutation Fewer offspringMutation lost

Identifying conserved regions 5% of genome is “conserved” – but only 1.5% exonic sequence

SNP frequency “spectrum” in CNC’s SNPs are at lower frequencies in CNC’s (p=3x ) Signal is weak – not all CNCs selected? –Stronger near genes –Strongest at very highly conserved elements (Katzman et al., Science 2007) Drake et al. (Nature Genetics, 2005)

Conclusions Population genetics provides diverse information about molecular evolution Combining population genetics with knowledge of genomic sequence –New insights into adaptive evolution –Evolution is ongoing, and influenced by local environment –Limited power means we will probably never find all sites of selection Avalanche of variation data being gathered –Will bring many more insights –Presents major challenges in utilising vast and highly informative datasets, whilst keeping analyses computationally tractable