DUF1220 Domains & the Search for the Genes that Made Us Human James M. Sikela, Ph.D. Human Medical Genetics, Neuroscience, & Comparative Genomics Programs,

Slides:



Advertisements
Similar presentations
Human Evolutionary Genomics: Lessons from DUF1220 Protein Domains, Cognitive Disease and Human Brain Evolution James M. Sikela, Ph.D. Department of Biochemistry.
Advertisements

Whole Genome Duplications (Polyploidy) Made famous by S. Ohno, who suggested WGD can be a route to evolutionary innovation (focusing on neofunctionalization)
Number of mitoses in females = 22 Number of mitoses in males is age dependent.
KEY CONCEPT Humans appeared late in Earth’s history.
Primate and Human evolution
Molecular Clock I. Evolutionary rate Xuhua Xia
1 Genetics The Study of Biological Information. 2 Chapter Outline DNA molecules encode the biological information fundamental to all life forms DNA molecules.
Genetica per Scienze Naturali a.a prof S. Presciuttini Human and chimpanzee genomes The human and chimpanzee genomes—with their 5-million-year history.
Comparative Genomic Hybridization (CGH). Outline Introduction to gene copy numbers and CGH technology DNA copy number alterations in breast cancer (Pollack.
02_13.jpg Human chromosome 4 02_15.jpg 02_15_2.jpg.
An Update in Genetics of Epilepsy
Introduction to Biological Sequences. Background: What is DNA? Deoxyribonucleic acid Blueprint that carries genetic information from one generation to.
How is the amino acid sequence determined?
Topics covered Overview of similarities between the genetic makeup of humans and chimpanzees. Comparison of brain and speech genes between humans and.
Pharmacogenomics and personalized medicines Jean-Marie Boeynaems
Comparative Genomics II: Functional comparisons Caterino and Hayes, 2007.
Dr Katie Snape Specialist Registrar in Genetics St Georges Hospital
Large-Scale Copy Number Polymorphism in the Human Genome J. Sebat et al. Science, 305:525 Luana Ávila MedG 505 Feb. 24 th /24.
Manifestation of Novel Social Challenges of the European Union in the Teaching Material of Medical Biotechnology Master’s Programmes at the University.
Molecular Clock. Rate of evolution of DNA is constant over time and across lineages Resolve history of species –Timing of events –Relationship of species.
The Center for Medical Genomics facilitates cutting-edge research with state-of-the-art genomic technologies for studying gene expression and genetics,
Evolution: Fact and Theory  Fact: Species change over time.  Theory: Species arise from common descent through natural selection  Random mutations lead.
Large Scale Variation Among Human and Great Ape Genomes Determined by Array Comparative Genomic Hybridization Devin P. Locke, Richard Segraves, Lucia Carbone,
Lessons from DUF1220 Protein Domains, Cognitive Disease and Human Brain Evolution II James M. Sikela, Ph.D. Department of Biochemistry & Molecular Genetics.
Principles of evolution, our heritage and the Origins of Life What was life like a long time ago? How did we come into being? How do we know this? What.
CS177 Lecture 10 SNPs and Human Genetic Variation
Questions. 09_12_Mutation.jpg Gene Evolution Pages
Ch. 21 Genomes and their Evolution. New approaches have accelerated the pace of genome sequencing The human genome project began in 1990, using a three-stage.
Nature Genetics Vol.36 Sept 2004 Detection of Large-scale Variation In the Human Genome Iafrate, Feuk, Rivera, Listewnik, Donahoe, Qi, Scherer, Lee any.
PRIMATE EVOLUTION DC Biology Bill Palmer.
1 Genome Evolution Chapter Introduction Genomes contain the raw material for evolution; Comparing whole genomes enhances – Our ability to understand.
Human Evolution Part I - Primates. “To understand the story of evolution, we must understand both our ancestors and our relationships to our closest living.
SHI Meng. Abstract Changes in gene expression are thought to underlie many of the phenotypic differences between species. However, large-scale analyses.
Identification of Copy Number Variants using Genome Graphs
Hominid Evolution Human Evolution. Objectives Identify the characteristics that all primates share. Describe the major evolutionary groups of primates.
Developmental Constraints, Genetic Correlations & Natural Selection.
Vertebrates Hair Mammary Glands Amniotic Egg Endothermy Four Limbs
Evolution is change over time The scientific theory of evolution explains how living things descended from earlier organisms. What is Evolution? Think.
Genomes & The Tree of Life
Darwin’s Discovery and Chimps and Humans.   BI.7 a. Students know why natural selection acts on the phenotype rather than the genotype of an organism.
Human Evolution Darwin’s Discovery and Chimps and Humans.
Can genes help explain our evolution? - What type of changes (regulatory or structural mutations?) - How many genes are involved?
Chapter 2 Genetic Variations. Introduction The human genome contains variations in base sequence from one individual to another. Some sequence variants.
Vertebrates Hagfish, Lampreys Cartilagenous Fish Bony Fish Amphibians Mammals Birds Reptiles Skull Backbone Paired Fins Jaws Bony Skeleton Swim Bladder.
Primate Evolution Section 16.1 Primates. Daily Objective Understand that Primates share several behavioral and biological characteristics, which indicates.
Major Primate Groups Lemurs Tarsiiformes New World Monkeys Old World Monkeys Gibbons Orangutan Gorilla Chimp Human Family Hominidae 25 mya 12 mya 6 mya.
Darwin’s Discovery and Chimps and Humans. BI.7 a. Students know why natural selection acts on the phenotype rather than the genotype of an organism. BI.
A high-resolution map of human evolutionary constraints using 29 mammals Kerstin Lindblad-Toh et al Presentation by Robert Lewis and Kaylee Wells.
Katherine S. Pollard Gladstone Institutes, Institute for Human Genetics and Division of Biostatistics - UCSF What makes us human?
Mus musculus 85% homology with human DNA Genome of appx. 3 million base pairs.
5.4 Cladistics The images above are both cladograms. They show the statistical similarities between species based on their DNA/RNA. The cladogram on the.
Published primate genome sequences - I Published primate genome sequences - II.
Primates and Human Origins Ch PRIMATES/PRIMATA (order) means FIRST Increased ability to use eyes and front limbs to perform tasks Binocular vision,
Essential idea: The ancestry of groups of species can be deduced by comparing their base or amino acid sequences. By Chris Paine
5.4 Cladistics Essential idea: The ancestry of groups of species can be deduced by comparing their base or amino acid sequences. The images above are.
5.4 Cladistics Essential idea: The ancestry of groups of species can be deduced by comparing their base or amino acid sequences. The images above are both.
Darwin’s Discovery and Chimps and Humans
5.4 Cladistics Essential idea: The ancestry of groups of species can be deduced by comparing their base or amino acid sequences. The images above are both.
Adaptation: from genes to traits
Evidence for Evolution.
Genome Projects Maps Human Genome Mapping Human Genome Sequencing
Genomes and Their Evolution
Biological Bases of Behavior Northwest High School
Gene duplications: evolutionary role
Bellringer Answer on your bellringer sheet:
Lamarck V. Darwin Theories of Evolution
Laura J. Dumas, Majesta S. O’Bleness, Jonathan M. Davis, C
First Draft of Chimpanzee Genome
Laura J. Dumas, Majesta S. O’Bleness, Jonathan M. Davis, C
5.4 Cladistics Essential idea: The ancestry of groups of species can be deduced by comparing their base or amino acid sequences. The images above are both.
Presentation transcript:

DUF1220 Domains & the Search for the Genes that Made Us Human James M. Sikela, Ph.D. Human Medical Genetics, Neuroscience, & Comparative Genomics Programs, Department of Biochemistry & Molecular Genetics, University of Colorado School of Medicine Genomics Course February 28, 2012 James M. Sikela, Ph.D. Human Medical Genetics, Neuroscience, & Comparative Genomics Programs, Department of Biochemistry & Molecular Genetics, University of Colorado School of Medicine Genomics Course February 28, 2012

Key Points First gene-based and first genome-wide study of lineage-specific gene duplication and loss in human and primate evolution Dramatic human-specific increase in copy number of DUF1220 protein domains DUF1220 copy number linked to evolution of brain size Selection of evolutionarily adaptive genome sequences may be driving disease, e.g. 1q21.1 First gene-based and first genome-wide study of lineage-specific gene duplication and loss in human and primate evolution Dramatic human-specific increase in copy number of DUF1220 protein domains DUF1220 copy number linked to evolution of brain size Selection of evolutionarily adaptive genome sequences may be driving disease, e.g. 1q21.1

Primate Evolution New World Monkeys (e.g. squirrel monkey,spider monkey) Old World Monkeys (e.g. baboon, rhesus, etc.) Gibbons Orangutan Gorilla Human Chimp Bonobo B/C = ~ 2 C/H = ~ 5 HC/G = ~ 8 HCG/O = ~ 13 HCG/O/Gib = ~20 Hom/OWM = ~ 25 HomOWM/NW = ~ MYA 25 MYA 20 MYA 13 MYA 8 MYA 5 MYA 2 MYA

Chimpanzee Gorilla Bonobo Orangutan

More Primates! ---- something has changed!

Human Characteristics Body shape and thorax Cranial properties (brain case and face) Small canine teeth Skull balanced upright on vertebral column Reduced hair cover Enhanced sweating Dimensions of the pelvis Elongated thumb and shortened fingers Relative limb length Body shape and thorax Cranial properties (brain case and face) Small canine teeth Skull balanced upright on vertebral column Reduced hair cover Enhanced sweating Dimensions of the pelvis Elongated thumb and shortened fingers Relative limb length Neocortex expansion Enhanced language & cognition Advanced tool making Neocortex expansion Enhanced language & cognition Advanced tool making modified from S. Carroll, Nature, 2005

Reports of “human-specific” genes FOXP2 –Mutated in family with language disability ASPM/MCPH –Mutated in individuals with microcephaly HAR1F –Gene sequence highly changed in humans DUF1220 protein domains –Highly increased in copy number in humans; expressed in important brain regions FOXP2 –Mutated in family with language disability ASPM/MCPH –Mutated in individuals with microcephaly HAR1F –Gene sequence highly changed in humans DUF1220 protein domains –Highly increased in copy number in humans; expressed in important brain regions

Molecular Mechanisms Underlying Genome Evolution Single nucleotide substitutions - change gene expression & structure Genome rearrangements Gene duplication - copy number change: gene dosage - redundancy as a facilitator of innovation Single nucleotide substitutions - change gene expression & structure Genome rearrangements Gene duplication - copy number change: gene dosage - redundancy as a facilitator of innovation

Gene Duplication & Evolutionary Change “There is now ample evidence that gene duplication is the most important mechanism for generating new genes and new biochemical processes that have facilitated the evolution of complex organisms from primitive ones.” - W. H. Li in Molecular Evolution, 1997 “Exceptional duplicated regions underlie exceptional biology” - Evan Eichler, Genome Research 11: , 2001

Fig 1. Measuring genomic DNA copy number alteration using cDNA microarrays (array CGH). Fluorescence ratios are depicted in a pseudocolor scale, such that red indicates increased, and green decreased, gene copy number in the test (right) compared to reference sample (left). Interhominoid cDNA Array-Based Comparative Genomic Hybridization (aCGH)

Experimental Design Carry out pairwise cDNA aCGH comparisons between human and other hominoid species Use a >39,000 cDNA microarray representing >29,000 human genes Hybridize human genomic DNA (reference sequence: cy3/green) and other hominoid genomic DNAs (test sequence: cy5/red) simultaneously to the microarray Visualize aCGH signals “gene-by-gene” along each chromosome across five species: human (n=5), bonobo (n=3), chimpanzee (n=4), gorilla (n=3) and orangutan (n=3) Carry out pairwise cDNA aCGH comparisons between human and other hominoid species Use a >39,000 cDNA microarray representing >29,000 human genes Hybridize human genomic DNA (reference sequence: cy3/green) and other hominoid genomic DNAs (test sequence: cy5/red) simultaneously to the microarray Visualize aCGH signals “gene-by-gene” along each chromosome across five species: human (n=5), bonobo (n=3), chimpanzee (n=4), gorilla (n=3) and orangutan (n=3)

Whole Genome Caryoscope Image of Interhominoid aCGH Data

Human & Great Ape Genes Showing Lineage-Specific Copy Number Gain/Loss Fortna, et al, PLoS Biol. 2004

Summary of Human/Primate ArrayCGH Results First genome-wide and first gene-based aCGH comparison of human and nonhuman primate gene copy number variation (Fortna, et al 2004) 1,004 (4,159) genes identified that showed lineage- specific changes in copy number Time machine of evolutionary copy number change Gene candidates to underlie lineage-specific traits Genes identified represent most of major lineage- specific gene duplications and losses over the last 60 million years of human and primate evolution (Dumas, et al 2007) First genome-wide and first gene-based aCGH comparison of human and nonhuman primate gene copy number variation (Fortna, et al 2004) 1,004 (4,159) genes identified that showed lineage- specific changes in copy number Time machine of evolutionary copy number change Gene candidates to underlie lineage-specific traits Genes identified represent most of major lineage- specific gene duplications and losses over the last 60 million years of human and primate evolution (Dumas, et al 2007)

Human & Great Ape Genes Showing Lineage-Specific Copy Number Gain/Loss Fortna, et al, PLoS Biol. 2004

“This (Fortna, et al, 2004) is the first time that copy number changes among apes have been assayed for the vast majority of human genes, and we can expect that the biological consequences of the 140 human- specific copy number changes identified in this study will be heavily investigated over the coming years. “ ---M. Hurles, PLoS Biol M. Hurles, PLoS Biol. 2004

DUF1220 Repeat Unit Popesco, et al, Science 2006

InterPro-predicted DUF1220-containing proteins (NBPF family*) * Vandepoule, et al, Mol. Biol. & Evol, 2005

Q-PCR Predicted Copy Number Copy Number of DUF1220 (Q8IX62/17-33) Sequences in Primate Species Human BonoboChimp Gorilla Orangutan GibbonMacaqueBaboon

Summary of aCGH, Q-PCR and BLAT results: DUF1220 domains are highly amplified in human, reduced in great apes, further reduced in Old & New World monkeys, single or low copy non-primate mammals and absent in non-mammals

DUF1220 copy number in Animal Genomes GenomePDE4DIP DUF1220 Total DUF1220 NBPF genes Human Chimp Gorilla39915 Orangutan49211 Macaque13510 Marmoset13010 Rabbit183 Mouse110 Rat110 Guinea Pig110 GenomePDE4DIP DUF1220 Total DUF1220 NBPF genes Cow162 Pig131 Horse183 Dog131 Panda121 Opposum110Euarchotanglines Laurasiatheria Afrotheria Metatheria Elephant111Prototheria Platypus110 Chicken000 Lizard000 Frog000 Zebrafish000 Other Vertebrates A total of 40 genomes were searched, but only the 22 with 4X coverage or higher are displayed.

DUF1220 Copy Number Statistics in hg19 build DUF1220 Copies Total in Human Genome272 Total amplified HLS DUF1220 Triplets129 Total DUF1220 in Last Common Ancestor of Homo/Pan102 Total of Newly Added Copies in Human Lineage167 Total Copies Added via Domain Amplification146 Total Copies Added via Gene Duplication21 Average Number Added to Human Lineage every million years28 This table shows the unprecedented DUF1220 copy number increase in the human lineage. The primary mechanism for this expansion was domain amplification via hyper-amplification of the HLS DUF1220 triplet.

Sequences encoding DUF1220 domains Show a major copy number burst in primates Are increasingly amplified generally as a function of a species evolutionary proximity to humans, where the greatest number of copies (270) is found Show signs of positive selection Are highly expressed in brain regions associated with higher cognitive function In brain show neuron-specific expression preferentially in cell bodies and dendrites Show a major copy number burst in primates Are increasingly amplified generally as a function of a species evolutionary proximity to humans, where the greatest number of copies (270) is found Show signs of positive selection Are highly expressed in brain regions associated with higher cognitive function In brain show neuron-specific expression preferentially in cell bodies and dendrites Popesco, et al, Science 2006

Recurrent Reciprocal 1q21.1 Deletions and Duplications Associated with Microcephaly or Macrocephaly and Developmental and Behavioral Abnormalities Brunetti-Pierri, et al, Nature Genetics 2008 Recurrent Rearrangements of Chromosome 1q21.1 and Variable Pediatric Phenotypes Mefford, et al, N. Engl. J. Med Recurrent Reciprocal 1q21.1 Deletions and Duplications Associated with Microcephaly or Macrocephaly and Developmental and Behavioral Abnormalities Brunetti-Pierri, et al, Nature Genetics 2008 Recurrent Rearrangements of Chromosome 1q21.1 and Variable Pediatric Phenotypes Mefford, et al, N. Engl. J. Med q21.1 Deletions* Linked to Microcephaly 1q21.1 Duplications* Linked to Macrocephaly 1q21.1 Deletions* Linked to Microcephaly 1q21.1 Duplications* Linked to Macrocephaly We note that these CNVs encompass or are immediately flanked by DUF1220 sequences (Dumas & Sikela, Cold Spring Harbor Symposium Quant. Biol., 2009) *Implies human brain size directly related to the dosage of one or more genes in these 1q21.1 CNVs

DUF1220/NBPF Sequences & Recurrent Disease-associated 1q21.1 CNVs

Association (p<0.0001) of human head circumference (FOC Z-score) & DUF1220 copy number

Copy number of genes in the 1q21.1-q21.2 region versus brain size 46 1q21.1 genes compared along with brain size across 5 primate species DUF1220 shows the most dramatic human-specific copy number increase. The evolutionary increase in DUF1220 copy number parallels the increase in brain size. HumanChimpOrangutanMacaqueMamoset Brain Size (g) Copy # DUF PPIAL LOC FAM72D20000 SRGAP10000 PDE4DIP33411 SEC22B11111 NOTCH2NL11111 HFE TXNIP11111 POLR ANKRD ANKRD LIX1L11111 RBM8A11111 GNRHR PEX11B11111 ITGA NUDT RNF CD PDZK GPR PRKAB PDIA3P11111 FMO CHD1L11111 BCL ACP GJA GJA LOC FCGR SV2A11111 BOLA MTMR OTUD7B11110 SF3B VPS PLEKHO ANP32E11111 PRPF C1orf MRPS CA C1orf APH1A11111

DUF1220 Copy Number Versus Brain Size * Neandertal DUF1220 copy number is estimate based on sequence read depth from the Neandertal genome (Green et al 2010). -but correlation is not causation

Factors that must be reconciled with model linking 1q21.1 instability, evolutionary adaptation & recurrent disease Evolutionarily rapid DUF1220 copy number increase –Estimate, on average, 28 more DUF1220 domains added to human genome every 1 million years since Homo/Pan split Underlying mechanism must account for continued, recurrent DUF1220 increases Underlying mechanism must account for excess of 1q21.1 disease-associated CNVs containing dosage- sensitive genes Evolutionarily rapid DUF1220 copy number increase –Estimate, on average, 28 more DUF1220 domains added to human genome every 1 million years since Homo/Pan split Underlying mechanism must account for continued, recurrent DUF1220 increases Underlying mechanism must account for excess of 1q21.1 disease-associated CNVs containing dosage- sensitive genes

Increased 1q21.1 Instability Increase in DUF1220 Copy Number Evolutionary Advantage (Increase in Brain Size?) Proposed Mechanism Linking DUF1220, Brain Evolution and Disease 1q21.1 duplications Macrocephaly; Autism* 1q21.1 duplications Macrocephaly; Autism* 1q21.1 deletions Microcephaly; Schizophrenia* 1q21.1 deletions Microcephaly; Schizophrenia* *Diseases proposed as “Diametric Opposites” (including brain size), Crespi, Stead & Elliot, PNAS, 2009

DUF1220 Model* DUF1220 model proposes that: 1) DUF1220 copy number is directly involved in influencing human brain size, and 2) the evolutionary advantage of rapidly increasing DUF1220 copy number in the human lineage has resulted in favoring retention of the high genomic instability of the 1q21.1 region which, in turn, has precipitated a spectrum of recurrent human brain and developmental disorders DUF1220 model proposes that: 1) DUF1220 copy number is directly involved in influencing human brain size, and 2) the evolutionary advantage of rapidly increasing DUF1220 copy number in the human lineage has resulted in favoring retention of the high genomic instability of the 1q21.1 region which, in turn, has precipitated a spectrum of recurrent human brain and developmental disorders *Dumas & Sikela, Cold Spring Harbor Symposium Quant. Biol., 2009

Concluding Thoughts DUF1220 domains shows the largest HLS protein coding copy number increase in the genomeDUF1220 domains shows the largest HLS protein coding copy number increase in the genome –But no one gene made us human –DUF1220 genotyping challenges We know more about our genome than everWe know more about our genome than ever –But there are vast areas of our genome about which we know virtually nothing –No mammalian genome has been completely sequenced

Acknowledgements Sikela Lab Laura Dumas Majesta O’Bleness Maggie Popesco Erik MacLaren Andy Fortna Jan Hopkins Jonathon Keeney Jack Davis Jay Jackson Megan Sikela Michael Cox Kriste Marshall Matt Brenton Sonya Burgers Raquel Hink Erin Dorning Park McNair Sikela Lab Laura Dumas Majesta O’Bleness Maggie Popesco Erik MacLaren Andy Fortna Jan Hopkins Jonathon Keeney Jack Davis Jay Jackson Megan Sikela Michael Cox Kriste Marshall Matt Brenton Sonya Burgers Raquel Hink Erin Dorning Park McNair Collaborators Stanford –Jon Pollack –Young Kim Univ. of Kansas - Gerald Wyckoff Univ of Utah –Lynn Jorde Baylor College –Pawel Stankiewicz –Sau Wai Cheng UCSOM –Epidemiology Tasha Fingerlin –Preventive Medicine & Biometrics Anis Karimpour-Fard –Neuroscience Program Rock Levinson John Caldwell Collaborators Stanford –Jon Pollack –Young Kim Univ. of Kansas - Gerald Wyckoff Univ of Utah –Lynn Jorde Baylor College –Pawel Stankiewicz –Sau Wai Cheng UCSOM –Epidemiology Tasha Fingerlin –Preventive Medicine & Biometrics Anis Karimpour-Fard –Neuroscience Program Rock Levinson John Caldwell

A Walk Through Our Genome --All regions of the genome are not created equal