Nucleotide variation in the human genome

Slides:



Advertisements
Similar presentations
Genetic research designs in the real world Vishwajit L Nimgaonkar MD, PhD University of Pittsburgh
Advertisements

SNP Applications statwww.epfl.ch/davison/teaching/Microarrays/snp.ppt.
Major insights from the HGP on Nature (2001) 15 th Feb Vol 409 special issue; pgs 814 & )Gene content 2)Proteome content 3)SNP identification.
Plant of the day! Pebble plants, Lithops, dwarf xerophytes Aizoaceae
Genetica per Scienze Naturali a.a prof S. Presciuttini Human and chimpanzee genomes The human and chimpanzee genomes—with their 5-million-year history.
Human non-synonymous SNP: molecular function, evolution and disease Shamil Sunyaev Genetics Division, Brigham & Women’s Hospital Harvard Medical School.
Computational Tools for Finding and Interpreting Genetic Variations Gabor T. Marth Department of Biology, Boston College
Evolutionary Genome Biology Gabor T. Marth, D.Sc. Department of Biology, Boston College Medical Genomics Course – Debrecen, Hungary, May 2006.
Something related to genetics? Dr. Lars Eijssen. Bioinformatics to understand studies in genomics – São Paulo – June Image:
Dr Katie Snape Specialist Registrar in Genetics St Georges Hospital
Modes of selection on quantitative traits. Directional selection The population responds to selection when the mean value changes in one direction Here,
Epigenome 1. 2 Background: GWAS Genome-Wide Association Studies 3.
- any detectable change in DNA sequence eg. errors in DNA replication/repair - inherited ones of interest in evolutionary studies Deleterious - will be.
Next-Generation Sequencing
Biology 101 DNA: elegant simplicity A molecule consisting of two strands that wrap around each other to form a “twisted ladder” shape, with the.
CS177 Lecture 10 SNPs and Human Genetic Variation
SNP Haplotypes as Diagnostic Markers Shrish Tiwari CCMB, Hyderabad.
A Genome-wide association study of Copy number variation in schizophrenia Andrés Ingason CNS Division, deCODE Genetics. Research Institute of Biological.
Ch. 21 Genomes and their Evolution. New approaches have accelerated the pace of genome sequencing The human genome project began in 1990, using a three-stage.
From Genome-Wide Association Studies to Medicine Florian Schmitzberger - CS 374 – 4/28/2009 Stanford University Biomedical Informatics
National Taiwan University Department of Computer Science and Information Engineering Pattern Identification in a Haplotype Block * Kun-Mao Chao Department.
Your genome: What does your DNA say about you? Personal Genetics Education Project (pgEd) Harvard Medical School personal genetics education.
Julia N. Chapman, Alia Kamal, Archith Ramkumar, Owen L. Astrachan Duke University, Genome Revolution Focus, Department of Computer Science Sources
Lecture 6. Functional Genomics: DNA microarrays and re-sequencing individual genomes by hybridization.
Table 8.3 & Alberts Fig.1.38 EVOLUTION OF GENOMES C-value paradox: - in certain cases, lack of correlation between morphological complexity and genome.
HW2: exome sequencing and complex disease Jacquemin Jonathan de Bournonville Sébastien.
Computational Biology and Genomics at Boston College Biology Gabor T. Marth Department of Biology, Boston College
Single Nucleotide Polymorphisms (SNPs) By Amira Jhelum Rahul Shweta.
Human Genomics Higher Human Biology. Learning Intentions Explain what is meant by human genomics State that bioinformatics can be used to identify DNA.
Personal DNA Testing Melanie Wark Mackenzie Steen.
The Haplotype Blocks Problems Wu Ling-Yun
1 Finding disease genes: A challenge for Medicine, Mathematics and Computer Science Andrew Collins, Professor of Genetic Epidemiology and Bioinformatics.
Inferences on human demographic history using computational Population Genetic models Gabor T. Marth Department of Biology Boston College Chestnut Hill,
Brendan Burke and Kyle Steffen. Important New Tool in Genomic Medicine GWAS is used to estimate disease risk and test SNPs( the most common type of genetic.
Chapter 8: Genetic Epidemiology. Introduction Scientific advancement in genetics – Mendel’s laws of inheritance – Charles Darwin’s observations – Discovery.
Interpreting exomes and genomes: a beginner’s guide
Single Nucleotide Polymorphisms (SNPs
SNPs and complex traits: where is the hidden heritability?
Genomic Analysis: GWAS
SNP Detection Congtam Pham 2/24/04 Dr. Marth’s Class.
Press report 13/10/ publications selected.
Common variation, GWAS & PLINK
Genetic Linkage.
Disease risk prediction
Interpretation Next Generation Sequencing (Bench Clinic)
Introduction to Direct-to-Consumer Genetic Testing
Neutrality Test First suggested by Kimura (1968) and King and Jukes (1969) Shift to using neutrality as a null hypothesis in positive selection and selection.
Pharmacogenetic effects on pharmacokinetics and pharmacodynamics
What should be in your SNPedia write-up?
Introduction to bioinformatics lecture 11 SNP by Ms.Shumaila Azam
Genetic Linkage.
Detection of the footprint of natural selection in the genome
Beyond GWAS Erik Fransen.
Linking Genetic Variation to Important Phenotypes
VWF sequence variants: innocent until proven guilty
Genomic alterations in breast cancer cell line MDA-MB-231.
Group A1 Caroline Kissel, Meg Sabourin, Kaylee Isaacs, Alex Maeder
Chapter 7 Multifactorial Traits
Genetic Drift, followed by selection can cause linkage disequilibrium
Steven A. McCarroll, Steven E. Hyman  Neuron 
Nic’s genome contains 16,124 variants,
Characteristics of Neutral and Deleterious Protein-Coding Variation among Individuals and Populations  Wenqing Fu, Rachel M. Gittelman, Michael J. Bamshad,
Evolutionary Change Without Natural Selection
Amélie Bonnefond, Philippe Froguel  Cell Metabolism 
The Content of the Genome
Haplotypes When the presence of two or more polymorphisms on a single chromosome is statistically correlated in a population, this is a haplotype Example.
SNPs and CNPs By: David Wendel.
The principles of genetic association
Analysis of protein-coding genetic variation in 60,706 humans
Minor variants expand the range of neonatal HSV-2 coding diversity.
Presentation transcript:

Nucleotide variation in the human genome Nieves García Gisbert

How variable is the human genome? We are identical in 99,5 % of our genome sequence Human genome has about 10 million polymorphisms Recent projects studying variability in humans: Single Nucleotide Polymorphism (SNP): variation in a single nucleotide that occurs at a specific position in the genome

7,000,000,000 5,000,000 How many humans are we now in the world? How many humans were 10,000 years ago? 5,000,000

Recent history of human population Human global population has grown from a 5 million people to 7 billion, in only 10.000 years x3 increase in 400 generations Recent explosive human population growth has resulted in an excess of rare genetic variants Keinan, Alon, et al. "Recent explosive human population growth has resulted in an excess of rare genetic variants." (2012):

Previous studies Recent studies limited sample size of these studies (at most 60 individuals) only allowed capturing variants of frequency as low as ~1% in the sample limited view of rare variants in the population only focused in common variants higher number of individuals are studied, having a bigger sample size allowed to detect all kind of variants, including the less frequent It has been observed an excess of rare variants compared to what was expected explosive human population growth

Identifying rare variants Singletons: rare variants, which are only found in one individual in the population sample. Doubletons are found twice. Variants considered rare in population require sequencing of a large sample of individuals. Now it is economically possible, but there are other problems: How to distinguish false positives from rare SNP variants? False positives: are errors in sequencing that may be considered as variants For example: (20,000 times the sequence) Gene 10,000 individuals 2 errors in each bp Sequencing method: error rate: 1 error per 10,000 mb

Identifying rare variants Singletons: rare variants, which are only found in one individual in the population sample. Doubletons are found twice. Variants considered rare in population require sequencing of a larger sample of individuals. Now it is economically possible, but there are other problems: How to distinguish false positives from rare SNP variants? Rare variants have such a low frequency that they are considered not reliable second validation of rare variants with other sequencing methods

Recent variants are mostly rare variants Due to the explosion in population growth in the last years, there is a high presence of new and rare variants A larger sample allow identification of rarer polymorphisms that are due to more recent mutations. An estimation is: - SNPs with >5% frequency have 10,000-100,000 years - SNPs with 5% frequency only 8% have 10,000 years less than 1% have 2,500 years Rare variants add information about changes that have occur during this recent epochs of accelerated explosive growth

Presence of rare variants Coventry, Alex, et al., 2010. In collaboration with the Atherosclerosis Risk community, they resequenced KCNJII and HHEX genes in 10,422 individuals 10,422 individuals Is this just because this genes are more likely to have this rare variants? two genes: KCNJII, HHEX Of all the SNVs that were found, 50% were singletons (only present in one of the individuals)

Presence of rare variants Tennessen, Jacob A., et al., 2012. They try to identify and understand variants related with complex diseases 2440 exomes where sequenced 503,481 SNV were identified in 15,583 genes 82% SNVs were novel, previously unknown. 48,9% of this are nonsynonymous 75% SNVs were singletons 110 genes had a higher proportion of rare variants Including six histone related genes, highly conserved and with great selective constraint

Presence of rare variants Tennessen, Jacob A., et al., 2012. They analyzed found SNVs using function predictors Combined software: Gives a score to a concrete aa change in the protein to predict if its function or conformation is altered 95.7% variants with functional importance are rare SNVs

Rare variants and disease it is not clear which is the medical implication of an excess of rare genetic variation as these variants are so recent, they appear as novel mutation, with little time for natural selection to operate rare variants appear in high conserved places, and sometimes are nonsynonymous, they may have a role in disease risk for more complex disease may be due to this rapid expansion of rare variants

Rare variants and disease Rare diseases When affecting to certain genes Rare variants For example, autism, mental retardation, epilepsy and schizophrenia have been shown to be influenced by rare variants that affect genes Studying common variants has not been enough to determine the cause of many common diseases Common diseases

Conclusions Studying this rare variants require large sample size studies As these variants are so recent, they appear as novel mutation, with little time for natural selection to operate. Majority of protein coding variation is evolutionary recent, rare, and probably deleterious Rare variants are contributing to human phenotypic variation and disease susceptibility, but it is not known how. It is not clear which is the medical implication of an excess of rare genetic variation Evaluation of rare variants found is needed to understand disease

Bibliography Keinan, Alon, and Andrew G. Clark. "Recent explosive human population growth has resulted in an excess of rare genetic variants." science 336.6082 (2012): 740-743. Goldstein, D.B., and Cavalleri, G.L. 2005. Genomics: Understanding human diversity. Nature 437:1241-1242. Tennessen, Jacob A., et al. "Evolution and functional impact of rare coding variation from deep sequencing of human exomes." science 337.6090 (2012): 64-69. Sachidanandam, Ravi, et al. "A map of human genome sequence variation containing 1.42 million single nucleotide polymorphisms." Nature 409.6822 (2001): 928-933. Cirulli, Elizabeth T., and David B. Goldstein. "Uncovering the roles of rare variants in common disease through whole-genome sequencing." Nature Reviews Genetics 11.6 (2010): 415-425.