“10,001 Dalmatians” research programme: Discovery of genetic variants that control human quantitative traits and predispose to diseases Igor Rudan, Mladen.

Slides:



Advertisements
Similar presentations
Lecture 2 Strachan and Read Chapter 13
Advertisements

The genetic dissection of complex traits
Genetic Approaches to Thinking, Moving and Feeling
What is an association study? Define linkage disequilibrium
Single Nucleotide Polymorphism And Association Studies Stat 115 Dec 12, 2006.
Julia Krushkal 4/11/2017 The International HapMap Project: A Rich Resource of Genetic Information Julia Krushkal Lecture in Bioinformatics 04/15/2010.
SNP Applications statwww.epfl.ch/davison/teaching/Microarrays/snp.ppt.
Missing Heritability Lipika Ray 4th June Heritability: Phenotype (P) = genotype (G) + environmental factors (E) (observed) (unobserved) (unobserved)
Genetic Analysis in Human Disease
Perspectives from Human Studies and Low Density Chip Jeffrey R. O’Connell University of Maryland School of Medicine October 28, 2008.
CARe: The NHLBI’s Candidate Gene Association Resource.
Mapping Genes for SLE: A Paradigm for Human Disease? Stephen S. Rich, Ph.D. Department of Public Health Sciences Wake Forest University School of Medicine.
Cardiovascular Continuum Sampling from Extremes Padmanabhan S et al. PLoS Genet 2011.
Understanding GWAS Chip Design – Linkage Disequilibrium and HapMap Peter Castaldi January 29, 2013.
Association Mapping David Evans. Outline Definitions / Terminology What is (genetic) association? How do we test for association? When to use association.
The role of variation in finding functional genetic elements Andy Clark – Cornell Dave Begun – UC Davis.
CS177 Lecture 9 SNPs and Human Genetic Variation Tom Madej
Update on Genetics in Hypertension Christian Delles BHF Glasgow Cardiovascular Research Centre Institute of Cardiovascular and Medical Sciences University.
MSc GBE Course: Genes: from sequence to function Genome-wide Association Studies Sven Bergmann Department of Medical Genetics University of Lausanne Rue.
Something related to genetics? Dr. Lars Eijssen. Bioinformatics to understand studies in genomics – São Paulo – June Image:
Genomewide Association Studies.  1. History –Linkage vs. Association –Power/Sample Size  2. Human Genetic Variation: SNPs  3. Direct vs. Indirect Association.
Polymorphisms – SNP, InDel, Transposon BMI/IBGP 730 Victor Jin, Ph.D. (Slides from Dr. Kun Huang) Department of Biomedical Informatics Ohio State University.
Give me your DNA and I tell you where you come from - and maybe more! Lausanne, Genopode 21 April 2010 Sven Bergmann University of Lausanne & Swiss Institute.
Introduction Basic Genetic Mechanisms Eukaryotic Gene Regulation The Human Genome Project Test 1 Genome I - Genes Genome II – Repetitive DNA Genome III.
Understanding Genetics of Schizophrenia
Genetic Analysis in Human Disease. Learning Objectives Describe the differences between a linkage analysis and an association analysis Identify potentially.
Rare and common variants: twenty arguments G.Gibson Homework 3 Mylène Champs Marine Flechet Mathieu Stifkens 1 Bioinformatics - GBIO K.Van Steen.
Strong Heart Family Study Phase VI Genetics Center Aims October 8, 2009.
Epigenome 1. 2 Background: GWAS Genome-Wide Association Studies 3.
Introduction to BST775: Statistical Methods for Genetic Analysis I Course master: Degui Zhi, Ph.D. Assistant professor Section on Statistical Genetics.
The medical relevance of genome variability Gabor T. Marth, D.Sc. Department of Biology, Boston College
Computational research for medical discovery at Boston College Biology Gabor T. Marth Boston College Department of Biology
Advances in Genomics Since the Publication of the Human Genome Greg Feero, M.D., Ph.D. Faculty, Maine-Dartmouth Family Practice Residency Program, Fairfield,
Quantitative genetics: traits controlled by alleles at many loci
The medical relevance of genome variability Gabor T. Marth, D.Sc. Department of Biology, Boston College Medical Genomics Course – Debrecen,
Biology 101 DNA: elegant simplicity A molecule consisting of two strands that wrap around each other to form a “twisted ladder” shape, with the.
CS177 Lecture 10 SNPs and Human Genetic Variation
A Genome-wide association study of Copy number variation in schizophrenia Andrés Ingason CNS Division, deCODE Genetics. Research Institute of Biological.
From Genome-Wide Association Studies to Medicine Florian Schmitzberger - CS 374 – 4/28/2009 Stanford University Biomedical Informatics
Genome-Wide Association Study (GWAS)
A PPROACHING THE G ENOME - G ENETIC M ARKERS, L INKAGE AND A SSOCIATION G ENETICS 202 Jon Bernstein Department of Pediatrics October 8, 2015.
An quick overview of human genetic linkage analysis
The HapMap Project and Haploview
The International Consortium. The International HapMap Project.
In The Name of GOD Genetic Polymorphism M.Dianatpour MLD,PHD.
An quick overview of human genetic linkage analysis Terry Speed Genetics & Bioinformatics, WEHI Statistics, UCB NWO/IOP Genomics Winterschool Mathematics.
Admixture Mapping Controlled Crosses Are Often Used to Determine the Genetic Basis of Differences Between Populations. When controlled crosses are not.
Analyzing DNA using Microarray and Next Generation Sequencing (1) Background SNP Array Basic design Applications: CNV, LOH, GWAS Deep sequencing Alignment.
Genome-Wides Association Studies (GWAS) Veryan Codd.
Next generation genomics: translation into clinically useful applications in health care Prof.dr. Martina Cornel
1 Finding disease genes: A challenge for Medicine, Mathematics and Computer Science Andrew Collins, Professor of Genetic Epidemiology and Bioinformatics.
Power and Meta-Analysis Dr Geraldine M. Clarke Wellcome Trust Advanced Courses; Genomic Epidemiology in Africa, 21 st – 26 th June 2015 Africa Centre for.
Brendan Burke and Kyle Steffen. Important New Tool in Genomic Medicine GWAS is used to estimate disease risk and test SNPs( the most common type of genetic.
OMICS Journals are welcoming Submissions
SNPs and complex traits: where is the hidden heritability?
Genetic Testing for the Clinician
upstream vs. ORF binding and gene expression?
Recombination (Crossing Over)
High level GWAS analysis
Epidemiology 101 Epidemiology is the study of the distribution and determinants of health-related states in populations Study design is a key component.
Genome-wide Associations
Linking Genetic Variation to Important Phenotypes
Chapter 7 Multifactorial Traits
Medical genomics BI420 Department of Biology, Boston College
Perspectives from Human Studies and Low Density Chip
Medical genomics BI420 Department of Biology, Boston College
Genomics, genetic epidemiology, and genomic medicine
Haplotypes When the presence of two or more polymorphisms on a single chromosome is statistically correlated in a population, this is a haplotype Example.
An Expanded View of Complex Traits: From Polygenic to Omnigenic
Presentation transcript:

“10,001 Dalmatians” research programme: Discovery of genetic variants that control human quantitative traits and predispose to diseases Igor Rudan, Mladen Boban, Tatijana Zemunik, Gordan Lauc, Zoran Đogaš, Stipan Janković, Ivica Grković, Ana Marušić, Janoš Terzić, Rosanda Mulić, Vjekoslav Krželj, Lina Zgaga, Zrinka Biloglav, Ivana Kolčić, Marina Pehlić, Grgo Gunjača, Danijela Budimir, Ozren Polašek

2001. – human genome sequence was published Main expectation (general public, investors, researchers, pharma and biotech industries): Linking genes with diseases and development of new treatments and “personalized medicine” – the race towards this goal begun (each group with its own approach)

Main idea: 1)Find “markers” in the genome and “tag” the whole genome as densely as possible; 2)Find consistent associations between some of those markers and disease phenotypes 3)Find genes in proximity of implicated markers – they are “disease genes”

CASES (“affected”) CONTROLS (“unaffected”) STR MARKER A STR MARKER B,C… DISEASE GENE (MUTATION) DISEASE GENE (WILD TYPE)

Short tandem repeats (STR) – e.g. (TA)x4 or (CTG)x7 – hundreds of STRs across the genome

-STR marker maps were not dense, but they were still very useful to “pick” genes that caused monogenic (Mendelian) diseases

Problems with genome-wide linkage analyses using genome-wide STR maps: 1)STR markers and diseases were not always 100% linked because of incomplete penetrance of causing mutations or genetic heterogeneity of the disease: low study power 2)STR markers and disease genes were not always 100% linked because of recombination (crossing over) between them: low study power

CASES (“affected”) CONTROLS (“unaffected”) STR MARKER A STR MARKER B,C… DISEASE GENE (MUTATION) DISEASE GENE (WILD TYPE)

Problems with genome-wide linkage analyses using genome-wide STR maps: 3) Even when a marker closest to disease gene was found with nearly 100% certainty, it still took years to find all candidate genes in regions up to 10 megabases (or more) and sequence them all to find exact causal mutation 4) Good ideas: -Choose to study phenotypes that are precisely measurable and in good correlation with genotypes -Use populations with large linkage disequilibrium

Strategy (1): Our group proposed to rely on isolated populations (for increased LD) and pedigree-based approach (adds information) in 1999 Nat Genet 1999; 23:

Strategy (2): Our group proposed a highly polygenic model for complex traits and diseases in 2003 Genetics 2003; 163: Trends Genet 2003; 19:

HIGHLY POLYGENIC GENETIC BASIS (FEW RARE VARIANTS WITH LARGE EFFECTS AND MANY COMMON WITH SMALL EFFECTS) “-OMICS” LEVEL (PROTEOMICS, LIPIDOMICS, GLYCOMICS, METABOLOMICS) QUANTITATIVE TRAIT LEVEL (e.g. CHOLESTEROL, BLOOD PRESSURE) Our understanding of complex traits and diseases: COMPLEX DISEASE PHENOTYPE ENVIRONMENT GWAS: MOST POWER & FUNCTIONAL RELEVANCE

Strategy (3): Our group proposed to measure large number of QTs (closer to genes - power, more chance, later - networks)

Quantitative traits: More than 100 selected initially

 : The British Council  : The Wellcome Trust  : Medical Research Council UK (1/3)  : Ministry of Science and Technology, Croatia  : The Royal Society, UK  : National Institutes of Health, USA  : Medical Research Council UK (2/3)  : EU fp6 EUROSPAN  : Medical Research Council UK (3/3)  : Ministry of S & T, Croatia (The Croatian Biobank) Grants awarded (£ 4.0 M) Strategy (4): Finding money to start a large cohort

“Susak-10”: served to choose the most appropriate population ( ) COHORT 1. (1001 examinee)

2003: The choice of further populations was based on demography data and population genetic studies

2003: The populations were extremely differentiated (based on analysis of 26 STR markers below); LD studies conducted using 8 STR markers on Xq13-12

COHORT 2. (1024 examinees) “Vis”: genotyped with (i) 800 STRs and (ii) Illumina 317 k ( )

COHORT 3. (969 examinees) “Korcula”: genotyped with Illumina 370 k CNV ( )

COHORT 4. (1001 examinees) “Split”: outbred population genotyped with Illumina 370 k CNV ( )

Year 2005: BAD YEAR We used 800 STR marker scan and analysed the data using genome-wide linkage analysis. What did we find? ABSOLUTELY NOTHING. Other approaches (e.g. candidate genes and case-control studies)? NO REPLICATIONS FOR ANY OF THE THOUSANDS OF REPORTED ASSOCIATIONS (…OK, MAYBE 4-5 MAX.)

The HapMap project Tried to define “blocks” of genome between “recombination hotspots” and tag each one of them with one of more than 10 million predicted SNPs: new GWAS based on SNPs

Year 2006: TECHNOLOGICAL BREAKTHROUGH! Affymetrix Inc. and Illumina Inc.: Dense genome-wide scans using hundreds of thousands of SNP markers (from HapMap project – “tagging SNPs)

Year 2007: THE “BRAVE NEW WORLD” STUDY (WTCCC, Nature, June 07, 2007)

First analyses of data using SNP

2008: uric acid & gout 2009: lipid levels & coronary heart disease 2010: fasting glucose & diabetes type 2 Nat Genet : FVC, FEC & chr. lung disease Nat Genet : creatinine & chr. kidney disease Nat Genet 2010 (2011: blood pressure & stroke) JAMA 2011 ? (2011: CFH & age-related mac. degeneration) Lancet ? Nat Genet 2009; 41: 47-55Nat Genet 2008; 40: Results of GWAS of QTs with “disease risk” studied

2009: smoking initiation and intensity Nat Genet : clotting factors VII, VIII & vWF Circulation 2010 (2010: sleep duration and latency) Nat Genet 2010 ? (2010: human height, weight, WHR) 3 x Nat Genet 2010 ? (2010: global lipids) Nature 2010 ? (2010: cognitive traits) 2 x Nat Genet 2010 ? (2010: ECG, urine, CRP, HbA1c, ABPI, P, cortisol…) PLoS Genet 2009; 5: e PLoS Genet 2009; 5: e Results of GWAS of QTs without disease risk links

Strategy (5): Next moves (plan for ) 1. GWAS of -OMICS (“1 level down from QT”) & functional follow-up & systems biology / pathways 2. Development of novel methods for analysis of the effect of CNV and rare variants on human QTs 3. Expand the number of phenotypes measured in plasma in at least 3,000 examinees (e.g. ILs, etc.) 4. Whole-genome sequence for 1,000 examinees & the new round of consortia participation

Forthcoming (2010): GWAS of 132 circulating phospholipids (PLoS Genetics) PLoS Genet 2009; 163: Results of GWAS of LIPIDOMICS traits Further interest of our group: GWAS of glycomics, proteomics, other metabolomics and functional follow-up

Progress in GLYCOMICS: dependent of measurement Nature 2009; 457: High-performance liquid chromatography (HPLC): - Glycoproteins immobilized Rudd PM et al. (Natl. Inst. Bioprocessing Res. Train.): refined chromatography approaches for analysis of glycosylation - Glycans released - Fluorescent labels attached - Labelled sugars run on a normal phase HPLC column - Resulting peaks correlated to a pre-run dextran ladder

“GlycoBioGen”: A consortium led by collaboration of Scottish, Croatian & Irish institutions CROATIAN CENTRE FOR GLOBAL HEALTH

Separation of plasma N-glycans in 16 chromatographic peaks using HPLC method (GP1-GP16): area under peak measured as a QT Quantitation of glycans in human plasma: Unusual biological variability at population level Significant effects of age, gender, environmental factors Highly varying heritabilities Striking correlations with other biochemical QTs J Proteome Res 2009; 8:

FUT8: associated with GP1 in 1,000 subjects (p=5.09 x x ) Results of GWAS study (Vis island, Croatia):

Strategy (5): Next moves (plan for ) 1. GWAS of -OMICS (“1 level down from QT”) & functional follow-up & systems biology / pathways 2. Development of novel methods for analysis of the effect of CNV and rare variants on human QTs 3. Expand the number of phenotypes measured in plasma in at least 3,000 examinees (e.g. ILs, etc.) 4. Whole-genome sequence for 1,000 examinees & the new round of consortia participation

CNVs (copy number variants): Nature (April 2010) – WTCCC – didn’t find any associations with disease at all; Rare variants: “Moving frames” method (by Eleftheria Zeggini at Sanger, Hinxton, Cambridge): MAGIC, DIAGRAM & SPIROMETA “Exome sequencing” (4-10x) “Deep whole-genome sequencing” (48x) “Missing heritability”:

Strategy (5): Next moves (plan for ) 1. GWAS of -OMICS (“1 level down from QT”) & functional follow-up & systems biology / pathways 2. Development of novel methods for analysis of the effect of CNV and rare variants on human QTs 3. Expand the number of phenotypes measured in plasma in at least 3,000 examinees (e.g. ILs, etc.) 4. Whole-genome sequence for 1,000 examinees & the new round of consortia participation

Gordan: N-glycans Zoran: CRD series Tatijana i Vesela: T4, TSH Mladen: markers of oxidative stress? Janoš: proteomics? Rosanda: anti-HBV antigens? Ana: interleukins, CD4? “Expand phenotypes”:

Strategy (5): Next moves (plan for ) 1. GWAS of -OMICS (“1 level down from QT”) & functional follow-up & systems biology / pathways 2. Development of novel methods for analysis of the effect of CNV and rare variants on human QTs 3. Expand the number of phenotypes measured in plasma in at least 3,000 examinees (e.g. ILs, etc.) 4. Whole-genome sequence for 1,000 examinees & the new round of consortia participation

Wellcome Trust Sanger Institute, Hinxton, Cambridge: agreement that 400 / 2500 first examinees with WGS will be Croatians (Korcula) Why? – genealogies (expanding the number through “imputation”) and dense phenotyping (hundreds of QTs) Project will start: end of 2011 Value for us: GBP 4 million at present time; should get us into the “next wave” of consortia work; needs Vesna Boraska etc. “Whole-genome sequence era”: