Genome-wide Association Study Focus on association between SNPs and traits Tendency – Larger and larger sample size – Use of more narrowly defined phenotypes(blood.

Slides:



Advertisements
Similar presentations
Imputation for GWAS 6 December 2012.
Advertisements

Genetic Analysis of Genome-wide Variation in Human Gene Expression Morley M. et al. Nature 2004,430: Yen-Yi Ho.
Julia Krushkal 4/11/2017 The International HapMap Project: A Rich Resource of Genetic Information Julia Krushkal Lecture in Bioinformatics 04/15/2010.
Genetics 101/Clinical Significance Camp Sunshine July 22, 2013 Diamond Blackfan Anemia Foundation Diamond Blackfan Anemia Canada.
Are you ready for the genomic age? An introduction to human genomics Jacques Fellay EPFL School of Life Sciences Swiss Institute of Bioinformatics Lausanne,
Next–generation DNA sequencing technologies – theory & practice
Ruibin Xi Peking University School of Mathematical Sciences
Efficient Algorithms for Genome-wide TagSNP Selection across Populations via the Linkage Disequilibrium Criterion Authors: Lan Liu, Yonghui Wu, Stefano.
Understanding GWAS Chip Design – Linkage Disequilibrium and HapMap Peter Castaldi January 29, 2013.
Genetic Association Analysis --- impact of NGS 1.
Ingredients for a successful genome-wide association studies: A statistical view Scott Weiss and Christoph Lange Channing Laboratory Pulmonary and Critical.
Variant discovery Different approaches: With or without a reference? With a reference – Limiting factors are CPU time and memory required – Crossbow –
The 1000 Genomes Project Gil McVean Department of Statistics, Oxford.
Predicting the Function of Single Nucleotide Polymorphisms Corey Harada Advisor: Eleazar Eskin.
Something related to genetics? Dr. Lars Eijssen. Bioinformatics to understand studies in genomics – São Paulo – June Image:
An Update in Genetics of Epilepsy
Genomewide Association Studies.  1. History –Linkage vs. Association –Power/Sample Size  2. Human Genetic Variation: SNPs  3. Direct vs. Indirect Association.
Polymorphisms – SNP, InDel, Transposon BMI/IBGP 730 Victor Jin, Ph.D. (Slides from Dr. Kun Huang) Department of Biomedical Informatics Ohio State University.
Two loci AA/AA x BB/BB AB/AB (F1) AA AA long chrom short chrom locus 1 locus 2 Parent A BB BB long chrom short chrom locus 1 locus 2 Parent B Locus1/Locus.
Course Overview Personalized Medicine: Understanding Your Own Genome Fall 2014.
Department of Biomedical Informatics Bioinformatics and Genetics Kun Huang Department of Biomedical Informatics OSUCCC Biomedical Informatics Shared Resource.
Design Considerations in Large- Scale Genetic Association Studies Michael Boehnke, Andrew Skol, Laura Scott, Cristen Willer, Gonçalo Abecasis, Anne Jackson,
Dr Katie Snape Specialist Registrar in Genetics St Georges Hospital
Whole Exome Sequencing for Variant Discovery and Prioritisation
Bernard Keavney Institute of Human Genetics University of Newcastle, UK. Recent developments in genetic epidemiology relevant to PURE.
HapMap: application in the design and interpretation of association studies Mark J. Daly, PhD on behalf of The International HapMap Consortium.
The medical relevance of genome variability Gabor T. Marth, D.Sc. Department of Biology, Boston College
Bioinformatics SNPs and haplotypes Kristel Van Steen, PhD, ScD Université de Liege - Institut Montefiore
Genetics-multistep tumorigenesis genomic integrity & cancer Sections from Weinberg’s ‘the biology of Cancer’ Cancer genetics and genomics Selected.
Medical variations Gabor T. Marth Boston College Biology Department BI543 Fall 2013 February 5, 2013.
Experimental validation. Integration of transcriptome and genome sequencing uncovers functional variation in human populations Tuuli Lappalainen et al.
Next-Generation Sequencing
Next Generation Sequencing and its data analysis challenges Background Alignment and Assembly Applications Genome Epigenome Transcriptome.
Allele. Alternate form of a gene gene variant autosome.
The medical relevance of genome variability Gabor T. Marth, D.Sc. Department of Biology, Boston College Medical Genomics Course – Debrecen,
Molecular & Genetic Epi 217 Association Studies
CS177 Lecture 10 SNPs and Human Genetic Variation
1 of 32 Sequence Variation in Ensembl. 2 of 32 Outline SNPs SNPs in Ensembl Haplotypes & Linkage Disequilibrium SNPs in BioMart HapMap project Strain-specific.
The 1000 Genomes Project Gil McVean Department of Statistics, Oxford.
Experimental Design and Data Structure Supplement to Lecture 8 Fall
Association mapping: finding genetic variants for common traits & diseases Manuel Ferreira Queensland Institute of Medical Research Brisbane Genetic Epidemiology.
Molecular & Genetic Epi 217 Association Studies: Indirect John Witte.
Polymorphism Haixu Tang School of Informatics. Genome variations underlie phenotypic differences cause inherited diseases.
E XOME SEQUENCING AND COMPLEX DISEASE : practical aspects of rare variant association studies Alice Bouchoms Amaury Vanvinckenroye Maxime Legrand 1.
ABC for the AEA Basic biological concepts for genetic epidemiology Martin Kennedy Department of Pathology Christchurch School of Medicine.
Lecture 6. Functional Genomics: DNA microarrays and re-sequencing individual genomes by hybridization.
HW2: exome sequencing and complex disease Jacquemin Jonathan de Bournonville Sébastien.
Lecture-3 EXOME SEQUENCING Huseyin Tombuloglu, Phd GBE423 Genomics & Proteomics.
The International Consortium. The International HapMap Project.
Single nucleotide polymorphisms and Large scale variation
Motivations to study human genetic variation
Copyright OpenHelix. No use or reproduction without express written consent1.
Current Data And Future Analysis Thomas Wieland, Thomas Schwarzmayr and Tim M Strom Helmholtz Zentrum München Institute of Human Genetics Geneva, 16/04/12.
Analyzing DNA using Microarray and Next Generation Sequencing (1) Background SNP Array Basic design Applications: CNV, LOH, GWAS Deep sequencing Alignment.
Analysis of Next Generation Sequence Data BIOST /06/2015.
Global Variation in Copy Number in the Human Genome Speaker: Yao-Ting Huang Nature, Genome Research, Genome Research, 2006.
Reliable Identification of Genomic Variants from RNA-seq Data Robert Piskol, Gokul Ramaswami, Jin Billy Li PRESENTED BY GAYATHRI RAJAN VINEELA GANGALAPUDI.
1 Finding disease genes: A challenge for Medicine, Mathematics and Computer Science Andrew Collins, Professor of Genetic Epidemiology and Bioinformatics.
Interpreting exomes and genomes: a beginner’s guide
Genomic Analysis: GWAS
Gil McVean Department of Statistics
Interpretation Next Generation Sequencing (Bench Clinic)
Week 5 Theory and application for setting up an RNA-Seq pipeline
High level GWAS analysis
Beyond GWAS Erik Fransen.
Genetics of Human Cardiovascular Disease
Catarina D. Campbell, Nick Sampas, Anya Tsalenko, Peter H
Haplotypes When the presence of two or more polymorphisms on a single chromosome is statistically correlated in a population, this is a haplotype Example.
Analysis of protein-coding genetic variation in 60,706 humans
KDM4A SNP-A482 (rs586339) correlates with worse outcome in patients with NSCLC. A, schematic of the human KDM4A protein is shown with both the protein.
Presentation transcript:

Genome-wide Association Study Focus on association between SNPs and traits Tendency – Larger and larger sample size – Use of more narrowly defined phenotypes(blood lipids, proinsulin or similar biomarkers Limitations – Sufficient sample size – The massive number of statistical tests performed presents an unprecedented potential for the positive results – Search the entire genome-->not worth the expenditure For each of SNPs, allele frequency alters?Odds ratio Proportion of the same alleleProportion of a specific allele genotyped for the majority of common known SNPs Healthy control groupCase group

Advantage of Exome Sequecing Whole genome sequencing – Redundant raw data(6 Gb in each human diploid genome ) Exome sequecing(targeted exome capture) – Exons are short and 180,000 exons constitute 1% of the human genome – The goal is to identify the functional variation that is responsible for both mendelian and common diseases

Significance Exome sequencing can be used to identify causal variants of rare disorders The first reported study that used exome sequencing as an approach to identify an unknown causal gene for a rare mendelian disorder

The Shendure Lab Next-generation human genetics – A multiplex approach to genome sequencing – Targeted sequence enrichment Protocols relying on molecular inversion probe Hybrid capture – Novel analytical strategies to identify the genetic basis of Mendelian disorders by exome sequecing Autosomal recessive disorders such as Miller syndrome Autosomal dominant disorders such as Kabuki syndrome

Hapmap project Focuse on common SNPs(at least 1% of the population) Samples: 4 populations – (30*3 YRI, 30*3 CEU, 45 JPT, 45 CHB) Data: – SNP frequencies, genotypes

Work flow Direct identification of the causal gene for FSS Read mapping and variant analysis DNA samples, targeted capture and massively parallel sequencing

a. PCR-based approach b. Molecular inversion probe(MIP)- based approach c. Hybrid capture-based approach Mamanova et al. Nat Method 7(2): Target enrichment Methods

Mamanova et al. Nat Method 7(2):

Figure. ① Probe list of array2 ② Probe list of array1 ③ Exome on 1-22, X and Y chromosomes

Work flow Direct identification of the causal gene for FSS Read mapping and variant analysis DNA samples, targeted capture and massively parallel sequencing

Coming… Direct identification of the causal gene for FSS Comparison of sequence calls to array genotypes, dbSNP and whole genome sequencing

Method Calculation of genome-wide estimates Variant annotation Comparison of sequence calls to array genotypes, dbSNP and whole genome sequencing Variant calling Target Masking Read mapping Sequencing Targeted capture by hybridization of DNA microarrays Design of exon capture array Shotgun library construction Oligonucleotides and adaptors Genomic DNA samples

Method

Method 2:MIP and resequencing

Method 3: Whole genome sequencing

Method 4:

Figure. Table of cSNPs of 8 HapMap individuals

Figure. Table of Splice Site Variants of 8 HapMap individuals

Figure. Table of Coding Indels of 8 HapMap individuals

Figure. Table of coverage of 8 HapMap individuals and 4 FSS individual

Figure. Intervals that were exclued….

Figure. ① Probe list of array2 ② Probe list of array1 ③ Exome on 1-22, X and Y chromosomes

YRI: Nigeria - Yoruba people of Ibadan CHB: China - Beijing JPT: Japan - Tokyo CEU: Centre d'Etude du Polymorphisme Humain (CEPH) Eur: European–American ancestry

About mendelian disease

Traditional situation

Current situation

Considerations Causal genes may be shared by case group. Control group may not contain that mutation. Common mutation may not be causal. Causal mutation should cause animo acid change.

Result

Further application Typical single gene disorder. Disorder caused by single but not uniform gene. Multiple gene disorder. Complex disease. Cancer.