Guidelines for Large-Scale Sequence-Based Complex Trait Association Studies: Lessons Learned from the NHLBI Exome Sequencing Project  Paul L. Auer, Alex.

Slides:



Advertisements
Similar presentations
Previous Estimates of Mitochondrial DNA Mutation Level Variance Did Not Account for Sampling Error: Comparing the mtDNA Genetic Bottleneck in Mice and.
Advertisements

Alternative Splicing QTLs in European and African Populations Halit Ongen, Emmanouil T. Dermitzakis The American Journal of Human Genetics Volume 97, Issue.
The Structure of Common Genetic Variation in United States Populations
Katarzyna Bryc, Eric Y. Durand, J
Michael Dannemann, Janet Kelso  The American Journal of Human Genetics 
Rare, Low-Frequency, and Common Variants in the Protein-Coding Sequence of Biological Candidate Genes from GWASs Contribute to Risk of Rheumatoid Arthritis 
Assessing Copy Number Alterations in Targeted, Amplicon-Based Next-Generation Sequencing Data  Catherine Grasso, Timothy Butler, Katherine Rhodes, Michael.
DOMINO: Using Machine Learning to Predict Genes Associated with Dominant Disorders  Mathieu Quinodoz, Beryl Royer-Bertrand, Katarina Cisarova, Silvio.
K. Alaine Broadaway, David J. Cutler, Richard Duncan, Jacob L
Pathogenic Variants for Mendelian and Complex Traits in Exomes of 6,517 European and African Americans: Implications for the Return of Incidental Results 
No evidence of large genetic effects on steroid response in asthma patients  Michael Mosteller, PhD, Louise Hosking, BSc, Kay Murphy, PhD, Judong Shen,
Katarzyna Bryc, Eric Y. Durand, J
SEQSpark: A Complete Analysis Tool for Large-Scale Rare Variant Association Studies Using Whole-Genome and Exome Sequence Data  Di Zhang, Linhai Zhao,
Reliable Identification of Genomic Variants from RNA-Seq Data
High-Resolution Genetic Maps Identify Multiple Type 2 Diabetes Loci at Regulatory Hotspots in African Americans and Europeans  Winston Lau, Toby Andrew,
Comparing Algorithms for Genotype Imputation
Yu Jiang, Glen A. Satten, Yujun Han, Michael P. Epstein, Erin L
Rare-Variant Extensions of the Transmission Disequilibrium Test: Application to Autism Exome Sequence Data  Zongxiao He, Brian J. O’Roak, Joshua D. Smith,
Haplotype Estimation Using Sequencing Reads
Detecting and Estimating Contamination of Human DNA Samples in Sequencing and Array-Based Genotype Data  Goo Jun, Matthew Flickinger, Kurt N. Hetrick,
Nova2 Interacts with a Cis-Acting Polymorphism to Influence the Proportions of Drug- Responsive Splice Variants of SCN1A  Erin L. Heinzen, Woohyun Yoon,
Jong-Min Lee, Kyung-Hee Kim, Aram Shin, Michael J
Zheng-Zheng Tang, Dan-Yu Lin  The American Journal of Human Genetics 
Functional Gene Group Analysis Reveals a Role of Synaptic Heterotrimeric G Proteins in Cognitive Ability  Dina Ruano, Gonçalo R. Abecasis, Beate Glaser,
Rounak Dey, Ellen M. Schmidt, Goncalo R. Abecasis, Seunggeun Lee 
Alternative Splicing QTLs in European and African Populations
Pharmacogenomic variability and anaesthesia
Relationship between Deleterious Variation, Genomic Autozygosity, and Disease Risk: Insights from The 1000 Genomes Project  Trevor J. Pemberton, Zachary.
Imputation of Exome Sequence Variants into Population- Based Samples and Blood- Cell-Trait-Associated Loci in African Americans: NHLBI GO Exome Sequencing.
Michael Dannemann, Janet Kelso  The American Journal of Human Genetics 
Genomic Signatures of Selective Pressures and Introgression from Archaic Hominins at Human Innate Immunity Genes  Matthieu Deschamps, Guillaume Laval,
Variant Association Tools for Quality Control and Analysis of Large-Scale Sequence and Genotyping Array Data  Gao T. Wang, Bo Peng, Suzanne M. Leal  The.
A Subset-Based Approach Improves Power and Interpretation for the Combined Analysis of Genetic Association Studies of Heterogeneous Traits  Samsiddhi.
Integrative Multi-omic Analysis of Human Platelet eQTLs Reveals Alternative Start Site in Mitofusin 2  Lukas M. Simon, Edward S. Chen, Leonard C. Edelstein,
Japanese Population Structure, Based on SNP Genotypes from 7003 Individuals Compared to Other Ethnic Groups: Effects on Population-Based Association Studies 
Maximizing the Power of Principal-Component Analysis of Correlated Phenotypes in Genome-wide Association Studies  Hugues Aschard, Bjarni J. Vilhjálmsson,
Sequencing the IL4 locus in African Americans implicates rare noncoding variants in asthma susceptibility  Gabe Haller, BA, Dara G. Torgerson, PhD, Carole.
Robust Inference of Identity by Descent from Exome-Sequencing Data
Ivan P. Gorlov, Olga Y. Gorlova, Shamil R. Sunyaev, Margaret R
Sherlock: Detecting Gene-Disease Associations by Matching Patterns of Expression QTL and GWAS  Xin He, Chris K. Fuller, Yi Song, Qingying Meng, Bin Zhang,
The Rare-Variant Generalized Disequilibrium Test for Association Analysis of Nuclear and Extended Pedigrees with Application to Alzheimer Disease WGS.
Family-Based Association Studies for Next-Generation Sequencing
Characteristics of Neutral and Deleterious Protein-Coding Variation among Individuals and Populations  Wenqing Fu, Rachel M. Gittelman, Michael J. Bamshad,
Structural Architecture of SNP Effects on Complex Traits
Molecular Convergence of Neurodevelopmental Disorders
E. Wang, Y. -C. Ding, P. Flodman, J. R. Kidd, K. K. Kidd, D. L
Rare-Variant Association Testing for Sequencing Data with the Sequence Kernel Association Test  Michael C. Wu, Seunggeun Lee, Tianxi Cai, Yun Li, Michael.
Johanna Jakobsdottir, Mary Sara McPeek 
Dan-Yu Lin, Zheng-Zheng Tang  The American Journal of Human Genetics 
Estimating Genetic Effects and Quantifying Missing Heritability Explained by Identified Rare-Variant Associations  Dajiang J. Liu, Suzanne M. Leal  The.
Imputing Phenotypes for Genome-wide Association Studies
Chen Yao, Roby Joehanes, Andrew D
Wei Pan, Il-Youp Kwak, Peng Wei  The American Journal of Human Genetics 
Selecting a Maximally Informative Set of Single-Nucleotide Polymorphisms for Association Analyses Using Linkage Disequilibrium  Christopher S. Carlson,
L-GATOR: Genetic Association Testing for a Longitudinally Measured Quantitative Trait in Samples with Related Individuals  Xiaowei Wu, Mary Sara McPeek 
Catherine T. Jordan, Li Cao, Elisha D. O
Unified Sequence-Based Association Tests Allowing for Multiple Functional Annotations and Meta-analysis of Noncoding Variation in Metabochip Data  Zihuai.
Long Runs of Homozygosity Are Enriched for Deleterious Variation
Are Variants in the CAPN10 Gene Related to Risk of Type 2 Diabetes
Analysis of protein-coding genetic variation in 60,706 humans
Pleiotropic Effects of Trait-Associated Genetic Variation on DNA Methylation: Utility for Refining GWAS Loci  Eilis Hannon, Mike Weedon, Nicholas Bray,
Markers for Mapping by Admixture Linkage Disequilibrium in African American and Hispanic Populations  Michael W. Smith, James A. Lautenberger, Hyoung.
Iuliana Ionita-Laza, Seunggeun Lee, Vlad Makarov, Joseph D
Catherine T. Jordan, Li Cao, Elisha D. O
No evidence of large genetic effects on steroid response in asthma patients  Michael Mosteller, PhD, Louise Hosking, BSc, Kay Murphy, PhD, Judong Shen,
Regie Lyn P. Santos-Cortez, Rabia Faridi, Atteeq U
Genotype-Imputation Accuracy across Worldwide Human Populations
Leveraging Multi-ethnic Evidence for Mapping Complex Traits in Minority Populations: An Empirical Bayes Approach  Marc A. Coram, Sophie I. Candille, Qing.
Development of a Novel Next-Generation Sequencing Assay for Carrier Screening in Old Order Amish and Mennonite Populations of Pennsylvania  Erin L. Crowgey,
Michael P. Epstein, Richard Duncan, Erin B. Ware, Min A
Presentation transcript:

Guidelines for Large-Scale Sequence-Based Complex Trait Association Studies: Lessons Learned from the NHLBI Exome Sequencing Project  Paul L. Auer, Alex P. Reiner, Gao Wang, Hyun Min Kang, Goncalo R. Abecasis, David Altshuler, Michael J. Bamshad, Deborah A. Nickerson, Russell P. Tracy, Stephen S. Rich, Suzanne M. Leal  The American Journal of Human Genetics  Volume 99, Issue 4, Pages 791-801 (October 2016) DOI: 10.1016/j.ajhg.2016.08.012 Copyright © 2016 American Society of Human Genetics Terms and Conditions

Figure 1 Schematic of the Work Flow for Sample Selection and Data Analysis in ESP Primary traits were selected from large, population-based studies with widely available data on secondary traits. Both European and African American samples were selected for sequencing. Association analyses were conducted using both genes and single variants as units of analysis. The American Journal of Human Genetics 2016 99, 791-801DOI: (10.1016/j.ajhg.2016.08.012) Copyright © 2016 American Society of Human Genetics Terms and Conditions

Figure 2 Coding Variants Observed in the NHLBI-ESP (A) The average number of missense, synonymous, nonsense, and splice site variants per study subject for 2,307 African Americans and 4,392 European Americans and all study subjects (n = 6,699) for the intersect of all four targets. The vertical lines display the smallest and largest number of variants of each type observed per person. (B) The number of missense, synonymous, nonsense, and splice sites observed for NHLBI-ESP (n = 6,699) study subjects. Represented in each pie chart is the number of singletons, doubletons, and variant sites with an MAF of ≤1%, >1%–5%, and >5%. (C) The average number of unique missense, synonymous, nonsense, and splice site variants per individual. The variants are not only exclusive to the NHLB-ESP but also are not observed in either dbSNP or 1000 Genomes. (D) Comparison of the number of coding variant sites observed in AAs and EAs. The number of missense, synonymous, nonsense, and splice site variants that are unique to each population are observed in both populations and have a MAF of ≥1%. The numbers displayed are exclusive to one category. In order to fairly compare the number of variant sites in African Americans and European Americans, equal numbers of African Americans (n = 2,312) and European Americans (n = 2,312) were studied. The American Journal of Human Genetics 2016 99, 791-801DOI: (10.1016/j.ajhg.2016.08.012) Copyright © 2016 American Society of Human Genetics Terms and Conditions

Figure 3 Triglyceride Rare Variant Association Analysis and Association of Rare Variants in APOC3 (A) QQ plot of the meta-analysis for African Americans and European Americans of rare variant burden analysis of triglyceride levels. Base 10 –log values of the observed p values are displayed versus their expected values. Rare variant association analysis was performed separately for African Americans (n = 1,654) and European Americans (n = 2,074) using the CMC analyzing those variant sites with a MAF ≤ 0.01. (B) Distribution of triglyceride levels for NHLBI-ESP study subjects and triglyceride levels for individuals with an APOC3 variant. The quantitative trait distribution of triglycerides after natural log transformation for African Americans and European Americans who are study subjects in the NHLBI-ESP. For the 27 individuals (8 African American and 19 European American) who are heterozygous for one of the 7 coding variants (3 splice, 1 stop-gain, and 3 missense), a tick represents their triglyceride levels after natural log transformation. For each variant site a diamond (red for African Americans and blue for European Americans) represents the average triglyceride levels for carriers of that variant. (C) Distribution of triglyceride levels for study subjects from the Women’s Health Initiative (WHI) and triglyceride levels for individuals with an APOC3 variant. The quantitative trait distribution of triglycerides after natural log transformation for African Americans (n = 1,820) and European Americans (n = 1,643) who are study subjects from the WHI. The DNA samples from the study subjects were genotyped on the exome chip. Of the seven variants that were observed in NHLBI-ESP, four were represented on the exome chip. The American Journal of Human Genetics 2016 99, 791-801DOI: (10.1016/j.ajhg.2016.08.012) Copyright © 2016 American Society of Human Genetics Terms and Conditions

Figure 4 An Analysis of Statistical Power to Detect Associations across the Exome (A) Sample sizes necessary to detect associations for a binary trait across the exome. (B) Sample sizes for a quantitative trait. Results from the SKAT, CMC, and BRV rare-variant association tests are shown in blue, green, and red, respectively. The American Journal of Human Genetics 2016 99, 791-801DOI: (10.1016/j.ajhg.2016.08.012) Copyright © 2016 American Society of Human Genetics Terms and Conditions