The Structure of Common Genetic Variation in United States Populations

Slides:



Advertisements
Similar presentations
A Haplotype at STAT2 Introgressed from Neanderthals and Serves as a Candidate of Positive Selection in Papua New Guinea  Fernando L. Mendez, Joseph C.
Advertisements

Copyright © 2005 American Medical Association. All rights reserved.
Katarzyna Bryc, Eric Y. Durand, J
Itsik Pe’er, Yves R. Chretien, Paul I. W. de Bakker, Jeffrey C
Genetic Landscape of Eurasia and “Admixture” in Uyghurs
Performance of Common Analysis Methods for Detecting Low-Frequency Single Nucleotide Variants in Targeted Next-Generation Sequence Data  David H. Spencer,
CYP3A Variation and the Evolution of Salt-Sensitivity Variants
Pathogenic Variants for Mendelian and Complex Traits in Exomes of 6,517 European and African Americans: Implications for the Return of Incidental Results 
Katarzyna Bryc, Eric Y. Durand, J
Population Genetic Structure of the People of Qatar
Volume 26, Issue 7, Pages (April 2016)
Introgression of Neandertal- and Denisovan-like Haplotypes Contributes to Adaptive Variation in Human Toll-like Receptors  Michael Dannemann, Aida M.
Model-free Estimation of Recent Genetic Relatedness
Integrating Gene Expression with Summary Association Statistics to Identify Genes Associated with 30 Complex Traits  Nicholas Mancuso, Huwenbo Shi, Pagé.
The Genetic Legacy of Zoroastrianism in Iran and India: Insights into Population Structure, Gene Flow, and Selection  Saioa López, Mark G. Thomas, Lucy.
An Extensive Analysis of Y-Chromosomal Microsatellite Haplotypes in Globally Dispersed Human Populations  Manfred Kayser, Michael Krawczak, Laurent Excoffier,
Haplotype Estimation Using Sequencing Reads
A Combined Linkage-Physical Map of the Human Genome
Association Mapping in Structured Populations
Proportioning Whole-Genome Single-Nucleotide–Polymorphism Diversity for the Identification of Geographic Population Structure and Genetic Ancestry  Oscar.
Brian K. Maples, Simon Gravel, Eimear E. Kenny, Carlos D. Bustamante 
Vincent B. McGinty, Antonio Rangel, William T. Newsome  Neuron 
Alternative Splicing QTLs in European and African Populations
Gene-Expression Variation Within and Among Human Populations
John Wakeley, Rasmus Nielsen, Shau Neen Liu-Cordero, Kristin Ardlie 
Reduction of Sample Heterogeneity through Use of Population Substructure: An Example from a Population of African American Families with Sarcoidosis 
Emily C. Walsh, Kristie A. Mather, Stephen F
Genomic Signatures of Selective Pressures and Introgression from Archaic Hominins at Human Innate Immunity Genes  Matthieu Deschamps, Guillaume Laval,
Variant Association Tools for Quality Control and Analysis of Large-Scale Sequence and Genotyping Array Data  Gao T. Wang, Bo Peng, Suzanne M. Leal  The.
Ida Moltke, Matteo Fumagalli, Thorfinn S. Korneliussen, Jacob E
Towfique Raj, Manik Kuchroo, Joseph M
A Comparison of Genotype-Phenotype Maps for RNA and Proteins
Guidelines for Large-Scale Sequence-Based Complex Trait Association Studies: Lessons Learned from the NHLBI Exome Sequencing Project  Paul L. Auer, Alex.
Japanese Population Structure, Based on SNP Genotypes from 7003 Individuals Compared to Other Ethnic Groups: Effects on Population-Based Association Studies 
CYP3A Variation and the Evolution of Salt-Sensitivity Variants
Xin Li, Alexis Battle, Konrad J. Karczewski, Zach Zappala, David A
Sequencing the IL4 locus in African Americans implicates rare noncoding variants in asthma susceptibility  Gabe Haller, BA, Dara G. Torgerson, PhD, Carole.
Selection and Reduced Population Size Cannot Explain Higher Amounts of Neandertal Ancestry in East Asian than in European Human Populations  Bernard Y.
Ivan P. Gorlov, Olga Y. Gorlova, Shamil R. Sunyaev, Margaret R
Mamoru Kato, Yusuke Nakamura, Tatsuhiko Tsunoda 
Matthieu Foll, Oscar E. Gaggiotti, Josephine T
Haplotypes at ATM Identify Coding-Sequence Variation and Indicate a Region of Extensive Linkage Disequilibrium  Penelope E. Bonnen, Michael D. Story,
Brian P. McEvoy, Joanne M. Lind, Eric T. Wang, Robert K
E. Wang, Y. -C. Ding, P. Flodman, J. R. Kidd, K. K. Kidd, D. L
Volume 25, Issue 15, Pages (August 2015)
Shusuke Numata, Tianzhang Ye, Thomas M
Natural Selection and Population History in the Human Angiotensinogen Gene (AGT): 736 Complete AGT Sequences in Chromosomes from Around the World  Toshiaki.
Highly Punctuated Patterns of Population Structure on the X Chromosome and Implications for African Evolutionary History  Charla A. Lambert, Caitlin F.
Shuhua Xu, Wei Huang, Ji Qian, Li Jin 
Brian P. McEvoy, Joanne M. Lind, Eric T. Wang, Robert K
An Efficient Multiple-Testing Adjustment for eQTL Studies that Accounts for Linkage Disequilibrium between Variants  Joe R. Davis, Laure Fresard, David A.
Nonpaternity in Linkage Studies of Extremely Discordant Sib Pairs
Haplotype Diversity across 100 Candidate Genes for Inflammation, Lipid Metabolism, and Blood Pressure Regulation in Two Populations  Dana C. Crawford,
Human Population Genetic Structure and Inference of Group Membership
Jared R. Kohler, David J. Cutler 
Selecting a Maximally Informative Set of Single-Nucleotide Polymorphisms for Association Analyses Using Linkage Disequilibrium  Christopher S. Carlson,
The Power of Genomic Control
Volume 78, Issue 7, Pages (October 2010)
Identifying Darwinian Selection Acting on Different Human APOL1 Variants among Diverse African Populations  Wen-Ya Ko, Prianka Rajan, Felicia Gomez, Laura.
Stephen Wooding, Un-kyung Kim, Michael J
Complex History of Admixture between Modern Humans and Neandertals
Leslie S. Emery, Kevin M. Magnaye, Abigail W. Bigham, Joshua M
Genetic and Epigenetic Regulation of Human lincRNA Gene Expression
Markers for Mapping by Admixture Linkage Disequilibrium in African American and Hispanic Populations  Michael W. Smith, James A. Lautenberger, Hyoung.
Abraham's Children in the Genome Era: Major Jewish Diaspora Populations Comprise Distinct Genetic Clusters with Shared Middle Eastern Ancestry  Gil Atzmon,
Genotype-Imputation Accuracy across Worldwide Human Populations
A Haplotype at STAT2 Introgressed from Neanderthals and Serves as a Candidate of Positive Selection in Papua New Guinea  Fernando L. Mendez, Joseph C.
Introgression of Neandertal- and Denisovan-like Haplotypes Contributes to Adaptive Variation in Human Toll-like Receptors  Michael Dannemann, Aida M.
Brian C. Verrelli, Sarah A. Tishkoff 
Population Genetic Structure of the People of Qatar
Presentation transcript:

The Structure of Common Genetic Variation in United States Populations Stephen L. Guthery, Benjamin A. Salisbury, Manish S. Pungliya, J. Claiborne Stephens, Michael Bamshad  The American Journal of Human Genetics  Volume 81, Issue 6, Pages 1221-1231 (December 2007) DOI: 10.1086/522239 Copyright © 2007 The American Society of Human Genetics Terms and Conditions

Figure 1 Summary statistics for data-reduction methods used to estimate population structure. A, Eigenvalues versus the number of principal components. B, Number of clusters plotted as a function of the pseudo F statistic obtained from the UPGMA algorithm. C, Number of clusters plotted as a function of LnP(D) from STRUCTURE. The American Journal of Human Genetics 2007 81, 1221-1231DOI: (10.1086/522239) Copyright © 2007 The American Society of Human Genetics Terms and Conditions

Figure 2 Site-frequency distributions for SNP data from 3,873 genes. A, SNP site–frequency distribution for the total sample. Of a total 63,127 SNPs (black bars) in the data set, 39% (n=24,982) were singletons (red bar). B, Number and distribution of private SNPs in each population determined from the site-frequency distribution of the total sample. The majority of private SNPs were observed in seven or fewer chromosomes, illustrated by cumulative frequency (gray line). C and D, SNP site–frequency distribution for each population. E and F, SNP site–frequency distributions for African Americans versus non–African Americans. The American Journal of Human Genetics 2007 81, 1221-1231DOI: (10.1086/522239) Copyright © 2007 The American Society of Human Genetics Terms and Conditions

Figure 3 Distribution of common SNPs among Latino/Hispanic, African, Asian, and European Americans. A and B, The percentage of SNPs that are common (i.e., ⩾10%) in at least one population but are found in both populations (black bars) is high overall but varies from ∼74% to 96%. A modest percentage of common SNPs that are common in at least one population are absent in the other populations (gray bars). C and D, The percentage of common SNPs common in both populations (black bars) compared with SNPs common in only each population compared: African Americans (AfA) (blue bars), Asian Americans (AsA) (red bars), European Americans (EA) (green bars), and Latino/Hispanic Americans (HA) (orange bars). Overall, only a modest percentage (44%–72%) of SNPs common in at least one population are common in both populations. A substantial proportion of common SNPs in African Americans are common only in African Americans. The American Journal of Human Genetics 2007 81, 1221-1231DOI: (10.1086/522239) Copyright © 2007 The American Society of Human Genetics Terms and Conditions

Figure 4 Contour plot of minor-SNP frequencies between pairs of populations. Plots compare frequencies of SNPs (n=38,145) excluding singletons. Each plot represents a scatterplot with minor-SNP frequency from a given population on each axis. Plots are divided into 3,600 grids (60×60 grids), and the number of data points within each grid is color coded. For example, purple represents 0.01 data points per grid, and red represents 100 data points per grid (see legend in the upper right-hand corner). The American Journal of Human Genetics 2007 81, 1221-1231DOI: (10.1086/522239) Copyright © 2007 The American Society of Human Genetics Terms and Conditions

Figure 5 Measures of SNP sharing among Latino/Hispanic (HA), African (AfA), Asian (AsA), and European (EA) Americans. For all figures, the X-axis represents overlapping bins (i.e., >0.05 represents all SNPs with MAF >0.05), and MAF is calculated across all 152 chromosomes. When two populations are compared, MAF is calculated separately for each population. A, Pairwise comparisons of the proportion of SNPs shared between populations. B, Mean differences of pairwise comparisons of MAF between SNPs. C, Spearman rank correlation coefficients among pairwise comparisons of MAF between SNPs. D, Pairwise FST estimates between SNPs. The solid black line in each figure represents the mean value, and the dotted lines indicate the CI of values estimated from 1,000 data sets in which individuals were randomly distributed into pairs of populations (see text for details). ns = nonsingletons. The American Journal of Human Genetics 2007 81, 1221-1231DOI: (10.1086/522239) Copyright © 2007 The American Society of Human Genetics Terms and Conditions

Figure 6 Site-frequency distribution of synonymous (syn) (gray bars) and nonsynonymous (nonsyn) (black bars) SNPs for the total sample and for African Americans (AfA) versus non–African Americans. The American Journal of Human Genetics 2007 81, 1221-1231DOI: (10.1086/522239) Copyright © 2007 The American Society of Human Genetics Terms and Conditions

Figure 7 Estimation of population structure in GRP samples. AfA = African American; AsA = Asian American; EA = European American; HA = Latino/Hispanic American. A, Phylogenetic network based on genetic distances with the use of UPGMA. B, Plot of principal components (PCs) estimated from a genetic-distance matrix. C, Stacked bar chart with inferences from results of a model-based cluster analysis with the use of STRUCTURE 2.0. Each bar represents an individual, and each bar is divided according to the fraction of cluster membership. D, Triangle plot illustrating the percentage of African, Asian, and European American ancestry of each individual (indicated by colored shapes, as given in panel B) estimated from STRUCTURE 2.0. The American Journal of Human Genetics 2007 81, 1221-1231DOI: (10.1086/522239) Copyright © 2007 The American Society of Human Genetics Terms and Conditions