Presentation is loading. Please wait.

Presentation is loading. Please wait.

PGC Worldwide Lab Call Details DATE: Friday, April 12 th, 2013 PRESENTER: Alkes Price, Harvard University TITLE: “GWAS in multiple ancestries: heritability,

Similar presentations


Presentation on theme: "PGC Worldwide Lab Call Details DATE: Friday, April 12 th, 2013 PRESENTER: Alkes Price, Harvard University TITLE: “GWAS in multiple ancestries: heritability,"— Presentation transcript:

1 PGC Worldwide Lab Call Details DATE: Friday, April 12 th, 2013 PRESENTER: Alkes Price, Harvard University TITLE: “GWAS in multiple ancestries: heritability, association, and replication” START: We will begin promptly on the hour. 1000 EDT - US East Coast 0700 PDT - US West Coast 1500 BST - UK 1600 CET - Central Europe 0000 AEDT – Australia (Saturday, April 13th, 2013) DURATION: 1 hour TELEPHONE: - US Toll free: 1 866 515.2912 - International direct: +1 617 399.5126 - Toll-free number? See http://www.btconferencing.com/globalaccess/?bid=75_publichttp://www.btconferencing.com/globalaccess/?bid=75_public - Operators will be on standby to assist with technical issues. “*0” will get you assistance. - This conference line can handle up to 300 participants. PASSCODE: 275 694 38

2 Lines are Muted NOW Lines have been automatically muted by operators as it is possible for just one person to ruin the call for everyone due to background noise, electronic feedback, crying children, wind, typing, etc. Operators announce callers one at a time during question and answer sessions. Dial *1 if you would like to ask a question of the presenter. Presenter will respond to calls as time allows. Dial *0 if you need operator assistance at any time during the duration of the call.

3 UPCOMING PGC Worldwide Lab DATE: Friday, May 10th, 2013 PRESENTER: To Be Announced TITLE: To Be Announced START: We will begin promptly on the hour. 1000 EDT - US East Coast 0700 PDT - US West Coast 1500 BST - UK 1600 CEST - Central Europe 0000 AEST – Australia (Saturday, May 11 th, 2013) DURATION: 1 hour TELEPHONE: - US Toll free: 1 866 515.2912 - International direct: +1 617 399.5126 - Toll-free number? See http://www.btconferencing.com/globalaccess/?bid=75_publichttp://www.btconferencing.com/globalaccess/?bid=75_public - Operators will be on standby to assist with technical issues. “*0” will get you assistance. - This conference line can handle up to 300 participants. PASSCODE: 275 694 38

4 GWAS in multiple ancestries: heritability, association, and replication Alkes L. Price Harvard School of Public Health April 12, 2013

5 Should East Asian (or African-American) samples be included in a predominantly European GWAS? Advantages: More samples => more power if effect sizes are similar. Disadvantages: Diverse samples => less power if effect sizes are different. ( Requires careful treatment of population stratification.) ( Analysis is more complex.)

6 Should East Asian (or African-American) samples be included in a predominantly European GWAS? Advantages: More samples => more power if effect sizes are similar. Disadvantages: Diverse samples => less power if effect sizes are different. ( Requires careful treatment of population stratification.) ( Analysis is more complex.) Are effect sizes similar or different in diverse samples? How can we quantify this?

7 What does “effect sizes” mean in “are effect sizes similar or different”? Causal SNPs at genome-wide significant loci or Top associated SNP at genome-wide significant loci or All genotyped SNPs at genome-wide significant loci or All genotyped SNPs in the genome

8 What does “effect sizes” mean in “are effect sizes similar or different”? Causal SNPs at genome-wide significant loci -- Hypothesis: same effect sizes at causal SNPs Top associated SNP at genome-wide significant loci or All genotyped SNPs at genome-wide significant loci or All genotyped SNPs in the genome

9 What does “effect sizes” mean in “are effect sizes similar or different”? Causal SNPs at genome-wide significant loci -- Hypothesis: same effect sizes at causal SNPs Top associated SNP at genome-wide significant loci -- This is what matters most in multi-ethnic GWAS All genotyped SNPs at genome-wide significant loci or All genotyped SNPs in the genome

10 What does “effect sizes” mean in “are effect sizes similar or different”? Causal SNPs at genome-wide significant loci -- Hypothesis: same effect sizes at causal SNPs Top associated SNP at genome-wide significant loci -- This is what matters most in multi-ethnic GWAS All genotyped SNPs at genome-wide significant loci -- Less similar effect sizes due to different LD patterns All genotyped SNPs in the genome -- Again, less similar effect sizes due to different LD patterns

11 Outline 1. Cross-ethnic replication: are effect sizes similar or different? 2. GWAS in multiple ancestries: should people do it? 3. GWAS in multiple ancestries: how should people do it? 4. Heritability: another approach to similar vs. different effect sizes.

12 Things I am not planning to talk about today Use of local ancestry information in admixed populations. (Pasaniuc et al. 2011 PLoS Genet; Shriner et al. 2011 PLoS Comp Biol; reviewed in Seldin et al. 2011 Nat Rev Genet) Fine-mapping using multiple ancestries. (Zaitlen et al. 2010 Am J Hum Genet; Franceschini et al. 2012 Am J Hum Genet; Peters et al. 2013 PLoS Genet; Wu et al. 2013 PLoS Genet) Polygenic risk prediction in Europeans or multiple ancestries. (Purcell et al. 2009 Nature; Ripke et al. 2011 Nat Genet; Chatterjee et al. 2013 Nat Genet)

13 Outline 1. Cross-ethnic replication: are effect sizes similar or different? 2. GWAS in multiple ancestries: should people do it? 3. GWAS in multiple ancestries: how should people do it? 4. Heritability: another approach to similar vs. different effect sizes.

14 Cross-ethnic replication: let’s look at replication in Europeans first log(odds ratio) in training sample Disclaimer: all results preliminary Slope of log(OR) regression for replication in Europeans: SNPs at P < 5 x10 -8 in training sample log(odds ratio) in test sample (Umea)

15 Cross-ethnic replication: let’s look at replication in Europeans first log(odds ratio) in training sample Disclaimer: all results preliminary Slope of log(OR) regression for replication in Europeans: SNPs at P < 5 x10 -8 in training sample log(odds ratio) in test sample (Bulgarian trios)

16 Cross-ethnic replication: let’s look at replication in Europeans first Slope of log(OR) regression for replication in Europeans, as a function of training sample P-value threshold: P –val thresh10 -7 5 x 10 -8 10 -8 10 -9 10 -10 10 -11 Umea Slope0.740.810.680.720.770.61 BUTR Slope0.640.620.670.700.570.76 Ice Slope0.670.730.750.771.030.97 Noice Slope0.640.620.670.74 0.75 Disclaimer: all results preliminary

17 Cross-ethnic replication: let’s look at replication in Europeans first Slope of log(OR) regression for replication in Europeans, as a function of training sample P-value threshold: (Note: slope of log(OR) regression is not affected by replication sample size.) P –val thresh10 -7 5 x 10 -8 10 -8 10 -9 10 -10 10 -11 AVG Slope0.670.69 0.730.780.77 Disclaimer: all results preliminary

18 Cross-ethnic replication: let’s look at replication in Europeans first Slope of log(OR) regression for replication in Europeans, as a function of training sample P-value threshold: (Note: slope of log(OR) regression is not affected by replication sample size.) Why is slope < 1? Winner’s curse (effect sizes overestimated in training sample) (Zollner & Pritchard 2007 Am J Hum Genet; Zhong & Prentice 2010 Genet Epidemiol) True heterogeneity across cohorts (Lee et al. 2012 Nat Genet: genetic correlations ~0.85) P –val thresh10 -7 5 x 10 -8 10 -8 10 -9 10 -10 10 -11 AVG Slope0.670.69 0.730.780.77 Disclaimer: all results preliminary

19 Cross-ethnic replication: replication in East Asians log(odds ratio) in training sample Disclaimer: all results preliminary log(odds ratio) in test sample (East Asians) Slope of log(OR) regression for replication in Europeans: SNPs at P < 5 x10 -8 in training sample

20 Cross-ethnic replication: replication in East Asians log(odds ratio) in training sample Disclaimer: all results preliminary log(odds ratio) in test sample (East Asians) Slope of log(OR) regression for replication in Europeans: SNPs at P < 5 x10 -8 in training sample ???

21 Cross-ethnic replication: replication in East Asians log(odds ratio) in training sample Disclaimer: all results preliminary log(odds ratio) in test sample (East Asians) Slope of log(OR) regression for replication in Europeans: SNPs at P < 5 x10 -8 in training sample Not an error. Low MAF in East Asians.

22 Cross-ethnic replication: replication in East Asians Slope of log(OR) regression for replication in East Asians, as a function of training sample P-value threshold: (Note: slope of log(OR) regression is not affected by replication sample size.) P –val thresh10 -7 5 x 10 -8 10 -8 10 -9 10 -10 10 -11 Asian Slope0.40 0.410.640.580.52 Disclaimer: all results preliminary

23 Cross-ethnic replication: replication in African Americans Work in progress …

24 Outline 1. Cross-ethnic replication: are effect sizes similar or different? 2. GWAS in multiple ancestries: should people do it? 3. GWAS in multiple ancestries: how should people do it? 4. Heritability: another approach to similar vs. different effect sizes.

25 GWAS in multiple ancestries: should people do it? Of course GWAS should be conducted in diverse populations, since some loci may not be discovered in Europeans. (Unoki et al. 2008 Nat Genet; Yasua et al. 2008 Nat Genet: KCNQ1 T2D locus. Also see Rosenberg et al. 2010 Nat Rev Genet; Bustamante et al. 2011 Nature)

26 GWAS in multiple ancestries: should people do multi-ethnic meta-analysis? Of course GWAS should be conducted in diverse populations, since some loci may not be discovered in Europeans. (Unoki et al. 2008 Nat Genet; Yasua et al. 2008 Nat Genet: KCNQ1 T2D locus. Also see Rosenberg et al. 2010 Nat Rev Genet; Bustamante et al. 2011 Nature) The real question is whether to meta-analyze GWAS of multiple ancestries at the discovery stage, before fine-mapping. (More samples => more power if effect sizes are similar. Diverse samples => less power if effect sizes are different.) Many previous authors have chosen multi-ethnic meta-analysis (Morris et al. 2012 Nat Genet; Estrada et al. 2012 Nat Genet; Franceschini et al. 2012 Am J Hum Genet; Lu et al. 2013 Nat Genet; Fritsche et al. 2013 Nat Genet)

27 GWAS in multiple ancestries: should people do multi-ethnic meta-analysis? Many previous studies have chosen multi-ethnic meta-analysis (Morris et al. 2012 Nat Genet; Estrada et al. 2012 Nat Genet; Franceschini et al. 2012 Am J Hum Genet; Lu et al. 2013 Nat Genet; Fritsche et al. 2013 Nat Genet) Caveat: reports of similar effect sizes across populations may not represent a random sample if those associated SNPs were discovered in a multi-ethnic meta-analysis, which is primarily well-powered to detect associated SNPs with similar effect sizes across populations.

28 GWAS in multiple ancestries: should people do multi-ethnic meta-analysis? Many previous studies have chosen multi-ethnic meta-analysis (Morris et al. 2012 Nat Genet; Estrada et al. 2012 Nat Genet; Franceschini et al. 2012 Am J Hum Genet; Lu et al. 2013 Nat Genet; Fritsche et al. 2013 Nat Genet) Caveat: reports of similar effect sizes across populations may not represent a random sample if those associated SNPs were discovered in a multi-ethnic meta-analysis, which is primarily well-powered to detect associated SNPs with similar effect sizes across populations. But many studies have also reported partial replication in multi-ethnic samples of variants discovered in Europeans. (e.g. Waters et al. 2010 PLoS Genet, N’Diaye et al. 2011 PLoS Genet)

29 GWAS in multiple ancestries: should people do multi-ethnic meta-analysis? Of course GWAS should be conducted in diverse populations, since some loci may not be discovered in Europeans. (Unoki et al. 2008 Nat Genet; Yasua et al. 2008 Nat Genet: KCNQ1 T2D locus. Also see Rosenberg et al. 2010 Nat Rev Genet; Bustamante et al. 2011 Nature) The real question is whether to meta-analyze GWAS of multiple ancestries at the discovery stage, before fine-mapping. If slope of cross-ethnic log(odds ratio) regression is high: YES If slope of cross-ethnic log(odds ratio) regression is low: NO

30 GWAS in multiple ancestries: should people do multi-ethnic meta-analysis? Of course GWAS should be conducted in diverse populations, since some loci may not be discovered in Europeans. (Unoki et al. 2008 Nat Genet; Yasua et al. 2008 Nat Genet: KCNQ1 T2D locus. Also see Rosenberg et al. 2010 Nat Rev Genet; Bustamante et al. 2011 Nature) The real question is whether to meta-analyze GWAS of multiple ancestries at the discovery stage, before fine-mapping. If slope of cross-ethnic log(odds ratio) regression is high: YES If slope of cross-ethnic log(odds ratio) regression is low: NO

31 Should people do multi-ethnic meta-analysis? An example Example: 45K European samples, 5K non-European samples, slope * of cross-ethnic log(odds ratio) = 0.6 (*: corrected for winner’s curse) By including 5K non-European samples in meta-analysis: Sample size increases by a factor of 1.111. log(odds ratio) decreases by a factor of 0.960. Sample size * [log(odds ratio)] 2 (which determines power) increases by a factor of 1.111 * [0.960] 2 = 1.024. (“worst-case” computation for SNPs associated in Europeans)

32 Should people do multi-ethnic meta-analysis? Yes if slope of cross-ethnic log(odds ratio) ≥ 0.5 In general: N 1 European samples, N 2 non-European samples, slope * of cross-ethnic log(odds ratio) = s (*: corrected for winner’s curse) By including N 2 non-European samples in meta-analysis: Sample size increases by a factor of (N 1 +N 2 )/N 1. log(odds ratio) decreases by a factor of (N 1 + sN 2 )/(N 1 +N 2 ). Sample size * [log(odds ratio)] 2 (which determines power) increases by a factor of (N 1 +sN 2 ) 2 /[N 1 (N 1 +N 2 )]. Conclusion: power will increase if and only if (≈ 0.5 when N 1 >> N 2 ). (“worst-case” computation for SNPs associated in Europeans)

33 Outline 1. Cross-ethnic replication: are effect sizes similar or different? 2. GWAS in multiple ancestries: should people do it? 3. GWAS in multiple ancestries: how should people do it? 4. Heritability: another approach to similar vs. different effect sizes.

34 GWAS in multiple ancestries: how should people do it? Option #1: Single mega-analysis with union of all samples. Option #2: Analyze each ethnicity separately, then do meta-analysis using weighted z-scores. Option #3: Analyze each ethnicity separately, then do meta-analysis using inverse-variance weighting.

35 GWAS in multiple ancestries: minimizing dangers of population stratification Option #1: Single mega-analysis with union of all samples.  Subtle within-continent ancestries harder to distinguish in PCA of samples with diverse continental ancestries. Option #2: Analyze each ethnicity separately, then do meta-analysis using weighted z-scores. OK for minimizing population stratification. Option #3: Analyze each ethnicity separately, then do meta-analysis using inverse-variance weighting. OK for minimizing population stratification.

36 GWAS in multiple ancestries: maximizing power Option #1: Single mega-analysis with union of all samples.  Lose power if effect sizes same, allele freq different. Option #2: Analyze each ethnicity separately, then do meta-analysis using weighted z-scores.  Lose power if effect sizes same, allele freq different. Option #3: Analyze each ethnicity separately, then do meta-analysis using inverse-variance weighting. OK for maximizing power.

37 GWAS in multiple ancestries: maximizing power: an example Example: 20K European cases: p = 0.015 (odds ratio = 1.5) 20K European controls: p = 0.01 2K non-European cases: p = 0.6 (odds ratio = 1.5) 2K non-European controls: p = 0.5

38 GWAS in multiple ancestries: maximizing power: an example Example: 20K European cases: p = 0.015 (odds ratio = 1.5) 20K European controls: p = 0.01 2K non-European cases: p = 0.6 (odds ratio = 1.5) 2K non-European controls: p = 0.5 Combined sample: 22K cases: p = 0.0682 (odds ratio = 1.27) 22K controls: p = 0.0545

39 GWAS in multiple ancestries: maximizing power: an example Example: 20K European cases: p = 0.015 (odds ratio = 1.5) 20K European controls: p = 0.01 2K non-European cases: p = 0.6 (odds ratio = 1.5) 2K non-European controls: p = 0.5  Option #1 (Mega-analysis): χ 2 = 35.49, P-value =3 x 10 -9  Option #2 (Meta-analysis, weighted z-scores): χ 2 = 31.49, P-value = 2 x 10 -8 Option #3 (Meta-analysis, inverse-variance weighting): χ 2 = 44.69, P-value = 2 x 10 -11

40 GWAS in multiple ancestries: how should people do it?  Option #1: Single mega-analysis with union of all samples.  Option #2: Analyze each ethnicity separately, then do meta-analysis using weighted z-scores. Option #3: Analyze each ethnicity separately, then do meta-analysis using inverse-variance weighting.

41 Outline 1. Cross-ethnic replication: are effect sizes similar or different? 2. GWAS in multiple ancestries: should people do it? 3. GWAS in multiple ancestries: how should people do it? 4. Heritability: another approach to similar vs. different effect sizes.

42 What does “effect sizes” mean in “are effect sizes similar or different”? Causal SNPs at genome-wide significant loci -- Hypothesis: same effect sizes at causal SNPs Top associated SNP at genome-wide significant loci -- This is what matters most in multi-ethnic GWAS All genotyped SNPs at genome-wide significant loci -- Less similar effect sizes due to different LD patterns All genotyped SNPs in the genome -- Again, less similar effect sizes due to different LD patterns

43 Heritability explained by genotyped SNPs (h g 2 ) Yang et al. 2010 Nat Genet; also see Purcell et al. 2009 Nature

44 Cross-trait h g 2 Lee et al. 2012 Nat Genet; Lee et al. 2012 Bioinformatics

45 Cross-population h g 2 Employ same methods as Lee et al. 2012 papers. A few thousand samples from each population is sufficient. (Yang et al. 2010 Nat Genet; Lee et al. 2012 Nat Genet; Lee et al. 2012 Bioinformatics) Quantifying genome-wide sharing of effect sizes across multi-ethnic populations will yield important clues about disease architecture.

46 Acknowledgements Bjarni Vilhjalmsson, HSPH Stephan Ripke, MGH PGC Consortium PGC Consortium members who have contributed to recent discussions on GWAS in multiple ancestries.


Download ppt "PGC Worldwide Lab Call Details DATE: Friday, April 12 th, 2013 PRESENTER: Alkes Price, Harvard University TITLE: “GWAS in multiple ancestries: heritability,"

Similar presentations


Ads by Google