Kaitlyn Cook Carleton College Northfield Undergraduate Mathematics Symposium October 7, 2014 A METHOD FOR COMBINING FAMILY-BASED RARE VARIANT TESTS OF.

Slides:



Advertisements
Similar presentations
Statistical methods for genetic association studies
Advertisements

Linkage and Genetic Mapping
Lecture 39 Prof Duncan Shaw. Meiosis and Recombination Chromosomes pair upDNA replication Chiasmata form Recombination 1st cell division 2nd cell divisionGametes.
Genetic research designs in the real world Vishwajit L Nimgaonkar MD, PhD University of Pittsburgh
What is Heredity?.
HEREDITY CHAPTER 4. You have Characteristics or traits. Acquired Traits —Reading Skills Inherited Traits —eye color.
MALD Mapping by Admixture Linkage Disequilibrium.
Introduction to Bioinformatics Yana Kortsarts Bob Morris.
More Powerful Genome-wide Association Methods for Case-control Data Robert C. Elston, PhD Case Western Reserve University Cleveland Ohio.
Introduction to Linkage Analysis March Stages of Genetic Mapping Are there genes influencing this trait? Epidemiological studies Where are those.
Deoxyribonucleic Acid
Robust and powerful sibpair test for rare variant association
Supervisor: Yihong Jennifer Tan Eric Gähwiler Karim Hamidi
Introducing genes Genetics is the study of inherited traits and their variations. Genetics is not genealogy! Genealogy is the study of family relationships.
Chuanyu Sun Paul VanRaden National Association of Animal breeders, USA Animal Improvement Programs Laboratory, USA Increasing long term response by selecting.
DNA What is it? And what does it do?. What two things did you not know, had forgotten or felt were important?
IUMSP Institut universitaire de médecine sociale et préventive, Lausanne Exploring the association of the CYP1A1- CYP1A2 locus with blood pressure in CoLaus.
Inquiry: How is DNA used to store and transmit cell information?
Introduction to BST775: Statistical Methods for Genetic Analysis I Course master: Degui Zhi, Ph.D. Assistant professor Section on Statistical Genetics.
GENETICS AND HEREDITY Chapter 5. Genetics and Heredity Heredity- the passing of traits from parents to offspring Genetics- the study of how traits are.
Generalized Linear Mixed Model (GLMM) & Weighted Sum Test (WST) Detecting Association between Rare Variants and Complex Traits Qunyuan Zhang, Ingrid Borecki,
Genetics Ms Mahoney MCAS Biology. Central Concepts: Genes allow for the storage and transmission of genetic information. They are a set of instructions.
The Complexities of Data Analysis in Human Genetics Marylyn DeRiggi Ritchie, Ph.D. Center for Human Genetics Research Vanderbilt University Nashville,
DNA and Modern Genetics Chapter 5C. D eoxyribo N ucleic A cid DNA is a molecule that stores information that a cell needs to function, grow, & divide.
DNA Bases. Adenine: Adenine: (A) pairs with Thymine (T) only.
1 Association Analysis of Rare Genetic Variants Qunyuan Zhang Division of Statistical Genomics Course M Computational Statistical Genetics.
An Introduction to Genetic Algorithms Lecture 2 November, 2010 Ivan Garibay
Heredity Notes. DNA  DNA- (Deoxyribonucleic acid)- genetic material that carries the information about an organism and is passed from parent to offspring.
DNA Replication Section 12-2 in Textbook. ANIMAL CELL.
DNA and Modern Genetics Chapter 5. Chapter 5 Section 1 NOTES Page 135.
DNA Review & Chromosomes
DNA & Chromosomes. DNA Model DNA can be compared to a ladder that is twisted into a continuous spiral shape known as a double helix Each step contains.
CATALYST Recall and Review: – What are chromosomes? – What are genes? – What are alleles? How do these terms relate to DNA? How do these terms relate to.
Genes and Behaviour By Mr Daniel Hansson. “The Jim Twins” - Bouchard (1979)  Twins were separated at birth and reunited at age of 39  Both drove Chevrolets.
Genetics DNA and Chromosomes. DNA Deoxyribonucleic Acid –D–Double Helix Shape –(–(Twisted Ladder) Genetic code – –I–Instruction manual.
An quick overview of human genetic linkage analysis
Association analysis Genetics for Computer Scientists Biomedicum & Department of Computer Science, Helsinki Päivi Onkamo.
Errors in Genetic Data Gonçalo Abecasis. Errors in Genetic Data Pedigree Errors Genotyping Errors Phenotyping Errors.
Association mapping for mendelian, and complex disorders January 16Bafna, BfB.
An quick overview of human genetic linkage analysis Terry Speed Genetics & Bioinformatics, WEHI Statistics, UCB NWO/IOP Genomics Winterschool Mathematics.
Fast test for multiple locus mapping By Yi Wen Nisha Rajagopal.
Genetic disorders can be due to any of the following factors: A. Monogenetic Disorders: Caused by a mutation in a single gene 1. Autosomal recessive alleles:
DESIGNER GENES GENETICS REVIEW. QUESTIONS? 1) WHAT IS GENETICS? 2) WHAT ARE GENES? 3)WHAT ARE CHROMOSOMES? 4)WHAT IS DNA? 5)HOW DOES A BABY GET HER OR.
DNA/Genetic Disorder Quiz Review. Any change in a gene or chromosome is a:  Pedigree  Mutation  Karyotype  Genome.
 passing on of characteristics from parents to offspring.
DNA: The Code of Life Biology I. What is DNA? DNA stores our genetic code in the nucleus the information is passed from one generation to the next (from.
An Introduction to Genetic Algorithms Lecture 2 November, 2010 Ivan Garibay
 passing on of characteristics from parents to offspring.
Analysis of Next Generation Sequence Data BIOST /06/2015.
Sequence Kernel Association Tests (SKAT) for the Combined Effect of Rare and Common Variants 統計論文 奈良原.
DNA Structure DNA consists of two molecules that are arranged into a ladder-like structure called a Double Helix. A molecule of DNA is made up of millions.
Power and Meta-Analysis Dr Geraldine M. Clarke Wellcome Trust Advanced Courses; Genomic Epidemiology in Africa, 21 st – 26 th June 2015 Africa Centre for.
Genetics Heredity Genes DNA Chapter Introduction.
SNPs and complex traits: where is the hidden heritability?
Introduction to Genetic Algorithms
How traits are passed from parents to offspring.
DNA Structure.
DNA Structure.
AIM: How are DNA molecules structured
DNA Chromosome Allele The nucleus of a cell contains chromosomes which carry instructions for the growth and development of an organism. The chromosomes.
BIOLOGY! – GENETICS!.
Unraveling DNA youtube. com/watch
Chapter 7 Multifactorial Traits
= DNA Nucleotide Phosphate Nitrogen Base Pairs:
Heredity Notes.
Sex-linked and Polygenic Traits
Learning Intentions Understanding the structure of the:
DNA Structure and Function Notes
DNA Structure.
Notes – Genetics 1.
Presentation transcript:

Kaitlyn Cook Carleton College Northfield Undergraduate Mathematics Symposium October 7, 2014 A METHOD FOR COMBINING FAMILY-BASED RARE VARIANT TESTS OF ASSOCIATION

 Genetic information stored in DNA  Coded as a sequence of Adenines, Thymines, Guanines, and Cytosines  Organized into chromosomes  Gene…a collection of nucleotides  Allele…a version of a gene AN INTRODUCTION TO GENETICS A A T C G G A T T C T G G A C C C C G C G C G A T T A A T C G G A T T C T G G A C A C C T C G C G A T T Gene 1Gene 2

 Genetic information stored in DNA  Coded as a sequence of Adenines, Thymines, Guanines, and Cytosines  Organized into chromosomes  Homozygous…having two copies of the same allele  Heterozygous…having copies of two different alleles AN INTRODUCTION TO GENETICS A A T C G G A T T C T G G A C C C C G C G C G A T T A A T C G G A T T C T G G A C A C C T C G C G A T T HomozygousHeterozygous

 Genetic information stored in DNA  Coded as a sequence of Adenines, Thymines, Guanines, and Cytosines  Organized into chromosomes  Single Nucleotide Polymorphism (SNP)  Minor allele  Single Nucleotide Variant (SNV) AN INTRODUCTION TO GENETICS A A T C G G A T T C T G G A C C C C G C G C G A T T A A T C G G A T T C T G G A C A C C T C G C G A T T

 Possible that these rare variants are associated with complex diseases  Either risk-increasing or risk-reducing  Want to be able to detect these sorts of associations  One problem: POWER  Two solutions:  Gene-based tests of association  Use family-based study designs RARE VARIANTS

 Rather than analyzing individual SNVs, we can collapse them on a gene level  Instead of testing whether one SNV is associated with a disease phenotype, we test whether at least one SNV in a given gene is associated  Aggregates the signal and increases power  Two ways of collapsing  Burden tests  Variance component tests GENE-BASED ASSOCIATION TESTS

 Tests perform optimally in different circumstances  Burden tests: all the SNVs are risk-increasing or risk-reducing  Variance component tests: mixture of risk-increasing or risk-reducing  Power depends on type of test and genetic architecture  Drawback: picking the “right” test requires knowing genetic architecture beforehand  Picking the “wrong” test could lead to substantial drop in power GENE-BASED ASSOCIATION TESTS

 Recruit families into the study  Genotype all possible members of each family  Construct pedigrees  Genetic inheritance means that rare variants are aggregated in the families  Increased prevalence means increased power FAMILY-BASED STUDY DESIGN

 If different tests perform better under different scenarios, can we somehow combine information from these tests to improve power?  Develop a flexible method that:  Can combine any number and type of family-based tests of association  Increase statistical power and robustness  Maintains type I error OUR FOCUS

 Generate a vector of test statistics, Q(0) = (Q 1,…, Q k )  Permute the phenotype m times, finding Q(1), …, Q(m)  Convert test statistic vectors Q(i) into p-value vectors p(i) by determining each test statistic’s relative rank among the m permutations  Combine p-values within each permutation into a single summary statistic S(i)  Compute the significance level of S(0) by finding the percentage of S(i) which are greater than S(0), out of m. COMBINATION METHOD

 Typical permutation approaches do not work in family setting  Fit the data to a random effects model  Fixed effects for covariates  Random effects for kinship  Permuted the residuals PERMUTATION STRATEGY

 Genetic Analysis Workshop 19  Human sequence data  Sequence information for 959 individuals in 20 families  Phenotype of interest: Systolic and diastolic blood pressure  Real and simulated phenotypes provided  200 replicates of the simulated phenotype  Used to calculate power/type I error DATA USED

STATISTICAL TESTS USED

 Type I error conserved across both individual and combined family-based tests  Power was generally low for all tests used  Combined test had empirical power in the same range as the individual tests  Choice of variant weighting system had substantial impact on empirical power  Combined test more robust to differences in the characteristics of individual genes RESULTS

 Warm colors: SKAT weights  Cool colors: WSS weights  Black: combined test approach

 Working to determine the optimal type or number of tests to include in the combined test statistic  Reducing correlation between the tests being combined  Incorporation of further biological information into the test statistic NEXT STEPS

 NIH/NHGRI: R15HG  NSF/MCB:  Derkach A, Lawless J, Sun L (2012) Robust and Powerful Tests of Rare Variants Using Fisher’s Method to Combine Evidence of Association From Two or More Complementary Tests. Genet Epidemiol 00:1-12.  Greco B, Hainline A, Liu K, Zawistowski M, Tintle N General approaches for combining multiple rare variant association tests provide improved power across a wider range of genetic architecture (in progress) ACKNOWLEDGEMENTS & REFERENCES