Introduction to QTL mapping

Slides:



Advertisements
Similar presentations
15 The Genetic Basis of Complex Inheritance
Advertisements

Unit B Understanding Animal Body Systems
DNA Structure and Function
Statistical methods for genetic association studies
Linkage and Genetic Mapping
Genetic Linkage and Recombination
The next generation Chapters 9, 10, 17 in the course textbook, especially pages , ,
A. Novelletto, F. De Rango Dept. Cell Biology, University of Calabria GENOTYPING CONCORDANT / DISCORDANT COUSIN PAIRS.
The genetic dissection of complex traits
Bacteria cells reproduce differently from other single celled organisms. What form of asexual reproduction do they use? a. binary fission b. budding c.
Planning breeding programs for impact
Software for Incorporating Marker Data in Genetic Evaluations Kathy Hanford U.S. Meat Animal Research Center Agricultural Research Service U.S. Department.
An Introduction to the application of Molecular Markers
Genetic research designs in the real world Vishwajit L Nimgaonkar MD, PhD University of Pittsburgh
Genetics notes For makeup. A gene is a piece of DNA that directs a cell to make a certain protein. –Homozygous describes two alleles that are the same.
HEREDITY CHAPTER 4. You have Characteristics or traits. Acquired Traits —Reading Skills Inherited Traits —eye color.
Basics of Linkage Analysis
. Parametric and Non-Parametric analysis of complex diseases Lecture #6 Based on: Chapter 25 & 26 in Terwilliger and Ott’s Handbook of Human Genetic Linkage.
Linkage Analysis: An Introduction Pak Sham Twin Workshop 2001.
Linkage analysis: basic principles Manuel Ferreira & Pak Sham Boulder Advanced Course 2005.
GGAW - Oct, 2001M-W LIN Study Design for Linkage, Association and TDT Studies 林明薇 Ming-Wei Lin, PhD 陽明大學醫學系家庭醫學科 台北榮民總醫院教學研究部.
Genetics.
Genetic Theory Manuel AR Ferreira Egmond, 2007 Massachusetts General Hospital Harvard Medical School Boston.
Genetics All Your Hopes and All Your Fears. Genetics Classical Genetics –Mendelian genetics Fundamental principles underlying transmission of genetic.
Introduction to Linkage Analysis March Stages of Genetic Mapping Are there genes influencing this trait? Epidemiological studies Where are those.
Introduction to Linkage
Mapping Basics MUPGRET Workshop June 18, Randomly Intermated P1 x P2  F1  SELF F …… One seed from each used for next generation.
Reconstructing Genealogies: a Bayesian approach Dario Gasbarra Matti Pirinen Mikko Sillanpää Elja Arjas Department of Mathematics and Statistics
Biometrical genetics Pak C Sham The University of Hong Kong Manuel AR Ferreira Queensland Institute for Medical Research 23 rd International Workshop on.
Linkage Analysis in Merlin
REPORTING CATEGORY 2. #21-PROTEIN SYNTHESIS CHANGES TO DNA CODE.
Stern - Introductory Plant Biology: 9th Ed. - All Rights Reserved - McGraw Hill Companies Genetics Chapter 13 Copyright © McGraw-Hill Companies Permission.
Copy the folder… Faculty/Sarah/Tues_merlin to the C Drive C:/Tues_merlin.
Linkage and LOD score Egmond, 2006 Manuel AR Ferreira Massachusetts General Hospital Harvard Medical School Boston.
Unit 4 Vocabulary Review. Nucleic Acids Organic molecules that serve as the blueprint for proteins and, through the action of proteins, for all cellular.
Introduction to BST775: Statistical Methods for Genetic Analysis I Course master: Degui Zhi, Ph.D. Assistant professor Section on Statistical Genetics.
Introduction to QTL analysis Peter Visscher University of Edinburgh
Broad-Sense Heritability Index
Genetic Mapping Oregon Wolfe Barley Map (Szucs et al., The Plant Genome 2, )
Introduction to Genetics
Non-Mendelian Genetics
Genetic Theory Manuel AR Ferreira Boulder, 2007 Massachusetts General Hospital Harvard Medical School Boston.
Biology 101 DNA: elegant simplicity A molecule consisting of two strands that wrap around each other to form a “twisted ladder” shape, with the.
Introduction to Linkage Analysis Pak Sham Twin Workshop 2003.
Life Science “The Molecular Basis of Heredity”. Amino Acid Any of the organic acids that are the chief component of proteins, either manufactured by cells.
Experimental Design and Data Structure Supplement to Lecture 8 Fall
Quantitative Genetics. Continuous phenotypic variation within populations- not discrete characters Phenotypic variation due to both genetic and environmental.
Quantitative Genetics
Regression-Based Linkage Analysis of General Pedigrees Pak Sham, Shaun Purcell, Stacey Cherny, Gonçalo Abecasis.
Recombination and Linkage
Practical With Merlin Gonçalo Abecasis. MERLIN Website Reference FAQ Source.
Lecture 23: Quantitative Traits III Date: 11/12/02  Single locus backcross regression  Single locus backcross likelihood  F2 – regression, likelihood,
Using Merlin in Rheumatoid Arthritis Analyses Wei V. Chen 05/05/2004.
Genetic principles for linkage and association analyses Manuel Ferreira & Pak Sham Boulder, 2009.
Biometrical genetics Manuel AR Ferreira Boulder, 2008 Massachusetts General Hospital Harvard Medical School Boston.
GENOME ORGANIZATION AS REVEALED BY GENOME MAPPING WHY MAP GENOMES? HOW TO MAP GENOMES?
Genetic Theory Manuel AR Ferreira Boulder, 2007 Massachusetts General Hospital Harvard Medical School Boston.
Power in QTL linkage analysis
Chapter 8 DNA & RNA.
GENETICS.
Regression Models for Linkage: Merlin Regress
UNIT FOUR – CONTINUITY & CHANGE AREA OF STUDY 0NE - HEREDITY
The Study of Heredity Chapter 2.
Biomedical Technology I
PLANT BIOTECHNOLOGY & GENETIC ENGINEERING (3 CREDIT HOURS)
GENETICS.
A Brief History What is molecular biology?
Biometrical model and introduction to genetic analysis
Lecture 9: QTL Mapping II: Outbred Populations
Presentation transcript:

Introduction to QTL mapping Manuel Ferreira Boulder Introductory Course 2006

Outline 1. Aim 2. The Human Genome 3. Principles of Linkage Analysis 4. Parametric Linkage Analysis 5. Nonparametric Linkage Analysis

1. Aim

QTL mapping LOCALIZE and then IDENTIFY a locus that regulates a trait (QTL) Nucleotide or sequence of nucleotides with variation in the population, with different variants associated with different trait levels.

For a heritable trait... Linkage: localize region of the genome where a QTL that regulates the trait is likely to be harboured Family-specific phenomenon: Affected individuals in a family share the same ancestral predisposing DNA segment at a given QTL Association: identify a QTL that regulates the trait Population-specific phenomenon: Affected individuals in a population share the same ancestral predisposing DNA segment at a given QTL

2. Human Genome

DNA structure A DNA molecule is a linear backbone of alternating sugar residues and phosphate groups Attached to carbon atom 1’ of each sugar is a nitrogenous base: A, C, G or T Two DNA molecules are held together in anti-parallel fashion by hydrogen bonds between bases [Watson-Crick rules] Antiparallel double helix A gene is a segment of DNA which is transcribed to give a protein or RNA product Only one strand is read during gene transcription Nucleotide: 1 phosphate group + 1 sugar + 1 base

DNA polymorphisms Microsatellites SNPs >100,000 Many alleles, (CA)n, very informative, even, easily automated SNPs 10,054,521 (25 Jan ‘05) 10,430,753 (11 Mar ‘06) Most with 2 alleles (up to 4), not very informative, even, easily automated A B

Diploid zygote >1 cell DNA organization 22 + 1 2 (22 + 1) 2 (22 + 1) 2 (22 + 1) ♂ ♂ ♁ A - A - A - ♁ B - ♂ ♁ ♂ ♁ Mitosis B - B - chr1 A - A - A - - A A - - A ♁ ♂ ♁ B - B - B - - B B - - B A - - A - A B - chr1 - B - B G1 phase S phase M phase Haploid gametes Diploid zygote 1 cell Diploid zygote >1 cell

DNA recombination ♁ NR R R Diploid gamete precursor cell NR 22 + 1 22 + 1 A - NR (♂) B - A - - A chr1 2 (22 + 1) 2 (22 + 1) B - - B ♁ Meiosis - A R chr1 ♂ (♂) (♁) ♁ - B A - - A A - - A chr1 B - - B B - - B A - R chr1 chr1 chr1 chr1 (♁) A - - A B - chr1 Diploid gamete precursor cell B - - B - A chr1 NR - B Haploid gamete precursors chr1 Hap. gametes

DNA recombination between linked loci 22 + 1 A - B - NR (♂) A - - A B - - B 2 (22 + 1) ♁ Meiosis - A NR - B ♂ (♂) (♁) ♁ A - - A A - - A B - - B B - - B A - B - NR (♁) A - - A B - - B Diploid gamete precursor - A - B NR Haploid gamete precursors Hap. gametes

Human Genome - summary DNA is a linear sequence of nucleotides partitioned into 23 chromosomes Two copies of each chromosome (2x22 autosomes + XY), from paternal and maternal origins. During meiosis in gamete precursors, recombination can occur between maternal and paternal homologs Recombination fraction between loci A and B (θ) Proportion of gametes produced that are recombinant for A and B If A and B are very far apart: 50%R:50%NR - θ = 0.5 If A and B are very close together: <50%R - 0 ≤ θ < 0.5 Recombination fraction (θ) can be converted to genetic distance (cM) Haldane: eg. θ=0.17, cM=20.8 Kosambi: eg. θ=0.17, cM=17.7

3. Principles of Linkage Analysis

Linkage Analysis requires genetic markers Mn θ 0.5 0.5 .4 .3 .3 .4 0.5 .15 M1 M2 Mn θ 0.5 .35 .22 .26 .35 0.5 0.5 .4 .3 .1 .3 .4 M1 M2 Mn

Linkage Analysis: Parametric vs. Nonparametric Gene Chromosome Recombination Genetic factors M Q A Mode of inheritance Correlation D Phe C Environmental factors E Adapted from Weiss & Terwilliger 2000

4. Parametric Linkage Analysis

Autosomal dominant, Q1 predisposing allele Linkage with informative phase known meiosis Gene ♂ Chromosome ♁ M1..6 Q1,2 Autosomal dominant, Q1 predisposing allele M2M5Q2Q2 M1M6Q1Q? M1 Q1 Informative Phase known M1M2Q1Q2 M1Q1/M2Q2 M3M4Q2Q2 M2 Q2 M1Q1/M3Q2 M2Q2/M3Q2 M1Q1/M4Q2 M1Q1/M4Q2 M2Q2/M4Q2 M2Q1/M3Q2 NR: M1Q1 NR: M2Q2 θMQ = 1/6 = 0.17 (~20.8 cM) R: M1Q2 R: M2Q1

Linkage with informative phase unknown meiosis Q1 M1 Q2 Q2Q2 Q1Q? M2 Q2 M2 Q1 Informative Phase unknown M1M2Q1Q2 M1Q2/M2Q1 M1Q1/M2Q2 M3M4Q2Q2 M1Q1/M3Q2 M2Q2/M3Q2 M1Q1/M4Q2 M1Q1/M4Q2 M2Q2/M4Q2 M2Q1/M3Q2 M1Q1/M2Q2 P N M1Q2/M2Q1 P N NR: M1Q1 ½(1-θ) 3 R: M1Q1 ½θ 3 NR: M2Q2 ½(1-θ) 2 R: M2Q2 ½θ 2 R: M1Q2 ½θ NR: M1Q2 ½(1-θ) R: M2Q1 ½θ 1 NR: M2Q1 ½(1-θ) 1 + +

Parametric LOD score calculation Overall LOD score for a given θ is the sum of all family LOD scores at θ eg. LOD=3 for θ=0.28

Parametric Linkage Analysis - summary Q M1 M2 Mn θ 0.5 0.5 .4 .3 .3 .4 0.5 .1 For each marker, estimate the θ that yields highest LOD score across all families This θ (and the LOD) will depend upon the mode of inheritance assumed MOI determines the genotype at the trait locus Q and thus determines the number of meiosis which are recombinant or nonrecombinant. Limited to Mendelian diseases. Markers with a significant parametric LOD score (>3) are said to be linked to the trait locus with recombination fraction θ

Outline 1. Aim 2. The Human Genome 3. Principles of Linkage Analysis 4. Parametric Linkage Analysis 5. Nonparametric Linkage Analysis

5. Nonparametric Linkage Analysis

Approach Parametric: genotype marker locus & genotype trait locus (latter inferred from phenotype according to a specific disease model) Parameter of interest: θ between marker and trait loci Nonparametric: genotype marker locus & phenotype If a trait locus truly regulates the expression of a phenotype, then two relatives with similar phenotypes should have similar genotypes at a marker in the vicinity of the trait locus, and vice-versa. Interest: correlation between phenotypic similarity and marker genotypic similarity No need to specify mode of inheritance, allele frequencies, etc...

Phenotypic similarity between relatives Squared trait differences Squared trait sums Trait cross-product Trait variance-covariance matrix Affection concordance T2 T1

Inheritance vector (M) Genotypic similarity between relatives IBS Alleles shared Identical By State “look the same”, may have the same DNA sequence but they are not necessarily derived from a known common ancestor M1 M2 M3 M3 IBD Alleles shared Identical By Descent are a copy of the same ancestor allele Q1 Q2 Q3 Q4 M1 M2 M3 M3 Q1 Q2 Q3 Q4 IBS IBD M1 M3 M1 M3 2 1 Q1 Q3 Q1 Q4 1 Inheritance vector (M) 1

Genotypic similarity between relatives - Inheritance vector (M) Number of alleles shared IBD Proportion of alleles shared IBD - M1 M3 M2 M3 1 1 Q1 Q3 Q2 Q4 M1 M3 M1 M3 1 1 0.5 Q1 Q3 Q1 Q4 M1 M3 M1 M3 2 1 Q1 Q3 Q1 Q3

Genotypic similarity between relatives - D 22n

H:\manuel - Copy folder “Linkage” to C:\ Practical Aim (1) Estimate IBD with MERLIN; (2) IBD estimation can be influenced by genotyped individuals and allele frequencies; (3) compute H:\manuel - Copy folder “Linkage” to C:\ 1. Open with Notepad: pr1.ped pr1.dat pr1.map pr1.freq 2. Start>Run>C:/Linkage/pfe32.exe 3. Run Command Prompt 4. Keep a File Explorer window open Exercice1 (1) Estimate IBD for pedigrees A, B and C in the previous slide (2) Change allele frequencies (pr1.freq) from 0.25 0.25 0.25 0.25 to (i) 0.45 0.25 0.25 0.05 and (ii) 0.05 0.25 0.25 0.45

Practical E F G A1A2 A3A4 A1A3 A1A3 A2A4 Exercice 2 A1A2 A1A3 A1A3 (1) Modify pr1.ped and estimate IBD probabilities and between twin 1 and twin 2 for pedigrees E, F and G: E F G A1A2 A1A3 A1A3 A1A3 A1A3 A1A3 A1A3 A2A4 P(IBD=0) 0.08 0.00 0.00 P(IBD=1) 0.31 0.20 0.00 P(IBD=2) 0.61 0.80 1.00 0.77 0.90 1.00 Allele frequencies on pr1.freq: 0.25 0.25 0.25 0.25

IBD at a marker Singlepoint IBD IBD at a ‘grid’ Multipoint IBD 5 cM M1 Mn IBD at a marker Singlepoint IBD 5 cM M1 M2 Mn IBD at a ‘grid’ Multipoint IBD

Statistics that incorporate both phenotypic and genotypic similarities Phenotypic similarity 0.5 1 Genotypic similarity ( )

Phenotypic dissimilarity Haseman-Elston regression – Quantitative traits 0.5 1 Phenotypic dissimilarity = b × Genotypic similarity + c

VC ML – Quantitative & Categorical traits method 0.5 1 H1: H0: e.g. LOD=3

Genome-wide linkage analysis (e.g. VC) Individual LOD scores can be expressed as P values (Pointwise) LOD Chi-sq (n-df) P value 2.1 9.67 0.0009 (x4.6) True positive k Theoretical (Lander & Kruglyak 1995) LOD LOD = 3.6, Chi-sq = 16.7, P = 0.000022 Type I error

Nonparametric Linkage Analysis - summary No need to specify mode of inheritance Models phenotypic and genotypic similarity of relatives Expression of phenotypic similarity, calculation of IBD HE and VC are the most popular statistics used for linkage of quantitative traits Other statistics available, specially for affection traits