Module 8: Estimating Genetic Variances Nested design GCA, SCA Diallel

Slides:

Advertisements

Similar presentations

Parental value: Combining ability estimates from Line x tester analysis for yield components in potato genotypes NEERAJ SHARMA.

Advertisements

Test of (µ 1 – µ 2 ),  1 =  2, Populations Normal Test Statistic and df = n 1 + n 2 – 2 2– )1– 2 ( 2 1 )1– 1 ( 2 where ] 2 – 1 [–

Combined Analysis of Experiments Basic Research –Researcher makes hypothesis and conducts a single experiment to test it –The hypothesis is modified and.

Combined Analysis of Experiments Basic Research –Researcher makes hypothesis and conducts a single experiment to test it –The hypothesis is modified and.

Statistical Analysis Professor Lynne Stokes Department of Statistical Science Lecture 15 Analysis of Data from Fractional Factorials and Other Unbalanced.

Factorial Models Random Effects Random Effects Gauge R&R studies (Repeatability and Reproducibility) have been an expanding area of application Gauge R&R.

CHAPTER 25: One-Way Analysis of Variance Comparing Several Means

Hypothesis Testing Steps in Hypothesis Testing:

Lecture 4: Basic Designs for Estimation of Genetic Parameters

Statistical Analysis Professor Lynne Stokes Department of Statistical Science Lecture #19 Analysis of Designs with Random Factor Levels.

PBG 650 Advanced Plant Breeding

PBG 650 Advanced Plant Breeding Module 9: Best Linear Unbiased Prediction – Purelines – Single-crosses.

Psychology 202b Advanced Psychological Statistics, II February 1, 2011.

Lecture 4: Heritability. Heritability Narrow vs. board sense Narrow sense: h 2 = V A /V P Board sense: H 2 = V G /V P Slope of midparent-offspring regression.

Lecture 4: Basic Designs for Estimation of Genetic Parameters.

ANalysis Of VAriance (ANOVA) Comparing > 2 means Frequently applied to experimental data Why not do multiple t-tests? If you want to test H 0 : m 1 = m.

Lecture 9: One Way ANOVA Between Subjects

= == Critical Value = 1.64 X = 177  = 170 S = 16 N = 25 Z =

Bootstrapping LING 572 Fei Xia 1/31/06.

Today Concepts underlying inferential statistics

Christopher Dougherty EC220 - Introduction to econometrics (chapter 3) Slideshow: prediction Original citation: Dougherty, C. (2012) EC220 - Introduction.

Linear Regression/Correlation

Two-Way Balanced Independent Samples ANOVA Computations Contrasts Confidence Intervals.

1 PREDICTION In the previous sequence, we saw how to predict the price of a good or asset given the composition of its characteristics. In this sequence,

PBG 650 Advanced Plant Breeding

Objectives of Multiple Regression

ANCOVA Lecture 9 Andrew Ainsworth. What is ANCOVA?

Factorial Experiments

Copyright © Cengage Learning. All rights reserved. 13 Linear Correlation and Regression Analysis.

Module 7: Estimating Genetic Variances – Why estimate genetic variances? – Single factor mating designs PBG 650 Advanced Plant Breeding.

Fixed vs. Random Effects

Fixed vs. Random Effects Fixed effect –we are interested in the effects of the treatments (or blocks) per se –if the experiment were repeated, the levels.

1 Experimental Statistics - week 10 Chapter 11: Linear Regression and Correlation Note: Homework Due Thursday.

Basic Statistical Concepts  M. Burgman & J. Carey 2002.

Effect Size Estimation in Fixed Factors Between-Groups ANOVA

Genetics and Genetic Prediction in Plant Breeding

Repeated Measurements Analysis. Repeated Measures Analysis of Variance Situations in which biologists would make repeated measurements on same individual.

Lecture 5: Chapter 5: Part I: pg Statistical Analysis of Data …yes the “S” word.

PBG 650 Advanced Plant Breeding

Measures of central tendency are statistics that express the most typical or average scores in a distribution These measures are: The Mode The Median.

Chapter 19 Analysis of Variance (ANOVA). ANOVA How to test a null hypothesis that the means of more than two populations are equal. H 0 :  1 =  2 =

The Completely Randomized Design (§8.3)

DOX 6E Montgomery1 Design of Engineering Experiments Part 9 – Experiments with Random Factors Text reference, Chapter 13, Pg. 484 Previous chapters have.

Experimental Design and Data Structure Supplement to Lecture 8 Fall

© Copyright McGraw-Hill 2000

1 Statistical Analysis Professor Lynne Stokes Department of Statistical Science Lecture 8 Analysis of Variance.

Limits to Statistical Theory Bootstrap analysis ESM April 2006.

Analysis Overheads1 Analyzing Heterogeneous Distributions: Multiple Regression Analysis Analog to the ANOVA is restricted to a single categorical between.

1 ANALYSIS OF VARIANCE (ANOVA) Heibatollah Baghi, and Mastee Badii.

Statistics for Differential Expression Naomi Altman Oct. 06.

Confidence Intervals for a Population Mean, Standard Deviation Unknown.

Chapter 8. Process and Measurement System Capability Analysis

Lecture 22: Quantitative Traits II

Experimental Statistics - week 9

ANOVA Overview of Major Designs. Between or Within Subjects Between-subjects (completely randomized) designs –Subjects are nested within treatment conditions.

Biostatistics Case Studies Peter D. Christenson Biostatistician Session 3: Missing Data in Longitudinal Studies.

Genetics and Genetic Prediction in Plant Breeding.

1 G Lect 13b G Lecture 13b Mixed models Special case: one entry per cell Equal vs. unequal cell n's.

Quantitative methods and R – (2) LING115 December 2, 2009.

THE INHERITANCE OF PLANT HEIGHT IN HEXAPLOID WHEAT (Triticum aestivum L.) Nataša LJUBIČIĆ 1*, Sofija PETROVIĆ 1, Miodrag DIMITRIJEVIĆ 1, Nikola HRISTOV.

Genetics and Genetic Prediction in Plant Breeding.

Chapter 6 Inferences Based on a Single Sample: Estimation with Confidence Intervals Slides for Optional Sections Section 7.5 Finite Population Correction.

Factorial Experiments

PBG 650 Advanced Plant Breeding

Comparing Three or More Means

Genetics and Genetic Prediction in Plant Breeding

Experimental Design Data Normal Distribution

Fixed, Random and Mixed effects

Presentation transcript:

Module 8: Estimating Genetic Variances Nested design GCA, SCA Diallel PBG 650 Advanced Plant Breeding Module 8: Estimating Genetic Variances Nested design GCA, SCA Diallel

Also called Nested design Two types of families Females North Carolina Design 1 Hierarchical design Two types of families Half sibs (male groups) Full-sibs (females/males) Females Males 1 2 3 4 1 5 6 7 8 2 . m . f

Nested design – one location Linear Model Yijk=  + Bi + Mj + Fk(j) + eijk Source df MS Expected Mean Square Blocks r-1 MSR Males m-1 MSM Females/males m(f-1) MSF Error (r-1)(mf-1) MSE Might also have sets and multiple environments See Bernardo, pg 164, for ANOVA with sets and environments

Variance components from the nested design (if the parents are not inbred) Design III – primary use is to estimate average level of dominance in F2 populations, and to determine possible bias in VA and VD due to linkage (if the parents are not inbred)

Expected Mean Squares in SAS Random statement generates expected mean squares Test option obtains appropriate F tests for the model specified In the example below, cultivars are fixed, all other effects are random Proc GLM; Class Loc Rep Cultivar; Model Yield=Loc Rep(Loc) Cultivar Loc*Cultivar; Random Loc Rep(Loc) Loc*Cultivar/Test; Run; controversial (could be dropped) Source Type III Expected Mean Square Loc Var(Error) + 3 Var(Loc*Cultivar) + 7 Var(Rep(Loc)) + 21 Var(Loc) Rep(Loc) Var(Error) + 7 Var(Rep(Loc)) Cultivar Var(Error) + 3 Var(Loc*Cultivar) + Q(Cultivar) Loc*Cultivar Var(Error) + 3 Var(Loc*Cultivar) fixed effect Proc Mixed may give better estimates of variance components

Combining Ability General combining ability (GCA)– the average of all F1 crosses from a line (or genotype), expressed as a deviation from the population mean The expected value of a cross is the sum of the combining ability of its two parents Specific combining ability (SCA)– the deviation of a cross from its expected value Where X is the performance of the cross

Estimation of combining ability GCA polycross method - allow all lines to intermate naturally top crossing - a line is crossed to a random sample of plants from a reference population GCA and SCA Factorial design (NC Design II) – a group of ‘male’ parents is crossed to a group of ‘female’ parents requires mxf crosses (e.g. 5x5=25) can be applied to two heterotic populations Diallel – all possible crosses among a set of parents n(n-1)/2 possible crosses without parents or reciprocals (e.g. 10x9/2=45)

Variations on the Diallel Type of cross-classified design With or without the parents With or without reciprocal crosses bulk seed from both parents if maternal effects are not important Genotypes may be random or fixed For random model, need many parents to adequately sample the population Large number of crosses! Can be divided into sets Partial diallels can be conducted If parents are inbred, can make paired row crosses to obtain more seed Hallauer, Carena, and Miranda (2010) pg 119-138

Griffing’s Methods (Diallels) all possible crosses, including selfs Method 2 no reciprocals Method 3 no parents Method 4 no parents or reciprocals most common, because parents often inbred and less vigorous For each Method, genotypes may be Model I = Fixed Model II = Random

Diallel crossing A B C D ……. a+a a+b a+c a+d a+n a b+b b+c b+d b+n b Parent A B C D ……. N Mean a+a a+b a+c a+d a+n a b+b b+c b+d b+n b c+c c+d c+n c d+d d+n d n+n n ….. …..

Diallel analysis

Random model Usually does not include parents and reciprocals Can be divided into sets Source df MS Expected Mean Square Blocks r-1 Crosses [n(n-1)/2] -1 MS2 GCA n-1 MS21 SCA n(n-3)/2 MS22 Error (r-1){[n(n-1)/2] -1} MS1 Griffing (1956) is classic reference

Genetic variances from random model General form for variance of a variance component k=coefficient of MS fg=df of the mean square

Fixed model GCA effects SCA effects Lattice designs are useful Advantage: first order effects (means) are estimated with greater precision than variances

Diallel analysis with parents Gardner-Eberhart Analysis II Source df Blocks r-1 Entries [n(n+1)/2]-1 Parents n-1 Parents vs crosses 1 Crosses [n(n-1)/2]-1 GCA SCA n(n-3)/2 Error (r-1){[n(n+1)/2] -1} Source df Blocks r-1 Entries [n(n+1)/2]-1 Varieties n-1 Heterosis n(n-1)/2 Average 1 Variety n -1 Specific n(n-3)/2 Error (r-1){[n(n+1)/2] -1} Gardner-Eberhart partitioning of Sums of Squares is non-orthogonal Fit model sequentially

Factorial Mating Design Diallel Factorial (Design II) Parents Diallel Factorial 4 6 15 9 10 45 25 20 190 100 4950 2500 n n(n-1)/2 n2/4

General formula for covariance of relatives A B C D X Y r = 2XY  = ACBD + ADBC Extended to include epistasis:

Epistatic Variance Often assumed to be absent, but could bias estimates of A2 and D2 upwards Estimation requires more complex mating designs Expected to be smaller than A2 and D2, so larger experiments are needed for adequate precision Coefficients are correlated with those for A2 and D2, which leads to multicollinearity problems For most crops, experimental estimates of epistatic variance have been small

Example of mating design to estimate epistatic variance Design I experiment from ‘Jarvis’ and ‘Indian Chief’ maize populations Obtained random inbred lines from each population, which were used as parents in a Design II experiment A comparison of these values can be made to estimate epistatic variances Eberhart et al., 1966

Precision of variance components Minimum of 50-100 progeny to adequately sample population (Bernardo’s advice, some would say more!) Large numbers of progeny do not guarantee precise estimates of variance Confidence intervals can be determined for estimates of variance (sets lower and upper bounds) It’s possible in practice to obtain negative estimates of variance components, but they are theoretically impossible large error variance true estimate of genetic variance is close to zero Report as zero? (may lead to bias when results are compiled across many experiments) See Bernardo, pg 166, for further details on confidence intervals

Resampling methods are useful when Confidence interval calculations assume that the underlying distribution is normal. Work best for balanced data. Resampling methods are useful when underlying distributions are unknown or are not normal we don’t know how to estimate the confidence interval Examples Bootstrap – resampling with replacement Jackknife – systematically delete data points Permutation test – data scrambling only works when there are two or more types of families