Presentation is loading. Please wait.

Presentation is loading. Please wait.

VISG – LARGE DATASETS Literature Review Introduction – Genome Wide Selection Aka Genomic Selection Set of Markers 10,000’s - enough to capture most genetic.

Similar presentations


Presentation on theme: "VISG – LARGE DATASETS Literature Review Introduction – Genome Wide Selection Aka Genomic Selection Set of Markers 10,000’s - enough to capture most genetic."— Presentation transcript:

1

2 VISG – LARGE DATASETS Literature Review

3 Introduction – Genome Wide Selection Aka Genomic Selection Set of Markers 10,000’s - enough to capture most genetic information ‘Training set’ of animals phenotyped and genotyped representative of industry Predictor Over-specified – e.g. 10000 variables, 1000 individuals Robust model selection required Application Predict in selection candidates –Maybe no phenotypes –Maybe no pedigrees

4 Introduction – Genome Wide Selection Prediction Methods Stepwise Regression gBLUP –Fit all markers as a random effect –g i ~ N(0,  g 2 ) BayesA –g i ~ N(0,  gi 2 ) –prior :  gi 2 ~ S/  2 (choose S and ) BayesB –similar to BayesA, except –proportion  of effects are zero Most investigations compare these Many variations (sometimes with the same name)

5 Literature Dairy applications review (Hayes et al., 2009) GWS in crops (Heffner, Sorrells, Jannick, 2009) Prediction in unrelateds (Meuwissen, 2009) Marker panels (Habier, 2009) Phenotypes (Harris & Johnson, unpub) +...

6 Issues National evaluations Long term gains LD or relationship tracking Multiple breeds Distance from Training to Application Marker Panels (subsets) Phenotypes (EBV-based) Non-additive effects Computing requirements

7 Methods gBLUP almost as good as Bayes(A) (dairy) Interpretation(?): many genes of small effect Bayes methods better at using real LD (vs relatedness) Bayes(B) advantage greater with Higher marker density Higher Training  Application distance Smaller Training set Mixture of 2 normals ~ BayesB Partial Least Squares Machine Learning Haplotype methods not used in practice yet

8 Marker Panels Evenly spaced panels Track inheritance from parents (both SNP-chipped) Will work with new traits Lasso methods popular Shrinks small effects to zero

9 Other Combining marker and other information Phenotype info, parent info Index methods; ‘blending’ Important for seamless national evaluations Computing strategies Tricks to reduce computation Approximation rather than Iterative (MCMC) methods

10 Online resources Conferences Statistical Genetics of Livestock for the Post-Genomic Era. UW-Madison, May, 2009. http://dysci.wisc.edu/sglpge/index.html http://dysci.wisc.edu/sglpge/index.html QTL/MAS Workshops. 2008: http://www.computationalgenetics.se/QTLMAS08 2009: http://www.qtlmas2009.wur.nl/UK/ http://www.computationalgenetics.se/QTLMAS08http://www.qtlmas2009.wur.nl/UK/ Courses Whole Genome Association and Genomic Selection. September 1-8, 2008, Salzburg, Austria. http://www.nas.boku.ac.at/12100.html?&L=0 http://www.nas.boku.ac.at/12100.html?&L=0 Use of High-density SNP Genotyping for Genetic Improvement of Livestock. Iowa State, June, 2009. http://www.ans.iastate.edu/stud/courses/short/ http://www.ans.iastate.edu/stud/courses/short/

11

12

13

14

15

16

17

18

19

20

21

22

23

24

25

26

27 Toy example 5 SNP / 1000 individuals y = mu + SNP1 + e – mu = 10 – SNP1 substitution effect = 10 / p = 0.5 – Var(e) = 1 1 block / 1000 iterations Runs in ~ 5 secs

28

29

30

31


Download ppt "VISG – LARGE DATASETS Literature Review Introduction – Genome Wide Selection Aka Genomic Selection Set of Markers 10,000’s - enough to capture most genetic."

Similar presentations


Ads by Google