Presentation is loading. Please wait.

Presentation is loading. Please wait.

ANALYSIS OF SELECTIVE DNA POOLING DATA IN FOX Joanna Szyda, Magdalena Zatoń-Dobrowolska, Heliodor Wierzbicki, Anna Rząsa.

Similar presentations


Presentation on theme: "ANALYSIS OF SELECTIVE DNA POOLING DATA IN FOX Joanna Szyda, Magdalena Zatoń-Dobrowolska, Heliodor Wierzbicki, Anna Rząsa."— Presentation transcript:

1 ANALYSIS OF SELECTIVE DNA POOLING DATA IN FOX Joanna Szyda, Magdalena Zatoń-Dobrowolska, Heliodor Wierzbicki, Anna Rząsa

2 MAIN OBJECTIVES: ASSES POLYMORPHISM OF MICROSATELLITES IDENTIFY MARKER-TRAIT ASSOCIATIONS METHODOLOGICAL OBJECTIVES: TOOLS FOR THE ANALYSIS OF SPARSE DATA

3 SELECTIVE (INDIVIDUAL) GENOTYPING MATERIALMETHODSRESULTSCONCLUSIONS qqQQ MORE POWER STANDARD (LINEAR) MODELS NOT VALID

4 SELECTIVE DNA POOLING MATERIALMETHODSRESULTSCONCLUSIONS qqQQ M1M2M2M3M3M4M4 QTL m1 M1 m1 m1 m2 m2 m3 m3 M4 M4 m4 m4 M1 m1 M1 M1 M2 M2 M3 M3 m4 m4 M4 M4

5 SELECTIVE DNA POOLING MATERIALMETHODSRESULTSCONCLUSIONS CHEAP~18%-60% more efficient (Barrat et al. 02) MORE POWERFULL~10%-70% less individuals HIGH TECHNICAL ERRORDNA pool formation (DNA quantification) DNA amplification (differential amplification, shadow bands) POOLING POPULATIONS:no relationship information testing for association POOLING HALFSIBS:partial relationship information testing for linkage

6 ANIMALS MATERIALMETHODSRESULTSCONCLUSIONS POLAR FOX (Alopex lagopus) NORWEGIAN TYPE “LARGE” FINNISH TYPE “SMALL” 63 77

7 MARKERS MARKERDOG GENOMEFOX GENOMEHET. REN112I0201 ? 0.76 C02.34202 ? 0.77 C03.62903 ? 0.76 FH273204 ? 0.86 C05.77105 ? 0.81 FH273406 ? 0.82 C08.41008 ? 0.79 G0640109 ? 0.64 REN153O1212 ? 0.76 REN227M1213 ? 0.74 FH276314 ? 0.70 REN275L1916 ? 0.82 FH304717 ? 0.77 REN100J1320 ? 0.83 REN128E2122 ? 0.70 LEI00227 ? 0.70 REN248F1430 ? 0.70 REN43H2431 ? 0.66 REN106I0736 ? 0.78 REN67C1837?0.83 MATERIALMETHODSRESULTSCONCLUSIONS

8 MARKERS MATERIALMETHODSRESULTSCONCLUSIONS MARKER SELECTION CRITERIA: POLYMORPHISM number of alleles allele lengths AMPLIFICATION PROPERTIES temperature ?

9 MARKER ALLELE FREQUENCY IN POOLS MATERIALMETHODSRESULTSCONCLUSIONS

10 MARKER ALLELE FREQUENCY IN POOLS MATERIALMETHODSRESULTSCONCLUSIONS LOW POLYMORPHISM WITHIN EACH POOL “POOL-SPECIFIC” ALLELES POOR CORRESPONDENCE BETWEEN REPLICATES

11 BINOMIAL DISTRIBUTION MATERIALMETHODSRESULTSCONCLUSIONS allele pool 147149155157161 10014540 2000535 31666000 40062100 BINOMIAL DISTRIBUTION Odds Ratio, Logistic Regression allele pool 147149155157161 1n 12 2n 21 n 22 3n 31 n 32 4n 41 n 42

12 ODDS RATIO MATERIALMETHODSRESULTSCONCLUSIONS ln (OR) = ln distribution ln (OR) ~ N (0,1) variance ln (OR) = confidence intervals ln (OR)±

13 ODDS RATIO IN SPARSE DATA MATERIALMETHODSRESULTSCONCLUSIONS ln (OR) = ln allele pool 147149155157161 10014540 2000535 31666000 40062100 SPARSE DATA PROBLEM ln (OR) = ln c = 0standard c = 0.5Haldane(55) c ij = 2 (n i. n.j / n 2 )Bishop(75) Agresti (99): c=0.5not valid for ln(OR)>4 c ij not valid for ln(OR)>8

14 ODDS RATIO: P-values MATERIALMETHODSRESULTSCONCLUSIONS

15 ODDS RATIO - CI MATERIALMETHODSRESULTSCONCLUSIONS 0.01 CI FOR “CONCORDANT” POOLS 0.01 CI FOR “DISCORDANT” POOLS

16 ODDS RATIO - REMARKS MATERIALMETHODSRESULTSCONCLUSIONS many 2x2 comparisons (theoretically) possible: 18 m4 – 60 m1,m6 significance pattern often inconsistent between alleles – sparse data difficult to summarize ORs with a single value marker C03.629 association C05.771 association C08.410 ? REN227M12 no association REN275L19 ? (sparse data) LEI002 ? (sparse data)

17 FURTHER WORK MATERIALMETHODSRESULTSCONCLUSIONS use all table cells account for sparseness in testing multivariate logistic models

18 MULTINOMIAL DISTRIBUTION MATERIALMETHODSRESULTSCONCLUSIONS allele pool 147149155157161 10014540 2000535 31666000 40062100 MULTINOMIAL DISTRIBUTION Multivariate Logistic Regression allele pool 147149155157161 1n 12 2n 21 n 22 3n 31 n 32 4n 41 n 42 allele pool 147149155157161 1n 11 n 12 n 13 n 14 n 15 2n 21 n 22 n 23 n 24 n 25 3n 31 n 32 n 33 n 34 n 35 4n 41 n 42 n 43 n 44 n 45

19 MODEL MATERIALMETHODSRESULTSCONCLUSIONS GENERAL LOGISTIC MODEL CONSIDERED MODELS FOR ALLELE FREQUENCIES 1 4 8 16

20 TEST STATISTIC MATERIALMETHODSRESULTSCONCLUSIONS MODEL SELECTION POWER DIVERGENCE FAMILY Cressie, Read (1984) Pearson’s X 2 Likelihood Ratio Test estimated frequencies observed frequencies DATAMODEL

21 TEST STATISTIC MATERIALMETHODSRESULTSCONCLUSIONS NORMALISATION SPARSE DATA ! INCREASING CELLS ASYMPTOTICS ! ?

22 TEST STATISTIC MATERIALMETHODSRESULTSCONCLUSIONS ANALYTICAL Osius, Rojek (1989): D( =1) Farrington (1996):D( =1)+  Copas (1989):a*D ( = 1) EMPIRICAL – Bootstrap, Jackknife EVALUATION OF REAL DATA NORMAL PROPERTIES - simulation  D ?  D ?

23

24 LITERATURE MATERIALMETHODSRESULTSCONCLUSIONS Agresti, A. (1990) Categorical data analysis. New York, Chichester, Brisbane, Toronto, Singapore. John Wiley & Sons. Agresti, A. (1999) On logit confidence intervals for the odds ratio with small samples. Biometrics 55:597-602. Barratt, B. J., Payne, F., Rance, H. E.,Nutland, S., Todd, J. A., Clayton, D. G. (2002) Identification of the sources of error in allele frequency estimations from pooled DNA indicates an optimal experimental design. Annals of Human Genetics 66:393-405. Bishop, Y.M.M., Fienberg, S.E., Holland, P. (1975) Discrete multivariate analysis. Cambridge, Massachusetts: MIT Press. Copas, J.B. (1989) Unweighted Sum of Squares Test for Proportions. Applied Statistics 38:71- 80. Cressie, N.A.C., Read, T.R.C. (1984) Multinomial goodness-of-t tests, Journal of the Royal Statistical Society Ser.B 46: 440-464. Farrington, C.P. (1996) On assessing goodness of fit of generalized linear models to sparse data. Journal of the Royal Statistical Society Ser.B 58:349-360. Haldane, J.B.S. (1956) The estimation and significance of the logarithm of a ratio of frequencies. Annals of Human Genetics 20:309-311. Osius, G., Rojek, D. (1989) Normal goodness-of-fit tests for parametric multinomial models with large degrees of freedom. Fahbereich Mathematik/Informatik, Universitaet Bremen. Mathematik Arbeitspapiere 36:


Download ppt "ANALYSIS OF SELECTIVE DNA POOLING DATA IN FOX Joanna Szyda, Magdalena Zatoń-Dobrowolska, Heliodor Wierzbicki, Anna Rząsa."

Similar presentations


Ads by Google