Presentation on theme: "Class 06 Case: The Roulette Wheel Goodness of Fit Tests EMBS 11.2 07."— Presentation transcript:
Class 06 Case: The Roulette Wheel Goodness of Fit Tests EMBS
What we learned last class Probability Distributions have characteristics Descriptive Statistics are used to estimate those characteristics. – Location (mean, median, mode) – Variability (variance and standard deviation) – Shape (skewness) The MEAN is important. – Sample mean “value” times n is total value. Measures of variability are under-appreciated
Descriptive Statistics Matter
Baseball Statistics A batter came to the plate five times – Got a hit – Struck Out – Walked – Flied Out – Grounded Out WALK is the fault of the pitcher: 1 success in 4 trials WALK is the fault of the batter: 2 successes in 5 trials Batting Average = ¼ = On Base Percentage = 2/5 = 0.400
The Roulette wheel. Surveillance video of 18 hours of play of a roulette wheel in a Reno, Nevada casino – 904 spins of the wheel – 22,527 bets places
OutcomeFrequencyNumber of Bets OutcomeFrequencyNumber of Bets , Total90422,527
First We Examine the Wheel H0: The wheel works properly H0: All 38 outcomes have equal probability of occurring H0: P0=P28=P9=…=P2=1/38 HA: they are not all equal Like before, the Hypothesis is about a parameter of a probability distribution
Goodness-of-fit Test OutcomeObservedExpectedDistance Total Calculate the expected counts under H0 2.Calculate the Distances as (O-E) 2 /E 3.The sum of the distances is the test statistic. We call it the calculated chi- squared. 4.We reject H0 (and say the results are statistically significant) if the calculated chi-squared statistic is too big. The calculated chi-squared statistic The sum of the distances.
We need a p-value! For the lady tasting tea P(X≥8 │ H0) = 1-binomdist(7,10,.5,true) Pvalue = For a GOODNESS OF FIT TEST P(χ 2 ≥ calculated χ 2 │H0) = chidist(calculated χ 2, dof) dof stands for “degrees of freedom” dof is the parameter of the chi-squared distribution dof here is 37, the number of cells - 1. P(χ 2 ≥ 31.2│H0) = chidist(31.2,37) Pvalue = 0.74 NOT statistically significant. Number correct depends on n and P χ 2 depends on number of cells - 1. Can also use =chisq.dist.rt(31.2,37)
WARNING The chi-squared test does not work well when some cells have low expected counts. If some cells have expected counts < 5, combine then with neighboring cells.
Roulette Wheel Demonstration H0: All 38 are equally likely to get bet on. Ha: The p’s are not equal. (Some segments are more popular than others)
Lorex Pharma H0: The Fill Amounts are Normally Distributed with μ=10.2 and σ=0.16 Ha: They are not…
Assignment 08 Due Monday, Feb 13 Youth Soccer (football) teams from several countries compete annually in an important international tournament. The birth months (Jan=1, Feb=2,.. Dec=12) of the 288 boys competing in the 2005 under 16 division showed higher counts for the early months and lower counts for the later months. Formulate and test a relevant hypothesis If you find statistical significance, offer an option about how it came to be that early months are more prevalent. Helsen, W.F., Van Winckel, J., and WIlliams, M., The relative age effect in youth soccer across Europe, Journal of Sports Sciences, June 2005; 23(6): opinion