Presentation is loading. Please wait.

Presentation is loading. Please wait.

Chi-Squared Test The chi-squared ( ) test, or Pearson’s chi-squared test, evaluates the likelihood that variation in your results was due to chance.

Similar presentations


Presentation on theme: "Chi-Squared Test The chi-squared ( ) test, or Pearson’s chi-squared test, evaluates the likelihood that variation in your results was due to chance."— Presentation transcript:

1 Chi-Squared Test The chi-squared ( ) test, or Pearson’s chi-squared test, evaluates the likelihood that variation in your results was due to chance. It can’t tell you whether the variation was because your independent variable caused it, but it can be used as evidence to rule out a null hypothesis.

2 AP Biology Curricula that Incorporates c2
Genetic Problems Hardy-Weinberg Equilibrium Problems Behavior Lab Mitosis Lab In part b, a number line representation might be helpful to represent f = g’, but students must remember to justify the answer in words since AP graders are instructed to “cover up the number line” and look at what students wrote-students may show a number line to assist, but teachers are recommended to emphasize a “nice, concise” way of writing the reason for the answer as shown on the rubric. Students MUST connect g’(x) to f(x) to receive credit.

3 Chi-Squared Test Sigma, “the sum of” “Expected,” the
data expected based on the hypothesis “Observed,” the data you actually collected

4 HOW DO YOU USE THIS TABLE PROPERLY?
This statistical test is compared to a theoretical probability distribution These probability (p) values are on the Chi Square distribution table HOW DO YOU USE THIS TABLE PROPERLY? you need to determine the degrees of freedom Degrees of freedom is the # of groups (categories) in your data minus one (1) If the level of significance read from the table is less than .05 or 5% then your hypothesis is accepted and the data is useful…the data is NOT due to randomness!

5 Statistics, Analysis, and AP Biology
On the AP exam, both multiple choice and free response style questions visit this connection using graphical, analytical, and application-based stems. Let’s take a look at some recent free response questions that illustrate this concept:

6 CHI SQUARE TABLE Accept Null Hypothesis (chance ONLY)
CHI-SQUARE DISTRIBUTION TABLE Accept Null Hypothesis (chance ONLY) Reject Null Hypothesis (NOT chance ONLY) Probability (p) Degrees of Freedom 0.95 0.90 0.80 0.70 0.50 0.30 0.20 0.10 0.05 0.01 0.001 1 0.004 0.02 0.06 0.15 0.46 1.07 1.64 2.71 3.84 6.64 10.83 2 0.21 0.45 0.71 1.39 2.41 3.22 4.60 5.99 9.21 13.82 3 0.35 0.58 1.01 1.42 2.37 3.66 4.64 6.25 7.82 11.34 16.27 4 1.06 1.65 2.20 3.36 4.88 7.78 9.49 13.38 18.47 5 1.14 1.61 2.34 3.00 4.35 6.06 7.29 9.24 11.07 15.09 20.52 6 1.63 3.07 3.83 5.35 7.23 8.56 10.64 12.59 16.81 22.46 7 2.17 2.83 3.82 4.67 6.35 8.38 9.80 12.02 14.07 18.48 24.32 8 2.73 3.49 4.59 5.53 7.34 9.52 11.03 13.36 15.51 20.09 26.12 9 3.32 4.17 5.38 6.39 8.34 10.66 12.24 14.68 16.92 21.67 27.88 10 3.94 4.86 6.18 7.27 9.34 11.78 13.44 15.99 18.31 23.21 29.59 In biological applications, a probability 5% is usually adopted as the standard conventional criteria for probability to have statistical significance is

7 The following formula is used
You need 2 different hypotheses: 1. NULL Hypothesis Data are occurring by chance and it is all RANDOM! There is NO preference between the groups of data. 2. Alternative Hypothesis Data are occurring by some outside force. It is NOT by chance and it is NOT RANDOM! There is preference between the groups of data.

8 Student’s Pitfalls in Using c2
Students lack an understanding of the “null hypothesis”. Students have difficulties in determining what is expected. Students have difficulties in working with fractions of different denominators. In part b, a number line representation might be helpful to represent f = g’, but students must remember to justify the answer in words since AP graders are instructed to “cover up the number line” and look at what students wrote-students may show a number line to assist, but teachers are recommended to emphasize a “nice, concise” way of writing the reason for the answer as shown on the rubric. Students MUST connect g’(x) to f(x) to receive credit.

9 The Null Hypothesis The null hypothesis refers to a general statement or default position that there is no relationship between two measured phenomena, OR a type of hypothesis used in statistics that proposes that no statistical significance exists in a set of given observations. The null hypothesis attempts to show that no variation exists between variables. If the null hypothesis is rejected or the data does not support it, then one can conclude there is statistical significance between variables. In part b, a number line representation might be helpful to represent f = g’, but students must remember to justify the answer in words since AP graders are instructed to “cover up the number line” and look at what students wrote-students may show a number line to assist, but teachers are recommended to emphasize a “nice, concise” way of writing the reason for the answer as shown on the rubric. Students MUST connect g’(x) to f(x) to receive credit.

10 Two Types of Hypotheses: 1. NULL HYPOTHESIS
states that there is no substantial statistical deviation between observed and expected data. a hypothesis of no difference (or no effect) is called a null hypothesis symbolized H0 In other words, the results are totally random and occurred by chance alone. There is NO preference. The null hypothesis states that the two variables are independent, or that there is NO relationship to one another.

11 Hypothesis Example A scientist studying bees and butterflies.
Her hypothesis was that a single bee visiting a flower will pollinate with a higher efficiency than a single butterfly, which will help produce a greater number of seeds in the flower bean pod. We will call this hypothesis H1 or an alternate hypothesis because it is an alternative to the null hypothesis. What is the null hypothesis? H0: There is no significant difference between bees and butterflies in the number of seeds produced by the flowers they pollinate.

12 Two Types of Hypotheses: 2. ALTERNATIVE HYPOTHESIS
states that there IS a substantial/ significant statistical deviation between observed and expected data. a hypothesis of difference (or effect) is called a alternative hypothesis symbolized H1 In other words, the results are affected by an outside force and are NOT random and did NOT occur by chance alone. There is a preference .

13 2 Types of Chi Square Problems
Non-genetic Null Hypothesis: Data is due to chance and is completely random. There is no preference between the groups/categories. Alternative Hypothesis Data is NOT due to chance and there IS a preference between the groups/categories. Data is not random. Genetic Data is due to chance and is random due to independent assortment being random. Punnett square ratios are expected. If there are 2+ genes involved in the experiment…There is no gene linkage affecting independent assortment & segregation. Punnett square ratios are expected. Data is due NOT to chance and is NOT random. Punnett square ratios are NOT expected. If there are 2+ genes …There IS gene linkage affecting independent assortment & segregation

14 most scientists have decided:
What level of probability does one choose to decide whether two groups differ as a result of NON-CHANCE events or simply because of CHANCE? most scientists have decided: If difference between 2+ groups is so great that it would happen by chance fewer than 1 out of 20 times ("P" < 0.05), then the groups differ significantly. That is, the null hypothesis (due to chance/no difference in data) is rejected. If greater confidence in the results is desired, scientists will choose probability levels of less than 1 in 100 (P < .01) or 1 in 1000 (P < 0.001).

15 Chi-Squared Test Example, let’s test the hypothesis that a coin is weighted towards heads. Null hypothesis: Coin flips are purely chance. If I flip the coin 100 times, and the hypothesis is correct, it should come up heads 50 times and tails 50 times. I do the test, and it comes up heads 68 times and tails 32 times. Chi-squared analysis can help me determine whether that variation is due to chance, i.e. whether my null hypothesis holds any water. You must have at least two possible outcomes in your experiment (heads and tails, here) for the test to work. Chi-square doesn’t work if you don’t have enough data points/trials. An oft-cited magic number is 30, but run as many trials as is reasonable and let the mathematical chips fall where they may.

16 Chi-Squared Test “Observed,” 68 heads, 32 tails “Expected,” 50 heads,
(68-50)2 = …… /50 = 6.48 PLUS (32-50)2 = ……… 324/50 = 6.48 Chi square = = 12.96

17 Chi-Squared Test Degrees of freedom: The number of outcomes minus 1
In our coin example, we have two outcomes being tested, heads and tails. That gives us one degree of freedom (2-1 = 1). Critical value (or p-value): Basically, how certain you can be of your result. The industry standard p-value is .05, and if your chi-square works it, that amounts to “I am 95% positive that this result is non-random.” A p-value of .01 amounts to “I am 99% positive that this result is non-random.” p-value of .001 is 99.9% certainty. Use .05 in AP Bio.

18 Chi-Squared Test Now that you have your chi-square, degrees of freedom, and critical value, you’re nearly done. You just need a chart of critical values. Find your degree of freedom and your p-value in the row and column headers. Read down and across to find your cell. If your chi-squared value is GREATER than that number, your null hypothesis is REJECTED. You’ve supported your results as non-random. If your chi-squared value is LESS than or EQUAL to that number, your null hypothesis is SUPPORTED. Variation is random.

19 Chi-Squared Test Our coin test gave us a chi-square of Does that support or reject the null hypothesis? Does this mean that the coin is definitely rigged or definitely fair??

20 Chi-Squared Test Try this problem:
You’re testing to see if fruit flies prefer different fruits: apples, oranges, grapefruits. Null hypothesis: there is no preference. Actual data: Of 147 fly visits that landed on fruit for at least 20 seconds, 48 flies spent at least 20 seconds on an apple, 87 flies spent at least 20 seconds on an orange, and 12 flies spent at least 20 seconds on a grapefruit. Is this variation due to chance?

21 Chi-Squared Test You can use means instead of counts. Try this problem: You’re testing to see if fruit flies prefer different fruits: apples, oranges, grapefruits. Null hypothesis: there is no preference. Actual data: You release 30 flies into a container with three fruits and clock how much time they spend on fruit. Some of your flies spend more time on apples or oranges or grapefruits, others less. Altogether, they spend an average of 45% of their time on apples, 28% of their time on oranges, and 27% of their time on grapefruits. Is this variation due to chance?

22 Chi-Squared Test If you reject the null hypothesis, your results can be reported as “significant” or “statistically significant.” When writing them up, you need to include all of the following: degrees of freedom, critical value (written as “less than” the p value), number of subjects (N), chi squared value. Round to two decimal places. For instance, I would write of our coin test: Coin flips were found to be non-random in a chi-squared test, X2 (2, N=100) = 12.96, p<.05. From this, we can conclude that coin flips were significantly weighted towards heads.

23 Statistics Again: statistics like these don’t answer your question for you. I’m more than 95% confident that the coin flips are non-random, but it doesn’t mean the coin was rigged! Maybe it was the way I flipped it, or air currents, or the table shape, or something else. The stats are like another data point, another piece of evidence. You have to engage your brain and interpret your statistics, no differently than how you must interpret raw data. And a crummy study design can give you great-looking statistics (or terrible ones).


Download ppt "Chi-Squared Test The chi-squared ( ) test, or Pearson’s chi-squared test, evaluates the likelihood that variation in your results was due to chance."

Similar presentations


Ads by Google