Chi Square Analyses: Comparing Frequency Distributions.

Slides:



Advertisements
Similar presentations
CHI-SQUARE(X2) DISTRIBUTION
Advertisements

AP Biology.  Segregation of the alleles into gametes is like a coin toss (heads or tails = equal probability)  Rule of Multiplication  Probability.
Mendelian Genetics. Genes- genetic material on a chromosome that codes for a specific trait Genotype- the genetic makeup of the organism Phenotype- the.
Chi-Square Test A fundamental problem is genetics is determining whether the experimentally determined data fits the results expected from theory (i.e.
Multinomial Experiments Goodness of Fit Tests We have just seen an example of comparing two proportions. For that analysis, we used the normal distribution.
The Chi-Square Test for Association
Laws of Probability and Chi Square
Chi-Square Analysis Mendel’s Peas and the Goodness of Fit Test.
Statistical Inference for Frequency Data Chapter 16.
Statistics for AP Biology. Understanding Trends in Data Mean: The average or middle of the data Range: The spread of the data Standard deviation: Variation.
Ch 15 - Chi-square Nonparametric Methods: Chi-Square Applications
Chi-Square Test A fundamental problem in genetics is determining whether the experimentally determined data fits the results expected from theory (i.e.
Presentation 12 Chi-Square test.
Chapter 13 Chi-Square Tests. The chi-square test for Goodness of Fit allows us to determine whether a specified population distribution seems valid. The.
Copyright © Cengage Learning. All rights reserved. 11 Applications of Chi-Square.
11.4 Hardy-Wineberg Equilibrium. Equation - used to predict genotype frequencies in a population Predicted genotype frequencies are compared with Actual.
Chi-Squared Test.
Aaker, Kumar, Day Ninth Edition Instructor’s Presentation Slides
Chi Square AP Biology.
For testing significance of patterns in qualitative data Test statistic is based on counts that represent the number of items that fall in each category.
Chi-Square as a Statistical Test Chi-square test: an inferential statistics technique designed to test for significant relationships between two variables.
Chapter 9: Non-parametric Tests n Parametric vs Non-parametric n Chi-Square –1 way –2 way.
Copyright © 2009 Cengage Learning 15.1 Chapter 16 Chi-Squared Tests.
Multinomial Experiments Goodness of Fit Tests We have just seen an example of comparing two proportions. For that analysis, we used the normal distribution.
Chapter 3 – Basic Principles of Heredity. Johann Gregor Mendel (1822 – 1884) Pisum sativum Rapid growth; lots of offspring Self fertilize with a single.
Chi-Square Test A fundamental problem in genetics is determining whether the experimentally determined data fits the results expected from theory. How.
Fitting probability models to frequency data. Review - proportions Data: discrete nominal variable with two states (“success” and “failure”) You can do.
Chi square analysis Just when you thought statistics was over!!
Section 10.2 Independence. Section 10.2 Objectives Use a chi-square distribution to test whether two variables are independent Use a contingency table.
© Copyright McGraw-Hill CHAPTER 11 Other Chi-Square Tests.
Chapter Outline Goodness of Fit test Test of Independence.
State the ‘null hypothesis’ State the ‘alternative hypothesis’ State either one-tailed or two-tailed test State the chosen statistical test with reasons.
Chi-Square Analysis AP Biology.
Chi-Square Test (χ 2 ) χ – greek symbol “chi”. Chi-Square Test (χ 2 ) When is the Chi-Square Test used? The chi-square test is used to determine whether.
Chapter 14 – 1 Chi-Square Chi-Square as a Statistical Test Statistical Independence Hypothesis Testing with Chi-Square The Assumptions Stating the Research.
1 1 Slide © 2008 Thomson South-Western. All Rights Reserved Chapter 12 Tests of Goodness of Fit and Independence n Goodness of Fit Test: A Multinomial.
Analyzing Data  2 Test….”Chi” Square. Forked-Line Method, F2 UuDd x UuDd 1/4 UU 1/2 Uu 1/4 uu 1/4 DD 1/2 Dd 1/4 dd 1/4 DD 1/2 Dd 1/4 dd 1/4 DD 1/2 Dd.
Did Mendel fake is data? Do a quick internet search and can you find opinions that support or reject this point of view. Does it matter? Should it matter?
Chi Square Pg 302. Why Chi - Squared ▪Biologists and other scientists use relationships they have discovered in the lab to predict events that might happen.
Section 10.2 Objectives Use a contingency table to find expected frequencies Use a chi-square distribution to test whether two variables are independent.
I. Allelic, Genic, and Environmental Interactions
AP Biology Heredity PowerPoint presentation text copied directly from NJCTL with corrections made as needed. Graphics may have been substituted with a.
CHI SQUARE DISTRIBUTION. The Chi-Square (  2 ) Distribution The chi-square distribution is the probability distribution of the sum of several independent,
 Test for Qualitative variables Chi Square Test Dr. Asif Rehman.
Chi-Square Analysis AP Biology.
The Chi Square Test A statistical method used to determine goodness of fit Chi-square requires no assumptions about the shape of the population distribution.
Chi-Square Test A fundamental problem is genetics is determining whether the experimentally determined data fits the results expected from theory (i.e.
Test of independence: Contingency Table
Math in Genetics Making Suby Proud.
Chapter 9: Non-parametric Tests
Chi-Square Test A fundamental problem is genetics is determining whether the experimentally determined data fits the results expected from theory (i.e.
Hypothesis Testing Review
Chi-Square Analysis AP Biology.
MENDELIAN GENETICS CHI SQUARE ANALYSIS
Analyzing Data c2 Test….”Chi” Square.
UNIT 6: MENDELIAN GENETICS CHI SQUARE ANALYSIS
Chi-Square Analysis.
Chi-Square Analysis AP Biology.
Contingency Tables: Independence and Homogeneity
Chi-Square Analysis AP Biology.
Chi-Square Analysis AP Biology.
UNIT V CHISQUARE DISTRIBUTION
S.M.JOSHI COLLEGE, HADAPSAR
Completion and analysis of Punnett squares for dihybrid traits
20 May 2019 Chi2 Test For Genetics Help sheet.
Chi-Square Test A fundamental problem in Science is determining whether the experiment data fits the results expected. How can you tell if an observed.
Quadrat sampling & the Chi-squared test
Quadrat sampling & the Chi-squared test
Chi-Square Analysis AP Biology.
Presentation transcript:

Chi Square Analyses: Comparing Frequency Distributions

Chi-Square Tests test probability distributions from nominal, ordinal, or discrete data Compare data to a theoretical distribution. Compare two sets of data

Chi Square Tests for Goodness of Fit Two types – extrinsic and intrinsic Assumptions of both tests – Measurement on at least a nominal scale – Observations are independent – The expected frequencies for each category must be specified – The sample size must be sufficiently large so that no category has an expected frequency of < 5.

Chi Square Tests for Goodness of Fit Hypotheses – Null – the observed frequency distribution is the same as the hypothesized frequency distribution – Alternative - the observed and hypothesized distributions are different

Chi Square Tests for Goodness of Fit Test Statistic – The test statistic is based on the difference between the observed and expected frequencies. It is calculated by:

Chi Square Test for Goodness of Fit In an extrinsic test, no population parameters need to be estimated from the data. An intrinsic test requires an estimation of a population parameter from the data collected. – Technically, the degrees of freedom should be reduced by 1 for each parameter estimated – However, this is a minor effect and not always considered (we won’t worry about it). – An intrinsic test is commonly used when comparing a sample to a derived distribution such as the poisson or binomial distribution

Chi Square Test for Goodness of Fit (Extrinsic) Example – Cross of two pea plants with purple flowers. – When you do the cross, you get 80 plants with round seeds, and 20 with wrinkled. – Your biological hypotheses are that: the parents were heterozygous (since some white flowered offspring were produced) P is completely dominant to p genes segregate correctly fertilization is random zygotes have the same probability of survival with respect to this gene.

Example – Your biological hypotheses are that: the parents were heterozygous (since some white flowered offspring were produced) P is completely dominant to p genes segregate correctly fertilization is random zygotes have the same probability of survival with respect to this gene. GAMETES of PARENTS in = Frequency Pp PPPPp p pp Expected Ratio under THESE hypotheses: ¾ Purple offspring ¼ White offspring

Chi Square Test for Goodness of Fit (Extrinsic) Offspring Phenotype OBSERVEDEXPECTED by HYPOTHESIS O-E(O-E) 2 Purple8075 (3/4)525 White2025 (1/4) SUM = 0 bummer SUM = 25 Hmmm… So, we want to see how close our observed results are to what we expect under our hypothesis. Maybe the “total difference” would be a good measure… But sample size matters….

Chi Square Test for Goodness of Fit (Extrinsic) Offspring Phenotype OBSERVEDEXPECTED by HYPOTHESIS O-E(O-E) 2 Purple (3/4)525 White (1/4) SUM = 0 bummer SUM = 25 same So, we want to see how close our observed results are to what we expect under our hypothesis. Maybe the “total difference” would be a good measure… But sample size matters….these results are a lot closer to the expected values, but give the same total. So we need to evaluate the “sum of Squares” in relation to sample size… “mean square”

Chi Square Test for Goodness of Fit (Extrinsic) Offspring Phenotype OBSERVEDEXPECTED by HYPOTHESIS O-E(O-E) 2 (O-E) 2 /E Purple8075 (3/4) White2025 (1/4) So, we want to see how close our observed results are to what we expect under our hypothesis. Maybe the “total difference” would be a good measure… This = your calculated Chi-Square value, and you compare it to a Chi-Square table with df = Categories (P or W = 2) – 1 = 2-1 = 1.

The critical value is associated with a probability; in this case p = This is the probability that results as deviant as yours could have occurred by chance if your null hypothesis was true. You only reject the null hypothesis if you observe a more deviant pattern. (This would make your calculated value greater than the threshold critical value).

Chi Square Test for Goodness of Fit (Intrinsic) Example – In the 98 year period from , there were 159 U.S. landfalling hurricanes. Does the number of landfalling hurricanes per year follow a Poisson distribution? – Calculate the expected frequencies – Calculate the expected number by multiplying the frequency by the number of categories (here, years = 98) Formula: p(x) = X x e -x x!

Chi Square Test for Goodness of Fit (Intrinsic) Hurricanes per year Observed # Expected freq Expected #

Chi Square Test for Goodness of Fit (Intrinsic) Hurricanes per year Observed # Expected freq Expected # > Since we had an expected value <5, we combined categories to fix this problem.

Chi Square Test for Goodness of Fit (Intrinsic) Calculate the chi square statistic in the same way as before, and look up on table. Here: – X 2 = – Tabled value for  = 0.05 = 7.81 – Thus, we fail to reject the null hypothesis, supporting the claim that the annual number of landfalling U.S. hurricanes follows a Poisson distribution (rare, independent, random).

Chi Square Test of Independence Also called the Chi Square Test for Contingency Tables This test is performed to see if two variables, both measured on a nominal scale, are related in some way. The question asked here is if there is a relationship between the variables; the null hypothesis is that no relationship exists – they are “independent”.

Chi Square Test of Independence Steps in doing the test – 1. Form a table, or matrix, from the data collected – 2. Calculate row, column, and grand totals for the matrix – 3. Use these totals to calculate expected values (frequencies) for each cell in the matrix Calculated by: [(row total) x (column total)]/grand total Based on the product rule – the probability of two independent events occurring together is the product of their independent probabilities.

Chi Square Test of Independence Classic Example: Testing for Linkage or Independent Assortment between two loci Suppose we cross two pea plants: PpTt x pptt - Purple is completely dominant to white - Tall is completely dominant to short Produce the following results in the offspring: PT = 32 Pt = 22 pT = 23 Pt = ARE THE GENES ASSORTING INDEPENDENTLY, OR ARE THEY LINKED?

Chi Square Test of Independence ARE THE GENES ASSORTING INDEPENDENTLY, OR ARE THEY LINKED? PT = 32 Pt = 22 pT = 23 Pt = Tt P p CONTINGENCY TABLE IF these events (flower color and plant height) are inherited independently, THEN the frequency of any combined outcome should be = to the product of their independent probabilities: IF IA, THEN f(PT) = f(P) x f(T) x N = 54/113 x 55/113 x 113 = Reduces to: f(PT) = f(P) x f(T) x N = 54 x 55/113 = = RT x CT/GT

Chi Square Test of Independence ARE THE GENES ASSORTING INDEPENDENTLY, OR ARE THEY LINKED? PT = 32 Pt = 22 pT = 23 Pt = Texpt P p CONTINGENCY TABLE IF these events (flower color and plant height) are inherited independently, THEN the frequency of any combined outcome should be = to the product of their independent probabilities: IF IA, THEN f(PT) = f(P) x f(T) x N = 54/113 x 55/113 x 113 = Reduces to: f(PT) = f(P) x f(T) x N = 54 x 55/113 = = RT x CT/GT

Texpt P p ObsExpO-E(O-E) 2 /E PT Pt pT pt Df = (R-1)(C-1) in contingency table (1)(1) = 1, p = 0.05, critical = 3.84…. Reject Ho.