Chi-Square as a Statistical Test Chi-square test: an inferential statistics technique designed to test for significant relationships between two variables.

Slides:



Advertisements
Similar presentations
The Chi Square Test A statistical method used to determine goodness of fit Goodness of fit refers to how close the observed data are to those predicted.
Advertisements

CHI-SQUARE(X2) DISTRIBUTION
Lecture (11,12) Parameter Estimation of PDF and Fitting a Distribution Function.
Chi-Square Test Chi-square is a statistical test commonly used to compare observed data with data we would expect to obtain according to a specific hypothesis.
Hypothesis Testing IV Chi Square.
Chapter 13: The Chi-Square Test
CJ 526 Statistical Analysis in Criminal Justice
Chi-square Test of Independence
Ch 15 - Chi-square Nonparametric Methods: Chi-Square Applications
Aaker, Kumar, Day Seventh Edition Instructor’s Presentation Slides
Chapter 13 Chi-Square Tests. The chi-square test for Goodness of Fit allows us to determine whether a specified population distribution seems valid. The.
+ Quantitative Statistics: Chi-Square ScWk 242 – Session 7 Slides.
Chapter 11(1e), Ch. 10 (2/3e) Hypothesis Testing Using the Chi Square ( χ 2 ) Distribution.
Hypothesis Testing IV (Chi Square)
Cross Tabulation and Chi-Square Testing. Cross-Tabulation While a frequency distribution describes one variable at a time, a cross-tabulation describes.
AM Recitation 2/10/11.
Chi-Squared Test.
Aaker, Kumar, Day Ninth Edition Instructor’s Presentation Slides
CJ 526 Statistical Analysis in Criminal Justice
Week 10 Chapter 10 - Hypothesis Testing III : The Analysis of Variance
© 2008 McGraw-Hill Higher Education The Statistical Imagination Chapter 13: Nominal Variables: The Chi-Square and Binomial Distributions.
Chi-square (χ 2 ) Fenster Chi-Square Chi-Square χ 2 Chi-Square χ 2 Tests of Statistical Significance for Nominal Level Data (Note: can also be used for.
Chapter 9: Non-parametric Tests n Parametric vs Non-parametric n Chi-Square –1 way –2 way.
Copyright © 2012 by Nelson Education Limited. Chapter 10 Hypothesis Testing IV: Chi Square 10-1.
Inquiry 1 written AND oral reports due Th 9/24 or M 9/28.
Chapter 16 The Chi-Square Statistic
Chapter 11 Hypothesis Testing IV (Chi Square). Chapter Outline  Introduction  Bivariate Tables  The Logic of Chi Square  The Computation of Chi Square.
Chapter 12 A Primer for Inferential Statistics What Does Statistically Significant Mean? It’s the probability that an observed difference or association.
Chi-Square X 2. Parking lot exercise Graph the distribution of car values for each parking lot Fill in the frequency and percentage tables.
Chi-Square Procedures Chi-Square Test for Goodness of Fit, Independence of Variables, and Homogeneity of Proportions.
Chi- square test x 2. Chi Square test Symbolized by Greek x 2 pronounced “Ki square” A Test of STATISTICAL SIGNIFICANCE for TABLE data.
Learning Objectives Copyright © 2002 South-Western/Thomson Learning Statistical Testing of Differences CHAPTER fifteen.
Chi square analysis Just when you thought statistics was over!!
Section 10.2 Independence. Section 10.2 Objectives Use a chi-square distribution to test whether two variables are independent Use a contingency table.
Today: Chi squared and non- nuclear inheritance. Homologous pair of chromosomes Linkage can be used to determine distance.
© Copyright McGraw-Hill CHAPTER 11 Other Chi-Square Tests.
1 Chapter 11: Analyzing the Association Between Categorical Variables Section 11.1: What is Independence and What is Association?
Chapter Outline Goodness of Fit test Test of Independence.
Chapter 11: Chi-Square  Chi-Square as a Statistical Test  Statistical Independence  Hypothesis Testing with Chi-Square The Assumptions Stating the Research.
Statistical Analysis: Chi Square AP Biology Ms. Haut.
© Copyright McGraw-Hill 2004
Section 12.2: Tests for Homogeneity and Independence in a Two-Way Table.
Copyright © 2013, 2009, and 2007, Pearson Education, Inc. Chapter 11 Analyzing the Association Between Categorical Variables Section 11.2 Testing Categorical.
Chi-Square Analysis AP Biology.
Chapter 15 The Chi-Square Statistic: Tests for Goodness of Fit and Independence PowerPoint Lecture Slides Essentials of Statistics for the Behavioral.
Chapter 14 – 1 Chi-Square Chi-Square as a Statistical Test Statistical Independence Hypothesis Testing with Chi-Square The Assumptions Stating the Research.
1 1 Slide © 2008 Thomson South-Western. All Rights Reserved Chapter 12 Tests of Goodness of Fit and Independence n Goodness of Fit Test: A Multinomial.
Did Mendel fake is data? Do a quick internet search and can you find opinions that support or reject this point of view. Does it matter? Should it matter?
PROBABILITY AND STATISTICS The laws of inheritance can be used to predict the outcomes of genetic crosses For example –Animal and plant breeders are concerned.
Class Seven Turn In: Chapter 18: 32, 34, 36 Chapter 19: 26, 34, 44 Quiz 3 For Class Eight: Chapter 20: 18, 20, 24 Chapter 22: 34, 36 Read Chapters 23 &
CHAPTER 8: RELATIONSHIPS BETWEEN TWO VARIABLES Leon-Guerrero and Frankfort-Nachmias, Essentials of Statistics for a Diverse Society.
Chi-Square Analysis AP Biology.
The Chi Square Test A statistical method used to determine goodness of fit Chi-square requires no assumptions about the shape of the population distribution.
Hypothesis Testing Review
Community &family medicine
Qualitative data – tests of association
Hypothesis Testing Using the Chi Square (χ2) Distribution
The Chi Square Test A statistical method used to determine goodness of fit Goodness of fit refers to how close the observed data are to those predicted.
MENDELIAN GENETICS CHI SQUARE ANALYSIS
The Chi Square Test A statistical method used to determine goodness of fit Goodness of fit refers to how close the observed data are to those predicted.
Chapter 10 Analyzing the Association Between Categorical Variables
Chi-Square Analysis AP Biology.
The Chi Square Test A statistical method used to determine goodness of fit Goodness of fit refers to how close the observed data are to those predicted.
Chi-Square Analysis AP Biology.
Analyzing the Association Between Categorical Variables
UNIT V CHISQUARE DISTRIBUTION
S.M.JOSHI COLLEGE, HADAPSAR
CHI SQUARE (χ2) Dangerous Curves Ahead!.
Presentation transcript:

Chi-Square as a Statistical Test Chi-square test: an inferential statistics technique designed to test for significant relationships between two variables organized in a bivariate table. Chi-square requires no assumptions about the shape of the population distribution from which a sample is drawn.

The Chi Square Test A statistical method used to determine goodness of fit –Goodness of fit refers to how close the observed data are to those predicted from a hypothesis Note: –The chi square test does not prove that a hypothesis is correct It evaluates to what extent the data and the hypothesis have a good fit

Limitations of the Chi-Square Test The chi-square test does not give us much information about the strength of the relationship or its substantive significance in the population. The chi-square test is sensitive to sample size. The size of the calculated chi-square is directly proportional to the size of the sample, independent of the strength of the relationship between the variables. The chi-square test is also sensitive to small expected frequencies in one or more of the cells in the table.

Statistical Independence Independence (statistical): the absence of association between two cross-tabulated variables. The percentage distributions of the dependent variable within each category of the independent variable are identical.

Hypothesis Testing with Chi- Square Chi-square follows five steps: 1.Making assumptions (random sampling) 2.Stating the research and null hypotheses 3.Selecting the sampling distribution and specifying the test statistic 4.Computing the test statistic 5.Making a decision and interpreting the results

The Assumptions The chi-square test requires no assumptions about the shape of the population distribution from which the sample was drawn. However, like all inferential techniques it assumes random sampling.

Stating Research and Null Hypotheses The research hypothesis (H 1 ) proposes that the two variables are related in the population. The null hypothesis (H 0 ) states that no association exists between the two cross- tabulated variables in the population, and therefore the variables are statistically independent.

H 1 : The two variables are related in the population. Gender and fear of walking alone at night are statistically dependent. No83.3%57.2%71.1% Yes16.7%42.8%28.9% Total100%100%100% AfraidMenWomenTotal

H 0 : There is no association between the two variables. Gender and fear of walking alone at night are statistically independent. AfraidMenWomenTotal No71.1%71.1%71.1% Yes28.9%28.9%28.9% Total100%100%100%

The Concept of Expected Frequencies Expected frequencies f e : the cell frequencies that would be expected in a bivariate table if the two tables were statistically independent. Observed frequencies f o : the cell frequencies actually observed in a bivariate table.

Calculating Expected Frequencies To obtain the expected frequencies for any cell in any cross-tabulation in which the two variables are assumed independent, multiply the row and column totals for that cell and divide the product by the total number of cases in the table. f e = (column marginal)(row marginal) N

Chi-Square (obtained) The test statistic that summarizes the differences between the observed (fo) and the expected (fe) frequencies in a bivariate table.

Calculating the Obtained Chi- Square f e = expected frequencies f o = observed frequencies

The Sampling Distribution of Chi- Square The sampling distribution of chi-square tells the probability of getting values of chi-square, assuming no relationship exists in the population. The chi-square sampling distributions depend on the degrees of freedom. The   sampling distribution is not one distribution, but is a family of distributions.

The Sampling Distribution of Chi- Square The distributions are positively skewed. The research hypothesis for the chi-square is always a one-tailed test. Chi-square values are always positive. The minimum possible value is zero, with no upper limit to its maximum value. As the number of degrees of freedom increases, the   distribution becomes more symmetrical.

Determining the Degrees of Freedom df = (r – 1)(c – 1) where r = the number of rows c = the number of columns

Calculating Degrees of Freedom How many degrees of freedom would a table with 3 rows and 2 columns have? (3 – 1)(2 – 1) = 2 2 degrees of freedom

The Chi Square Test (we will cover this in lab;) The general formula is      (O – E) 2 E where –O = observed data in each category –E = observed data in each category based on the experimenter’s hypothesis –  = Sum of the calculations for each category

Consider the following example in Drosophila melanogaster Gene affecting wing shape –c + = Normal wing –c = Curved wing Gene affecting body color –e + = Normal (gray) –e = ebony Note: –The wild-type allele is designated with a + sign –Recessive mutant alleles are designated with lowercase letters The Cross: –A cross is made between two true-breeding flies (c + c + e + e + and ccee). The flies of the F 1 generation are then allowed to mate with each other to produce an F 2 generation.

The outcome –F 1 generation All offspring have straight wings and gray bodies –F 2 generation 193 straight wings, gray bodies 69 straight wings, ebony bodies 64 curved wings, gray bodies 26 curved wings, ebony bodies 352 total flies Applying the chi square test –Step 1: Propose a null hypothesis (H o ) that allows us to calculate the expected values based on Mendel’s laws The two traits are independently assorting

–Step 2: Calculate the expected values of the four phenotypes, based on the hypothesis According to our hypothesis, there should be a 9:3:3:1 ratio on the F 2 generation PhenotypeExpected probability Expected number Observed number straight wings, gray bodies 9/169/16 X 352 = straight wings, ebony bodies 3/163/16 X 352 = curved wings, gray bodies 3/163/16 X 352 = curved wings, ebony bodies 1/161/16 X 352 = 22 24

–Step 3: Apply the chi square formula    (O 1 – E 1 ) 2 E1E1 (O 2 – E 2 ) 2 E2E2 (O 3 – E 3 ) 2 E3E3 (O 4 – E 4 ) 2 E4E4 +++    (193 – 198) (69 – 66) 2 66 (64 – 66) 2 66 (26 – 22)       1.06 Expected number Observed number

Step 4: Interpret the chi square value –The calculated chi square value can be used to obtain probabilities, or P values, from a chi square table These probabilities allow us to determine the likelihood that the observed deviations are due to random chance alone –Low chi square values indicate a high probability that the observed deviations could be due to random chance alone –High chi square values indicate a low probability that the observed deviations are due to random chance alone –If the chi square value results in a probability that is less than 0.05 (ie: less than 5%) it is considered statistically significant The hypothesis is rejected

Step 4: Interpret the chi square value –Before we can use the chi square table, we have to determine the degrees of freedom (df) The df is a measure of the number of categories that are independent of each other If you know the 3 of the 4 categories you can deduce the 4 th (total number of progeny – categories 1-3) df = n – 1 –where n = total number of categories In our experiment, there are four phenotypes/categories –Therefore, df = 4 – 1 = 3 –Refer to Table 2.1

1.06

Step 4: Interpret the chi square value –With df = 3, the chi square value of 1.06 is slightly greater than (which corresponds to P-value = 0.80) –P-value = 0.80 means that Chi-square values equal to or greater than are expected to occur 80% of the time due to random chance alone; that is, when the null hypothesis is true. –Therefore, it is quite probable that the deviations between the observed and expected values in this experiment can be explained by random sampling error and the null hypothesis is not rejected. What was the null hypothesis?