Stat 512 – Lecture 14 Analysis of Variance (Ch. 12)

Slides:



Advertisements
Similar presentations
BPS - 5th Ed. Chapter 241 One-Way Analysis of Variance: Comparing Several Means.
Advertisements

CHAPTER 25: One-Way Analysis of Variance Comparing Several Means
CHAPTER 25: One-Way Analysis of Variance: Comparing Several Means ESSENTIAL STATISTICS Second Edition David S. Moore, William I. Notz, and Michael A. Fligner.
CHAPTER 23: Two Categorical Variables: The Chi-Square Test
CHAPTER 23: Two Categorical Variables The Chi-Square Test ESSENTIAL STATISTICS Second Edition David S. Moore, William I. Notz, and Michael A. Fligner Lecture.
Copyright ©2006 Brooks/Cole, a division of Thomson Learning, Inc. Analysis of Variance Chapter 16.
Chapter 13: Inference for Distributions of Categorical Data
The Two Factor ANOVA © 2010 Pearson Prentice Hall. All rights reserved.
CHAPTER 11 Inference for Distributions of Categorical Data
Stat 217 – Day 24 Analysis of Variance Have yesterday’s handout handy.
Class 23: Thursday, Dec. 2nd Today: One-way analysis of variance, multiple comparisons. Next week: Two-way analysis of variance. I will the final.
Lecture 14 – Thurs, Oct 23 Multiple Comparisons (Sections 6.3, 6.4). Next time: Simple linear regression (Sections )
Stat 322 – Day 14 ANOVA and Multiple Comparisons.
Stat 512 – Lecture 12 Two sample comparisons (Ch. 7) Experiments revisited.
Stat 217 – Day 27 Chi-square tests (Topic 25). The Plan Exam 2 returned at end of class today  Mean.80 (36/45)  Solutions with commentary online  Discuss.
Stat 512 – Lecture 13 Chi-Square Analysis (Ch. 8).
Stat 512 – Day 8 Tests of Significance (Ch. 6). Last Time Use random sampling to eliminate sampling errors Use caution to reduce nonsampling errors Use.
Chapter 9 Hypothesis Testing.
Stat 512 – Lecture 15 Two-way ANOVA (Ch. 12). Last Time – Analysis of Variance (ANOVA) When: Want to compare two or more population/true treatment means.
Lecture 14: Thur., Feb. 26 Multiple Comparisons (Sections ) Next class: Inferences about Linear Combinations of Group Means (Section 6.2).
Lecture Slides Elementary Statistics Twelfth Edition
Review for Exam 2 Some important themes from Chapters 6-9 Chap. 6. Significance Tests Chap. 7: Comparing Two Groups Chap. 8: Contingency Tables (Categorical.
1 Chapter 20 Two Categorical Variables: The Chi-Square Test.
Inferential Statistics
Chapter 12: Analysis of Variance
Estimation and Hypothesis Testing Faculty of Information Technology King Mongkut’s University of Technology North Bangkok 1.
STAT 3130 Statistical Methods I Session 2 One Way Analysis of Variance (ANOVA)
Ch 10 Comparing Two Proportions Target Goal: I can determine the significance of a two sample proportion. 10.1b h.w: pg 623: 15, 17, 21, 23.
Copyright © 2013, 2009, and 2007, Pearson Education, Inc. Chapter 14 Comparing Groups: Analysis of Variance Methods Section 14.2 Estimating Differences.
Introduction to Statistical Inference Probability & Statistics April 2014.
Comparing Two Proportions
Section Copyright © 2014, 2012, 2010 Pearson Education, Inc. Lecture Slides Elementary Statistics Twelfth Edition and the Triola Statistics Series.
ANOVA (Analysis of Variance) by Aziza Munir
Chapter 22: Comparing Two Proportions
Section Copyright © 2014, 2012, 2010 Pearson Education, Inc. Lecture Slides Elementary Statistics Twelfth Edition and the Triola Statistics Series.
Copyright © 2007 Pearson Education, Inc. Publishing as Pearson Addison-Wesley Slide
CHAPTER 11 SECTION 2 Inference for Relationships.
Chapter 22: Comparing Two Proportions. Yet Another Standard Deviation (YASD) Standard deviation of the sampling distribution The variance of the sum or.
McGraw-Hill/Irwin Copyright © 2007 by The McGraw-Hill Companies, Inc. All rights reserved. Chapter 8 Hypothesis Testing.
Copyright © 2010 Pearson Education, Inc. Slide Beware: Lots of hidden slides!
Copyright © 2010 Pearson Education, Inc. Chapter 22 Comparing Two Proportions.
BPS - 5th Ed. Chapter 221 Two Categorical Variables: The Chi-Square Test.
Copyright © 2010, 2007, 2004 Pearson Education, Inc. Chapter 22 Comparing Two Proportions.
CHAPTER 23: Two Categorical Variables The Chi-Square Test ESSENTIAL STATISTICS Second Edition David S. Moore, William I. Notz, and Michael A. Fligner Lecture.
Statistical Significance for a two-way table Inference for a two-way table We often gather data and arrange them in a two-way table to see if two categorical.
Section Copyright © 2014, 2012, 2010 Pearson Education, Inc. Lecture Slides Elementary Statistics Twelfth Edition and the Triola Statistics Series.
Copyright © 2013, 2009, and 2007, Pearson Education, Inc. Chapter 11 Analyzing the Association Between Categorical Variables Section 11.2 Testing Categorical.
Lecture PowerPoint Slides Basic Practice of Statistics 7 th Edition.
Copyright © 2010, 2007, 2004 Pearson Education, Inc. All Rights Reserved Lecture Slides Elementary Statistics Eleventh Edition and the Triola.
Lecture PowerPoint Slides Basic Practice of Statistics 7 th Edition.
+ Section 11.1 Chi-Square Goodness-of-Fit Tests. + Introduction In the previous chapter, we discussed inference procedures for comparing the proportion.
The Practice of Statistics, 5th Edition Starnes, Tabor, Yates, Moore Bedford Freeman Worth Publishers CHAPTER 11 Inference for Distributions of Categorical.
Analysis of Variance ANOVA - method used to test the equality of three or more population means Null Hypothesis - H 0 : μ 1 = μ 2 = μ 3 = μ k Alternative.
Slide Slide 1 Copyright © 2007 Pearson Education, Inc Publishing as Pearson Addison-Wesley. Lecture Slides Elementary Statistics Tenth Edition and the.
Chapter 11: Categorical Data n Chi-square goodness of fit test allows us to examine a single distribution of a categorical variable in a population. n.
Class Seven Turn In: Chapter 18: 32, 34, 36 Chapter 19: 26, 34, 44 Quiz 3 For Class Eight: Chapter 20: 18, 20, 24 Chapter 22: 34, 36 Read Chapters 23 &
AP Stats Check In Where we’ve been… Chapter 7…Chapter 8… Where we are going… Significance Tests!! –Ch 9 Tests about a population proportion –Ch 9Tests.
Textbook Section * We already know how to compare two proportions for two populations/groups. * What if we want to compare the distributions of.
Chi Square Test of Homogeneity. Are the different types of M&M’s distributed the same across the different colors? PlainPeanutPeanut Butter Crispy Brown7447.
CHAPTER 11 Inference for Distributions of Categorical Data
AP Stats Check In Where we’ve been… Chapter 7…Chapter 8…
CHAPTER 11 Inference for Distributions of Categorical Data
CHAPTER 11 Inference for Distributions of Categorical Data
CHAPTER 11 Inference for Distributions of Categorical Data
CHAPTER 11 Inference for Distributions of Categorical Data
CHAPTER 11 Inference for Distributions of Categorical Data
CHAPTER 11 Inference for Distributions of Categorical Data
CHAPTER 11 Inference for Distributions of Categorical Data
CHAPTER 11 Inference for Distributions of Categorical Data
CHAPTER 11 Inference for Distributions of Categorical Data
Presentation transcript:

Stat 512 – Lecture 14 Analysis of Variance (Ch. 12)

Recap Comparing means  Male/female body temperatures  Average improvement with/without enough sleep Comparing proportions  Newspaper believability in 1998/2002  Survival with and without letrozole Comparing more than two proportions  Probability of selecting a woman juror among 7 judges

Recap – Chi-Square Procedures Why use Chi-Square procedures?  Each test (at the 5% level of significance) has a 5% chance of making a Type I Error  With 21 tests, have a 66% chance of at least one false rejection of the null hypothesis  With the overall chi-square procedure, have a 5% chance of making a type I error Downside?  Only get to say “at least one judge differs”

Recap – Chi-Square Procedures How carry out a chi-square test?  Minitab  Enter two-way table of observed counts Not row and column totals  Is test statistic too large?  Interpret p-value as usual if all expected counts are at least 5 and have randomness in study design  Can do some follow-up analysis on chi-square contributions

Recap – Chi-Square Procedures When use chi-square procedures?  Answer 1: Whenever have two qualitative variables on each observational unit

Answer 2 Chi-square tests arise in several situations 1. Comparing 2 or more population proportions H 0 :       H a : at least one  i differs 2. Comparing 2 or more population distributions on categorical response variable H 0 : the population distributions are the same H a : the population distributions are not all the same

Answer 2 (cont.) 3. Association between 2 categorical variables H o : no association between var 1 and var 2 (independent) H a : is an association between the variables Technical conditions: Random Case 1 and 2: Independent random samples from each population or randomized experiment Case 3: Random sample from population of interest Large sample(s) All expected cell counts >5

PP 11 – Problem 1 (a) Technical conditions “valid,” “appropriate” vs. “can we” Population size? Populations normal? Both sample sizes? Both samples random? (b)-(c) Inference procedures Include all the steps! Population vs. sample values The confidence interval is from to (d) Would you pay money to improve your scores this much?

PP 11 – Problem 2 Let  w represent the probability of a winner living this long, with  N for the nominees and  C ­ for the controls. H 0 :  W =  N =  C (the long-term survival rate is the same for the 3 processes) H a : at least one  differs We are considering these three groups to be independent random samples from the award winning process. The expected counts (smallest = ) are large enough for the chi-square approximation to be accurate.

PP 11 – Problem 2 The chi-square value (13.229) is large and the p-value (.001) small so we reject the null hypothesis. We have very strong evidence that the three population probabilities are not the same. The largest contributions to the chi-square sum arise from the deaths among controls, where we observed more than we would have expected had the three probabilities all been equal. died alive

HW Questions? Note HW 7 is posted online

Example 1: Handicap Discrimination In 1984, handicapped individuals in the labor force had an unemployment rate of 7% compared to 4.5% in the non-impaired labor force. Cesare, S.J., Tannenbaum, R.J., and Dalessio, A. (1990), “Interviewers’ Decisions Related to Applicant Handicap Type and Rater Empathy,” Human Performance 3(3):

Example 1: Handicap Discrimination Observational units?  Undergraduate students Explanatory variable  Which type of handicap in video (qualitative) Response variable  Qualification rating (quantitative) Type of study  Experiment since randomly assigned them to different videos

Example 1: Handicap Discrimination Is it possible that there is no treatment effect but the observed treatment group means are 4.429, 5.921, 4.050, 4.900, 5.343? How decide? Let  i = underlying true mean treatment response H 0 :  none =  leg amp =  crutches =  hearing =  wheel H a : at least one  differs

Example 1: Handicap Discrimination

Inference Procedure Want one procedure for comparing the 5 treatment means simultaneously  Take into account the distances between the sample means relative to the variability in the data  Comparisons of “between group” variability to “within group” variability (“by chance”)

“Analysis of Variance” (ANOVA) F statistic = discrepancy in group means variability in data If F statistic is large, have evidence against the null hypothesis p-value = probability of observing an F statistic at least this large when H 0 is true

Minitab Stat > ANOVA > One-Way

Notes The test statistic takes the sample sizes into account, giving more weight to the group with larger sizes. Producing a pooled estimate of the overall variability in the data requires us to assume that each population/treatment group has the same variability  2.

Checking the Technical Conditions Normal populations Equal variances  Ratio of largest SD/smallest SD < 2 Independence  Random samples/randomization

Example 1: Handicap Discrimination The samples look reasonably symmetric with similar standard deviations, so it is appropriate to apply the Analysis of Variance procedure. There is moderate evidence that the mean qualification ratings differ depending on the type of handicap (p- value =.030). Descriptively, the candidates with crutches appear to have higher ratings and the candidates with hearing impairments slightly lower ratings (other procedures could be used to follow-up to test the significance of these individual differences). This was a randomized experiment so we can attribute these differences to the handicap status but we must be cautious in thinking the students in this study are representative of a larger population, particularly, a population of employers who make hiring decisions.

Example 2: Restaurant Spending Hypotheses Technical conditions? How do different factors affect the size of the p- value?  when the population means are further apart, the p-value is usually smaller (more evidence they aren’t equal)  when the within group variability is larger, the p-value is larger (less evidence didn’t happen by chance)  when the sample sizes are larger, and there is a true difference between the population means, then the p-value is smaller

Example 3: Follow-up Analysis Multiple comparison procedures control overall Type I Error rate  Are several different such procedures Bonferroni, Tukey, Scheffe’… Let Minitab do all the work

Example 4: Lifetimes of Notables Which professions appear to differ?

For Thursday Submit PP 12 in Blackboard Continue reading Ch. 12 Preview Example 5, complete (a) and (b)