ANALYZING MORE GENERAL SITUATIONS UNIT 3. Unit Overview  In the first unit we explored tests of significance, confidence intervals, generalization, and.

Slides:



Advertisements
Similar presentations
1 COMM 301: Empirical Research in Communication Lecture 15 – Hypothesis Testing Kwan M Lee.
Advertisements

Paired Data: One Quantitative Variable Chapter 7.
Testing Hypotheses About Proportions Chapter 20. Hypotheses Hypotheses are working models that we adopt temporarily. Our starting hypothesis is called.
Copyright ©2011 Brooks/Cole, Cengage Learning More about Inference for Categorical Variables Chapter 15 1.
Copyright ©2006 Brooks/Cole, a division of Thomson Learning, Inc. More About Categorical Variables Chapter 15.
Stat 512 – Lecture 12 Two sample comparisons (Ch. 7) Experiments revisited.
Stat 512 – Lecture 13 Chi-Square Analysis (Ch. 8).
Chapter 10 - Part 1 Factorial Experiments.
Review for Exam 2 Some important themes from Chapters 6-9 Chap. 6. Significance Tests Chap. 7: Comparing Two Groups Chap. 8: Contingency Tables (Categorical.
1 Chapter 20 Two Categorical Variables: The Chi-Square Test.
How Can We Test whether Categorical Variables are Independent?
Review Tests of Significance. Single Proportion.
Chapter 13: Inference in Regression
Chapter 14: Nonparametric Statistics
 Involves testing a hypothesis.  There is no single parameter to estimate.  Considers all categories to give an overall idea of whether the observed.
Chapter 9 Comparing More than Two Means. Review of Simulation-Based Tests  One proportion:  We created a null distribution by flipping a coin, rolling.
The Practice of Statistics, 5th Edition Starnes, Tabor, Yates, Moore Bedford Freeman Worth Publishers CHAPTER 10 Comparing Two Populations or Groups 10.2.
+ Chapter 12: More About Regression Section 12.1 Inference for Linear Regression.
CHAPTER 11 SECTION 2 Inference for Relationships.
+ Chi Square Test Homogeneity or Independence( Association)
BPS - 5th Ed. Chapter 221 Two Categorical Variables: The Chi-Square Test.
The Practice of Statistics, 5th Edition Starnes, Tabor, Yates, Moore Bedford Freeman Worth Publishers CHAPTER 10 Comparing Two Populations or Groups 10.1.
The Practice of Statistics, 5th Edition Starnes, Tabor, Yates, Moore Bedford Freeman Worth Publishers CHAPTER 10 Comparing Two Populations or Groups 10.1.
Copyright © 2010 Pearson Education, Inc. Slide
1 Chapter 11: Analyzing the Association Between Categorical Variables Section 11.1: What is Independence and What is Association?
AP Statistics Section 11.1 B More on Significance Tests.
© Copyright McGraw-Hill 2004
Copyright © 2013, 2009, and 2007, Pearson Education, Inc. Chapter 10 Comparing Two Groups Section 10.1 Categorical Response: Comparing Two Proportions.
Copyright © 2013, 2009, and 2007, Pearson Education, Inc. Chapter 11 Analyzing the Association Between Categorical Variables Section 11.2 Testing Categorical.
Learning Objectives After this section, you should be able to: The Practice of Statistics, 5 th Edition1 DESCRIBE the shape, center, and spread of the.
BPS - 5th Ed. Chapter 221 Two Categorical Variables: The Chi-Square Test.
Statistics for Business and Economics Module 1:Probability Theory and Statistical Inference Spring 2010 Lecture 8: Tests of significance and confidence.
Chapter 11: Categorical Data n Chi-square goodness of fit test allows us to examine a single distribution of a categorical variable in a population. n.
Class Seven Turn In: Chapter 18: 32, 34, 36 Chapter 19: 26, 34, 44 Quiz 3 For Class Eight: Chapter 20: 18, 20, 24 Chapter 22: 34, 36 Read Chapters 23 &
Chi Square Procedures Chapter 14. Chi-Square Goodness-of-Fit Tests Section 14.1.
Simulation-based inference beyond the introductory course Beth Chance Department of Statistics Cal Poly – San Luis Obispo
The Practice of Statistics, 5 th Edition1 Check your pulse! Count your pulse for 15 seconds. Multiply by 4 to get your pulse rate for a minute. Write that.
The Practice of Statistics, 5th Edition Starnes, Tabor, Yates, Moore Bedford Freeman Worth Publishers CHAPTER 10 Comparing Two Populations or Groups 10.2.
Lecture #8 Thursday, September 15, 2016 Textbook: Section 4.4
CHAPTER 10 Comparing Two Populations or Groups
CHAPTER 10 Comparing Two Populations or Groups
Association between two categorical variables
Chapter 12 Tests with Qualitative Data
Simulation-Based Approach for Comparing Two Means
CHAPTER 10 Comparing Two Populations or Groups
Stat 217 – Day 28 Review Stat 217.
Chi Square Two-way Tables
AP Stats Check In Where we’ve been… Chapter 7…Chapter 8…
Chapter 11: Inference for Distributions of Categorical Data
Chapter 10 Analyzing the Association Between Categorical Variables
CHAPTER 10 Comparing Two Populations or Groups
Inference for Relationships
CHAPTER 10 Comparing Two Populations or Groups
Significance Tests: The Basics
Lesson 11 - R Chapter 11 Review:
Analyzing the Association Between Categorical Variables
CHAPTER 12 More About Regression
CHAPTER 10 Comparing Two Populations or Groups
CHAPTER 10 Comparing Two Populations or Groups
CHAPTER 10 Comparing Two Populations or Groups
CHAPTER 10 Comparing Two Populations or Groups
Chapter 26 Comparing Counts Copyright © 2009 Pearson Education, Inc.
CHAPTER 10 Comparing Two Populations or Groups
CHAPTER 10 Comparing Two Populations or Groups
Chapter 26 Comparing Counts.
CHAPTER 10 Comparing Two Populations or Groups
CHAPTER 10 Comparing Two Populations or Groups
CHAPTER 10 Comparing Two Populations or Groups
CHAPTER 10 Comparing Two Populations or Groups
CHAPTER 10 Comparing Two Populations or Groups
Presentation transcript:

ANALYZING MORE GENERAL SITUATIONS UNIT 3

Unit Overview  In the first unit we explored tests of significance, confidence intervals, generalization, and causation mostly in terms of a single proportion.  In the second unit we compared two proportions, two averages (independent), and paired data. All of these had a binary categorical explanatory variable.  In the third unit we will expand on this to compare multiple proportions, multiple averages, and two quantitative variables. The explanatory variable will be categorical (not necessarily binary) or quantitative.

Unit Overview  Throughout this unit we can still express the null hypothesis in terms of no association between the response and the explanatory variable and the alternative hypothesis in terms of an association between the response and the explanatory variable.  We can also be more specific in our hypotheses to reflect the type of data (and hence parameters) we are working with.

Comparing More Than Two Groups Using Proportions Chapter 8

Chapter Overview  In chapter 5, our explanatory variable had two outcomes (like parents smoke or not) and the response variable also had two outcomes (like baby is boy or girl).  In this chapter we will allow the explanatory variable to have more than two outcomes (like both parents smoke, only mother smokes, only father smokes, or neither parent smokes).  We will still focus on the response variable having two outcomes, but it doesn’t have to. It can have many as well.

Section 8.1: Simulation-Based Approach to Compare Multiple Proportions Example 8.1 Coming to a Stop

Stopping  Virginia Tech students investigated which vehicles came to a stop at a intersection where there was a four-way stop.  While the students examined many factors for an association with coming to a complete stop, we will investigate whether intersection arrival patterns are associated with coming to a complete stop: Vehicle arrives alone Vehicle is the lead in a group of vehicles Vehicle is a follower in a group of vehicles

Stopping  Null hypothesis: There is no association between the arrival pattern of the vehicle and if it comes to a complete stop.  Alternative hypothesis: There is an association between the arrival pattern of the vehicle and if it comes to a complete stop.

Stopping

 Percentage of vehicles stopping: 85.8% of single vehicles 90.5% of lead vehicles 77.6% of vehicles following in a group Single Vehicle Lead Vehicle Following Vehicle Total Complete Stop 151 (85.8%) 38 (90.5%) 76 (77.6%) 265 Not Complete Stop 25 (14.2%) 4 (9.5%) 22 (22.4%) 51 Total

Stopping  Remember that no association implies the proportion of vehicles that stop in each category should be the same.  Our question is, if the same proportion of vehicles come to a complete stop in all the three categories, how unlikely would we get proportions at least as far apart as we did?

Stopping Applying the 3S Strategy  We need to find a statistic that will describe how far apart our proportions are from each other.  This is more complicated than when we just had two groups.  We also need to decide what types of values for that statistic (e.g., large or small, positive or negative) we would consider evidence against the null hypothesis.

Stopping  To find a statistic we start by finding the three differences in proportions.

Stopping  How can we combine 3 differences (-0.047, and 0.129) into a single statistic?  Add them up? Average them? (-0.082) = 0. [ (-0.082) ]/3 = 0.  What could we do so we don’t always get a sum of zero?

Stopping 1. Statistic  We are going to use the mean of the absolute value of the differences (MAD)  ( )/3 =  What would have to be true for the average of absolute differences to equal 0?  What types of values of this statistic (e.g., large or small) would provide evidence in favor of the alternative hypothesis?

Stopping 2. Simulate  If there is no association between arrival pattern and whether or not a vehicle stops it basically means it doesn’t matter what the arrival pattern is. Some vehicles will stop no matter what the arrival pattern and some vehicles won’t.  We can model this by shuffling either the explanatory or response variables. (The applet will shuffle the response.)

Stopping  Use the Multiple Proportions Applet

Stopping  The results of one shuffle of the response

Stopping  Simulated values of the statistic for 1000 shuffles  Bell-shaped?  Centered at 0?

Stopping 3. Strength of evidence  Do we use a 1-sided or 2-sided alternative hypothesis to compute a p-value? By finding the absolute values we have lost direction in terms of which proportion is smaller than another. When we are looking for more of a difference, the MAD statistic will be larger. Hence to calculate our p-value we will always count the simulations that are as large or larger than the MAD statistic.

Stopping  We had a p-value of so there is moderate evidence against the null hypothesis  Do the results generalize to intersections beyond the one used? Probably not since intersections have different factors that influence stopping  Can we draw any cause-and-effect conclusions? No, an observational study  If we had stronger evidence of a difference in groups, we could follow with pairwise tests to see which proportions are significantly different from each other. (We will save this until the next section.)

Recruiting Organ Donors Exploration 8.1

Theory-Based Approach to Compare Multi-Category Categorical Variables Section 8.2

Theory based vs. Simulation based Just as always:  Simulation based methods Always work Require the ability to simulate (computer).  Theory-based methods Avoid the need for simulation Additional ‘validity conditions’ must be met

Other Distributions  The null distributions in the past few chapters were bell-shaped and centered at 0.  For these, we used normal and t-distributions to predict what the null distribution would look like.  In this chapter our null distribution was neither bell-shaped nor centered at 0.  We can, however, use a chi-squared distribution to predict the shape of the null distribution.  When doing this, we will not use the MAD statistic, but a chi-square statistic.

Sham Acupuncture  A randomized experiment was conducted exploring the effectiveness of acupuncture in treating chronic lower back pain (Haake et al. 2007).  Acupuncture inserts needles into the skin of the patient at acupuncture points to treat a variety of ailments.

Sham Acupuncture  3 treatment groups 1. Verum acupuncture: traditional Chinese 2. Sham acupuncture: needles inserted into the skin, but not deeply and not at acupuncture points 3. Traditional, non-acupuncture, therapy of drugs, physical therapy and exercise.  1162 patients were randomly assigned to each treatment group  387 patients in groups 1 and 2, and 388 in group 3.

Sham Acupuncture Null hypothesis - no association between type of treatment received and reduction in back pain. Alternative hypothesis - there is an association between type of treatment and reduction in back pain.

Sham Acupuncture  Here are the results. Real Acup. Sham Acup. Non- Acup. Total Pain reduced 184 (0.475) 171 (0.442) 106 (0.273) 461 Pain not reduced 203 (0.524) 216 (0.558) 282 (0.776) 701 Total

Sham Acupuncture  Remember that the MAD statistic is the mean of the absolute value of differences in proportions for all 3 groups: Real vs. Sham: =0.034 Real vs. None: = Sham vs. None: =  The statistic (MAD) is ( )/3=0.135  Larger MAD statistics give more evidence against the null

Sham Acupuncture

 Don’t worry about the formula. Just realize it is another way to measure how far apart the proportions are from each other (or the overall proportion of successes).  Just like the MAD statistic, the larger the chi- square statistic the more evidence there is against the null.  Let’s test this in the Multiple Proportions Applet using both MAD and chi-square simulation.

Sham Acupuncture Strength of evidence:  In both cases, we got p-values of 0.  Nothing as large as a MAD statistic of or larger ever occurred in this simulation.  Likewise nothing as large as a chi-square statistic of or larger ever occurred in this simulation.  Hence we have very strong evidence against the null and in support of the type of acupuncture used is associated with pain reduction.

Sham Acupuncture  The theory-based method will predict the chi-square null distribution.  We can use the applet to overlay a theoretical chi-square distribution and find the theory-based p-value.  The theory based method only works well when each cell in the 2-way table has at least 10 observations.  This is easily met here. The smallest count was 106.

Results

Sham Acupuncture  What if you find evidence of an association?  What happens after a chi-squared test? No standard approach to follow Could describe the association with confidence intervals on each of the differences in proportions.

Sham Acupuncture  95% confidence intervals (found in applet) on the difference improvement percentages comparing: Real to sham ( , ) Real to none (0.1356,0.2689)* Sham to none (0.1022, )*  There is evidence that the probability of pain reduction is different (lower) for no acupuncture treatment than the other two.  There is no significant difference between real and sham acupuncture however.  Let’s see all this in the applet.

Exploration 8.2: Conserving Hotel Towels  Hotels are encouraging guests to practice conservation by not having their towels washed.  (Goldstein et al., 2008) conducted a randomized experiment to investigate how different phrasings on signs placed on bathroom towel racks impacted towel reuse behavior.  Researchers were interested how messages which communicated different types of “social norms” impacted towel reuse.

Exploration 8.2: Conserving Hotel Towels  Rooms at a single hotel were randomly assigned to receive 1 of 5 messages on the sign on the towel bar  Is there an association between the message left and whether or not towels get reused?