Paired Data: One Quantitative Variable Chapter 7.

Slides:



Advertisements
Similar presentations
ANALYZING MORE GENERAL SITUATIONS UNIT 3. Unit Overview  In the first unit we explored tests of significance, confidence intervals, generalization, and.
Advertisements

Two Quantitative Variables
CHAPTER 24: Inference for Regression
Evaluating Hypotheses Chapter 9. Descriptive vs. Inferential Statistics n Descriptive l quantitative descriptions of characteristics.
Evaluating Hypotheses Chapter 9 Homework: 1-9. Descriptive vs. Inferential Statistics n Descriptive l quantitative descriptions of characteristics ~
1 Simple Linear Regression Chapter Introduction In this chapter we examine the relationship among interval variables via a mathematical equation.
Stat 217 – Day 25 Regression. Last Time - ANOVA When?  Comparing 2 or means (one categorical and one quantitative variable) Research question  Null.
Chapter 11: Inference for Distributions
Chapter 6: Introduction to Formal Statistical Inference November 19, 2008.
Review for Exam 2 Some important themes from Chapters 6-9 Chap. 6. Significance Tests Chap. 7: Comparing Two Groups Chap. 8: Contingency Tables (Categorical.
CHAPTER 11: Sampling Distributions
Review Tests of Significance. Single Proportion.
Copyright © 2013, 2010 and 2007 Pearson Education, Inc. Chapter Inference on the Least-Squares Regression Model and Multiple Regression 14.
Chapter 9 Comparing More than Two Means. Review of Simulation-Based Tests  One proportion:  We created a null distribution by flipping a coin, rolling.
CHAPTER 16: Inference in Practice. Chapter 16 Concepts 2  Conditions for Inference in Practice  Cautions About Confidence Intervals  Cautions About.
Statistical inference. Distribution of the sample mean Take a random sample of n independent observations from a population. Calculate the mean of these.
CHAPTER 18: Inference about a Population Mean
Copyright © 2014, 2011 Pearson Education, Inc. 1 Chapter 22 Regression Diagnostics.
Significance Tests: THE BASICS Could it happen by chance alone?
LECTURE 19 THURSDAY, 14 April STA 291 Spring
1 Happiness comes not from material wealth but less desire.
The Practice of Statistics, 5th Edition Starnes, Tabor, Yates, Moore Bedford Freeman Worth Publishers CHAPTER 10 Comparing Two Populations or Groups 10.2.
+ Chapter 12: More About Regression Section 12.1 Inference for Linear Regression.
1 Chapter 10: Introduction to Inference. 2 Inference Inference is the statistical process by which we use information collected from a sample to infer.
Analysis of Variance 1 Dr. Mohammed Alahmed Ph.D. in BioStatistics (011)
Statistics 101 Chapter 10 Section 2. How to run a significance test Step 1: Identify the population of interest and the parameter you want to draw conclusions.
The Practice of Statistics, 5th Edition Starnes, Tabor, Yates, Moore Bedford Freeman Worth Publishers CHAPTER 10 Comparing Two Populations or Groups 10.1.
The Practice of Statistics, 5th Edition Starnes, Tabor, Yates, Moore Bedford Freeman Worth Publishers CHAPTER 10 Comparing Two Populations or Groups 10.1.
Stat 1510: Sampling Distributions
Chapter 14: Inference for Regression. A brief review of chapter 4... (Regression Analysis: Exploring Association BetweenVariables )  Bi-variate data.
Lecture PowerPoint Slides Basic Practice of Statistics 7 th Edition.
Inferential Statistics Introduction. If both variables are categorical, build tables... Convention: Each value of the independent (causal) variable has.
Copyright © 2013, 2009, and 2007, Pearson Education, Inc. Chapter 10 Comparing Two Groups Section 10.1 Categorical Response: Comparing Two Proportions.
The Practice of Statistics, 5th Edition Starnes, Tabor, Yates, Moore Bedford Freeman Worth Publishers CHAPTER 12 More About Regression 12.1 Inference for.
+ Unit 6: Comparing Two Populations or Groups Section 10.2 Comparing Two Means.
Learning Objectives After this section, you should be able to: The Practice of Statistics, 5 th Edition1 DESCRIBE the shape, center, and spread of the.
Essential Statistics Chapter 171 Two-Sample Problems.
The Practice of Statistics, 5th Edition Starnes, Tabor, Yates, Moore Bedford Freeman Worth Publishers CHAPTER 12 More About Regression 12.1 Inference for.
AP Stat 2007 Free Response. 1. A. Roughly speaking, the standard deviation (s = 2.141) measures a “typical” distance between the individual discoloration.
The Practice of Statistics, 5th Edition Starnes, Tabor, Yates, Moore Bedford Freeman Worth Publishers CHAPTER 10 Comparing Two Populations or Groups 10.1.
AP Test Practice. A student organization at a university is interested in estimating the proportion of students in favor of showing movies biweekly instead.
The Practice of Statistics, 5 th Edition1 Check your pulse! Count your pulse for 15 seconds. Multiply by 4 to get your pulse rate for a minute. Write that.
The Practice of Statistics, 5th Edition Starnes, Tabor, Yates, Moore Bedford Freeman Worth Publishers CHAPTER 10 Comparing Two Populations or Groups 10.2.
Introduction For inference on the difference between the means of two populations, we need samples from both populations. The basic assumptions.
CHAPTER 9 Testing a Claim
CHAPTER 10 Comparing Two Populations or Groups
CHAPTER 10 Comparing Two Populations or Groups
Simulation-Based Approach for Comparing Two Means
CHAPTER 12 More About Regression
CHAPTER 10 Comparing Two Populations or Groups
CHAPTER 9 Testing a Claim
CHAPTER 10 Comparing Two Populations or Groups
CHAPTER 10 Comparing Two Populations or Groups
Basic Practice of Statistics - 3rd Edition Two-Sample Problems
CHAPTER 12 More About Regression
CHAPTER 10 Comparing Two Populations or Groups
CHAPTER 10 Comparing Two Populations or Groups
CHAPTER 10 Comparing Two Populations or Groups
CHAPTER 10 Comparing Two Populations or Groups
CHAPTER 10 Comparing Two Populations or Groups
CHAPTER 9 Testing a Claim
CHAPTER 12 More About Regression
CHAPTER 10 Comparing Two Populations or Groups
CHAPTER 9 Testing a Claim
CHAPTER 10 Comparing Two Populations or Groups
CHAPTER 10 Comparing Two Populations or Groups
CHAPTER 10 Comparing Two Populations or Groups
CHAPTER 10 Comparing Two Populations or Groups
CHAPTER 10 Comparing Two Populations or Groups
CHAPTER 10 Comparing Two Populations or Groups
Presentation transcript:

Paired Data: One Quantitative Variable Chapter 7

Introduction  The paired datasets in this chapter have one pair of quantitative response values for each observational unit.  This allows for a built-in comparison.  Studies with paired data remove individual variability by looking at the difference score for each individual.  Reducing variability in data improves inferences: Narrower confidence intervals Smaller p-values when the null hypothesis is false

Introduction  Our data that we will analyze will just be a single quantitative variable.  So things like mean and standard deviation are important to look at, but really nothing new for descriptive statistics.  Section 7.1: Simulation-based method  Section 7.2: Theory-based method

Section 7.1: Simulation-Based Approach for Analyzing Paired Data Example 7.1: Rounding First Base

First Base  Imagine you’ve hit a line drive and are trying to reach second base.  Does the path that you take to “round” first base make much a difference? Narrow angle Wide angle

First Base  Hollander and Wolfe (1999) report on a Master’s Thesis by Woodward (1970) that investigates base running strategies.  Woodward timed 22 different runners from a spot 35 feet past home to a spot 15 feet before second.  Each runner used each strategy (paired design), with a rest between.  This paired design controls for the runner-to- runner variability.  He used random assignment to decide which path each runner should do first.

First Base  Times for the first 10 runners  Dotplots of times for all 22 runners Subject narrow angle … wide angle …

First Base  There is a lot of overlap in the distributions and a fair bit of variability  Difficult to detect a difference between the methods when there’s a lot of variation MeanSD Narrow Wide

First Base  What are the observational units in this study? The runners (22 total)  What variables are recorded? What are their types and roles? Explanatory variable: base running method: wide or narrow angle (categorical) Response variable: time for middle of the route from home plate to second base (quantitative)  Is this an observational study or an experiment? Randomized experiment since the explanatory variable was randomly applied to determined which method each runner used first

First Base  These data are clearly paired.  The paired response variable is time difference in running between the two methods (narrow angle – wide angle).  Could we do wide angle – narrow angle?

First Base  Differences for the first 10 runners  A dotplot of the differences for all 22 runners. Subject narrow angle … wide angle … diff …

First Base

 The original dotplots with each observation paired between the base running strategies.  What do you notice?

First Base

 How can simulation-based methods find an approximate p-value? The null basically says the running path doesn’t matter --- the times, on average, will be the same for the two methods. So we can use our same data set and randomly decide which time goes with the narrow and wide methods and compute a mean difference. (Notice we don’t break our pairs.) We can repeat this process many times to develop a null distribution.

First Base Subject narrow angle … wide angle … diff …

First Base  Mean differences from 1000 repetitions  Describe the shape of the distribution.  The distribution appears to be centered at about 0. Does that make sense?

First Base  Using the null distribution is the observed average from the study of out in the tail?

First Base

 Based on the p-value and standardized statistic we have very strong evidence against the null hypothesis.  We can draw a cause-and-effect conclusions since the researcher used random assignment of the two base running methods for each runner.  There was not a lot of information about how these 22 runners were selected to decide if we can generalize to a larger population.

First Base

Alternative Analysis  What do you think would happen if we wrongly analyzed the data using a 2 independent samples procedure?  I.e. the researcher selected 22 runners to use the wide method and an independent sample of 22 other runners to use the narrow method, obtaining the same 44 times as in the actual study.  Would the p-value stay the same, increase, or decrease?

First Base Using the Two Means applet (which does an independent test) we get a p-value of Does it make sense that this p-value is larger than the one we obtained earlier?

Exercise and Heart Rate Exploration 7.1

Section 7.2: Theory-based methods for paired data.

First Base  Our null distribution was centered at zero and fairly bell-shaped.  This can all be predicted (along with the variability) using theory-based methods. To do this, our sample size should be at least 20.

Theory-based test

Theory-based results

First Base  The theory-based model gives slightly different results, but we come to the same conclusion. Which base running path used does make a difference in the average times (we can see that with our small p-value).  We estimate the narrow angle path will take between to seconds longer, on average, to complete than the wide angle path.

Exploration 7.2 Comparing Auction Formats  We will compare: Dutch auction the item for sale starts at a very high price and is lowered gradually until someone finds the price low enough to buy. First-price sealed bid auction each bidder summits a single sealed bid before a particular deadline. After the deadline, the person with the highest bid wins.