Download presentation
Presentation is loading. Please wait.
2
Chapter 3: Describing Relationships
Section 3.1 Scatterplots and Correlation
3
Scatterplots and Correlation
Explanatory and Response Variables Most statistical studies examine data on more than one variable. In many of these settings, the two variables play different roles. Scatterplots and Correlation Definition: A response variable (y) measures an outcome of a study. An explanatory variable (x) may help explain or influence changes in a response variable.
4
Displaying Relationships: Scatterplots
The most useful graph for displaying the relationship between two quantitative variables is a scatterplot. Definition: A scatterplot shows the relationship between two quantitative variables measured on the same individuals. The values of one variable appear on the horizontal axis, and the values of the other variable appear on the vertical axis. Each individual in the data appears as a point on the graph.
5
How to Make a Scatterplot
Decide which variable should go on each axis. Remember, the eXplanatory variable goes on the X-axis! Label and scale your axes. Plot individual data values. NOTE: The axes need not intersect at (0,0). For each of the axes, the scale should be chosen so that the minimum and maximum values on the scale are convenient and the values to be plotted are between the two values.
6
Displaying Relationships: Scatterplots
Make a scatterplot of the relationship between body weight and pack weight. Since Body weight is our eXplanatory variable, be sure to place it on the X-axis! Body weight (lb) 120 187 109 103 131 165 158 116 Backpack weight (lb) 26 30 24 29 35 31 28
7
How to Examine a Scatterplot
Interpreting Scatterplots For the distribution of a single quantitative variable, “shape, center, spread, outliers” (SOCS) has been a useful summary. How to Examine a Scatterplot As in any graph of data, look for the overall pattern and for striking departures from that pattern. You can describe the overall pattern of a scatterplot by the direction, shape, and strength of the relationship. An important kind of departure is an outlier, an individual value that falls outside the overall pattern of the relationship. (Ask yourself, “Is there a striking exception to the overall pattern?”)
8
Describing Strength Describe the strength of the relationship. If the points cluster closely around an imaginary line, the association is strong. If the points are scattered farther from the line, the association is weak.
9
Describing Direction Definition: Two variables have a positive association when above-average values of one tend to accompany above-average values of the other, and when below- average values also tend to occur together. (i.e., Generally speaking, the y values tend to increase as the x values increase.) Two variables have a negative association when above-average values of one tend to accompany below-average values of the other. (i.e., Generally speaking, the y values tend to decrease as the x values increase.)
10
Interpreting Scatterplots
Outlier There is one possible outlier, the hiker with the body weight of 187 pounds seems to be carrying relatively less weight than are the other group members. Strength Direction Form There is a moderately strong, positive, linear relationship between body weight and pack weight. It appears that lighter students are carrying lighter backpacks.
11
Interpreting Scatterplots
Consider the SAT example from page Interpret the scatterplot. Strength Direction Form There is a moderately strong, negative, curved relationship between the percent of students in a state who take the SAT and the mean SAT math score. Further, there are two distinct clusters of states and two possible outliers that fall outside the overall pattern.
12
Example A sample of one-way Greyhound bus fares from Rochester, NY to cities less than 750 miles away was taken by going to Greyhound’s website. The following table gives the destination city, the distance and the one-way fare. Which variable is the explanatory? Response?
13
Verify the plot on your graphing calculator
Example Scatterplot Verify the plot on your graphing calculator Copyright © 2005 Brooks/Cole, a division of Thomson Learning, Inc.
14
Further Comments It is possible that two points might have the same x value with different y values. Notice that Potsdam (239) and Albany (240) come very close to having the same x value but the y values are $8 apart. Clearly, the value of y is not determined solely by the x value (there are factors other than distance that affect the fare). In this example, the y value tends to increase as x increases. We say that there is a positive relationship between the variables distance and fare. It appears that the y value (fare) could be predicted reasonably well from the x value (distance) by finding a line that is close to the points in the plot.
Similar presentations
© 2024 SlidePlayer.com Inc.
All rights reserved.