Presentation on theme: "Thinking About Variation"— Presentation transcript:

Since Statistics is about variation, spread is an important fundamental concept of Statistics. Measures of spread help us talk about what we don’t know. When the data values are tightly clustered around the center of the distribution, the IQR and standard deviation will be small. When the data values are scattered far from the center, the IQR and standard deviation will be large.

When telling about a quantitative variable, always report the shape of its distribution, along with a center and a spread. If the shape is skewed, report the median and IQR. If the shape is symmetric, report the mean and standard deviation and possibly the median and IQR as well.

List 1 : 55, 56, 57, 58, 59, 60, 60, 60, 61, 62, 63, 64, 65 mean = median = mode = 60 X XXXXXXXXXXX List 2 : 35, 40, 45, 50, 55, 60, 60, 60, 65, 70, 75, 80, 85 ___ X X X X X X X X X _ X X Both distributions have the same mean, median, and mode, but differ in their variation (or spread)

What does St. Dev. tell us? Standard deviation is a measure of the variation – Big st. dev. indicates a lot of variation, so the data is scattered far from the center Little st. dev. indicates not much variation, so the data is clustered around the center Zero st. dev. indicates no variation, so the data values are all the same as the mean

Stem-and-Leaf Displays
Stem-and-leaf displays show the distribution of a quantitative variable, like histograms do, while preserving the individual values. Stem-and-leaf displays contain all the information found in a histogram and, when carefully drawn, satisfy the area principle and show the distribution.

Stem-and-Leaf Example
Compare the histogram and stem-and-leaf display for the pulse rates of 24 women at a health clinic. Which graphical display do you prefer?

Comparing a Histogram to a Stemplot
Both graphs show the first 36 months of Enron monthly stock price changes.

Comparing a Histogram to a Stemplot
Both graphs show the first 36 months of Enron monthly stock price changes.

Simpson’s Paradox This occurs when an association you make from categorical variables is “reversed” when you consider a third variable. Helicopter Road Victim Died 64 260 Victim survived 136 840 Total 200 1100

Survived 52 40 Total 100 Less Serious Accidents Helicopter Road Died 16 200 Survived 94 800 Total 100 1000

Example The frequency table shows the heights (in inches) of 130 members of a choir. Enter the data into your calculator – put the heights into L1 and the frequencies into L2

Example To find the five-number summary: 1-var stats L1, L2
To create a boxplot of the distribution:

Example Zoom-9 to graph!! Set up Plot 2 as a histogram:

Example Zoom-9 to graph!!