The Five-Number Summary and Boxplots

Slides:



Advertisements
Similar presentations
Describing Quantitative Variables
Advertisements

DESCRIBING DISTRIBUTION NUMERICALLY
Lesson Describing Distributions with Numbers parts from Mr. Molesky’s Statmonkey website.
Click the mouse button or press the Space Bar to display the answers.
Descriptive Measures MARE 250 Dr. Jason Turner.
Understanding and Comparing Distributions 30 min.
Numerically Summarizing Data
CHAPTER 4 Displaying and Summarizing Quantitative Data Slice up the entire span of values in piles called bins (or classes) Then count the number of values.
Averages and Variation
Sullivan – Statistics: Informed Decisions Using Data – 2 nd Edition – Chapter 3 Introduction – Slide 1 of 3 Topic 16 Numerically Summarizing Data- Averages.
Chapter 3 Numerically Summarizing Data Section 3.5 Five Number Summary; Boxplots.
Numerical Representation of Data Part 3 – Measure of Position
BPS - 5th Ed. Chapter 21 Describing Distributions with Numbers.
Understanding and Comparing Distributions
Understanding and Comparing Distributions
Statistics: Use Graphs to Show Data Box Plots.
5 Number Summary, Boxplots, Outliers, and Resistance
Quartiles and the Interquartile Range.  Comparing shape, center, and spreads of two or more distributions  Distribution has too many values for a stem.
5 Number Summary Box Plots. The five-number summary is the collection of The smallest value The first quartile (Q 1 or P 25 ) The median (M or Q 2 or.
The Five-Number Summary And Boxplots. Chapter 3 – Section 5 ●Learning objectives  Compute the five-number summary  Draw and interpret boxplots 1 2.
Describing distributions with numbers
Why use boxplots? ease of construction convenient handling of outliers construction is not subjective (like histograms) Used with medium or large size.
Copyright © 2010, 2007, 2004 Pearson Education, Inc.
Objectives 1.2 Describing distributions with numbers
Copyright © Cengage Learning. All rights reserved. Averages and Variation 3.
Copyright © 2009 Pearson Education, Inc. Chapter 5 Understanding and Comparing Distributions.
What is variability in data? Measuring how much the group as a whole deviates from the center. Gives you an indication of what is the spread of the data.
Lesson Describing Distributions with Numbers adapted from Mr. Molesky’s Statmonkey website.
1 Further Maths Chapter 2 Summarising Numerical Data.
Describing Quantitative Data with Numbers Section 1.3.
Copyright © 2008 Pearson Education, Inc. Publishing as Pearson Addison-Wesley Chapter 5 Understanding and Comparing Distributions.
Understanding and Comparing Distributions
Chapter 5: Boxplots  Objective: To find the five-number summaries of data and create and analyze boxplots CHS Statistics.
Why use boxplots? ease of construction convenient handling of outliers construction is not subjective (like histograms) Used with medium or large size.
Chapter 4 Understanding and Comparing Distributions Another Useful Graphical Method: Boxplots.
Using Measures of Position (rather than value) to Describe Spread? 1.
Summary Statistics, Center, Spread, Range, Mean, and Median Ms. Daniels Integrated Math 1.
Introductory Statistics Lesson 2.5 A Objective: SSBAT find the first, second and third quartiles of a data set. SSBAT find the interquartile range of a.
CCGPS Advanced Algebra Day 1 UNIT QUESTION: How do we use data to draw conclusions about populations? Standard: MCC9-12.S.ID.1-3, 5-9, SP.5 Today’s Question:
Why use boxplots? ease of construction convenient handling of outliers construction is not subjective (like histograms) Used with medium or large size.
5 Number Summary, Boxplots, Outliers, and Resistance.
Chapter 5 Describing Distributions Numerically Describing a Quantitative Variable using Percentiles Percentile –A given percent of the observations are.
Chapter 4 - Exploring Data Section 1 - Describing Distribution with Numbers.
Box and Whisker Plot Chapter 3.5. Box and Whisker Plot A Box-and-Whisker Plot or Box plot is a visual device that uses a 5-number summary to reveal the.
Quantitative Data Continued
Box and Whisker Plots or Boxplots
Bellwork 1. Order the test scores from least to greatest: 89, 93, 79, 87, 91, 88, Find the median of the test scores. 79, 87, 88, 89, 91, 92, 93.
Boxplots.
Unit 2 Section 2.5.
Chapter 2b.
2.6: Boxplots CHS Statistics
Click the mouse button or press the Space Bar to display the answers.
Numerical Measures: Skewness and Location
Warmup What five numbers need to be mentioned in the complete sentence you write when the data distribution is skewed?
Percentiles and Box-and-Whisker Plots
Boxplots.
1.3 Describing Quantitative Data with Numbers
Constructing Box Plots
Unit 12: Intro to Statistics
Define the following words in your own definition
Boxplots.
Organizing, Summarizing, &Describing Data UNIT SELF-TEST QUESTIONS
Exploratory Data Analysis
Section 3:4 Answers page 173 #5,6, 12, 14, 16, 19, 21 23, 28
Boxplots.
Boxplots.
Click the mouse button or press the Space Bar to display the answers.
Honors Statistics Review Chapters 4 - 5
Boxplots.
Boxplots.
Presentation transcript:

The Five-Number Summary and Boxplots Lesson 3 - 5 The Five-Number Summary and Boxplots

Objectives Compute the five-number summary Draw and interpret boxplots

Vocabulary Five-number Summary – the minimum data value, Q1, median, Q3 and the maximum data value

Five-number summary Boxplot Min Q1 M Q3 Max Lower Fence Upper Fence [ smallest value First, Second and Third Quartiles (Second Quartile is the Median, M) largest value Lower Fence Upper Fence Boxplot [ ] * Smallest Data Value > Lower Fence Largest Data Value < Upper Fence (Min unless min is an outlier) (Max unless max is an outlier) Outlier

Distribution Shape Based on Boxplots: If the median is near the center of the box and each horizontal line is of approximately equal length, then the distribution is roughly symmetric If the median is to the left of the center of the box or the right line is substantially longer than the left line, then the distribution is skewed right If the median is to the right of the center of the box or the left line is substantially longer than the right line, then the distribution is skewed left

Why Use a Boxplot? A boxplot provides an alternative to a histogram, a dotplot, and a stem-and-leaf plot. Among the advantages of a boxplot over a histogram are ease of construction and convenient handling of outliers. In addition, the construction of a boxplot does not involve subjective judgements, as does a histogram. That is, two individuals will construct the same boxplot for a given set of data - which is not necessarily true of a histogram, because the number of classes and the class endpoints must be chosen. On the other hand, the boxplot lacks the details the histogram provides. Dotplots and stemplots retain the identity of the individual observations; a boxplot does not. Many sets of data are more suitable for display as boxplots than as a stemplot. A boxplot as well as a stemplot are useful for making side-by-side comparisons.

Example 1 Consumer Reports did a study of ice cream bars (sigh, only vanilla flavored) in their August 1989 issue. Twenty-seven bars having a taste-test rating of at least “fair” were listed, and calories per bar was included. Calories vary quite a bit partly because bars are not of uniform size. Just how many calories should an ice cream bar contain? Construct a boxplot for the data above. 342 377 319 353 295 234 294 286 182 310 439 111 201 197 209 147 190 151 131

Example 1 - Answer Q1 = 182 Q2 = 221.5 Q3 = 319 Min = 111 Max = 439 Range = 328 IQR = 137 UF = 524.5 LF = -23.5 100 125 150 175 200 225 250 275 300 325 350 375 400 425 450 475 500 Calories

Example 2 The weights of 20 randomly selected juniors at MSHS are recorded below:   a) Construct a boxplot of the data b) Determine if there are any mild or extreme outliers. 121 126 130 132 143 137 141 144 148 205 125 128 131 133 135 139 147 153 213

Example 2 - Answer Q1 = 130.5 Q2 = 138 Q3 = 145.5 Min = 121 Max = 213 Range = 92 IQR = 15 UF = 168 LF = 108 Extreme Outliers ( > 3 IQR from Q3) * * 100 110 120 130 140 150 160 170 180 190 200 210 220 230 240 250 260 Weight

Example 3 The following are the scores of 12 members of a woman’s golf team in tournament play: a) Construct a boxplot of the data. b) Are there any mild or extreme outliers? c) Find the mean and standard deviation. d) Based on the mean and median describe the distribution? 89 90 87 95 86 81 111 108 83 88 91 79

Example 3 - Answer Q1 = 84.5 Q2 = 88.5 Q3 = 93 Min = 79 Max = 111 Range = 32 IQR = 18.5 UF = 120.75 LF = 56.75 Golf Scores 78 81 84 87 90 93 96 99 102 105 108 111 114 117 120 123 126 No Outliers Mean= 90.67 St Dev = 9.85 Distribution appears to be skewed right (mean > median and long whisker)

Example 4 Comparative Boxplots: The scores of 18 first year college women on the Survey of Study Habits and Attitudes (this psychological test measures motivation, study habits and attitudes toward school) are given below: The college also administered the test to 20 first-year college men. There scores are also given: Compare the two distributions by constructing boxplots. Are there any outliers in either group? Are there any noticeable differences or similarities between the two groups? 154 109 137 115 152 140 178 101 103 126 165 129 200 148 108 140 114 91 180 115 126 92 169 146 109 132 75 88 113 151 70 187 104

Example 4 - Answer Q1 = 126 98 Q2 = 138.5 114.5 Q3 = 154 143 Min = 101 70 Max = 200 187 Range = 99 117 IQR = 28 45 UF = 196 210.5 LF = 59 30.5 Comparing Men and Women Study Habits and Attitudes Women * 60 70 80 90 100 110 120 130 140 150 160 170 180 190 200 210 220 Men Women’s median is greater and they have less variability (spread) in their scores; the women’s distribution is more symmetric while the men’s is skewed right. Women have an outlier; while the men do not.

Summary and Homework Summary Homework: pg 181-183: 5-7, 15 Boxplots are used for checking for outliers Use comparative boxplots for two datasets Constructing a boxplot is not subjective Identifying a distribution from boxplots or histograms is subjective! Homework: pg 181-183: 5-7, 15