1 Boxplots. 2 Boxplots & the 5 # Summary To construct a boxplot, first obtain the 5 number summary { Min, Q 1, M, Q 3, Max } Q 1 : 1 st quartile = 25.

Slides:



Advertisements
Similar presentations
HS 67 - Intro Health Statistics Describing Distributions with Numbers
Advertisements

Descriptive Measures MARE 250 Dr. Jason Turner.
Understanding and Comparing Distributions 30 min.
CHAPTER 1 Exploring Data
Describing Quantitative Data with Numbers Part 2
Copyright © 2010, 2007, 2004 Pearson Education, Inc. Lecture Slides Elementary Statistics Eleventh Edition and the Triola Statistics Series by.
Measures of Relative Standing and Boxplots
Percentiles Def: The kth percentile is the value such that at least k% of the measurements are less than or equal to the value. I.E. k% of the measurements.
Understanding and Comparing Distributions
5 Number Summary, Boxplots, Outliers, and Resistance
Boxplots (Box and Whisker Plots). Comparing Data Using Boxplots Each section of the boxplot represents 25% of the data. The median (50%tile) is the line.
Unit 2 : Data Analysis Box Plots CCSS: S.ID.1, S.ID.2, S.ID.3.
M08-Numerical Summaries 2 1  Department of ISM, University of Alabama, Lesson Objectives  Learn what percentiles are and how to calculate quartiles.
STAT 250 Dr. Kari Lock Morgan
Summary statistics Using a single value to summarize some characteristic of a dataset. For example, the arithmetic mean (or average) is a summary statistic.
Copyright © 2005 Pearson Education, Inc. Slide 6-1.
Review Measures of central tendency
Copyright © 2009 Pearson Education, Inc. Chapter 5 Understanding and Comparing Distributions.
Section 1 Topic 31 Summarising metric data: Median, IQR, and boxplots.
Descriptive Statistics1 LSSG Green Belt Training Descriptive Statistics.
Measures of Relative Standing Percentiles Percentiles z-scores z-scores T-scores T-scores.
Boxplots (Box and Whisker Plots). Boxplot and Modified Boxplot 25% of data in each section.
Continued… Obj: draw Box-and-whisker plots representing a set of data Do now: use your calculator to find the mean for 85, 18, 87, 100, 27, 34, 93, 52,
Statistics Chapter 1: Exploring Data. 1.1 Displaying Distributions with Graphs Individuals Objects that are described by a set of data Variables Any characteristic.
Copyright © 2008 Pearson Education, Inc. Publishing as Pearson Addison-Wesley Chapter 5 Understanding and Comparing Distributions.
Understanding and Comparing Distributions
Chapter 5: Boxplots  Objective: To find the five-number summaries of data and create and analyze boxplots CHS Statistics.
Chapter 5 Describing Distributions Numerically.
Lecture 5 Dustin Lueker. 2 Mode - Most frequent value. Notation: Subscripted variables n = # of units in the sample N = # of units in the population x.
Foundations of Math I: Unit 3 - Statistics Arithmetic average Median: Middle of the data listed in ascending order (use if there is an outlier) Mode: Most.
Why use boxplots? ease of construction convenient handling of outliers construction is not subjective (like histograms) Used with medium or large size.
© 2012 W.H. Freeman and Company Lecture 2 – Aug 29.
5 Number Summary, Boxplots, Outliers, and Resistance.
Describe Quantitative Data with Numbers. Mean The most common measure of center is the ordinary arithmetic average, or mean.
Box and Whisker Plots or Boxplots
5-Number Summaries, Outliers, and Boxplots
CHAPTER 1 Exploring Data
Box and Whisker Plots and the 5 number summary
Box and Whisker Plots and the 5 number summary
Chapter 5 : Describing Distributions Numerically I
Sections 2.3 and 2.4.
CHAPTER 1 Exploring Data
Chapter 2b.
CHAPTER 1 Exploring Data
2.6: Boxplots CHS Statistics
Lecture 2 Chapter 3. Displaying and Summarizing Quantitative Data
Box Plots and Outliers.
One Quantitative Variable: Measures of Spread
CHAPTER 1 Exploring Data
Data Analysis and Statistical Software I Quarter: Spring 2003
Measures of Central Tendency
Exploratory Data Analysis
Section 2.4 Measures of Variation.
CHAPTER 1 Exploring Data
CHAPTER 1 Exploring Data
Statistics Fractiles
CHAPTER 1 Exploring Data
1-4 Quartiles, Percentiles and Box Plots
CHAPTER 1 Exploring Data
CHAPTER 1 Exploring Data
CHAPTER 1 Exploring Data
CHAPTER 1 Exploring Data
The Five-Number Summary
CHAPTER 1 Exploring Data
CHAPTER 1 Exploring Data
Box and Whisker Plots and the 5 number summary
CHAPTER 1 Exploring Data
Univariate Data Univariate Data: involving a single variable
CHAPTER 1 Exploring Data
Number Summaries and Box Plots.
Presentation transcript:

1 Boxplots

2 Boxplots & the 5 # Summary To construct a boxplot, first obtain the 5 number summary { Min, Q 1, M, Q 3, Max } Q 1 : 1 st quartile = 25 th percentile M = Q 2 : median = 2 nd quartile = 50 th percentile Q 3 : 3 rd quartile = 75 th percentile

3 Example: Simple Boxplot Failure times of industrial machines (in hours) # summary: { 189.1, , 232.3, , }

4 Example: Simple Boxplot Min M Max Q1 Q3

5 Example: Simple Boxplot

6 Example: Modified Boxplot A mathematical rule designates “outliers.” These are plotted using special symbols. (Note: The maximum is still )

7 Modified Boxplot: Outliers A mathematical rule designates “outliers.” Outliers are part of the data – do not assume they are “bad data” or can be deleted. In fact, they may constitute the most meaningful information in the data. If we discover why the one machine took so long to fail, perhaps we can adjust others to increase the likelihood of a large failure time for future machines. Then future failure times should be plotted in a new boxplot.

8 Modified Boxplot: Outliers A mathematical rule designates “outliers” in a boxplot. Sometimes* an outlier is ripe for… …change learn why it’s wrong and correct it …deletion on the basis of something that distinguishes the unit (but not the value alone) determine why it’s different and doesn’t belong * not always – in fact, in this class, you may assume that outliers aren’t “wrong” and “do belong.

9 Modified Boxplot: Outliers A mathematical rule designates “outliers.” For a large set of good data drawn from a Normal population, about 2% of the data will be marked “outlier” in a modified boxplot. For small but good data sets, generally more than 2% will be so marked. For good data from skewed distributions, generally much more than 2% will be so marked.

10 Modified Boxplot: Outliers Learn to match shape with boxplot orientation.