Percentiles Def: The kth percentile is the value such that at least k% of the measurements are less than or equal to the value. I.E. k% of the measurements.

Slides:



Advertisements
Similar presentations
Lecture 17 Sec Wed, Feb 13, 2008 Boxplots.
Advertisements

Understanding and Comparing Distributions 30 min.
Measures of Variation Sample range Sample variance Sample standard deviation Sample interquartile range.
Sullivan – Statistics: Informed Decisions Using Data – 2 nd Edition – Chapter 3 Introduction – Slide 1 of 3 Topic 16 Numerically Summarizing Data- Averages.
Understanding and Comparing Distributions
5 Number Summary Box Plots. The five-number summary is the collection of The smallest value The first quartile (Q 1 or P 25 ) The median (M or Q 2 or.
The Five-Number Summary And Boxplots. Chapter 3 – Section 5 ●Learning objectives  Compute the five-number summary  Draw and interpret boxplots 1 2.
Descriptive Statistics  Summarizing, Simplifying  Useful for comprehending data, and thus making meaningful interpretations, particularly in medium to.
Box and Whisker Plots A Modern View of the Data. History Lesson In 1977, John Tukey published an efficient method for displaying a five-number data summary.
Why use boxplots? ease of construction convenient handling of outliers construction is not subjective (like histograms) Used with medium or large size.
Boxplots The boxplot is an informative way of displaying the distribution of a numerical variable.. It uses the five-figure summary: minimum, lower quartile,
Review Measures of central tendency
Section 1 Topic 31 Summarising metric data: Median, IQR, and boxplots.
Measure of Central Tendency Measures of central tendency – used to organize and summarize data so that you can understand a set of data. There are three.
Boxplots (Box and Whisker Plots). Boxplot and Modified Boxplot 25% of data in each section.
1 Further Maths Chapter 2 Summarising Numerical Data.
Sullivan – Fundamentals of Statistics – 2 nd Edition – Chapter 3 Section 5 – Slide 1 of 21 Chapter 3 Section 5 The Five-Number Summary And Boxplots.
Chapter 5: Boxplots  Objective: To find the five-number summaries of data and create and analyze boxplots CHS Statistics.
Why use boxplots? ease of construction convenient handling of outliers construction is not subjective (like histograms) Used with medium or large size.
Chapter 5 Describing Distributions Numerically Describing a Quantitative Variable using Percentiles Percentile –A given percent of the observations are.
Probability & Statistics Box Plots. Describing Distributions Numerically Five Number Summary and Box Plots (Box & Whisker Plots )
AP Statistics 5 Number Summary and Boxplots. Measures of Center and Distributions For a symmetrical distribution, the mean, median and the mode are the.
Example - Fax Here are the number of pages faxed by each fax sent from our Math and Stats department since April 24 th, in the order that they occurred.
Box-and-Whisker Plots
CHAPTER 1 Exploring Data
a graphical presentation of the five-number summary of data
Get out your notes we previously took on Box and Whisker Plots.
Box and Whisker Plots and the 5 number summary
Box and Whisker Plots and the 5 number summary
Chapter 5 : Describing Distributions Numerically I
Boxplots.
Box-and-Whisker Plots
Chapter 2b.
2.6: Boxplots CHS Statistics
Box-and-Whisker Plots
Box-and-Whisker Plots
A Modern View of the Data
Numerical Measures: Skewness and Location
Lecture 2 Chapter 3. Displaying and Summarizing Quantitative Data
Measure of Center And Boxplot’s.
Box-and-Whisker Plots
Boxplots.
Measure of Center And Boxplot’s.
Approximate the answers by referring to the box plot.
Boxplots.
Measuring Variation 2 Lecture 17 Sec Mon, Oct 3, 2005.
Measures of Central Tendency
Constructing Box Plots
Define the following words in your own definition
Box & Whiskers Plots AQR.
AP Statistics Day 4 Objective: The students will be able to describe distributions with numbers and create and interpret boxplots.
Boxplots.
Boxplots.
Boxplots.
Day 52 – Box-and-Whisker.
Box-and-Whisker Plots
Box-and-Whisker Plots
Freebird
1-4 Quartiles, Percentiles and Box Plots
Boxplots.
. . Box and Whisker Measures of Variation Measures of Variation 8 12
Box-and-Whisker Plots
Box Plots CCSS 6.7.
Quiz.
Box-and-Whisker Plots
Box and Whisker Plots and the 5 number summary
Box-and-Whisker Plots
Box-and-Whisker Plots
Boxplots.
Boxplots.
Presentation transcript:

Percentiles Def: The kth percentile is the value such that at least k% of the measurements are less than or equal to the value. I.E. k% of the measurements are less than or equal to the kth percentile.

Calculating Percentiles A random sample yields the dataset: We wish to calculate percentiles and construct a boxplot from this data. Step 1: Order the data

Order the data from smallest to largest: The data value 90 is what percentile? Order the Data

Calculate Percentiles 2 values are =< 50 3 values are =< 74 4 values are =< 78 5 values are =< 81 6 values are =< 89 7 values are =< is therefore the 100*(7 / 20) = 35th percentile

Quartiles To find quartiles, we first calculate their position in the ordered dataset. Position of Q1 =.25*(n + 1).25* (20 + 1) = 5.25 Position of Q3 =.75* (n + 1).75* (20 + 1) = 15.75

First Quartile Since 5.25 is not an integer, calculate a value 1/4 of the way between the 5th and 6th numbers * (89 – 81) = 83 Q1 =

First Quartile Since is not an integer, calculate a value 3/4 of the way between the 15th and 16th numbers (104 – 103) = Q3 =

Boxplot Make an axis for the graph covering the range of the data Draw bar for the: min, Q1, Median, Q3, and max Connect edges of bars at quartiles, and middles of min/Q1 and Q3/max

Boxplot The boxplot is now complete The plot tells us several things: –Location: the median is at 99 –Spread: the IQR is –Shape: This distribution is left-skewed. Note that the left tail is much longer and the median is closer to Q3

Outliers We will define two more measures: –Lower Fence = Q1 – 1.5*(IQR) = 83 – 1.5*(20.75) = –Upper Fence = Q *(IQR) = – 1.5*(20.75) = Any values outside of the fences are suspected to be outliers.

Outliers We can mark the fences to the boxplot. –Lower Fence = , Upper Fence = Any data point outside the fences are marked with *, the ‘whiskers’ extend only to the last data points still inside the fences.

Comments It may be easier to calculate the fences before constructing the plot. Boxplots can be vertical. Boxplots using * for outliers are sometimes called modified boxplots. Sometimes “Outer Fences” are used to denote “extreme outliers”, calculated by: Q1 – 3*(IQR) Q3 + 3*(IQR)