Numerical Measures: Skewness and Location

Slides:



Advertisements
Similar presentations
Describing Quantitative Variables
Advertisements

C. D. Toliver AP Statistics
Chapter 2 Exploring Data with Graphs and Numerical Summaries
Descriptive Measures MARE 250 Dr. Jason Turner.
Measures of Position - Quartiles
Quartiles  Divide data sets into fourths or four equal parts. Smallest data value Q1Q2Q3 Largest data value 25% of data 25% of data 25% of data 25% of.
Understanding and Comparing Distributions 30 min.
Measures of Dispersion
Descriptive Statistics
Sullivan – Statistics: Informed Decisions Using Data – 2 nd Edition – Chapter 3 Introduction – Slide 1 of 3 Topic 16 Numerically Summarizing Data- Averages.
MEASURES OF SPREAD – VARIABILITY- DIVERSITY- VARIATION-DISPERSION
Percentiles Def: The kth percentile is the value such that at least k% of the measurements are less than or equal to the value. I.E. k% of the measurements.
Statistics: Use Graphs to Show Data Box Plots.
5 Number Summary Box Plots. The five-number summary is the collection of The smallest value The first quartile (Q 1 or P 25 ) The median (M or Q 2 or.
The Five-Number Summary And Boxplots. Chapter 3 – Section 5 ●Learning objectives  Compute the five-number summary  Draw and interpret boxplots 1 2.
Descriptive Statistics  Summarizing, Simplifying  Useful for comprehending data, and thus making meaningful interpretations, particularly in medium to.
Box and Whisker Plots A Modern View of the Data. History Lesson In 1977, John Tukey published an efficient method for displaying a five-number data summary.
Box and Whisker Plots and Quartiles Sixth Grade. Five Statistical Summary When describing a set of data we have seen that we can use measures such as.
Chapter 2 Describing Data with Numerical Measurements
Numerical Descriptive Measures
6-9 Data Distributions Objective Create and interpret box-and-whisker plots.
What is variability in data? Measuring how much the group as a whole deviates from the center. Gives you an indication of what is the spread of the data.
Section 1 Topic 31 Summarising metric data: Median, IQR, and boxplots.
Percentiles For any whole number P (between 1 and 99), the Pth percentile of a distribution is a value such that P% of the data fall at or below it. The.
Chapter 2 Section 5 Notes Coach Bridges
Sullivan – Fundamentals of Statistics – 2 nd Edition – Chapter 3 Section 5 – Slide 1 of 21 Chapter 3 Section 5 The Five-Number Summary And Boxplots.
Chapter 5: Boxplots  Objective: To find the five-number summaries of data and create and analyze boxplots CHS Statistics.
Chapter 5 Describing Distributions Numerically.
Summary Statistics: Measures of Location and Dispersion.
Business Statistics, 4e, by Ken Black. © 2003 John Wiley & Sons. 3-1 Business Statistics, 4e by Ken Black Chapter 3 Descriptive Statistics.
Using Measures of Position (rather than value) to Describe Spread? 1.
MODULE 3: DESCRIPTIVE STATISTICS 2/6/2016BUS216: Probability & Statistics for Economics & Business 1.
What is a box-and-whisker plot? 5-number summary Quartile 1 st, 2 nd, and 3 rd quartiles Interquartile Range Outliers.
Unit 3: Averages and Variations Part 3 Statistics Mr. Evans.
Chapter 5 Describing Distributions Numerically Describing a Quantitative Variable using Percentiles Percentile –A given percent of the observations are.
Probability & Statistics Box Plots. Describing Distributions Numerically Five Number Summary and Box Plots (Box & Whisker Plots )
Descriptive Statistics ( )
Probability & Statistics
a graphical presentation of the five-number summary of data
Describing Distributions Numerically
Get out your notes we previously took on Box and Whisker Plots.
Chapter 3 Describing Data Using Numerical Measures
Boxplots.
Chapter 16: Exploratory data analysis: Numerical summaries
Unit 2 Section 2.5.
3-3: Measures of Position
NUMERICAL DESCRIPTIVE MEASURES
Chapter 3 Describing Data Using Numerical Measures
Numerical Descriptive Measures
Chapter 2b.
Box and Whisker Plots Algebra 2.
Percentiles and Box-and- Whisker Plots
2.6: Boxplots CHS Statistics
Topic 5: Exploring Quantitative data
A Modern View of the Data
Quartile Measures DCOVA
Box & Whiskers Plots AQR.
AP Statistics Day 4 Objective: The students will be able to describe distributions with numbers and create and interpret boxplots.
Boxplots.
Boxplots.
Boxplots.
Day 52 – Box-and-Whisker.
Describing Distributions Numerically
Honors Statistics Review Chapters 4 - 5
MBA 510 Lecture 2 Spring 2013 Dr. Tonya Balan 4/20/2019.
Boxplots.
Box Plot Lesson 11-4.
Chapter 12 Statistics.
Presentation transcript:

Numerical Measures: Skewness and Location PSYSTA1 – Week 6

Measure of Skewness statistical measure used to describe the distribution of the data relative to symmetry Goal: quantify the degree of asymmetry (e.g., location of tails, difference between “centers”, etc.) in a data set Sample Skewness: 𝐒𝐊 𝐱 = 𝟑[ 𝒙 −𝐦𝐞𝐝 𝒙 ] 𝒔

Some PROPERTIES In relation with histograms (i.e., locating the centers):

Example 1 Compute the coefficient of skewness for the data given below. Then, describe the skewness of the data based on computed coefficient. 2.5 3.2 3.8 1.3 1.4 0.0 0.0 2.6 5.2 4.8 0.0 4.6 2.8 3.3

Measures of Location statistical measures used to describe the (relative) standing or location of an observation relative to the rest of the data Goal: locate the observation relative to the rest of the observations Most Commonly used Measures: Percentiles (including deciles and quartiles) z-Scores

Percentiles defined as the value on the measurement scale below which a specified percentage of the scores in the distribution fall denoted by 𝑃 𝑘 , they divide the ranked data set into 100 equal parts A percentile 𝑷 𝒌 would indicate that at least k% of the data is less than or equal to the value of 𝑷 𝑘 (thus, 100%−𝑘% of the data is greater than 𝑃 𝑘 ).

Percentiles Calculating Percentiles: The (approximate) value of the 𝑘 𝑡ℎ percentile, denoted by 𝑷 𝒌 , is 𝑷 𝒌 ≈𝐯𝐚𝐥𝐮𝐞 𝐨𝐟 𝐭𝐡𝐞 𝒌𝒏 𝟏𝟎𝟎 𝒕𝒉 𝐭𝐞𝐫𝐦 𝐢𝐧 𝐚 𝐫𝐚𝐧𝐤𝐞𝐝 𝐬𝐞𝐭 where 𝑘 denotes the number of the percentile and n represents the sample size.

Percentiles Note: 𝒑= 𝒌𝒏 𝟏𝟎𝟎

Percentiles Some Special Percentiles: Deciles - divide the data set into ten equal parts Quartiles - divide the data set into four equal parts

Percentile Rank (𝑘) defined as the percentage of scores with values lower than the score in question Finding Percentile Rank of a Value: 𝒌= 𝐧𝐮𝐦𝐛𝐞𝐫 𝐨𝐟 𝐯𝐚𝐥𝐮𝐞𝐬 𝐥𝐞𝐬𝐬 𝐭𝐡𝐚𝐧 𝒙 𝐭𝐨𝐭𝐚𝐥 𝐧𝐮𝐦𝐛𝐞𝐫 𝐨𝐟 𝐯𝐚𝐥𝐮𝐞𝐬 𝐢𝐧 𝐭𝐡𝐞 𝐝𝐚𝐭𝐚 𝐬𝐞𝐭 ×𝟏𝟎𝟎

Some PROPERTIES 𝐦𝐞𝐝 𝐱 = 𝐏 𝟓𝟎 = 𝐃 𝟓 = 𝐐 𝟐 *[median] 𝑷 𝒌 in relation with probability (particularly cumulative %): 𝑷 𝑿≤ 𝑷 𝒌 ≈ 𝒌 𝟏𝟎𝟎 i.e., the cumulative percentage of observations with values less than or equal to 𝑷 𝒌 is approximately 𝒌% A more (statistically) robust measure of variability can be defined using quartiles, i.e., the inter- quartile range (IQR) defined as 𝐈𝐐𝐑= 𝑸 𝟑 − 𝑸 𝟏

Example 2 Consider the following data set which relates again to the student’s number of hours studied each day over a 2-week period. 2.5 3.2 3.8 1.3 1.4 0.0 0.0 2.6 5.2 4.8 0.0 4.6 2.8 3.3 Compute, and interpret whenever appropriate, for the following: a.) 𝑷 𝟑𝟑 e.) 𝑸 𝟐 b.) 𝑷 𝟖𝟓 f.) 𝑸 𝟑 c.) 𝑫 𝟏 g.) 𝐈𝐐𝐑 d.) 𝑸 𝟏 h.) 𝐩𝐞𝐫𝐜𝐞𝐧𝐭𝐢𝐥𝐞 𝐫𝐚𝐧𝐤 𝐨𝐟 𝟑.𝟖

BoxPlot (Box-and-Whiskers Plot) a graphical representation of a summary of five important values: the minimum value, the first quartile, the median (or the second quartile), the third quartile, and the maximum value [i.e., the five-number summary]

BoxPlot (Box-and-Whiskers Plot) Steps in Constructing a Boxplot: Rank the data in increasing order and calculate the values of the median ( 𝑄 2 ), first quartile ( 𝑄 1 ), and third quartile ( 𝑄 3 ). Also find the interquartile range (IQR). Find the lower and upper inner fences. 𝐋𝐨𝐰𝐞𝐫 𝐈𝐧𝐧𝐞𝐫 𝐅𝐞𝐧𝐜𝐞= 𝐐 𝟏 −𝟏.𝟓×𝐈𝐐𝐑 𝐔𝐩𝐩𝐞𝐫 𝐈𝐧𝐧𝐞𝐫 𝐅𝐞𝐧𝐜𝐞= 𝐐 𝟑 +𝟏.𝟓×𝐈𝐐𝐑 Determine the smallest and the largest values in the given data set within the two inner fences.

BoxPlot (Box-and-Whiskers Plot) Draw a horizontal line and mark the levels on it such that all the values in the given data set are covered. Above or below the horizontal line, draw a box with its left side at the position of the first quartile and the right side at the position of the third quartile. Inside the box, draw a vertical line at the position of the median. By drawing two lines, join the points of the smallest and the largest values within the two inner fences of the box. These two lines are called whiskers.

BoxPlot (Box-and-Whiskers Plot) The observations that fall outside the two inner fences are called outliers. They are either mild or extreme outliers. To determine such, there is a need to find the lower and upper outer fences. 𝐋𝐨𝐰𝐞𝐫 𝐎𝐮𝐭𝐞𝐫 𝐅𝐞𝐧𝐜𝐞= 𝐐 𝟏 −𝟑.𝟎×𝐈𝐐𝐑 𝐔𝐩𝐩𝐞𝐫 𝐎𝐮𝐭𝐞𝐫 𝐅𝐞𝐧𝐜𝐞= 𝐐 𝟑 +𝟑.𝟎×𝐈𝐐𝐑 Values outside of the inner fences but inside of the outer fences (yellow card zone) are referred to as mild outliers. Values outside of both fences (red card zone) are referred to as extreme outliers.

BoxPlot (Box-and-Whiskers Plot)

Some PROPERTIES In relation with skewness (i.e., characterizing asymmetry):

Example 3 Construct a boxplot for the data given below. 2.5 3.2 3.8 1.3 1.4 0.0 0.0 2.6 5.2 4.8 0.0 4.6 2.8 3.3