Lesson 1 - 2 Describing Distributions with Numbers adapted from Mr. Molesky’s Statmonkey website.

Slides:



Advertisements
Similar presentations
DESCRIBING DISTRIBUTION NUMERICALLY
Advertisements

HS 67 - Intro Health Statistics Describing Distributions with Numbers
Lesson Describing Distributions with Numbers parts from Mr. Molesky’s Statmonkey website.
Click the mouse button or press the Space Bar to display the answers.
CHAPTER 1 Exploring Data
The Five-Number Summary and Boxplots
Lecture 4 Chapter 2. Numerical descriptors
CHAPTER 2: Describing Distributions with Numbers
AP Statistics Chapters 0 & 1 Review. Variables fall into two main categories: A categorical, or qualitative, variable places an individual into one of.
Describing distributions with numbers
Why use boxplots? ease of construction convenient handling of outliers construction is not subjective (like histograms) Used with medium or large size.
CHAPTER 2: Describing Distributions with Numbers ESSENTIAL STATISTICS Second Edition David S. Moore, William I. Notz, and Michael A. Fligner Lecture Presentation.
Objectives 1.2 Describing distributions with numbers
1.1 Displaying Distributions with Graphs
1.3: Describing Quantitative Data with Numbers
Lecture PowerPoint Slides Basic Practice of Statistics 7 th Edition.
What is Statistics? Statistics is the science of collecting, analyzing, and drawing conclusions from data –Descriptive Statistics Organizing and summarizing.
+ Chapter 1: Exploring Data Section 1.3 Describing Quantitative Data with Numbers The Practice of Statistics, 4 th edition - For AP* STARNES, YATES, MOORE.
The Practice of Statistics Third Edition Chapter 1: Exploring Data 1.2 Describing Distributions with Numbers Copyright © 2008 by W. H. Freeman & Company.
Warm-up The number of deaths among persons aged 15 to 24 years in the United States in 1997 due to the seven leading causes of death for this age group.
Exploring Data 1.2 Describing Distributions with Numbers YMS3e AP Stats at LSHS Mr. Molesky 1.2 Describing Distributions with Numbers YMS3e AP Stats at.
Chapter 3 Looking at Data: Distributions Chapter Three
Organizing Data AP Stats Chapter 1. Organizing Data Categorical Categorical Dotplot (also used for quantitative) Dotplot (also used for quantitative)
Unit 3: Averages and Variations Week 6 Ms. Sanchez.
1.3 Describing Quantitative Data with Numbers Pages Objectives SWBAT: 1)Calculate measures of center (mean, median). 2)Calculate and interpret measures.
+ Chapter 1: Exploring Data Section 1.3 Describing Quantitative Data with Numbers The Practice of Statistics, 4 th edition - For AP* STARNES, YATES, MOORE.
More Univariate Data Quantitative Graphs & Describing Distributions with Numbers.
+ Chapter 1: Exploring Data Section 1.3 Describing Quantitative Data with Numbers The Practice of Statistics, 4 th edition - For AP* STARNES, YATES, MOORE.
Describe Quantitative Data with Numbers. Mean The most common measure of center is the ordinary arithmetic average, or mean.
Chapter 1 Section 1 Displaying Distributions with Graphs
CHAPTER 1 Exploring Data
Chapter 1: Exploring Data
Describing Distributions with Numbers
CHAPTER 1 Exploring Data
Chapter 5 : Describing Distributions Numerically I
CHAPTER 2: Describing Distributions with Numbers
1st Semester Final Review Day 1: Exploratory Data Analysis
Summary Statistics 9/23/2018 Summary Statistics
CHAPTER 1 Exploring Data
DAY 3 Sections 1.2 and 1.3.
Please take out Sec HW It is worth 20 points (2 pts
1.2 Describing Distributions with Numbers
Click the mouse button or press the Space Bar to display the answers.
Lecture 2 Chapter 3. Displaying and Summarizing Quantitative Data
POPULATION VS. SAMPLE Population: a collection of ALL outcomes, responses, measurements or counts that are of interest. Sample: a subset of a population.
Chapter 1: Exploring Data
Chapter 1: Exploring Data
Chapter 1 Warm Up .
Exploratory Data Analysis
Chapter 1: Exploring Data
Histograms and Measures of Center vs. Spread
Measures of Center and Spread
Chapter 1: Exploring Data
Chapter 1: Exploring Data
Chapter 1: Exploring Data
Chapter 1: Exploring Data
Chapter 1: Exploring Data
CHAPTER 1 Exploring Data
CHAPTER 1 Exploring Data
CHAPTER 1 Exploring Data
Chapter 1: Exploring Data
Chapter 1: Exploring Data
Chapter 1: Exploring Data
Chapter 1: Exploring Data
Chapter 1: Exploring Data
CHAPTER 1 Exploring Data
Chapter 1: Exploring Data
Lesson Plan Day 1 Lesson Plan Day 2 Lesson Plan Day 3
Warm up Honors Algebra 2 3/14/19
Chapter 1: Exploring Data
Presentation transcript:

Lesson Describing Distributions with Numbers adapted from Mr. Molesky’s Statmonkey website

Measures of Spread Variability is the key to Statistics. Without variability, there would be no need for the subject. When describing data, never rely on center alone. Measures of Spread: Range - {rarely used... why?} Quartiles - InterQuartile Range {IQR=Q3-Q1} Variance and Standard Deviation {var and s x } Like Measures of Center, you must choose the most appropriate measure of spread.

Standard Deviation Another common measure of spread is the Standard Deviation: a measure of the “average” deviation of all observations from the mean. To calculate Standard Deviation: Calculate the mean. Determine each observation’s deviation (x - xbar). “Average” the squared-deviations by dividing the total squared deviation by (n-1). This quantity is the Variance. Square root the result to determine the Standard Deviation.

Standard Deviation Properties s measures spread about the mean and should be used only when the mean is used as the measure of center s = 0 only when there is no spread/variability. This happens only when all observations have the same value. Otherwise, s > 0. As the observations become more spread out about their mean, s gets larger s, like the mean x-bar, is not resistant. A few outliers can make s very large

Standard Deviation Variance: Standard Deviation: Example 1.16 (p.85): Metabolic Rates

Standard Deviation x(x - x)(x - x) Totals: Metabolic Rates: mean=1600 Total Squared Deviation Variance var=214870/6 var= Standard Deviation s=√ s= cal What does this value, s, mean?

Example 1 Which of the following measures of spread are resistant? 1.Range 2.Variance 3.Standard Deviation Not Resistant

Example 2 Given the following set of data: 70, 56, 48, 48, 53, 52, 66, 48, 36, 49, 28, 35, 58, 62, 45, 60, 38, 73, 45, 51, 56, 51, 46, 39, 56, 32, 44, 60, 51, 44, 63, 50, 46, 69, 53, 70, 33, 54, 55, 52 What is the range? What is the variance? What is the standard deviation? =

Quartiles Quartiles Q1 and Q3 represent the 25th and 75th percentiles. To find them, order data from min to max. Determine the median - average if necessary. The first quartile is the middle of the ‘bottom half’. The third quartile is the middle of the ‘top half’ medQ3=29.5 Q1=23 med=79Q1Q3

Using the TI-83 Enter the test data into List, L1 –STAT, EDIT enter data into L1 Calculate 5 Number Summary –Hit STAT go over to CALC and select 1-Var Stats and hitt 2 nd 1 (L1) Use 2 nd Y= (STAT PLOT) to graph the box plot –Turn plot1 ON –Select BOX PLOT (4 th option, first in second row) –Xlist: L1 –Freq: 1 –Hit ZOOM 9:ZoomStat to graph the box plot Copy graph with appropriate labels and titles

5-Number Summary, Boxplots The 5 Number Summary provides a reasonably complete description of the center and spread of distribution We can visualize the 5 Number Summary with a boxplot. MINQ1MEDQ3MAX min=45Q1=74med=79Q3=91max= Quiz Scores Outlier?

Determining Outliers InterQuartile Range “IQR”: Distance between Q1 and Q3. Resistant measure of spread...only measures middle 50% of data. IQR = Q3 - Q1 {width of the “box” in a boxplot} 1.5 IQR Rule: If an observation falls more than 1.5 IQRs above Q3 or below Q1, it is an outlier. “1.5 IQR Rule” Why 1.5? According to John Tukey, 1 IQR seemed like too little and 2 IQRs seemed like too much...

Outliers: 1.5 IQR Rule To determine outliers: 1.Find 5 Number Summary 2.Determine IQR 3.Multiply 1.5xIQR 4.Set up “fences” A.Lower Fence: Q1-(1.5∙IQR) B.Upper Fence: Q3+(1.5∙IQR) 5.Observations “outside” the fences are outliers.

Outlier Example Spending ($) IQR= IQR=26.66 IQR= IQR= IQR=1.5(26.66) 1.5IQR= IQR=1.5(26.66) 1.5IQR=39.99 All data on pg 48, #1.6 outliers } fence: = fence: = {

Example 4 Consumer Reports did a study of ice cream bars (sigh, only vanilla flavored) in their August 1989 issue. Twenty-seven bars having a taste-test rating of at least “fair” were listed, and calories per bar was included. Calories vary quite a bit partly because bars are not of uniform size. Just how many calories should an ice cream bar contain? Construct a boxplot for the data above

Example 4 - Answer Q1 = 182Q2 = 221.5Q3 = 319 Min = 111Max = 439Range = 328 IQR = 137UF = 524.5LF = Calories

Example 5 The weights of 20 randomly selected juniors at MSHS are recorded below: a) Construct a boxplot of the data b) Determine if there are any mild or extreme outliers c) Comment on the distribution

Example 5 - Answer Q1 = 130.5Q2 = 138Q3 = Min = 121Max = 213Range = 92 IQR = 15UF = 168 LF = 108 Mean = StDev = Weight (lbs) * * Extreme Outliers ( > 3 IQR from Q3) Shape: somewhat symmetric Outliers: 2 extreme outliers Center: Median = 138Spread: IQR = 15

Linear Transformations Variables can be measured in different units (feet vs meters, pounds vs kilograms, etc) When converting units, the measures of center and spread will change Linear Transformations (x new = a+bx) do not change the overall shape of a distribution Multiplying each observation by b multiplies both the measure of center and spread by b Adding a to each observation adds a to the measure of center, but does not affect spread If the distribution was symmetric, its transformation is symmetric. If the distribution was skewed, its transformation maintains the same skewness

Transformation Example 6 Using the data from example #5 –a) Change the weight from pounds to kilograms and add 2 kg (for a special band uniform) –b) Get summary statistics and compare with example 5 –c) Draw a box plot

Example 6 - Answer Convert Pounds to Kg (  ) and add 2 Q1 = 61.19Q2 = 64.60Q3 = Min = 56.89Max = 98.62Range = IQR = 6.81UF = LF = Mean = (143.6  ) StDev = (23.91  )

Example 6 – Answer cont Transformation follows what we expect: Multiplying each observation by b multiplies both the measure of center and spread by b Adding a to each observation adds a to the measure of center, but does not affect spread If the distribution was symmetric, its transformation is symmetric. If the distribution was skewed, its transformation maintains the same skewness Weight (in Kg) * * Extreme Outliers ( > 3 IQR from Q3)

Day 2 Summary and Homework Summary –Sample variance is found by dividing by (n – 1) to keep it an unbiased (since we estimate the population mean, μ, by using the sample mean, x‾) estimator of population variance –The larger the standard deviation, the more dispersion the distribution has –Boxplots can be used to check outliers and distributions –Use comparative boxplots for two datasets –Identifying a distribution from boxplots or histograms is subjective! Homework –pg 82: prob 33; pg 89 probs 40, 41; pg 97 probs 45, 46