Chapter 2 Descriptive Statistics Section 2.3 Measures of Variation Figure 2.31 Repair Times for Personal Computers at Two Service Centers  Figure 2.31.

Slides:



Advertisements
Similar presentations
Descriptive Statistics
Advertisements

Descriptive Statistics – Central Tendency & Variability Chapter 3 (Part 2) MSIS 111 Prof. Nick Dedeke.
B a c kn e x t h o m e Parameters and Statistics statistic A statistic is a descriptive measure computed from a sample of data. parameter A parameter is.
Copyright © 2010 Pearson Education, Inc. Publishing as Prentice Hall Ch. 2-1 Statistics for Business and Economics 7 th Edition Chapter 2 Describing Data:
1 1 Slide © 2003 South-Western/Thomson Learning TM Slides Prepared by JOHN S. LOUCKS St. Edward’s University.
Business Statistics: A Decision-Making Approach, 7e © 2008 Prentice-Hall, Inc. Chap 3-1 Business Statistics: A Decision-Making Approach 7 th Edition Chapter.
Slides by JOHN LOUCKS St. Edward’s University.
B a c kn e x t h o m e Classification of Variables Discrete Numerical Variable A variable that produces a response that comes from a counting process.
Chapter Two Descriptive Statistics McGraw-Hill/Irwin Copyright © 2004 by The McGraw-Hill Companies, Inc. All rights reserved.
Chapter 3, Part 1 Descriptive Statistics II: Numerical Methods
1 1 Slide © 2003 South-Western/Thomson Learning TM Slides Prepared by JOHN S. LOUCKS St. Edward’s University.
Descriptive Statistics  Summarizing, Simplifying  Useful for comprehending data, and thus making meaningful interpretations, particularly in medium to.
McGraw-Hill Ryerson Copyright © 2011 McGraw-Hill Ryerson Limited. Adapted by Peter Au, George Brown College.
Chapter 2 Describing Data with Numerical Measurements
Describing Data Using Numerical Measures
Department of Quantitative Methods & Information Systems
Describing distributions with numbers
Descriptive Statistics  Summarizing, Simplifying  Useful for comprehending data, and thus making meaningful interpretations, particularly in medium to.
Enter these data into your calculator!!!
Chapter 2 Describing Data with Numerical Measurements General Objectives: Graphs are extremely useful for the visual description of a data set. However,
Chapter 3 - Part B Descriptive Statistics: Numerical Methods
1 1 Slide © 2001 South-Western /Thomson Learning  Anderson  Sweeney  Williams Anderson  Sweeney  Williams  Slides Prepared by JOHN LOUCKS  CONTEMPORARYBUSINESSSTATISTICS.
1 1 Slide © 2014 Cengage Learning. All Rights Reserved. May not be scanned, copied or duplicated, or posted to a publicly accessible website, in whole.
Census A survey to collect data on the entire population.   Data The facts and figures collected, analyzed, and summarized for presentation and.
Numerical Descriptive Techniques
Chapter 3 – Descriptive Statistics
Methods for Describing Sets of Data
2011 Summer ERIE/REU Program Descriptive Statistics Igor Jankovic Department of Civil, Structural, and Environmental Engineering University at Buffalo,
McGraw-Hill/Irwin Copyright © 2007 by The McGraw-Hill Companies, Inc. All rights reserved. Chapter 2 Descriptive Statistics.
McGraw-Hill/IrwinCopyright © 2009 by The McGraw-Hill Companies, Inc. All Rights Reserved. Chapter 3 Descriptive Statistics: Numerical Methods.
McGraw-Hill/IrwinCopyright © 2009 by The McGraw-Hill Companies, Inc. All Rights Reserved. Chapter 3 Descriptive Statistics: Numerical Methods.
Copyright (C) 2002 Houghton Mifflin Company. All rights reserved. 1 Understandable Statistics Seventh Edition By Brase and Brase Prepared by: Lynn Smith.
Business Statistics: Communicating with Numbers
4 - 1 Copyright © 2004 by The McGraw-Hill Companies, Inc. All rights reserved.
Chapter 3 Descriptive Statistics: Numerical Methods Copyright © 2014 by The McGraw-Hill Companies, Inc. All rights reserved.McGraw-Hill/Irwin.
1 MATB344 Applied Statistics Chapter 2 Describing Data with Numerical Measures.
McGraw-Hill/IrwinCopyright © 2009 by The McGraw-Hill Companies, Inc. All Rights Reserved. Chapter 3 Descriptive Statistics: Numerical Methods.
STAT 280: Elementary Applied Statistics Describing Data Using Numerical Measures.
What is variability in data? Measuring how much the group as a whole deviates from the center. Gives you an indication of what is the spread of the data.
Chapter 2 Describing Data.
1 1 Slide Slides Prepared by JOHN S. LOUCKS St. Edward’s University © 2002 South-Western/Thomson Learning.
Describing distributions with numbers
McGraw-Hill/Irwin Copyright © 2010 by The McGraw-Hill Companies, Inc. All rights reserved. Chapter 3 Descriptive Statistics: Numerical Methods.
Applied Quantitative Analysis and Practices LECTURE#09 By Dr. Osman Sadiq Paracha.
1 CHAPTER 3 NUMERICAL DESCRIPTIVE MEASURES. 2 MEASURES OF CENTRAL TENDENCY FOR UNGROUPED DATA  In Chapter 2, we used tables and graphs to summarize a.
Averages and Variation Chapter 3 Understandable Statistics Ninth Edition By Brase and Brase Prepared by Yixun Shi Bloomsburg University of Pennsylvania.
Understanding Basic Statistics Fourth Edition By Brase and Brase Prepared by: Lynn Smith Gloucester County College Chapter Three Averages and Variation.
Chap 3-1 A Course In Business Statistics, 4th © 2006 Prentice-Hall, Inc. A Course In Business Statistics 4 th Edition Chapter 3 Describing Data Using Numerical.
Chapter 3 Descriptive Statistics II: Additional Descriptive Measures and Data Displays.
Copyright (C) 2002 Houghton Mifflin Company. All rights reserved. 1 CHEBYSHEV'S THEOREM For any set of data and for any number k, greater than one, the.
Chapter 3 Data Description Section 3-3 Measures of Variation.
Chapter 3, Part B Descriptive Statistics: Numerical Measures n Measures of Distribution Shape, Relative Location, and Detecting Outliers n Exploratory.
Business Statistics, 4e, by Ken Black. © 2003 John Wiley & Sons. 3-1 Business Statistics, 4e by Ken Black Chapter 3 Descriptive Statistics.
Copyright © 2015 McGraw-Hill Education. All rights reserved. No reproduction or distribution without the prior written consent of McGraw-Hill Education.
MODULE 3: DESCRIPTIVE STATISTICS 2/6/2016BUS216: Probability & Statistics for Economics & Business 1.
Unit 3: Averages and Variations Part 3 Statistics Mr. Evans.
Describing Data: Summary Measures. Identifying the Scale of Measurement Before you analyze the data, identify the measurement scale for each variable.
Chapter 2 Describing Data: Numerical
Descriptive Statistics
Chapter 3 Describing Data Using Numerical Measures
Descriptive Statistics: Numerical Methods
NUMERICAL DESCRIPTIVE MEASURES
Chapter 3 Describing Data Using Numerical Measures
Descriptive Statistics: Numerical Methods
Percentiles and Box-and- Whisker Plots
Numerical Measures: Skewness and Location
Describing Data with Numerical Measures
St. Edward’s University
NUMERICAL DESCRIPTIVE MEASURES
Presentation transcript:

Chapter 2 Descriptive Statistics Section 2.3 Measures of Variation Figure 2.31 Repair Times for Personal Computers at Two Service Centers  Figure 2.31 indicates that we need measures of variation to express how the two distributions differ.

Chapter 2 Descriptive Statistics Range Largest minus the smallest measurement The Population Variance (pronounced sigma squared) The average of the squared deviations of all the population measurements from the population mean Standard Deviation (pronounced sigma) The square root of the variance Measures of Variation

Chapter 2 Descriptive Statistics The Range Range = largest measurement - smallest measurement The range measures the interval spanned by all the data Example 2.3: Internist’s Salaries (in thousands of dollars) Range = = 114 ($114,000)

Chapter 2 Descriptive Statistics The Variance Population X 1, X 2, …, X N Sample x 1, x 2, …, x n Sample Variance ss Population Variance 

Chapter 2 Descriptive Statistics The Variance For a population of size N, the population variance is defined as For a sample of size n, the sample variance s 2 is defined as and is a point estimate for  2

Chapter 2 Descriptive Statistics The Standard Deviation Population Standard Deviation,  : Sample Standard Deviation, s:

Chapter 2 Descriptive Statistics Example 2.5 Example 2.5 Consider the population of profit margins for five of the best big companies in America as rated by Forbes magazine on its website on March 16, These profit margins are 8%, 10%, 15%, 12% and 5%. Population Mean Population Variance Population Standard Deviation

Chapter 2 Descriptive Statistics Sample variance and standard deviation for first five car mileages from Table , 31.7, 30.1, 31.6, 32.1 Example 2.6 Example 2.6 The Car Mileage Case = /4 = 0.643

Chapter 2 Descriptive Statistics Sample variance and standard deviation for all car mileages from Table 2.1,. The point estimate of the variance of all cars is mpg 2 and the point estimate of the standard deviation of all cars is mpg.

Chapter 2 Descriptive Statistics The computational formula for the sample variance Example 2.7 Example 2.7 The Payment Time Case Consider the sample of 65 payment times in Table 2.2. Therefore andDays.

Chapter 2 Descriptive Statistics The Empirical Rule for Normal Populations If a population has mean  and standard deviation  and is described by a normal curve ( symmetrical, bell-shaped distribution ), then % of the population measurements lie within one standard deviation of the mean: [  % of the population measurements lie within two standard deviations of the mean: [  2  2  % of the population measurements lie within three standard deviations of the mean: [  3  3 

Chapter 2 Descriptive Statistics      68% 95% 99.7% Empirical Rule

Chapter 2 Descriptive Statistics Tolerance Intervals An Interval that contains a specified percentage of the individual measurements in a population is called a tolerance interval.  The one, two, and three standard deviation intervals around given in (1), (2) and (3) are tolerance intervals containing, respectively, percent, percent and percent of the measurements in a normally distributed population.  The three-sigma interval to be a tolerance interval that contains almost all of the measurements in a normally distributed population.

Chapter 2 Descriptive Statistics 68.26% of all individual cars will have mileages in the range 68.26% of all individual cars will have mileages in the range 95.44% of all individual cars will have mileages in the range 95.44% of all individual cars will have mileages in the range 99.73% of all individual cars will have mileages in the range 99.73% of all individual cars will have mileages in the range Example 2.8 Example 2.8 mpg The Car Mileage Case and standard deviation S=

Chapter 2 Descriptive Statistics Example 2.8 Example 2.8 The Car Mileage Case

Chapter 2 Descriptive Statistics Skewness and the Empirical Rule  The Empirical Rule holds for normally distributed populations.  This rule also approximately holds for populations having mound-shaped (single-peaked) distributions that are not very skewed to the right or left.  For example, Recall that the distribution of 65 payment times, it indicates that the empirical rule holds.

Chapter 2 Descriptive Statistics Chebyshev’s Theorem Let  and  be a population’s mean and standard deviation, then for any value k > 1, At least 100(1 - 1/k 2 )% of the population measurements lie in the interval: [  k  k  Use only for non-mound distributions Skewed  But not extremely skewed. (If a population is extremely skewed, it is best to measure variation by using percentiles.) Bimodal

Chapter 2 Descriptive Statistics Empirical Rule and Chebyshev’s Theorem Empirical RuleChebyshev PopulationNormal Mound-shaped, not very skewed Any population but not extremely skewed [ μ-σ, μ+σ] About 68% [ μ-2σ, μ+2σ] About 95%At least 75% [ μ-3σ, μ+3σ] About 99.7%At least 88.89%

Chapter 2 Descriptive Statistics Exercise Shipping Times (Chebychev’s Theorem and the Empirical Rule) A group of 13 students is studying in Istanbul, Tukey, for five weeks. As part of their study of the local economy, they each purchased an Oriental rug and arranged for its shipment back to the United States. The shipping time, in days, For each rug was Estimate the percentage of days that are within two standard deviations of the Mean. Is it likely to take 2 months for a delivery?

Chapter 2 Descriptive Statistics z Scores For any x in a population or sample, the associated z score is The z score is the number of standard deviations that x is from the mean A positive z score is for x above (greater than) the mean A negative z score is for x below (less than) the mean

Chapter 2 Descriptive Statistics Example 2.9 Example 2.9 Population of profit margins for five big American companies: 8%, 10%, 15%, 12%, 5%  = 10%,  = 3.406%

Chapter 2 Descriptive Statistics Coefficient of Variation Measures the size of the standard deviation relative to the size of the mean Coefficient of Variation = ×100% Used to: compare the relative variabilities of values about the mean Compare the relative variability of populations or samples with different means and different standard deviations Measure risk standard deviation mean

Chapter 2 Descriptive Statistics Table: Rates of Return: Asset A and B Which of these two single asset is better? Rates of Return Year Asset A (house) Asset B (stock) 5 years ago 11.3% 9.4% 4 years ago years ago years ago year ago Total Average rate of Return 12.2% 12.3% Standard deviation Expected Return

Chapter 2 Descriptive Statistics Section 2.4 Percentiles, Quartiles and Box- and-Whiskers Display For a set of measurements arranged in increasing order, the p th percentile is a value such that p percent of the measurements fall at or below the value and (100-p) percent of the measurements fall at or above the value The first quartile Q 1 is the 25th percentile The second quartile (or median) M d is the 50th percentile The third quartile Q 3 is the 75th percentile The interquartile range IQR is Q 3 - Q 1

Chapter 2 Descriptive Statistics One of Procedures for calculating Percentiles 1.Arrange the measurements in increasing order. 2. Calculate the index A. If i is not integer, the next integer greater than i denotes the Position of the pth percentile in the ordered arrangement. B. If i is integer, then the pth percentile is the average of the The measurement in positions i and i+1 in the ordered arrangement

Chapter 2 Descriptive Statistics Example 2.10 Example 2.10 DVD Recorder Satisfaction 20 customer satisfaction ratings: Q 1 = (7+8)/2 = 7.5 M d = (8+8)/2 = 8 Q 3 = (9+9)/2 = 9 IQR = Q 3  Q 1 = 9  7.5 = 1.5

Chapter 2 Descriptive Statistics Five-Number Summary Smallest Value First Quartile Median Third Quartile Largest Value

Chapter 2 Descriptive Statistics Using stem-and-leaf displays to find percentiles (a)The 75th percentile of the 65 payment times, and a five- number summary

Chapter 2 Descriptive Statistics (b) The 5 th percentile of the 60 bottle design ratings and a five-number summary

Chapter 2 Descriptive Statistics Box-and-Whiskers Plots The box plots the: The box plots the: first quartile, Q 1 first quartile, Q 1 median, M d median, M d third quartile, Q 3 third quartile, Q 3 inner fences, located 1.5  IQR away from the quartiles: inner fences, located 1.5  IQR away from the quartiles: = Q 1 – (1.5  IQR) = Q 1 – (1.5  IQR) = Q 3 + (1.5  IQR) = Q 3 + (1.5  IQR) outer fences, located 3  IQR away from the quartiles: outer fences, located 3  IQR away from the quartiles: = Q 1 – (3  IQR) = Q 1 – (3  IQR) = Q 3 + (3  IQR) = Q 3 + (3  IQR)

Chapter 2 Descriptive Statistics The “whiskers” are dashed lines that plot the range of the data The “whiskers” are dashed lines that plot the range of the data A dashed line drawn from the box below Q 1 down to the smallest measurement A dashed line drawn from the box below Q 1 down to the smallest measurement Another dashed line drawn from the box above Q 3 up to the largest measurement Another dashed line drawn from the box above Q 3 up to the largest measurement Example: 20 customer satisfaction ratings:

Chapter 2 Descriptive Statistics Outliers Outliers are measurements that are very different from most of the other measurements Outliers are measurements that are very different from most of the other measurements Because they are either very much larger or very much smaller than most of the other measurements Because they are either very much larger or very much smaller than most of the other measurements Outliers lie beyond the fences of the box-and-whiskers plot Outliers lie beyond the fences of the box-and-whiskers plot Measurements between the inner and outer fences are mild outliers Measurements between the inner and outer fences are mild outliers Measurements beyond the outer fences are severe outliers Measurements beyond the outer fences are severe outliers

Chapter 2 Descriptive Statistics

Section 2.5 Describing Qualitative Data Pie charts of the proportion (as percent) of all cars sold in the United States by different manufacturers, 1970 versus 1997

Chapter 2 Descriptive Statistics Population and Sample Proportions Population X 1, X 2, …, X N p Population Proportion Sample x 1, x 2, …, x n Sample Proportion p is the point estimate of p ^

Chapter 2 Descriptive Statistics Example 2.11 Example 2.11 The Marketing Ethics Case 117 out of 205 marketing researchers disapproved of action taken in a hypothetical scenario X = 117, number of researches who disapprove n = 205, number of researchers surveyed Sample Proportion:

Chapter 2 Descriptive Statistics Bar Chart Percentage of Automobiles Sold by Manufacturer, 1970 versus 1997

Chapter 2 Descriptive Statistics Pie Chart Percentage of Automobiles Sold by Manufacturer,1997

Chapter 2 Descriptive Statistics An Bar Chart of U.S Automobile Sales in 1997

Chapter 2 Descriptive Statistics Misleading Graphs and Charts: Scale Break Break the vertical scale to exaggerate effect Mean Salaries at a Major University,

Chapter 2 Descriptive Statistics Misleading Graphs and Charts: Scale Effects Compress vs. stretch the vertical axis to exaggerate or minimize the effect Mean Salary Increases at a Major University,

Chapter 2 Descriptive Statistics Weighted Means Sometimes, some measurements are more important than others Assign numerical “weights” to the data Weights measure relative importance of the value Calculate weighted mean as where w i is the weight assigned to the ith measurement x i

Chapter 2 Descriptive Statistics Example 2.12 Example 2.12 June 2001 unemployment rates in the U.S. by region Want the mean unemployment rate for the U.S.

Chapter 2 Descriptive Statistics Calculate it as a weighted mean So that the bigger the region, the more heavily it counts in the mean The data values are the regional unemployment rates The weights are the sizes of the regional labor forces Note that the unweigthed mean is 4.55%, which underestimates the true rate by 0.03% That is,  million = 43,410 workers