McGraw-Hill/IrwinCopyright © 2009 by The McGraw-Hill Companies, Inc. All Rights Reserved. Chapter 3 Descriptive Statistics: Numerical Methods.

Slides:



Advertisements
Similar presentations
Copyright © 2014 by McGraw-Hill Higher Education. All rights reserved.
Advertisements

Descriptive Statistics
Ka-fu Wong © 2007 ECON1003: Analysis of Economic Data Lesson2-1 Lesson 2: Descriptive Statistics.
Chap 3-1 EF 507 QUANTITATIVE METHODS FOR ECONOMICS AND FINANCE FALL 2008 Chapter 3 Describing Data: Numerical.
Descriptive Statistics – Central Tendency & Variability Chapter 3 (Part 2) MSIS 111 Prof. Nick Dedeke.
B a c kn e x t h o m e Parameters and Statistics statistic A statistic is a descriptive measure computed from a sample of data. parameter A parameter is.
Copyright © 2010 Pearson Education, Inc. Publishing as Prentice Hall Ch. 2-1 Statistics for Business and Economics 7 th Edition Chapter 2 Describing Data:
1 1 Slide © 2003 South-Western/Thomson Learning TM Slides Prepared by JOHN S. LOUCKS St. Edward’s University.
Business Statistics: A Decision-Making Approach, 7e © 2008 Prentice-Hall, Inc. Chap 3-1 Business Statistics: A Decision-Making Approach 7 th Edition Chapter.
Slides by JOHN LOUCKS St. Edward’s University.
Basic Business Statistics 10th Edition
B a c kn e x t h o m e Classification of Variables Discrete Numerical Variable A variable that produces a response that comes from a counting process.
Chapter Two Descriptive Statistics McGraw-Hill/Irwin Copyright © 2004 by The McGraw-Hill Companies, Inc. All rights reserved.
Chapter 3, Part 1 Descriptive Statistics II: Numerical Methods
Chap 3-1 Statistics for Business and Economics, 6e © 2007 Pearson Education, Inc. Chapter 3 Describing Data: Numerical Statistics for Business and Economics.
1 1 Slide © 2003 South-Western/Thomson Learning TM Slides Prepared by JOHN S. LOUCKS St. Edward’s University.
Descriptive Statistics  Summarizing, Simplifying  Useful for comprehending data, and thus making meaningful interpretations, particularly in medium to.
McGraw-Hill Ryerson Copyright © 2011 McGraw-Hill Ryerson Limited. Adapted by Peter Au, George Brown College.
Chapter 2 Describing Data with Numerical Measurements
Department of Quantitative Methods & Information Systems
McGraw-Hill/Irwin Copyright © 2013 by The McGraw-Hill Companies, Inc. All rights reserved.
Chapter 2 Describing Data with Numerical Measurements General Objectives: Graphs are extremely useful for the visual description of a data set. However,
Chapter 3 - Part B Descriptive Statistics: Numerical Methods
1 1 Slide © 2001 South-Western /Thomson Learning  Anderson  Sweeney  Williams Anderson  Sweeney  Williams  Slides Prepared by JOHN LOUCKS  CONTEMPORARYBUSINESSSTATISTICS.
1 1 Slide © 2014 Cengage Learning. All Rights Reserved. May not be scanned, copied or duplicated, or posted to a publicly accessible website, in whole.
Numerical Descriptive Techniques
Chapter 3 Descriptive Statistics: Numerical Methods Statistics for Business (Env) 1.
Chapter 3 – Descriptive Statistics
Methods for Describing Sets of Data
McGraw-Hill/Irwin Copyright © 2007 by The McGraw-Hill Companies, Inc. All rights reserved. Chapter 2 Descriptive Statistics.
McGraw-Hill/IrwinCopyright © 2009 by The McGraw-Hill Companies, Inc. All Rights Reserved. Chapter 3 Descriptive Statistics: Numerical Methods.
Copyright (C) 2002 Houghton Mifflin Company. All rights reserved. 1 Understandable Statistics Seventh Edition By Brase and Brase Prepared by: Lynn Smith.
Business Statistics: Communicating with Numbers
1 1 Slide Descriptive Statistics: Numerical Measures Location and Variability Chapter 3 BA 201.
Chapter 3 Descriptive Statistics: Numerical Methods Copyright © 2014 by The McGraw-Hill Companies, Inc. All rights reserved.McGraw-Hill/Irwin.
McGraw-Hill/IrwinCopyright © 2009 by The McGraw-Hill Companies, Inc. All Rights Reserved. Chapter 3 Descriptive Statistics: Numerical Methods.
STAT 280: Elementary Applied Statistics Describing Data Using Numerical Measures.
Chapter 2 Describing Data.
1 1 Slide Slides Prepared by JOHN S. LOUCKS St. Edward’s University © 2002 South-Western/Thomson Learning.
1 1 Slide © 2007 Thomson South-Western. All Rights Reserved.
McGraw-Hill/Irwin Copyright © 2010 by The McGraw-Hill Companies, Inc. All rights reserved. Chapter 3 Descriptive Statistics: Numerical Methods.
Applied Quantitative Analysis and Practices LECTURE#09 By Dr. Osman Sadiq Paracha.
Variation This presentation should be read by students at home to be able to solve problems.
1 CHAPTER 3 NUMERICAL DESCRIPTIVE MEASURES. 2 MEASURES OF CENTRAL TENDENCY FOR UNGROUPED DATA  In Chapter 2, we used tables and graphs to summarize a.
Understanding Basic Statistics Fourth Edition By Brase and Brase Prepared by: Lynn Smith Gloucester County College Chapter Three Averages and Variation.
Business Statistics, A First Course (4e) © 2006 Prentice-Hall, Inc. Chap 3-1 Chapter 3 Numerical Descriptive Measures Business Statistics, A First Course.
Chapter 3 Descriptive Statistics II: Additional Descriptive Measures and Data Displays.
Basic Business Statistics Chapter 3: Numerical Descriptive Measures Assoc. Prof. Dr. Mustafa Yüzükırmızı.
Chapter 2 Descriptive Statistics Section 2.3 Measures of Variation Figure 2.31 Repair Times for Personal Computers at Two Service Centers  Figure 2.31.
Basic Business Statistics, 11e © 2009 Prentice-Hall, Inc. Chap 3-1 Chapter 3 Numerical Descriptive Measures Basic Business Statistics 11 th Edition.
Edpsy 511 Exploratory Data Analysis Homework 1: Due 9/19.
Chapter 3 Averages and Variation Understanding Basic Statistics Fifth Edition By Brase and Brase Prepared by Jon Booze.
Business Statistics, 4e, by Ken Black. © 2003 John Wiley & Sons. 3-1 Business Statistics, 4e by Ken Black Chapter 3 Descriptive Statistics.
Copyright © 2015 McGraw-Hill Education. All rights reserved. No reproduction or distribution without the prior written consent of McGraw-Hill Education.
MODULE 3: DESCRIPTIVE STATISTICS 2/6/2016BUS216: Probability & Statistics for Economics & Business 1.
Statistical Methods © 2004 Prentice-Hall, Inc. Week 3-1 Week 3 Numerical Descriptive Measures Statistical Methods.
Describing Data: Summary Measures. Identifying the Scale of Measurement Before you analyze the data, identify the measurement scale for each variable.
Statistics -Descriptive statistics 2013/09/30. Descriptive statistics Numerical measures of location, dispersion, shape, and association are also used.
Chapter 2 Describing Data: Numerical
Business and Economics 6th Edition
Descriptive Statistics
Descriptive Statistics: Numerical Methods
NUMERICAL DESCRIPTIVE MEASURES
Descriptive Statistics
Descriptive Statistics: Numerical Methods
MBA 510 Lecture 2 Spring 2013 Dr. Tonya Balan 4/20/2019.
St. Edward’s University
Business and Economics 7th Edition
Presentation transcript:

McGraw-Hill/IrwinCopyright © 2009 by The McGraw-Hill Companies, Inc. All Rights Reserved. Chapter 3 Descriptive Statistics: Numerical Methods

3-2 Descriptive Statistics 3.1Describing Central Tendency 3.2Measures of Variation 3.3Percentiles, Quartiles and Box-and- Whiskers Displays 3.4Covariance, Correlation, and the Least Square Line 3.5Weighted Means and Grouped Data (Optional) 3.6The Geometric Mean (Optional)

3-3 Describing Central Tendency In addition to describing the shape of a distribution, want to describe the data set’s central tendency –A measure of central tendency represents the center or middle of the data

3-4 Parameters and Statistics A population parameter is a number calculated from all the population measurements that describes some aspect of the population A sample statistic is a number calculated using the sample measurements that describes some aspect of the sample

3-5 Measures of Central Tendency Mean,  The average or expected value Median, M d The value of the middle point of the ordered measurements Mode, M o The most frequent value

3-6 The Mean Population X 1, X 2, …, X N  Population Mean Sample x 1, x 2, …, x n Sample Mean

3-7 The Sample Mean and is a point estimate of the population mean  It is the value to expect, on average and in the long run For a sample of size n, the sample mean is defined as

3-8 Example: Car Mileage Case Example 3.1:Sample mean for first five car mileages from Table , 31.7, 30.1, 31.6, 32.1

3-9 The Median The median M d is a value such that 50% of all measurements, after having been arranged in numerical order, lie above (or below) it 1.If the number of measurements is odd, the median is the middlemost measurement in the ordering 2.If the number of measurements is even, the median is the average of the two middlemost measurements in the ordering

3-10 Example: Car Mileage Case Example 3.1: First five observations from Table 3.1: 30.8, 31.7, 30.1, 31.6, 32.1 In order: 30.1, 30.8, 31.6, 31.7, 32.1 There is an odd so median is one in middle, or 31.6

3-11 The Mode The mode M o of a population or sample of measurements is the measurement that occurs most frequently –Modes are the values that are observed “most typically” –Sometimes higher frequencies at two or more values If there are two modes, the data is bimodal If more than two modes, the data is multimodal –When data are in classes, the class with the highest frequency is the modal class The tallest box in the histogram

3-12 Relationships Among Mean, Median and Mode

3-13 Measures of Variation Knowing the measures of central tendency is not enough Both of the distributions below have identical measures of central tendency

3-14 Measures of Variation RangeLargest minus the smallest measurement VarianceThe average of the squared deviations of all the population measurements from the population mean StandardThe square root of the Deviation variance

3-15 The Range Largest minus smallest Measures the interval spanned by all the data For Figure 3.13, largest is 5 and smallest is 3 Range is 5 – 3 = 2 days

3-16 Population Variance and Standard Deviation The population variance (σ 2 ) is the average of the squared deviations of the individual population measurements from the population mean (µ) The population standard deviation (σ) is the positive square root of the population variance

3-17 Variance For a population of size N, the population variance σ 2 is: For a sample of size n, the sample variance s 2 is:

3-18 Standard Deviation Population standard deviation (σ): Sample standard deviation (s):

3-19 Example: Chris’s Class Sizes This Semester Data points are: 60, 41, 15, 30, 34 Mean is 36 Variance is: Standard deviation is:

3-20 Example: Sample Variance and Standard Deviation Example 3.7: data for first five car mileages from Table 3.1 are 30.8, 31.7, 30.1, 31.6, 32.1 The sample mean is 31.26

3-21 The Empirical Rule for Normal Populations If a population has mean µ and standard deviation σ and is described by a normal curve, then –68.26% of the population measurements lie within one standard deviation of the mean: [µ-σ, µ+σ] –68.26% of the population measurements lie within two standard deviations of the mean: [µ-2σ, µ+2σ] –68.26% of the population measurements lie within three standard deviations of the mean: [µ-3σ, µ+3σ]

3-22 Chebyshev’s Theorem Let µ and σ be a population’s mean and standard deviation, then for any value k> 1 At least 100(1 - 1/k 2 )% of the population measurements lie in the interval [µ-kσ, µ+kσ] Only practical for non-mound-shaped distribution population that is not very skewed

3-23 z Scores For any x in a population or sample, the associated z score is The z score is the number of standard deviations that x is from the mean –A positive z score is for x above (greater than) the mean –A negative z score is for x below (less than) the mean

3-24 Coefficient of Variation Measures the size of the standard deviation relative to the size of the mean Coefficient of variation =standard deviation/mean × 100% Used to: –Compare the relative variabilities of values about the mean –Compare the relative variability of populations or samples with different means and different standard deviations –Measure risk

3-25 Percentiles, Quartiles, and Box-and- Whiskers Displays For a set of measurements arranged in increasing order, the p th percentile is a value such that p percent of the measurements fall at or below the value and (100-p) percent of the measurements fall at or above the value The first quartile Q 1 is the 25th percentile The second quartile (or median) is the 50 th percentile The third quartile Q 3 is the 75th percentile The interquartile range IQR is Q 3 - Q 1

3-26 Calculating Percentiles 1.Arrange the measurements in increasing order 2.Calculate the index i=(p/100)n where p is the percentile to find 3.(a) If i is not an integer, round up and the next integer greater than i denotes the p th percentile (b) If i is an integer, the p th percentile is the average of the measurements in the i and i+1 positions

3-27 Percentile Example (p=10 th Percentile) i=(10/100)12=1.2 Not an integer so round up to 2 10 th percentile is in the second position so 11,070 7,52411,07018,21126,81736,55141,286 49,31257,28372,81490,416135,540190,250

3-28 Percentile Example (p=25 th Percentile) i=(25/100)12=3 Integer so average values in positions 3 and 4 25 th percentile (18,211+26,817)/2 or 22,514 7,52411,07018,21126,81736,55141,286 49,31257,28372,81490,416135,540190,250

3-29 Five Number Summary 1.The smallest measurement 2.The first quartile, Q 1 3.The median, M d 4.The third quartile, Q 3 5.The largest measurement Displayed visually using a box-and- whiskers plot

3-30 Box-and-Whiskers Plots The box plots the: –first quartile, Q 1 –median, M d –third quartile, Q 3 –inner fences –outer fences

3-31 Box-and-Whiskers Plots Continued Inner fences –Located 1.5  IQR away from the quartiles: Q 1 – (1.5  IQR) Q 3 + (1.5  IQR) Outer fences –Located 3  IQR away from the quartiles: Q 1 – (3  IQR) Q 3 + (3  IQR)

3-32 Box-and-Whiskers Plots Continued The “whiskers” are dashed lines that plot the range of the data –A dashed line drawn from the box below Q 1 down to the smallest measurement –Another dashed line drawn from the box above Q 3 up to the largest measurement

3-33 Box-and-Whiskers Plots Continued

3-34 Outliers Outliers are measurements that are very different from other measurements –They are either much larger or much smaller than most of the other measurements Outliers lie beyond the fences of the box-and- whiskers plot –Measurements between the inner and outer fences are mild outliers –Measurements beyond the outer fences are severe outliers

3-35 Covariance, Correlation, and the Least Squares Line When points on a scatter plot seem to fluctuate around a straight line, there is a linear relationship between x and y A measure of the strength of a linear relationship is the covariance s xy

3-36 Covariance A positive covariance indicates a positive linear relationship between x and y –As x increases, y increases A negative covariance indicates a negative linear relationship between x and y –As x increases, y decreases

3-37 Correlation Coefficient Magnitude of covariance does not indicate the strength of the relationship –Magnitude depends on the unit of measurement used for the data Correlation coefficient (r) is a measure of the strength of the relationship that does not depend on the magnitude of the data

3-38 Correlation Coefficient Continued Sample correlation coefficient r is always between -1 and +1 –Values near -1 show strong negative correlation –Values near 0 show no correlation –Values near +1 show strong positive correlation Sample correlation coefficient is the point estimate for the population correlation coefficient ρ

3-39 Least Squares Line If there is a linear relationship between x and y, might wish to predict y on the basis of x This requires the equation of a line describing the linear relationship Line is calculated based on least squares line –Discussed in detail in Chapter 13

3-40 Least Squares Line Continued Need to calculate slope (b 1 ) and y- intercept (b 0 )

3-41 Weighted Means Sometimes, some measurements are more important than others –Assign numerical “weights” to the data Weights measure relative importance of the value Calculate weighted mean as where w i is the weight assigned to the i th measurement x i

3-42 Descriptive Statistics for Grouped Data Data already categorized into a frequency distribution or a histogram is called grouped data Can calculate the mean and variance even when the raw data is not available Calculations are slightly different for data from a sample and data from a population

3-43 Descriptive Statistics for Grouped Data (Sample) Sample mean for grouped data: Sample variance for grouped data: f i is the frequency for class i M i is the midpoint of class i n = Σf i = sample size

3-44 Descriptive Statistics for Grouped Data (Population) Population mean for grouped data: Population variance for grouped data: f i is the frequency for class i Mi is the midpoint of class i N = Σf i = population size

3-45 The Geometric Mean (Optional) For rates of return of an investment, use the geometric mean to give the correct wealth at the end of the investment Suppose the rates of return (expressed as decimal fractions) are R 1, R 2, …, R n for periods 1, 2, …, n The mean of all these returns is the calculated as the geometric mean: