Describing Data: Measures of Central Tendency

Slides:



Advertisements
Similar presentations
Measures of Location and Dispersion
Advertisements

Copyright © 2014 by McGraw-Hill Higher Education. All rights reserved.
Part 3 Probabilistic Decision Models
Chapter 1 The Study of Body Function Image PowerPoint
Describing Data: Measures of Dispersion
Describing Data: Frequency Distributions and Graphic Presentation
BUS 220: Elementary Statistics
DESCRIPTIVE STATISTICS
1 1 Slide © 2003 South-Western/Thomson Learning TM Slides Prepared by JOHN S. LOUCKS St. Edwards University.
1 1 Slide © 2014 Cengage Learning. All Rights Reserved. May not be scanned, copied or duplicated, or posted to a publicly accessible website, in whole.
Describing Data: Measures of Central Tendency
Data Distributions Warm Up Lesson Presentation Lesson Quiz
Learning Objectives In this chapter you will learn about measures of central tendency measures of central tendency levels of measurement levels of measurement.
Measures of Central Tendency
Basic Statistics Measures of Central Tendency.
Lecture 2 Part a: Numerical Measures
Chapter Three McGraw-Hill/Irwin © 2005 The McGraw-Hill Companies, Inc., All Rights Reserved
Math Qualification from Cambridge University
Calculating & Reporting Healthcare Statistics
Chapters 3 & 4 Alan D. Smith Descriptive Statistics -
Ka-fu Wong © 2004 ECON1003: Analysis of Economic Data Lesson2-1 Lesson 2: Descriptive Statistics.
Slides by JOHN LOUCKS St. Edward’s University.
Describing Data: Numerical Measures
Introductory Mathematics & Statistics
Describing Data: Numerical Measures
3 - 1 Copyright © 2004 by The McGraw-Hill Companies, Inc. All rights reserved.
Describing Data Measures of Location Goals
Describing Data: Numerical Measures
©The McGraw-Hill Companies, Inc. 2008McGraw-Hill/Irwin Describing Data: Numerical Measures Chapter 3.
Describing Data: Numerical
Describing Data: Numerical Measures Chapter 3 McGraw-Hill/Irwin Copyright © 2012 by The McGraw-Hill Companies, Inc. All rights reserved.
JDS Special Program: Pre-training1 Basic Statistics 01 Describing Data.
Describing Data: Numerical Measures Chapter 3 McGraw-Hill/Irwin Copyright © 2012 by The McGraw-Hill Companies, Inc. All rights reserved.
Describing Data: Numerical Measures Chapter 03 McGraw-Hill/Irwin Copyright © 2013 by The McGraw-Hill Companies, Inc. All rights reserved.
4 - 1 Copyright © 2004 by The McGraw-Hill Companies, Inc. All rights reserved.
Describing Data: Numerical Measures. GOALS 1.Calculate the arithmetic mean, weighted mean, median, mode, and geometric mean. 2.Explain the characteristics,
Chapter Three McGraw-Hill/Irwin © 2005 The McGraw-Hill Companies, Inc., All Rights Reserved
Chapter 2 Means to an End: Computing and Understanding Averages Part II  igma Freud & Descriptive Statistics.
Business Statistics Spring 2005 Summarizing and Describing Numerical Data.
©The McGraw-Hill Companies, Inc. 2008McGraw-Hill/Irwin Describing Data: Numerical Measures Chapter 3.
Business Statistics Chapter 1
Chapter 3: Central Tendency. Central Tendency In general terms, central tendency is a statistical measure that determines a single value that accurately.
Chapter Three McGraw-Hill/Irwin © 2006 The McGraw-Hill Companies, Inc., All Rights Reserved. Describing Data: Numerical Measures.
1-1 What is Statistics? GOALS When you have completed this Part, you will be able to: ONE Understand why we study statistics. TWO Explain what is meant.
Business Statistics, 4e, by Ken Black. © 2003 John Wiley & Sons. 3-1 Business Statistics, 4e by Ken Black Chapter 3 Descriptive Statistics.
Describing Data: Numerical Measures Chapter 3 McGraw-Hill/Irwin Copyright © 2010 by The McGraw-Hill Companies, Inc. All rights reserved.
Chapter Three McGraw-Hill/Irwin © 2006 The McGraw-Hill Companies, Inc., All Rights Reserved.
Chapter 3 Descriptive Statistics: Numerical Methods.
Describing Data: Numerical Measures Chapter 03 Copyright © 2013 by The McGraw-Hill Companies, Inc. All rights reserved. McGraw-Hill/Irwin.
Day 2. GOALS 1.Calculate the arithmetic mean, weighted mean, median, mode, and geometric mean. 2.Explain the characteristics, uses, advantages, and disadvantages.
Summarizing Data with Numerical Values Introduction: to summarize a set of numerical data we used three types of groups can be used to give an idea about.
©The McGraw-Hill Companies, Inc. 2008McGraw-Hill/Irwin Describing Data: Numerical Measures Chapter 3.
Describing Data: Numerical Measures
Statistics for Business
Chapter 3 Describing Data Basic Statistics for Business and Economics
Measures of Central Tendency and Location
Topic 3: Measures of central tendency, dispersion and shape
Chapter 3 Created by Bethany Stubbe and Stephan Kogitz.
Descriptive Statistics
Characteristics of the Mean
Describing Data: Numerical Measures
What is Statistics? GOALS ONE Understand why we study statistics. TWO
LESSON 3: CENTRAL TENDENCY
Describing Data: Numerical Measures
Describing Data: Numerical Measures
Describing Data: Numerical Measures
Describing Data: Numerical Measures
Describing Data: Numerical Measures
Presentation transcript:

Describing Data: Measures of Central Tendency 1-1 Chapter Three Describing Data: Measures of Central Tendency GOALS When you have completed this chapter, you will be able to: ONE Calculate the arithmetic mean, median, mode, weighted mean, and the geometric mean. TWO Explain the characteristics, uses, advantages, and disadvantages of each measure of location. THREE Identify the position of the arithmetic mean, median, and mode for both a symmetrical and a skewed distribution. Irwin/McGraw-Hill © The McGraw-Hill Companies, Inc., 1999

where µ stands for the population mean. 3-2 Population Mean For ungrouped data, the population mean is the sum of all the population values divided by the total number of population values: where µ stands for the population mean. N is the total number of observations in the population. X stands for a particular value.  indicates the operation of adding.

Parameter: a measurable characteristic of a population. 3-3 EXAMPLE 1 Parameter: a measurable characteristic of a population. The Kiers family owns four cars. The following is the mileage attained by each car: 56,000, 23,000, 42,000, and 73,000. Find the average miles covered by each car. The mean is (56,000 + 23,000 + 42,000 + 73,000)/4 = 48,500

where X stands for the sample mean 3-4 Sample Mean For ungrouped data, the sample mean is the sum of all the sample values divided by the number of sample values: where X stands for the sample mean n is the total number of values in the sample

3-5 EXAMPLE 2 Statistic: a measurable characteristic of a sample.(sometimes called descriptives) A sample of five executives received the following amounts of bonus last year: $14,000, $15,000, $17,000, $16,000, and $15,000. Find the average bonus for these five executives. Since these values represent a sample size of 5, the sample mean is (14,000 + 15,000 +17,000 + 16,000 +15,000)/5 = $15,400.

Properties of the Arithmetic Mean 3-6 Properties of the Arithmetic Mean Every set of interval-level and ratio-level data has a mean. All the values are included in computing the mean. A set of data has a unique mean. The mean is easily affected by unusually large or small data values. The arithmetic mean is the only measure of central tendency where the sum of the deviations of each value from the mean is zero.

3-7 EXAMPLE 3 Consider the set of values: 3, 8, and 4. The mean is 5. Illustrating the fifth property, (3-5) + (8-5) + (4-5) = -2 +3 -1 = 0. In other words,

3-8 Weighted Mean The weighted mean of a set of numbers X1, X2, ..., Xn, with corresponding weights w1, w2, ...,wn, is computed from the following formula:

This describes the money got from a one drink by average 3-9 EXAMPLE 6 During a one hour period on a hot Saturday afternoon cabana boy Chris served fifty drinks. Compute the weighted mean of the price of the drinks. (Price ($), Number sold): (.50,5), (.75,15), (.90,15), (1.10,15). The weighted mean is: $(.50x5 + .75x15 + .90x15 + 1.10x15)/(5 + 15 + 15 + 15) = $43.75/50 = $0.875 This describes the money got from a one drink by average

The Median, (Symbol: Md or Q2) 3-10 The Median, (Symbol: Md or Q2) Median: The midpoint of the values after they have been ordered from the smallest to the largest, or the largest to the smallest. There are as many values above the median as below it in the data array. Note: For an even set of numbers, the median will be the arithmetic average (if it can be calculated) of the two middle numbers.

Compute the median for the following data. 3-11 EXAMPLE 4 Compute the median for the following data. The age of a sample of five college students is: 21, 25, 19, 20, and 22. Arranging the data in ascending order gives: 19, 20, 21, 22, 25. Thus the median is 21. the height of four basketball players, in inches, is 76, 73, 80, and 75. Arranging the data in ascending order gives: 73, 75, 76, 80. Thus the median is 75.5

Properties of the Median 3-12 Properties of the Median There is a unique median for each data set. It is not affected by extremely large or small values and is therefore a valuable measure of central tendency when such values occur. It can be computed for ratio-level, interval-level, and ordinal-level data. It can be computed for an open-ended frequency distribution if the median does not lie in an open-ended class.

Quartiles do the same, but they divide data into four sets. Median divided data into two sets. Half of the values are lower than median and other half are higher than median. Quartiles do the same, but they divide data into four sets. Median is actually the middle quartile

Quartiles Lower quartile Q1: At most 25% of observed values are lower that Q1 and at most 75% are higher Upper quartile Q3: At most 25% of observed values are higher that Q1 and at most 75% are lower

Observations (in ascending order!!, n = 11) were Quartiles Observations (in ascending order!!, n = 11) were 1, 1, 3, 5, 7, 23, 32, 32, 41, 42, 42 Median is the middle value = 23 Lower quartile: 0,25 x 11 = 2,75, round to closest higher integer!  3 This means that lower quartile is the value of third observation = 3 Upper quartile: 0,75 x 11 = 8,25, round to closest higher integer  9 This meas that upper quartile is the value of ninth observation = 41

The mode is the value of the observation that appears most frequently. 3-13 The Mode The mode is the value of the observation that appears most frequently. EXAMPLE 5: The exam scores for ten students are: 81, 93, 84, 75, 68, 87, 81, 75, 81, 87. Since the score of 81 occurs the most, the modal score is 81.

3-14 Geometric Mean The geometric mean (GM) of a set of n numbers is defined as the nth root of the product of the n numbers. The formula is: Here the geometric mean is used to average percents, indexes, and relatives.

The interest rate on three bonds were 5, 7, and 4 percent. 3-15 EXAMPLE 7 The interest rate on three bonds were 5, 7, and 4 percent. The geometric mean is =5.192. The arithmetic mean is (6 + 3 + 2)/3 =5.333. The GM gives a more conservative profit figure because it is not heavily weighted by the rate of 7 percent.

Geometric Mean continued 3-16 Geometric Mean continued Another use of the geometric mean is to determine the average percent increase in sales, production or other business or economic series from one time period to another (indexnumbers). The formula for this type of problem is:

That is, the geometric mean rate of increase is 1.27%. 3-17 EXAMPLE 8 The total number of females enrolled in American colleges increased from 755,000 in 1986 to 835,000 in 1995. Here n = 10, so (n-1) = 9. That is, the geometric mean rate of increase is 1.27%.

The Mean of Grouped Data 3-18 The Mean of Grouped Data The mean of a sample of data organized in a frequency distribution is computed by the following formula: , where X = class midpoint and f = class frequency

3-19 EXAMPLE 9 A sample of ten movie theaters in a large metropolitan area tallied the total number of movies showing last week. Compute the mean number of movies showing.

3-20 EXAMPLE 9 continued 61/10 = 6.1 movies

The Median of Grouped Data 3-21 The Median of Grouped Data The median of a sample of data organized in a frequency distribution is computed by the following formula: Median = L + [(n/2 - CF)/f] (i) where L is the lower limit of the median class, CF is the cumulative frequency preceding the median class, f is the frequency of the median class, and i is the median class interval. Sometimes median is defined just as the midpoint of the median class!

Finding the Median Class 3-22 Finding the Median Class To determine the median class for grouped data: Construct a cumulative frequency distribution. Divide the total number of data values by 2. Determine which class will contain this value. For example, if n=50, 50/2 = 25, then determine which class will contain the 25th value -> the median class.

The median class is 5-6, since it contains the 5th value (n/2 =5) 3-23 EXAMPLE 10 The median class is 5-6, since it contains the 5th value (n/2 =5)

From the table, L=5, n=10, f=3, i=2, CF=3. 3-24 EXAMPLE 10 continued From the table, L=5, n=10, f=3, i=2, CF=3. Thus, Median= 5 + [((10/2) - 3)/3](2)= 6.33

The Mode of Grouped Data 3-25 The Mode of Grouped Data The mode for grouped data is approximated by the midpoint of the class with the largest class frequency. The modes in EXAMPLE 10 are 5.5 and 9.5. When two values occur a large number of times, the distribution is called bimodal, as in example 10.

Symmetric Distribution 3-26 Symmetric Distribution zero skewness mode = median = mean

Right Skewed Distribution 3-27 Right Skewed Distribution positively skewed: Mean and Median are to the right of the Mode. Mode<Median<Mean

Left Skewed Distribution 3-28 Left Skewed Distribution Negatively Skewed: Mean and Median are to the left of the Mode. Mean<Median<Mode

Mode = mean - 3(mean - median) Mean = [3(median) - mode]/2 3-29 NOTE If two averages of a moderately skewed frequency distribution are known, the third can be approximated. Mode = mean - 3(mean - median) Mean = [3(median) - mode]/2 Median = [2(mean) + mode]/3