Presentation on theme: "Measures of Central Tendency"— Presentation transcript:
1 Measures of Central Tendency Descriptive Statistics
2 Measures of Central Tendency A measure of central tendency is a value that represents a typical, or central, entry of a data set. The three most commonly used measures of central tendency are:the meanthe medianthe mode
3 The MeanThe mean (arithmetic average)of a data set is the sum of the data entries divided by the number of entries. To find the mean of a data set, use one of the following formulas.Population (Parameter) Mean: μ = Ʃ x / NSample (Statistic) Mean: x = Ʃ x / nThe lowercase Greek letter μ (pronounced mu) represents the population mean and x (read as “x bar”) represents the sample mean. Note that N represents the number of entries in a population and n represents the number of entries in a sample.
4 Finding a Sample MeanThe prices (in dollars) for a sample of room air conditioners are listed. What is the mean price of the air conditioners?$500 $840 $470 $480 $420 $440 $440=3590 /7= or $General Rounding Rule: In statistics the basic rounding rule is that when computations are done in the calculations, rounding should not be done until the final answer is calculated. When rounding is done in the intermediate steps, it tends to increase the difference between that answer and the exact one.
5 The MedianThe median of a data set is the value that lies in the middle of the data when the data set is ordered. If the data set has an odd number of entries, the median is the middle data entry. If the data set has an even number of entries, the median is the mean of the two middle data entries
6 Finding the MedianFind the median of the air conditioner prices given in the previous example.$420 $440 $440 $470 $480 $500 $840Because there are seven entries (an odd number), the median is the middle, or fourth data entry. So therefore the median air conditioning price is $470.00
7 Finding the MedianWhat if we added 600 to our data? Find the median of the air conditioner prices given in this example.$420 $440 $470 $480 $500 $600 $840Because there are now eight entries (an even number), the median is the middle, of the fourth and fifth data entry. Therefore we must add the middle numbers and divide by 2 to find the median air conditioning price.$420 $440 $440 $470 $480 $500 $600 $840= /2=$475
8 The ModeThe mode of a data set is the data entry that occurs with the greatest frequency ( 1 mode =unimodal). If no entry is repeated, the data set has no mode. If the two entries occur with the same greatest frequency, each entry is a mode and the data is called bimodal. More than two modes is multimodal.The mode is the only measure of central tendency that can be used to describe data at the nominal level of measurement.
9 Finding the ModeFind the mode of the air conditioning prices in our previous example.From the ordered data, you can see that the entry of 440 occurs twice, whereas the other data entries occur once. So the mode of the air conditioning prices is $440.00
10 Measures of Central Tendency Although the mean, the median, and the mode each describe a typical entry of a data set, there are advantages and disadvantages of using each, especially when the data set contains outliners.Outliners is a data entry that is far removed from the other entries in the data set.
11 Comparing the Mean, the Median and Mode Find the mean, the median and the mode of the sample ages of a class shown at the left. Which measure of central tendency best describes a typical entry of this data set? Are there any outliners?Mean = 475/20 = 23.8Median = 21+22/2= 21.5Mode = The entry occurring the greatest is 20.Ages in a Class202122232465
12 Comparing the Mean, the Median and Mode Mode = The entry occurring the greatest is 20.Interpretation: The mean takes every entry into account but is influenced by the outliner of 65. The median also takes into account every entry and it is not affected by the outliner. In this case the mode exists, but it doesn’t appear to represent a typical entry.Ages in a Class202122232465
13 Graphical ComparisonSometimes a graphical comparison can help you decide which measure of central tendency best represents a data set. In this case the median best describes the data set.
14 MidrangeThe midrange is a rough estimate of the middle. It is found by adding the lowest and the highest values in the data set and dividing by 2. It is a very rough estimate of the average and can be affected by one extremely high or low value.
15 Weighted Mean and Mean of Grouped Data Sometimes data sets contain entries that have greater effect on the mean than do other entries. To find the mean of such data sets, you must find the weighted mean.A weighted mean is the mean of a data set whose entries have varying weights. A weighted mean is given byWhere w is the weight of each entry x.
16 Finding a Weighted Mean SourceScore, xWeight, wxwTest mean860.5043.0Nine weeks exam960.1514.4Semester Exam820.2016,4ComputerLab980.109.8Homework1000.055.0Ʃ w = 1Ʃ (x*w)=88.6You are taking a class in which your grade is determined from 5 sources; 50% from your test mean, 15% 9 weeks test mean, 20% for your semester exam, 10% computer lab work, and 5% homework. Your scores are 86 test mean, 96 nine weeks test, 82 semester exam, 98 computer lab work and 100 homework. What is the weighted mean of your scores.
17 What if data is presented in a frequency distribution? The mean of a frequency distribution for a sample is approximated by where x and f are the midpoints and frequencies of a class, respectively.SamplePopulation
18 Guidelines for finding the mean of a frequency distribution
19 Finding the mean of a frequency distribution Use the frequency distribution at the right to approximate the mean number of minutes that a sample of Internet subscribers spent online during their most recent session /50 = 41.8xFrequency, f(x*f)12.5675.024.510245.036.513474.548.58388.060.55302.572.5435.084.52169.0n=50Ʃ =
20 The Shapes of Distributions A graph reveals several characteristics of a frequency distribution. One such characteristic is the shape of the distribution.A frequency distribution is symmetric when a vertical line can be drawn through the middle of a graph of the distribution and the resulting halves are approximately mirror images.A frequency distribution is uniform (or rectangle) when all entries, or classes, in the distribution have equal frequencies. A uniform distribution is also symmetric.A frequency distribution is skewed if the “tail” of the graph elongates more to one side than to the other. A distribution is skewed left (negatively skewed) if its tail extends to the left. A distribution is skewed right (positively skewed) if its tail extends to the right.
21 Shapes cont…When a distribution is symmetric and unimodal, the mean, median, and the mode are equal. If a distribution is skewed left, the mean is less than the median and the median is usually less than the mode. If a distribution is skewed right, the mean is greater than the median and the median is usually greater than the mode.The mean will always fall in the direction the distribution is skewed. For instance, when the distribution is skewed left, the mean is to the left of the median.