Presentation is loading. Please wait.

Presentation is loading. Please wait.

Chapter 3 Descriptive Measures

Similar presentations


Presentation on theme: "Chapter 3 Descriptive Measures"— Presentation transcript:

1

2 Chapter 3 Descriptive Measures
Section 1 Measures of Center

3 Descriptive Measures Numbers that are used to describe data sets are called descriptive measures. Statistical methods can be used to summarize data Measures of average are also called measures of central tendency or measures of center and include the mean, median, mode, and midrange. Indicate where the center or most typical value of the data set lies. Measures that determine the spread of data values are called measures of variation and include the range, variance, and standard deviation. Change to page 92

4 General Rounding Rule In statistics the basic rounding rule is that when computations are done in the calculation, rounding should not be done until the final answer is calculated.

5 Mean of a Data Set The mean of a data set is the sum of the observations divided by the number of observations. Mean is known as the arithmetic average. Found by adding the values (Sum of values) of the data and dividing by the total number of values. The symbols used are the Greek letter μ (mu) for the population and (x-bar) for the sample. Change to page 92

6 The Mean: Compute the mean by using all the values of the data. Varies less than the median or mode when samples are taken from the same population and all three measures are computed for these samples. Is used in computing other statistics, such as variance. The data set is unique, and not necessarily one of the data values. Cannot be computed for an open-ended frequency distribution. Is affected by extremely high or low values and may not be the appropriate average to use in these situations. Sample Population

7 Example 3.1 Professor Hassett spent one summer working for a small mathematical consulting firm. The firm employed a few senior consultants, who made between $800 and $1050 per week; a few junior consultants, who made between $400 and $450 per week; and several clerical workers, who made $300 per week. The firm required more employees during the first half of the summer than the second half. The tables list typical weekly earnings for the two halves of the summer. New Slide Insert Table 3.1,Table 3.2 and Table 3.4 Table 3.1 Table 3.2 Data Set I Data Set II

8 Solution Example 3.1 Table 3.4
New Slide Insert Table 3.1,Table 3.2 and Table 3.4 Interpretation: The employees who worked in the first half of the summer earned more, on average (a mean salary of $483.85), than those who worked in the second half (a mean salary of $474.00).

9 Median of a Data Set Median – is the middle point in the data set that has been ordered. The symbol for the median is MD. Arrange the data in increasing order. If the number of observations is odd, then the median is the observation exactly in the middle of the ordered list. If the number of observations is even, then the median is the mean of the two middle observations in the ordered list. In both cases, if we let n denote the number of observations, then the median is at position (n + 1) / 2 in the ordered list. Change to page 93

10 The median: is the halfway point (middle value) in a data set when the data is in ascending/descending order. is also a measure of the center. is found by arranging the data in order and selecting the middle point. The symbol for the median is MD. is used to find the center or middle value of a data set. is used when we must determine whether the data values fall into the upper half or lower half of the distribution. is used to find the average of an open-ended distribution. is affected less than the mean by extremely high or extremely low values. either a specific value or may fall between two values.

11 Steps in finding the median
Arrange the data in order Select the middle point OR If even number of data values, add the two middle values and divide by 2.

12 Mode of a Data Set Mode – is the most frequent data value. There is no symbol used. Find the frequency of each value in the data set. If no value occurs more than once, then the data set has no mode. Otherwise, any value that occurs with the greatest frequency is a mode of the data set. Change to page 94

13 The mode: The value that occurs most often in a data set. is used when the most typical case is desired. is the easiest average to compute. can be used when the data are nominal, such as religious preference, gender, or political affiliation. is not always unique. A data set can have more than one mode, or the mode may not exist for a data set. The mode for grouped data is the modal class and is the class with the largest frequency.

14 Notations & Formulas x variable for which we have sample data
n number of values in the sample size x1 first observation ... xn nth observation ∑ Sum of values. Greek letter sigma sample mean. x-bar μ population mean. Greek letter mu N number of values in the population Sample Mean Population Mean

15 The mean of the population is the average of all values of x in the population and is denoted by .
x-bar may vary from sample to sample has only one value. Always assume that data are obtained from a sample unless otherwise specified. The type of mean that considers an additional factor is called the weighted mean. It is used when the values are not equally represented. Rounding rule for the Mean: the mean should be rounded to one more decimal place than occurs in the raw data.

16 This figure shows the relative positions of the mean and median for right-skewed, symmetric, and left-skewed distributions. Note that the mean is pulled in the direction of skewness, that is, in the direction of the extreme observations. For a right-skewed distribution, the mean is greater than the median; for a symmetric distribution, the mean and the median are equal; and, for a left-skewed distribution, the mean is less than the median. New Slide: Insert Figure 3.1 Figure 3.1

17 The median is a resistant measure of center, but the mean is not.
A resistant measure is not sensitive to the influence of a few extreme observations The median is a resistant measure of center, but the mean is not. A Trimmed mean can improve the resistance of the mean: to do this remove the percentage of the smallest and largest observations before computing the mean. New Slide: Insert Figure 3.1

18 Sample Mean Sample Mean is the mean of the sample data.
There are many sample means – one for each possible sample of the population. Change to page 103

19 Population Mean Population Mean or Mean of the variable is the mean of the population data. There is only one population mean – namely, the mean of all the possible observations of the variable for the entire population. Change to page 103

20 Example: Mean = Median = 1, 2, 3, 3, 7, 11, 18, 30, 61
Burglaries at Pennsylvania University. The following data are the number of burglaries reported for a specific year for nine western Pennsylvania universities. Find the mean, median, and mode. Which measure of average might be the best in this case? Explain. 61, 11, 1, 3, 2, 30, 18, 3, 7 Mean = Median = 1, 2, 3, 3, 7, 11, 18, 30, 61 Mode = most repeated = 3 I think the mean is the best measure. The median and mode are to small a number for the sample and who give the impression of less burglaries.


Download ppt "Chapter 3 Descriptive Measures"

Similar presentations


Ads by Google