Presentation is loading. Please wait.

Presentation is loading. Please wait.

Copyright © Cengage Learning. All rights reserved. Probability and Statistics.

Similar presentations


Presentation on theme: "Copyright © Cengage Learning. All rights reserved. Probability and Statistics."— Presentation transcript:

1 Copyright © Cengage Learning. All rights reserved. Probability and Statistics

2 Copyright © Cengage Learning. All rights reserved. 14.5 Descriptive Statistics (Numerical)

3 3 Objectives ► Introduction to Statistical Measures ► Measures of Central Tendency: Mean, Median, Mode ► Measures of Spread: Variance and Standard Deviation

4 4 Descriptive Statistics (Numerical) Data usually consist of thousands or even millions of numbers. The first goal of statistics is to describe such huge sets of data in simpler terms. One way to make sense of data is to find a “typical” number or the “center” of the data. Any such number is called a measure of central tendency.

5 5 There are 3 Measures of Central Tendency: Mean, Median, Mode What do each of these tell us about the Data?

6 6 Example 3 – Effect of Outliers on the Mean and Median The following table gives the selling prices of houses sold in 2007 in a small coastal California town. (a) Find the mean house price. (b) Find the median house price. (c)Which value is a more “typical” value? Why?

7 7 Measures of Central Tendency: Mean, Median, Mode If a data set includes a number that is “far out” or far away from the rest of the data, that data point is called an outlier. In general, when a data set has outliers, the median is a better indicator of central tendency than the mean.

8 8 Measures of Central Tendency: Mean, Median, Mode The mode of a data set is a summary statistic that is usually less informative than the mean or median, but has the advantage of not being limited to numerical data.

9 9 Measures of Central Tendency: Mean, Median, Mode The mode of the data set 1, 1, 2, 2, 2, 3, 5, 8 is the number 2. The data set 1, 2, 2, 3, 5, 5, 8 has two modes: 2 and 5. Data sets with two modes are called bimodal. The data set 1, 2, 4, 5, 7, 8 has no mode.

10 10 Organizing Data: Frequency Tables Sometimes listing the data in a special way can help us get useful information about the data. One such method the frequency table. A frequency table for a set of data is a table that includes each different data point and the number of times that point occurs in the data. The mode is most easily determined from a frequency table.

11 11 Example 4 – Using a Frequency Table The scores obtained by the students in an algebra class on a five-question quiz are given in the following frequency table. Find the mean, median, and mode of the scores. Frequency Table

12 12 Example 4 – Solution The mode is 5, because more students got this score than any other score. The total number of quizzes is 16 + 8 + 5 + 5 + 3 + 3 = 40. To find the mean, we add all the scores and divide by 40. Note that the score 5 occurs 16 times, the score 4 occurs 8 times, and so on. So the mean score is

13 13 Example 4 – Solution There are 40 students in this class. If we rank the scores from highest to lowest, the median score is the average of the 20th and 21st scores. From frequency column in the table, we see that these scores are each 4. So the median score is 4. cont’d

14 14 Measures of Spread: Standard Deviation Measures of central tendency identify the “center” or “typical value” of the data. Measures of spread (also called measures of dispersion) describe the spread or variability of the data around a central value. For example, find the mean of each of the following sets of numbers. 50, 58, 78, 81, 93 72, 71, 72, 72, 73 Although, the means are the same, we say that the first data set shows more variability than the second.

15 15 Measures of Spread: Standard Deviation The most important measure of variability in statistics is the standard deviation. Standard deviation measures the average deviation (or difference) from the mean.

16 Copyright © 2010, 2007, 2004 Pearson Education, Inc. Slide 4 - 16 Example: Finding the Sample Standard Deviation The starting salaries are for the Chicago branches of a corporation. The corporation has several other branches, and you plan to use the starting salaries of the Chicago branches to estimate the starting salaries for the larger population. Find the sample standard deviation of the starting salaries. Starting salaries (1000s of dollars) 41 38 39 45 47 41 44 41 37 42 Larson/Farber 4th ed. 16

17 Copyright © 2010, 2007, 2004 Pearson Education, Inc. Slide 4 - 17 Solution: Finding the Sample Standard Deviation Larson/Farber 4th ed. 17 First find the mean Next calculate each deviation from the mean Sum these deviations What do you notice??? Salary, xDeviation: x – μ Squares: (x – μ) 2 4141 – 41.5 = –0.5(–0.5) 2 = 0.25 3838 – 41.5 = –3.5(–3.5) 2 = 12.25 3939 – 41.5 = –2.5(–2.5) 2 = 6.25 4545 – 41.5 = 3.5(3.5) 2 = 12.25 4747 – 41.5 = 5.5(5.5) 2 = 30.25 4141 – 41.5 = –0.5(–0.5) 2 = 0.25 4444 – 41.5 = 2.5(2.5) 2 = 6.25 4141 – 41.5 = –0.5(–0.5) 2 = 0.25 3737 – 41.5 = –4.5(–4.5) 2 = 20.25 4242 – 41.5 = 0.5(0.5) 2 = 0.25 Σ(x – μ) = 0 SS x = 88.5

18 Copyright © 2010, 2007, 2004 Pearson Education, Inc. Slide 4 - 18 Since adding all deviations together would total zero, we square each deviation and find an average of sorts for the deviations.

19 Copyright © 2010, 2007, 2004 Pearson Education, Inc. Slide 4 - 19 Solution: Finding the Sample Standard Deviation Larson/Farber 4th ed. 19 Sample Standard Deviation The sample standard deviation is about 3.1, or $3100.

20 Copyright © 2010, 2007, 2004 Pearson Education, Inc. Slide 4 - 20 The standard deviation, s, is measured in the same units as the original data.

21 Copyright © 2010, 2007, 2004 Pearson Education, Inc. Slide 4 - 21 Solution: Using Technology to Find the Standard Deviation Larson/Farber 4th ed. 21 Sample Mean Sample Standard Deviation

22 Copyright © 2010, 2007, 2004 Pearson Education, Inc. Slide 4 - 22 Example: Using Technology to Find the Standard Deviation Sample office rental rates (in dollars per square foot per year) for Miami’s central business district are shown in the table. Use a calculator to find the mean rental rate and the sample standard deviation. (Adapted from: Cushman & Wakefield Inc.) Larson/Farber 4th ed. 22 Office Rental Rates 35.0033.5037.00 23.7526.5031.25 36.5040.0032.00 39.2537.5034.75 37.7537.2536.75 27.0035.7526.00 37.0029.0040.50 24.5033.0038.00

23 Copyright © 2010, 2007, 2004 Pearson Education, Inc. Slide 4 - 23 Interpreting Standard Deviation Standard deviation is a measure of the typical amount an entry deviates from the mean. The more the entries are spread out, the greater the standard deviation. Larson/Farber 4th ed. 23

24 Copyright © 2010, 2007, 2004 Pearson Education, Inc. Slide 4 - 24 Thinking About Variation Since Statistics is about variation, spread is an important fundamental concept of Statistics. Measures of spread help us talk about what we don’t know. When the data values are tightly clustered around the center of the distribution, standard deviation will be small. When the data values are scattered far from the center, standard deviation will be large.

25 25 Example 7 – Calculating Standard Deviation Two machines are used in filling 16-ounce soda bottles. To test how consistently each machine fills the bottles, a sample of 20 bottles from the output of each machine is selected. Find the standard deviation for each machine. Which machine is more consistent in filling the bottles?

26 26 Example 7 – Solution The standard deviations  x and  y for Soda Machines I and II, respectively, are Soda Machine I is more consistent in filling the bottles because the standard deviation of the data from Machine I is much smaller than that of the data from Machine II. cont’d


Download ppt "Copyright © Cengage Learning. All rights reserved. Probability and Statistics."

Similar presentations


Ads by Google