Presentation is loading. Please wait.

Presentation is loading. Please wait.

1 Review Sections 2.1-2.4 Descriptive Statistics –Qualitative (Graphical) –Quantitative (Graphical) –Summation Notation –Qualitative (Numerical) Central.

Similar presentations


Presentation on theme: "1 Review Sections 2.1-2.4 Descriptive Statistics –Qualitative (Graphical) –Quantitative (Graphical) –Summation Notation –Qualitative (Numerical) Central."— Presentation transcript:

1 1 Review Sections 2.1-2.4 Descriptive Statistics –Qualitative (Graphical) –Quantitative (Graphical) –Summation Notation –Qualitative (Numerical) Central Measures (mean, median, mode and modal class) Shape of the Data

2 2 Review Sections 2.1-2.4 Descriptive Statistics –Qualitative (Graphical) –Quantitative (Graphical) –Summation Notation –Qualitative (Numerical) Central Measures (mean, median, mode and modal class) Shape of the Data Measures of Variability

3 3 Outlier A data measurement which is unusually large or small compared to the rest of the data. Usually from: –Measurement or recording error –Measurement from a different population –A rare, chance event.

4 4 Advantages/Disadvantages Mean Disadvantages – is sensitive to outliers Advantages –always exists – very common – nice mathematical properties

5 5 Advantages/Disadvantages Median Disadvantages – does not take all data into account Advantages –always exists –easily calculated –not affected by outliers –nice mathematical properties

6 6 Advantages/Disadvantages Mode Disadvantages – does not always exist, there could be just one of each data point – sometimes more than one Advantages –appropriate for qualitative data

7 7 Review A data set is skewed if one tail of the distribution has more extreme observations than the other. http://www.shodor.org/interactivate/activities/ SkewDistribution/

8 8 Review Skewed to the right: The mean is bigger than the median.

9 9 Review Skewed to the left: The mean is less than the median.

10 10 Review When the mean and median are equal, the data is symmetric

11 11 Numerical Measures of Variability These measure the variability or spread of the data.

12 12 Numerical Measures of Variability These measure the variability or spread of the data. Relative Frequency 0 13452 0.3 0.4 0.5 0.2 0.1

13 13 Numerical Measures of Variability These measure the variability or spread of the data. Relative Frequency 0 13452 0.3 0.4 0.5 0.2 0.1

14 14 Numerical Measures of Variability These measure the variability or spread of the data. Relative Frequency 0 13452 0.3 0.4 0.5 0.2 0.1 6 7

15 15 Numerical Measures of Variability These measure the variability, spread or relative standing of the data. –Range –Standard Deviation –Percentile Ranking –Z-score

16 16 Range The range of quantitative data is denoted R and is given by: R = Maximum – Minimum

17 17 Range The range of quantitative data is denoted R and is given by: R = Maximum – Minimum In the previous examples the first two graphs have a range of 5 and the third has a range of 7.

18 18 Range R = Maximum – Minimum Disadvantages: –Since the range uses only two values in the sample it is very sensitive to outliers. –Give you no idea about how much data is in the center of the data.

19 19 What else? We want a measure which shows how far away most of the data points are from the mean.

20 20 What else? We want a measure which shows how far away most of the data points are from the mean. One option is to keep track of the average distance each point is from the mean.

21 21 Mean Deviation The Mean Deviation is a measure of dispersion which calculates the distance between each data point and the mean, and then finds the average of these distances.

22 22 Mean Deviation Advantages: The mean deviation takes into account all values in the sample. Disadvantages: The absolute value signs are very cumbersome in mathematical equations.

23 23 Standard Deviation The sample variance, denoted by s², is:

24 24 Standard Deviation The sample variance, denoted by s², is: The sample standard deviation is The sample standard deviation is much more commonly used as a measure of variance.

25 25 Example Let the following be data from a sample: 2, 4, 3, 2, 5, 2, 1, 4, 5, 2. Find: a) The range b) The standard deviation of this sample.

26 26 Sample: 2, 4, 3, 2, 5, 2, 1, 4, 5, 2. a) The range b) The standard deviation of this sample. 2432521452

27 27 Sample: 2, 4, 3, 2, 5, 2, 1, 4, 5, 2. a) The range b) The standard deviation of this sample. 2432521452

28 28 Sample: 2, 4, 3, 2, 5, 2, 1, 4, 5, 2. a) The range b) The standard deviation of this sample. 2432521452 10

29 29 Sample: 2, 4, 3, 2, 5, 2, 1, 4, 5, 2. a) The range b) The standard deviation of this sample. 2432521452 10 2 -212 1101414141

30 30 Sample: 2, 4, 3, 2, 5, 2, 1, 4, 5, 2. 2432521452 10 2 -212 1101414141

31 31 Sample: 2, 4, 3, 2, 5, 2, 1, 4, 5, 2. 2432521452 10 2 -212 1101414141

32 32 Sample: 2, 4, 3, 2, 5, 2, 1, 4, 5, 2. 2432521452 10 2 -212 1101414141

33 33 Sample: 2, 4, 3, 2, 5, 2, 1, 4, 5, 2. Standard Deviation:

34 34 More Standard Deviation There is a “short cut” formula for finding the variance and the standard deviation

35 35 More Standard Deviation There is a “short cut” formula for finding the variance and the standard deviation

36 36 More Standard Deviation Use this to find the standard deviation of the previous example:

37 37 More Standard Deviation Use this to find the standard deviation of the previous example: 2432521452

38 38 More Standard Deviation Use this to find the standard deviation of the previous example: 2432521452 41694254116254

39 39 More Standard Deviation Use this to find the standard deviation of the previous example: 2432521452 41694254116254

40 40 More Standard Deviation Use this to find the standard deviation of the previous example: 2432521452 41694254116254 30 108

41 41 More Standard Deviation 2432521452 41694254116254 30 108

42 42 More Standard Deviation 2432521452 41694254116254 30 108

43 43 More Standard Deviation 2432521452 41694254116254 30 108

44 44 More Standard Deviation Like the mean, we are also interested in the population variance (i.e. your sample is the whole population) and the population standard deviation. The population variance and standard deviation are denoted σ and σ 2 respectively.

45 45 More Standard Deviation The population variance and standard deviation are denoted σ and σ 2 respectively. ****The formula for population variance is slightly different than sample variance

46 46 Example - Calculator Find the mean, median, mode, range and standard deviation for the following sample of data: 2.3, 2.5, 2.6, 2.7, 3.0, 3.4, 3.4, 3.5, 3.5, 3.5, 3.7, 3.8 Use your calculator

47 47 Using your Calculator Change calculator to statistics mode. (SD if you have it) Enter in the data and then press the  key, or data key. Keep entering data by pressing the  key, or data key until complete. To obtain the summary data, find the key for the sample mean and the s key or  n-1 key to display the sample standard deviation.

48 48 2.3, 2.5, 2.6, 2.7, 3.0, 3.4, 3.4, 3.5, 3.5, 3.5, 3.7, 3.8 Change calculator to statistics mode. (SD if you have it) Enter in the data and then press the  key, or data key. Keep entering data by pressing the  key, or data key until complete. To obtain the summary data, find the key for the sample mean and the s key or  n-1 key to display the sample standard deviation.

49 49 Example - Calculator Find the mean, median, mode, range and standard deviation for the following sample of data: 2.3, 2.5, 2.6, 2.7, 3.0, 3.4, 3.4, 3.5, 3.5, 3.5, 3.7, 3.8 Answer: Mode = 3.5 M = 3.4 Range = 1.5

50 50 Example – Using Standard Deviation Here are eight test scores from a previous Stats 201 class: 35, 59, 70, 73, 75, 81, 84, 86. The mean and standard deviation are 70.4 and 16.7, respectively.

51 51 Example – Using Standard Deviation Here are eight test scores from a previous Stats 201 class: 35, 59, 70, 73, 75, 81, 84, 86. The mean and standard deviation are 70.4 and 16.7, respectively. We wish to know if any of are data points are outliers. That is whether they don’t fit with the general trend of the rest of the data.

52 52 Example – Using Standard Deviation 35, 59, 70, 73, 75, 81, 84, 86. The mean and standard deviation are 70.4 and 16.7, respectively. We wish to know if any of are data points are outliers. That is whether they don’t fit with the general trend of the rest of the data. To find this we calculate the number of standard deviations each point is from the mean.

53 53 Example – Using Standard Deviation To find this we calculate the number of standard deviations each point is from the mean. To simplify things for now, work out which data points are within a)one standard deviation from the mean i.e. b)two standard deviations from the mean i.e. c)three standard deviations from the mean i.e.

54 54 Example – Using Standard Deviation Here are eight test scores from a previous Stats 201 class: 35, 59, 70, 73, 75, 81, 84, 86. The mean and standard deviation are 70.4 and 16.7, respectively. Work out which data points are within a) one standard deviation from the mean i.e. b) two standard deviations from the mean i.e. c) three standard deviations from the mean i.e.

55 55 Example – Using Standard Deviation Here are eight test scores from a previous Stats 201 class: 35, 59, 70, 73, 75, 81, 84, 86. The mean and standard deviation are 70.4 and 16.7, respectively. Work out which data points are within a) one standard deviation from the mean i.e. 59, 70, 73, 75, 81, 84, 86 b) two standard deviations from the mean i.e. 59, 70, 73, 75, 81, 84, 86 c) three standard deviations from the mean i.e. 35, 59, 70, 73, 75, 81, 84, 86


Download ppt "1 Review Sections 2.1-2.4 Descriptive Statistics –Qualitative (Graphical) –Quantitative (Graphical) –Summation Notation –Qualitative (Numerical) Central."

Similar presentations


Ads by Google