Presentation is loading. Please wait.

Presentation is loading. Please wait.

Statistics 1. How long is a name? To answer this question, we might collect some data on the length of a name.

Similar presentations


Presentation on theme: "Statistics 1. How long is a name? To answer this question, we might collect some data on the length of a name."— Presentation transcript:

1 Statistics 1

2 How long is a name? To answer this question, we might collect some data on the length of a name.

3 How long is a name? First we need to establish our target population.

4 How long is a name? First we need to establish our target population. Let’s say in this mathematics class.

5 How long is a name? What names should we use?

6 How long is a name? What names should we use? Names as listed on the roll.

7 Data

8 Averaging We call this a central tendency. There are 3 measures which we can use. MEAN MEDIAN MODE

9 Mean Usually when we say average, we are referring to the mean. To find the mean, we add up all the numbers and divide by how many there are.

10 Example Find the mean of 4, 0, 2, 1, 6

11 In Excel we can use the formula =average(highlight cells)

12 Data on names

13 Median A median is the middle value when the data is put in order. If there are an odd number of data, the middle is unique. If there is an even number of data, we need to average the two middles.

14 Example Find the median of 4, 8, 2, 9, 1 First put them in order 1, 2, 4, 8, 9

15 Example Find the median of 4, 8, 2, 9, 1 First put them in order 1, 2, 4, 8, 9 The middle number is ‘4’

16 Example Find the median of 4, 8, 2, 9, 1, 6 First put them in order 1, 2, 4, 6, 8, 9 The middle number is ‘4’ and ‘6’ Averaging gives median is 5.

17 Sort data on Excel or use formula =median(data)

18 Mode The mode is the most common number. You can have 2 modes but not more than 2.

19 Example Find the mode of 6, 4, 3, 7, 8, 6, 7, 2

20 Example Find the mode of 6, 4, 3, 7, 8, 6, 7, 2 There are two modes 6 and 7

21 Using Excel Formula =mode(data) You must be careful as Excel will only give one mode

22 Which average is the best? Generally we use the mean as it includes all the data but if we have extreme values, the median is a better measure as it is not affected by extreme values.

23 Example These are the incomes of a group of university students. $2400, $1500, $2000, $1800, $22 000 Find the best ‘average’.

24 Example $2400, $1500, $2000, $1800, $22 000 The mean is not representative whereas the median is.

25 Frequency tables LengthTallyFrequency 3ll2 4llll5 5llll llll llll14 6llll ll7 7llll5 8ll2

26 Mode is 5 LengthTallyFrequency 3ll2 4llll5 5llll llll llll14 6llll ll7 7llll5 8ll2

27 Median is also 5 LengthTallyFrequency 3ll2 4llll5 5llll llll llll14 6llll ll7 7llll5 8ll2

28 Mean is 5.4 LengthTallyFrequency 3ll2 4llll5 5llll llll llll14 6llll ll7 7llll5 8ll2

29 Calculating the mean by hand

30 Using the calculator STAT mode Place data in list 1 Place frequency in list 2 CALC, SET, 1Var Xlist list1 1Var Freq list2 Exe 1Var

31 Measures of spread It is not enough to just give the ‘average’. The mean, median and mode is the same for all 3 sets of data: 48 49 50 50 51 52 40 45 50 50 50 55 60 0 0 50 50 50 100 100 But the data sets are quite different

32 Measures of spread Range is (highest number) - (lowest number) For our data set the first names have a range of 8 - 3 = 5

33 Measures of spread Again, if there are extreme values, the range can distort the true spread of the data.

34 5-number summary We often sort the data into a 5 number summary. The data is split into 4 groups

35 Example 1 1 14 29 35 43 48 49 78 82 82 92 95 95 13 numbers

36 Example 1 1 14 29 35 43 48 49 78 82 82 92 95 95 Lowest is 1 Median is 49 Highest is 95

37 Example 1 1 14 29 35 43 48 49 78 82 82 92 95 95 Lowest is 1 Lower quartile is 35 Median is 49 Upper quartile is 82 Highest is 95

38 Example 2 9 11 17 22 23 28 30 36

39 Example 2 9 11 17 22 23 28 30 36 22.5 14 29

40 Example 2 9 11 17 22 23 28 30 36 5-number summary is 9 14 22.5 29 36 22.5 14 29

41 For first names in our class The 5-number summary is 3 4 5 6 8 Lower quartile is 4 Upper quartile is 6 Interquartile range is the difference between quartiles 6 - 4 = 2

42 Statistics so far Central tendencies: Mean = 5.4 Median = 5 Mean = 5 Because the mean and median are about the same, we wouldn’t expect extreme values.

43 Statistics so far Measures of spread: Range = 5 Interquartile range = 2

44 Statistics so far 5 - number summary 3 4 5 6 8


Download ppt "Statistics 1. How long is a name? To answer this question, we might collect some data on the length of a name."

Similar presentations


Ads by Google