Presentation is loading. Please wait.

Presentation is loading. Please wait.

Unit 7 Today we will look at: Normal distributions

Similar presentations


Presentation on theme: "Unit 7 Today we will look at: Normal distributions"— Presentation transcript:

1 Unit 7 Today we will look at: Normal distributions
Sampling distributions of the mean Hypothesis testing again: the z-test One sample z-test Two sample z-test Unit 7 Today we will look at:

2 Normal Distribution The normal distribution is bell shaped
The mean is the central value about which the data values are clustered. The standard deviation is a measure of the spread of the data. The larger the standard deviation the more spread out the data is.

3 Normal Distribution So the normal distribution has
location specified by the population mean, μ and spread specified by the population standard deviation, σ

4 Normal Distribution Normal distribution with different locations

5 Normal Distribution Normal distribution with different spreads

6 Which of the following variables
do you think are distributed normally? The earnings of male employees in a company The heights of M140 students The measurement error of repeated measurements of the same quantity The number of heads obtained when tossing a coin a certain number of times The age of individuals in a human population The mean age of (randomly selected) groups of individuals in a human population

7 Which of the following variables
do you think are distributed normally? The earnings of male employees in a company The heights of M140 students The measurement error of repeated measurements of the same quantity The number of heads obtained when tossing a coin a certain number of times The age of individuals in a human population The mean age of (randomly selected) groups of individuals in a human population

8 Sampling distribution of the sample mean
What does it refer to? Choose one of the following options: the distribution of the various sample sizes the distribution of the different possible values of the sample mean the distribution of the values of the individuals in the population the distribution of the data values in a given sample none of the above

9 Is this distribution a normal distribution?
Sampling distribution of the sample mean What does it refer to? Choose one of the following options: the distribution of the various sample sizes the distribution of the different possible values of the sample mean the distribution of the values of the individuals in the population the distribution of the data values in a given sample none of the above Is this distribution a normal distribution? Let us find out!!

10 Sampling Distribution of the mean of a sample size 2

11 Sampling Distribution of the mean of a sample size 3

12 Sampling distributions of the mean with different sample sizes.
What do you observe happens to the distribution as the sample size increases?

13 The sampling distributions rise more and more sharply
Sampling distributions of the mean with different sample sizes. The sampling distributions rise more and more sharply to a mode, but the mean is generally of a similar value The distributions become more and more compressed i.e. the standard deviation decreases.

14 Example: Is it normal? How would you describe this distribution?
Sampling distribution of means based on earnings data in UK in 2011 No Is it normal? How would you describe this distribution? Right - skewed

15 Sampling distribution of the sample mean

16 Terminology Used Sample Population Mean: 𝑥 = 𝑥 𝑛 𝜇= 𝑥 𝑁
𝑥 = 𝑥 𝑛 𝜇= 𝑥 𝑁 Standard deviation : 𝑠= 𝑥− 𝑥 𝑛− 𝜎= 𝑥−𝜇 2 𝑁−1

17 Normal Distribution (recap)

18 Standard Normal Distribution

19 Transforming to Standard Normal Distribution
Consider the normal distribution of a variable x with mean μ = 10 and standard deviation σ = 2

20 Transforming to Standard Normal Distribution
Shift the whole distribution to the left so that the mode occurs at zero v = x - 10

21 Transforming to Standard Normal Distribution
Then we have distribution of v with standard deviation 2 Divide v by 2 to get distribution with standard deviation 1

22 Transforming to Standard Normal Distribution

23 Question 1 For the normal distributions with the following values of μ and σ, write down the formula to transform x to z a) μ = 25, σ = 4 𝑧= 𝑥−𝜇 𝜎 𝑧= 𝑥−25 4

24 Question 1 For the normal distributions with the following values of μ and σ, write down the formula to transform x to z b) μ = 75, σ = 20 𝑧= 𝑥−𝜇 𝜎 𝑧= 𝑥−75 20

25 Question 1 For the normal distributions with the following values of μ and σ, write down the formula to transform x to z c) μ = 2, σ = 0.5 𝑧= 𝑥−𝜇 𝜎 𝑧= 𝑥−2 0.5

26 Here x= 85, μ = 80 and σ = 4, so using the equation: z = 𝑥 − 𝜇 𝜎
Question 2 The mean number of seeds in a packet of beetroot seeds of a certain variety is 80, and the standard deviation is 4. a) What is the z value corresponding to a packet of seeds containing 85 seeds? 𝑧= 𝑥−𝜇 𝜎 Here x= 85, μ = 80 and σ = 4, so using the equation: z = 𝑥 − 𝜇 𝜎 z = 85 −80 4 = = 1.25 Where is this on the normal distribution? To the right of 𝜇

27 Here x= 70, μ = 80 and σ = 4, so using the equation: z = 𝑥 − 𝜇 𝜎
Question 2 The mean number of seeds in a packet of beetroot seeds of a certain variety is 80, and the standard deviation is 4. b) What is the z value corresponding to a packet of seeds containing 70 seeds? 𝑧= 𝑥−𝜇 𝜎 Here x= 70, μ = 80 and σ = 4, so using the equation: z = 𝑥 − 𝜇 𝜎 z = 70 −80 4 = −10 4 = - 2.5 Where is this on the normal distribution? To the left of 𝜇

28 Question 3 The distribution of the wing span of house sparrows, wp, is normal with μ = 23cm and standard deviation σ = 0.67 cm. a) Transform this to a standard normal distribution 𝑧= 𝑥−𝜇 𝜎 𝑧= 𝑤𝑝−𝜇 𝜎 𝑧 = 𝑤𝑝−

29 Question 3 b) Complete the sentence A wingspan of 24cm is standard deviations the mean wingspan of . 𝑧= 𝑥−𝜇 𝜎 𝑧= 24− =1.5 So a wingspan of 24cm is one and a half standard deviations above the mean wingspan of 23cm

30 Mean & Standard deviation of Sampling distribution of the sample mean
This result holds generally for all sampling distributions, no matter what the population distribution and no matter what sample size is involved.

31 Central Limit Theorem

32 Question 4 Find the standard deviation of the mean for a sample with the following values: μ = 80, σ = 4, for the following sample sizes: 10 SE = 𝜎 𝑛 Here μ = 80, σ = 4 and n = 10 So SE = 𝜎 𝑛 = = (𝑢𝑝𝑡𝑜 2𝑑𝑝)

33 Question 4 Find the standard deviation of the mean for a sample with the following values: μ = 80, σ = 4, for the following sample sizes: b) 20 SE = 𝜎 𝑛 Here μ = 80, σ = 4 and n = 20 So SE = 𝜎 𝑛 = = (𝑢𝑝𝑡𝑜 2𝑑𝑝)

34 Question 4 Find the standard deviation of the mean for a sample with the following values: μ = 80, σ = 4, for the following sample sizes: c) SE = 𝜎 𝑛 Here μ = 80, σ = 4 and n = 100 So SE = 𝜎 𝑛 = = 0. 4

35 What can we deduce from these results?
Question 4 n SE 10 1.26 20 0.89 100 0.4 What can we deduce from these results? As the sample size increases, the standard error of the mean decreases, i.e. the distribution becomes more compressed. We hope the results are more accurate

36 Question 5 Suppose that a given set of M140 final results have a mean of 72 marks and a standard deviation of 15 marks. a) Find the standard deviation of the sample mean results of samples of 30 students. SE = 𝜎 𝑛 Here n= 20 and σ = 15, so the standard deviation of the sample mean score is 𝜎 𝑛 = = 2.738… = 2.74(upto 2dp)

37 Recall Central Limit Theorem
Question 5 Suppose that a given set of M140 final results have a mean of 72 marks and a standard deviation of 15 marks. b) Given the standard error of the means is 𝜎 𝑛 =3.35, what is approximate distribution of the mean for sample of size 30? Recall Central Limit Theorem Sampling distribution of the mean score is approximately Normal with mean μ = 72 and standard deviation σ/√n = 2.74

38 One Sample z - test Four stages of hypothesis testing Set up the hypothesis that we wish to test Determine the sampling distribution of a test statistic Ascertain how unlikely the observed value of the test statistic is Interpret the results Under the assumption that the null hypothesis is true If the test statistic turns out to have a very unlikely value, then either: a very unusual event has happened, or the sample has provided evidence against the correctness of null hypothesis

39 Alternative hypothesis
One Sample z – test: hypothesis The hypothesis that the population mean is equal to A is known as the Null hypothesis; denoted by H0 Null hypothesis H0: μ = A Alternative hypothesis H1: μ ≠ A

40 One Sample z – test: hypothesis
Example: Let’s go back to M140 students in Question 5: A given set of M140 final results have a mean of 72 marks and a standard deviation of 15 marks. Write down the null and alternative hypothesis. H0: The mean mark for students on M140 is equal to 72 H0: μ = 72 H1: The mean mark for students on M140 is not equal to 72 H1:μ ≠ 72

41 One Sample z – test: Test Statistic
If σ is unknown but the sample size is 25 or more z = 𝑥 −𝐴 𝐸𝑆𝐸 where ESE = 𝑠 𝑛 where 𝑥 is the sample mean and s is the sample standard deviation

42 𝑧= 𝑥 −𝐴 𝑆𝐸 = 70 −72 15/ 600 =−3.265… ̴ - 3.27 SE = 𝞼 𝑛 = 15 600
Question 6 Suppose there are 600 students on the Feb presentation and their mean score is 𝑥 =70 and assume their standard deviation is σ = 15. Calculate the test statistic. σ is known z = 𝑥 −𝐴 𝑆𝐸 where SE = 𝞼 𝑛 SE = 𝞼 𝑛 = 𝑧= 𝑥 −𝐴 𝑆𝐸 = 70 −72 15/ =−3.265… ̴ Interpret the result

43 Critical Values & Critical Region
One Sample z – test: Critical Values & Critical Region We know if the null hypothesis, H0, is true, then z should follow the standard normal distribution. So this distribution has a mean of zero. So if the value of z given by our data is too extreme i.e. very large in size ( + or - ) it would suggest that H0 is false. We perform the test at 5% and 1% significance level

44 Critical Values & Critical Region
One Sample z – test: Critical Values & Critical Region At 5% significance level At 1% significance level The values defining the inner ends of the critical region are the critical values

45 One Sample z – test: Analysis & Conclusion Strong evidence against H0
Moderate evidence against H0 Either little or weak evidence against H0

46 Let’s go back to M140 students in Q5 & Q6
A given set of M140 final results have a mean of 72 marks and a standard deviation of 15 marks. Suppose there are 600 students on the Feb presentation and their mean score is 𝑥 =70 Null & Alternate hypotheses: H0: μ = H1: μ ≠ 72 𝑇𝑒𝑠𝑡 𝑠𝑡𝑎𝑡𝑖𝑠𝑡𝑖𝑐 𝑖𝑠 𝑧 ̴ Interpret the result So there is strong evidence to reject the null hypothesis We conclude that the mean scores for this group of M140 students does appear to be different from 72 marks. And indeed seems to be lower.

47 Let Minitab do the work!

48 Let minitab do the work!

49 Minitab Results! Remember the p-value?

50 One Sample z – test: Recap

51 One Sample z – test: Recap Procedure
Pg 140 Unit 7

52 The null and alternate hypothesis are: H0: μ = 22 H1: μ ≠ 22
Question 7 A sample of 55 cuckoo eggs found in the nests of wrens had a sample mean of 21.13mm and a sample deviation of 2.4 a) Investigate whether the mean of this sample differs from the population mean, believed to be 22mm σ is unknown z = 𝑥 −𝐴 𝐸𝑆𝐸 where ESE = 𝑠 𝑛 The null and alternate hypothesis are: H0: μ = 22 H1: μ ≠ 22 Sample mean and standard deviation are: 𝑥 = 21.13, s = 2.4 So ESE = 𝑠 𝑛 = Therefore z = 𝑥 −𝐴 𝐸𝑆𝐸 = −22 2.4/ = … =

53 What evidence does this provide?
Question 7 z = -2.69 What evidence does this provide? Since ≤ -2.58, H0 is rejected at the 1% significance level. Therefore there is strong evidence against H0 . Thus there is strong evidence that the mean length of cuckoo eggs is not 22mm. Since z is negative, the length of cuckoo eggs is more likely to be less than 22mm.

54 How would this result differ if the sample
Question 7 A sample of 55 cuckoo eggs found in the nests of wrens had a sample mean of 21.13mm and a sample standard deviation of 2.4 Investigate whether the mean of this sample differs from the population mean, believed to be 22mm How would this result differ if the sample standard deviation had been 8.7? σ is unknown z = 𝑥 −𝐴 𝐸𝑆𝐸 where ESE = 𝑠 𝑛 The null and alternate hypothesis are: H0: μ = 22 H1: μ ≠ 22 Sample mean and standard deviation are: 𝑥 = 21.13, s = 8.7 So ESE = 𝑠 𝑛 = Therefore z = 𝑥 −𝐴 𝐸𝑆𝐸 = −22 8.7/ = … =

55 What evidence does this provide?
Question 7 z = -0.74 What evidence does this provide? Since ≤ ≤ 1.96, we do not reject H0 at 5% significance level. Therefore there is little evidence against H0 . Thus there is strong evidence that the mean length of cuckoo eggs is 22mm, confirming the value given for the population mean

56 Two Sample z – test is used to analyse the difference in locations between two populations, say G and B The null hypothesis is: H0: μG = μB Or equivalently H0: μG - μB = 0 The alternative hypothesis is: H1: μG ≠ μB Or equivalently H0: μG - μB ≠ 0

57 Sampling distribution of the difference between two means:

58 Mean and Standard Deviation
Sampling distribution of the difference between two means: Mean and Standard Deviation

59 Two Sample z – test: Test Statistic
Note: If σA and σB are not available then the sample standard deviation values sG and sB are used

60 Two Sample z – test: Procedure
Pg 149 Unit 7

61 One Sample z – test: Recap
Again if σA and σB are not available then the sample standard deviation values sA and sB are used Here the value of SE is calculated as follows: SE = σA 2 nA + σB 2 nB where nAand nBare the sample sizes and σA and σB are the population standard deviations. ESE = sA 2 nA + sB 2 nB The test statistic is z = xA − xB 𝑆𝐸 where xA and xB are the sample means z = xA − xB 𝐸𝑆𝐸

62 Null Hypothesis H0: μf = μm
Question 8 Suppose we wanted to find out if males or females performed better on iCMA41 in M140. Suggest possible hypotheses. Null Hypothesis H0: μf = μm (mean mark for females is equal to mean mark for males) Alternate Hypothesis H1: μf ≠ μm (mean mark for females is not equal to mean mark for males)

63 Question 9 Suppose there are 700 females, 500 males and the sample mean score for the females is 74 and the sample mean score for the males is 72. The sample standard deviation for the females is 16 and the sample standard deviation for the males is 18. Calculate the test statistic and interpret the result. 𝐸𝑆𝐸= 𝑠 𝑓 2 𝑛 𝑓 + 𝑠 𝑚 2 𝑛 𝑚 = ≈1.0068 𝑆𝑜 𝑧= 𝑥 𝑓 − 𝑥 𝑚 𝐸𝑆𝐸 = 74− ≈ 1.99

64 What evidence does this provide?
z = 1.99 What evidence does this provide? Question 9 We reject H0 at the 5% significance level but not at the 1% significance level. So there is moderate evidence to reject the null hypothesis. It seems that females are indeed better at iCMA41!!

65 Question 10 Mean, x St dev, s Sample size, n Species A 128.4 6.99 32 Species B 122.8 10.72 29 Carry out a hypothesis test to investigate whether the joint sizes are the same for both species Null Hypothesis H0: μA = μB Alternate Hypothesis H1: μA ≠ μB = 𝑠 𝐴 2 𝑛 𝐴 + 𝑠 𝐵 2 𝑛 𝐵 𝐸𝑆𝐸 = ≈2.3429 𝑆𝑜, 𝑧 = − ≈2.39 = 𝑥 𝐴 − 𝑥 𝐵 𝐸𝑆𝐸

66 What does this result mean?
z = 2.39, What does this result mean? Question 10 Since 1.96 ≤ 2.39 < 2.58, then the H0 is rejected at the 5% significance level, but not at the 1% level So we can reject the null hypothesis at the 5% level but not at the 1% level, so there is evidence against the null hypothesis, but not very strong evidence. There is moderate evidence that the joint sizes are different. There is weak evidence that the joint sizes are the same, in fact the joints of species A may be larger than those of species B

67 True or false? The sampling distribution of the sample mean is always a normal distribution 99.7% of the population values for a normally distributed variable fall within three standard deviations of the mean The z value represents the number of standard deviations from the mean The mean and median of a normally distributed variable may be different Any normal distribution is symmetric about the mean The curve touches the x-axis at approximately 4 standard deviations below and above the mean

68 True or false?

69 Creating a sampling distribution of the mean
Let the variable x be the number shown when a six sided die is thrown What is the population mean and standard deviation

70 Creating a sampling distribution of the mean
A six sided die is rolled What is the population mean and standard deviation? There are six possible outcomes: 1, 2, 3, 4, 5, 6. So mean 𝞵= = 3.5

71 Creating a sampling distribution of the mean
We know the mean 𝞵= 3.5. We use it to calculate the deviations and squared deviations from mean 1 -2.5 6.25 2 -1.5 2.25 3 -0.5 0.25 4 0.5 5 1.5 6 2.5 1 2 3 4 5 6

72 Creating a sampling distribution of the mean 1 2 3 4 5 6
-2.5 6.25 2 -1.5 2.25 3 -0.5 0.25 4 0.5 5 1.5 6 2.5

73 Creating a sampling distribution of the mean
Let the variable x be the number shown when a six sided die is thrown. How many possible samples of size 2 are there? The possible samples and the resulting sample means are shown in the table below:

74 Resulting sampling distribution of the mean

75 Resulting sampling distribution of the mean

76 Creating a sampling distribution of the mean

77 Creating a sampling distribution of the mean

78 Creating a sampling distribution of the mean

79 Creating a sampling distribution of the mean
We observe the distribution becomes more and more symmetric and bell shaped. This symmetric bell-shaped distribution is called Normal Distributions. peaked and compressed about this peak.

80 Where would z = 2.39 be on the normal distribution diagram?


Download ppt "Unit 7 Today we will look at: Normal distributions"

Similar presentations


Ads by Google