Presentation is loading. Please wait.

Presentation is loading. Please wait.

Problems: Q&A chapter 6, problems Chapter 6:

Similar presentations


Presentation on theme: "Problems: Q&A chapter 6, problems Chapter 6:"— Presentation transcript:

1

2 Problems: Q&A chapter 6, problems 1-10. Chapter 6: 6.1 6.5 6.11 6.27 6.35 6.41 6.45

3 Note: Examples in chapter 6 are based on data that come from a normal distribution: N(, ) and most usually we are estimating  based on the unrealistic assumption that  is known. Formal methods for estimating  when  is unknown follow in later chapters.

4

5

6

7

8

9 Example: Suppose we wish to compute the true mean score on the US college entrance exam, the population mean SAT score. These scores are known to be distributed normally with mean  and standard deviation . That is, if x1, x2, .., xn, represents an SRS from the population of students taking the SAT, we say that the x’s are normally distributed with mean  and standard deviation , or, more formally: x1, x2, .., xn ~ N(, ). We don’t know the mean, . Also, in a real situation, we will not know the standard deviation, . But we will proceed by assuming that we know  and that  = Also, assume that the sample size is n=500.

10 Our objective is to estimate  from the sample data x1, x2, .., xn.
We know (from chapter 5) that an unbiased estimator of  is the sample mean We also know (from chapter 5) that the sample mean also has a normal distribution with mean, the parameter we are estimating (because the sample mean is an unbiased estimator) and variance equal to  divided by the square root of the number of observations in the SRS. In other words ~ N(, /n).

11 The task in chapter 6 is to use this information to make formal probability statements about the unknown value . All of these statements pertain to the fact that the sample mean has a normal distribution with known variance 2.

12 Three important points that follow from the normal distribution:
The rule states that 95% of the time, the sample mean will lie within  2 standard deviations of the population mean . This implies that 95% of the time the population mean  lies within  2 standard deviations of the sample mean. Hence, 95% of all sample means calculated from SRS’s will contain the true value in the interval

13

14 Suppose we take an SRS and calculate the sample mean at 495.
We construct the interval 4959 = (495-9, 495+9) = (486, 504). Either (486, 504) or (486, 504). The first situation arises 95% of the time and the second situation arises 5% of the time if we repeated the experiment (take an SRS, calculate the mean, calculate the interval) a very large number of times.

15

16 Confidence interval A level C confidence interval for a parameter is an interval computed from the sample data by a method that has probability C of producing an interval containing the true value of the parameter. Convention is to use C = 0.95 or C = 0.90.

17 Formal construction of confidence intervals
This relies on two facts. First, all normal distributions have the same standardized forms. That is, Second, the central limit theorem (chapter 5) tells us that the normal distribution is approximately correct, even if the population is not normal.

18 Now, we find the central C area under the standard normal curve and use Table A to construct the “critical values”, -z* and +z*. For C = 0.95, z* = For C = 0.95, z* = For C = 0.99, z* =

19

20 Formally:

21

22

23 We call the component the “margin of error.” Opportunities for reducing the margin of error are three, namely, reducing z*, reducing , or increasing n.

24

25

26

27 Controlling the margin of error
Usually, we will choose conventional levels of C, for example, C = 0.95 and C = 0.90. The population standard deviation is often out of our control. Only the sample size, n, is within our control. We can select n, to set the margin of error,

28 Formally:

29

30

31 6.2 Tests of significance

32

33

34

35

36 Example: (example 6.6 page 453)
Someone states their weight as “187 pounds.” We take four measures over a one month period, x1 = 190.5, x2 = 189.0, x3 = 195.5, x1 = Sample mean = 190.5, and we know that the weights are normally distributed with (unknown) mean  and (known) standard deviation  = 3. Are these measures compatible with a true population mean of

37 Example: (continued) First, restate: What is the probability of observing a sample mean of or larger from a population with a true mean 187.0? Second, formalize: Null hypothesis, Ho:  = Alternative hypothesis, Ha:   (This is a one-sided hypothesis.)

38 Example: (continued) Alternatively: First, restate: What is the probability of observing a sample mean of or larger from a population with a true mean 187.0? Second, formalize: Null hypothesis, Ho:  = Alternative hypothesis, Ha:   (This is a two-sided hypothesis.)

39 In this first example, we will compute the probability of observing a sample mean of or larger when the true population mean is 1.87 to be prob = 0.01, which is 1% and, thus small. Hence, we conclude that the data are not compatible with the hypothesis, or, more precisely, we conclude that it is only 1% of the time likely that the computed sample mean could be consistent with the null hypothesis.

40 Suppose we take another sample and find that the computed sample mean is 188.5.
This number is much closer to the hypothesized value and, hence, it seems more likely that this result could have been generated by the hypothesized population mean We will compute the probability of the event “obtaining a sample mean greater than or equal to when the true population mean is to be So, about 16% of the time we expect such a result. This proportion is not particularly small, so we say that the data are not inconsistent with the hull hypothesis.

41

42 Calculations: All based on the z distribution, the standard-normal N(0, 1). There are two ways to proceed. First is based on a test statistic. Second is based on P-values. Both approaches exploit the z distribution.

43 Formally:

44

45 Once a critical value is chosen, z
Once a critical value is chosen, z*, we reject the null hypothesis in favour of the alternative hypothesis if the computed value, z, lies further in the tail than the critical value z*. Important: Note the rules for one sided tests in the greater-than direction. Note the rules for one sided tests in the less-than direction. Note the rules for two-sided tests.

46 For one sided tests of the greater-than variety:
Ho:  = o Ha:   o We reject Ho in favor of Ha if z  z*.

47 For one sided tests of the less-than variety:
Ho:  = o Ha:   o We reject Ho in favor of Ha if z  z*.

48 For one sided tests of the not-equal-to variety:
Ho:  = o Ha:   o We reject Ho in favor of Ha if z  zL* or z  zU*.

49 Example: Two-sided significance test. Ho:  = o = 187.0 Ha:   o = 187.0 You select C = 0.95 and compute 1-C/2 = From Table D (last line) you find zL* = -1.96, and zU* = You note z > zU* and you reject the null in favour of the alternative.

50 Example: One-sided significance test. Ho:  = o = 187.0 Ha:   o = 187.0 You select C = 0.95 and compute 1-C = 0.05. From Table D (last line) you find z* = You note z > z* and you reject the null in favour of the alternative.

51 Example: One-sided significance test. Ho:  = o = 187.0 Ha:   o = 187.0 You select C = 0.95 and compute 1-C = 0.05. From Table D (last line) you find z* = You note z > z* and you do not reject the null in favour of the alternative.

52 Example: Two-sided significance test. Ho:  = o = 187.0 Ha:   o = 187.0 You select C = 0.95 and compute 1-C/2 = From Table D (last line) you find zL* = -1.96, and zU* = You note zL* < z < zU* and you do not reject the null hypothesis.

53 Example of P-value calculations.
Remember, the probability, computed assuming Ho is true, that the test statistic would take a value as extreme or more extreme than the observed value is the P-value. The smaller the P-value, the stronger the evidence against Ho.

54 Example: One-sided significance test. Ho:  = o = 187.0 Ha:   o = 187.0 Now, if Ho is true, then z = 1.00 is a single observation from the z, N(0,1), distribution. The probability of observing a z value greater than 1.00 is (from table A) p = =

55 Example: Two-sided significance test. Ho:  = o = 187.0 Ha:   o = 187.0 Now, if Ho is true, then z = 2.33 is a single observation from the z, N(0,1), distribution. The probability of observing a z value greater than 2.33 is (from table A) p = = , but, because this is a two-sided test, we say that the P-value is p = 2  =

56

57

58

59

60

61

62 We may cover this section next week, or not.
Please look at chapter 7 in advance, especially sections 7.1 and 7.2.


Download ppt "Problems: Q&A chapter 6, problems Chapter 6:"

Similar presentations


Ads by Google