Download presentation

Presentation is loading. Please wait.

Published byMichaela Bunney Modified over 2 years ago

1
3-1 HYPOTHESIS TESTING

2
3-2 INTRODUCTION One of the ways of estimating the true value of population parameters is to test the validity of the claim(assertion or statement) made about this value using simple statistics e.g. a pharmaceutical company claims the efficacy of a medicine against a disease that 95 % of all persons suffering from the disease get cured

3
3-3 INTRODUCTION To test such claims sample data is collected and analyzed. On the basis of sample findings the hypothesized value of population parameter is either accepted or rejected The process that enables a decision maker to test the validity (or significance ) of the claim by analyzing the difference between the value of sample statistic and the corresponding hypothesized population parameter value is called hypothesis testing

4
3-4 Hypothesis Hypotheses are tentative, intelligent guesses as to the solution of the problem. A hypothesis is an assumption about the population parameter. – A parameter is a characteristic of the population, like its mean or variance. Goal:Make statement(s) regarding unknown population parameter values based on sample data

5
3-5 Types of Hypotheses Null – that no statistically significant difference exists between the parameter and the statistic being compared Alternative – logical opposite of the null hypothesis – that a statistically significant difference does exist between the parameter and the statistic being compared. H 0 always contains = H A matches the question asked; it is what were trying to prove.

6
3-6 Various types of H 0 and H A Case Null Hypothesis Alternate Hypothesis H 0 H A 1. µ = µ 0 µ µ 0 2. µ µ 0 µ > µ 0 3. µ µ 0 µ < µ 0 µ 0 is Hypothesised mean

7
3-7 Always the null hypothesis is tested, i.e., we want to either accept or reject the null hypothesis because we have information only for the null hypothesis. Ho is taken as true and tested for its possible rejection Either H 0 or H A must be true, making the other false. H 0 and H 1 are: Mutually exclusive –Only one can be true. Exhaustive –Together they cover all possibilities, so one or the other must be true.

8
3-8 FORMATS OF HYPOTHESIS Directional Hypothesis. – A hypothesis is a statement to be tested about the true value of population parameter – A hypothesis whether there exists any significant difference between two or more populations with respect to any of their common parameter can also be tested. In this case the hypothesis can be stated in the form if- then statement.Consider If inflation rate has increased then wholesale index will also increase If employees are healthy then they will take less sick leave

9
3-9 FORMATS OF HYPOTHESIS Directional Hypothesis. – If terms such as positive, negative, more than etc are used to make a statement, such a hypothesis is called directional hypothesis as it indicates the direction of the relationship Non Directional Hypothesis. – It indicates a relationship but offers no direction of relationship ( or difference)e.g. There is a relationship between age and job satisfaction There is difference between average pulse rate of men and women

10
3-10 Hypotheses about other parameters such as population proportions and and population variances are also possible. For example H 0 : p 40% H 1 : p < 40% H 0 : H 1 : Hypothesis about other Parameters

11
3-11 RATIONALE FOR HYPOTHESIS TESTING Since statistics are random values, therefore their sampling distributions show the tendency of variation Consequently we do not expect the sample statistic value to be equal to hypothesized value If value of sample statistic differ significantly from the hypothesized parameter value then the question arises whether the hypothesized parameter value is correct or not

12
3-12 RATIONALE FOR HYPOTHESIS TESTING The probability level at which the decision maker concludes that observed difference between the value of the test statistic and hypothesized value cannot be due to chance is called the level of significance

13
3-13 PROCEDURE FOR HYPOTHESIS TESTING STEP 1 –STATE THE NULL HYPOTHESIS (Ho) AND ALTERNATIVE HYPOTHESIS(H1) Null hypothesis Ho represents the claim or statement made about the value or range of value of the population parameter. The letter H stands for hypothesis and zero implies no difference between sample statistic and parameter value

14
3-14 PROCEDURE FOR HYPOTHESIS TESTING Null hypothesis is always expressed in the form of mathematical statement which includes (,,=)making a claim regarding the specific value of the population parameter Ho: μ (,,=) μo where μ is population mean and μo represents a hypothesized value of μ Alternate hypothesis H1 is the counter claim made against the value of particular population parameter. That is alternate hypothesis must be true when null hypothesis is found to be false.

15
3-15 PROCEDURE FOR HYPOTHESIS TESTING In other words alternative hypothesis states that specific population parameter is not equal to the value stated in the null hypothesis H1:μ μo Consequently H1 :μ μo

16
3-16 ERRORS IN HYPOTHESIS TESTING Ideally the hypothesis testing procedure should lead to acceptance of the null hypothesis Ho when it is true and the rejection of Ho when it is not true However the correct decision is not always possible Since the decision to reject or accept a hypothesis is based on sample date there is a possibility of an incorrect decision or error

17
3-17 ERRORS IN HYPOTHESIS TESTING There are two types of error STATE OF NATURE Sample decision Ho is trueHo is false Accept HoCorrect Decision=1- Type II error. P(Type II)= Reject HoType I error. p(Type I)= Correct Decision Power=1-

18
3-18 ERRORS IN HYPOTHESIS TESTING TYPE 1 ERROR This is the probability of rejecting the null hypothesis when it is true and some alternative hypothesis is wrong The probability of making a Type I error is denoted by the symbol α It is represented by the area under the sampling distribution curve over the region of rejection The probability of making a Type I error is referred to as level of significance The complement 1-α of the probability of Type I error measure the probability level of not rejecting a true null hypothesis. It is also referred to as confidence level

19
3-19 ERRORS IN HYPOTHESIS TESTING TYPE II ERROR This is the probability of accepting the null hypothesis when it is false and some alternative hypothesis is true The probability of Type II error varies with the actual values of the population parameter being tested when null hypothesis Ho is false

20
3-20 The power of a statistical hypothesis test is the probability of rejecting the null hypothesis when the null hypothesis is false. Power = (1 - ) The power of a statistical hypothesis test is the probability of rejecting the null hypothesis when the null hypothesis is false. Power = (1 - ) The Power of a Test

21
3-21 The probability of a type II error, and the power of a test, depends on the actual value of the unknown population parameter. The relationship between the population mean and the power of the test is called the power function. 7069686766656463626160 1.0 0.9 0.8 0.7 0.6 0.5 0.4 0.3 0.2 0.1 0.0 Power of a One-Tailed Test: =60, =0.05 Power Value of Power = (1 - ) 61 0.87390.1261 62 0.74050.2695 63 0.55770.4423 64 0.36130.6387 65 0.19630.8037 66 0.08770.9123 67 0.03180.9682 68 0.00920.9908 69 0.00210.9972 Value of Power = (1 - ) 61 0.87390.1261 62 0.74050.2695 63 0.55770.4423 64 0.36130.6387 65 0.19630.8037 66 0.08770.9123 67 0.03180.9682 68 0.00920.9908 69 0.00210.9972 The Power Function

22
3-22 l The power depends on the distance between the value of the parameter under the null hypothesis and the true value of the parameter in question: the greater this distance, the greater the power. l The power depends on the population standard deviation: the smaller the population standard deviation, the greater the power. l The power depends on the sample size used: the larger the sample, the greater the power. The power depends on the level of significance of the test: the smaller the level of significance,, the smaller the power. Factors Affecting the Power Function

23
3-23 To Note: 1.Would like and to be as small as possible. and are inversely related. 3.Usually set (and dont worry too much about.) 4.Most common values for and are 0.01 and 0.05. 23

24
3-24 PROCEDURE FOR HYPOTHESIS TESTING STEP 2-STATE THE LEVEL OF SIGNIFICANCE Select an appropriate significance level (Type I error), it is denoted by α. It is the risk a decision maker takes of rejecting the null hypothesis when it is true Traditionally α = 0.05 for consumer research projects, α = 0.01 for quality assurance and α=0.10 for political polling

25
3-25 Significance level gives the confidence with which a researcher rejects or retain the null hypothesis. If we take 5% level of significance then there are about 5 chances out of 100 that we would reject the Null hypothesis when it should be accepted or that we are 95% confident about the right decision

26
3-26 PROCEDURE FOR HYPOTHESIS TESTING STEP 3- ESTABLISH CRITICAL OR REJECTION REGION The area under the sampling distribution curve of test statistic is divided into two regions. These regions are called acceptance region and the rejection region ( critical region)

27
3-27 Acceptance region and critical regions for two tailed test in hypothesis testing

28
3-28 One-sided hypothesis where the critical region lies in the right tail

29
3-29 One-sided hypothesis where the critical region lies in the left tail

30
3-30 Given the same level of significance the two tailed test is more conservative, i.e., it is more rigorous than the one- tailed test because the rejection point is farther out in the tail. It is more difficult to reject H0 with a two-tailed test than with a one-tailed test.

31
3-31 The tails of a statistical test are determined by the need for an action. If action is to be taken if a parameter is greater than some value a, then the alternative hypothesis is that the parameter is greater than a, and the test is a right-tailed test. H 0 : 50 H 1 : 50 The tails of a statistical test are determined by the need for an action. If action is to be taken if a parameter is greater than some value a, then the alternative hypothesis is that the parameter is greater than a, and the test is a right-tailed test. H 0 : 50 H 1 : 50 If action is to be taken if a parameter is less than some value a, then the alternative hypothesis is that the parameter is less than a, and the test is a left-tailed test. H 0 : 50 H 1 : 50 If action is to be taken if a parameter is less than some value a, then the alternative hypothesis is that the parameter is less than a, and the test is a left-tailed test. H 0 : 50 H 1 : 50 If action is to be taken if a parameter is either greater than or less than some value a, then the alternative hypothesis is that the parameter is not equal to a, and the test is a two-tailed test. H 0 : 50 H 1 : 50 If action is to be taken if a parameter is either greater than or less than some value a, then the alternative hypothesis is that the parameter is not equal to a, and the test is a two-tailed test. H 0 : 50 H 1 : 50 1-Tailed and 2-Tailed Tests

32
3-32 PROCEDURE FOR HYPOTHESIS TESTING STEP 4 –SELECT THE TEST OF SIGNIFICANCE OR TEST STATISTIC The tests of significance or test statistic are divided into two categories: parametric and non parametric Parametric tests are more powerful because their data is derived from interval and ratio measurements Non parametric tests are used to test hypothesis with nominal and ordinal data

33
3-33 Univariate Techniques Parametric Test One Sample t test Z test Two or More Samples Independent t test Z test One way ANOVA Related Paired t test Nonparametric Test One Sample Frequency Chi-square K-S Runs Binomial Two or More Samples Independent Chi-Square Median Related Chi-Square Sign

34
3-34 How to Select a Test Which does the test involve? – one sample, – two samples – k samples If two or k samples,are the individual cases independent or related? Is the measurement scale nominal, ordinal, interval, or ratio?

35
3-35 Cases in which the test statistic is Z is known and the population is normal. is known and the sample size is at least 30. (The population need not be normal) Cases in which the test statistic is Z is known and the population is normal. is known and the sample size is at least 30. (The population need not be normal) Testing Population Means

36
3-36 Cases in which the test statistic is t s is unknown but the sample standard deviation is known and the population is normal. Cases in which the test statistic is t s is unknown but the sample standard deviation is known and the population is normal. Testing Population Means

37
3-37 PROCEDURE FOR HYPOTHESIS TESTING STEP 5 – FORMULATE A DECISION RULE TO ACCEPT NULL HYPOTHESIS Compare the calculated value of test statistic with the critical value ( also called standard value of test statistic). The decision rule for null hypothesis are as follows – Accept Ho if the test statistic value falls within the area of acceptance – Reject otherwise

38
3-38 The p-value is the probability of obtaining a value of the test statistic as extreme as, or more extreme than, the actual value obtained, when the null hypothesis is true. The p-value is the smallest level of significance,, at which the null hypothesis may be rejected using the obtained value of the test statistic. Policy: When the p-value is less than, reject H 0. The p-value is the probability of obtaining a value of the test statistic as extreme as, or more extreme than, the actual value obtained, when the null hypothesis is true. The p-value is the smallest level of significance,, at which the null hypothesis may be rejected using the obtained value of the test statistic. Policy: When the p-value is less than, reject H 0. PROCEDURE FOR HYPOTHESIS TESTING The p-Value approach

39
3-39

40
3-40 EXAMPLE 1 An automatic bottling machine fills cola into two liter (2000 cc) bottles. A consumer advocate wants to test the null hypothesis that the average amount filled by the machine into a bottle is at least 2000 cc. A random sample of 40 bottles coming out of the machine was selected and the exact content of the selected bottles are recorded. The sample mean was 1999.6 cc. The population standard deviation is known from past experience to be 1.30 cc. Test the null hypothesis at the 5% significance level.

41
3-41 H 0 : 2000 H 1 : 2000 n = 40 For = 0.05, the critical value of z is -1.645 The test statistic is: Do not reject H 0 if: [z -1.645] Reject H 0 if: z ]

42
3-42

43
3-43 H 0 : 2000 H 1 : 2000 n = 40 For = 0.05, the critical value of z is -1.645 The test statistic is: Do not reject H 0 if: [p-value ] Reject H 0 if: p-value ] H 0 : 2000 H 1 : 2000 n = 40 For = 0.05, the critical value of z is -1.645 The test statistic is: Do not reject H 0 if: [p-value ] Reject H 0 if: p-value ] Example : p-value approach

44
3-44 As part of a survey to determine the extent of required in-cabin storage capacity, a researcher needs to test the null hypothesis that the average weight of carry-on baggage per person is = 12 pounds, versus the alternative hypothesis that the average weight is not 12 pounds. The analyst wants to test the null hypothesis at = 0.05. H 0 : = 12 H 1 : 12 For = 0.05, critical values of z are ±1.96 The test statistic is: Do not reject H 0 if: [-1.96 z 1.96] Reject H 0 if: [z <-1.96] or z 1.96] H 0 : = 12 H 1 : 12 For = 0.05, critical values of z are ±1.96 The test statistic is: Do not reject H 0 if: [-1.96 z 1.96] Reject H 0 if: [z <-1.96] or z 1.96] Examples :2

45
3-45 Since the test statistic falls in the upper rejection region, H 0 is rejected, and we may conclude that the average amount of carry-on baggage is more than 12 pounds.

46
3-46 An insurance company believes that, over the last few years, the average liability insurance per board seat in companies defined as small companies has been $2000. Using = 0.01, test this hypothesis using Growth Resources, Inc. survey data. H 0 : = 2000 H 1 : 2000 For = 0.01, critical values of z are ±2.576 The test statistic is: Do not reject H 0 if: [-2.576 z 2.576] Reject H 0 if: [z <-2.576] or z 2.576] H 0 : = 2000 H 1 : 2000 For = 0.01, critical values of z are ±2.576 The test statistic is: Do not reject H 0 if: [-2.576 z 2.576] Reject H 0 if: [z <-2.576] or z 2.576] Examples :3

47
3-47 Since the test statistic falls in the upper rejection region, H 0 is rejected, and we may conclude that the average insurance liability per board seat in small companies is more than $2000. Examples 3 : Continued

48
3-48 The average time it takes a computer to perform a certain task is believed to be 3.24 seconds. It was decided to test the statistical hypothesis that the average performance time of the task using the new algorithm is the same, against the alternative that the average performance time is no longer the same, at the 0.05 level of significance. H 0 : = 3.24 H 1 : 3.24 For = 0.05, critical values of z are ±1.96 The test statistic is: Do not reject H 0 if: [-1.96 z 1.96] Reject H 0 if: [z < -1.96] or z 1.96] H 0 : = 3.24 H 1 : 3.24 For = 0.05, critical values of z are ±1.96 The test statistic is: Do not reject H 0 if: [-1.96 z 1.96] Reject H 0 if: [z < -1.96] or z 1.96] Examples :4

49
3-49 Since the test statistic falls in the nonrejection region, H 0 is not rejected, and we may conclude that the average performance time has not changed from 3.24 seconds. Examples 4 : Continued

50
3-50 According to the Japanese National Land Agency, average land prices in central Tokyo soared 49% in the first six months of 1995. An international real estate investment company wants to test this claim against the alternative that the average price did not rise by 49%, at a 0.01 level of significance. H 0 : = 49 H 1 : 49 n = 18 For = 0.01 and (18-1) = 17 df, critical values of t are ±2.898 The test statistic is: Do not reject H 0 if: [-2.898 t 2.898] Reject H 0 if: [t < -2.898] or t 2.898] H 0 : = 49 H 1 : 49 n = 18 For = 0.01 and (18-1) = 17 df, critical values of t are ±2.898 The test statistic is: Do not reject H 0 if: [-2.898 t 2.898] Reject H 0 if: [t < -2.898] or t 2.898] Examples :5

51
3-51 Since the test statistic falls in the rejection region, H 0 is rejected, and we may conclude that the average price has not risen by 49%. Since the test statistic is in the lower rejection region, we may conclude that the average price has risen by less than 49%. Examples 5: Continued

52
3-52 An investment analyst for Goldman Sachs and Company wanted to test the hypothesis made by British securities experts that 70% of all foreign investors in the British market were American. The analyst gathered a random sample of 210 accounts of foreign investors in London and found that 130 were owned by U.S. citizens. At the = 0.05 level of significance, is there evidence to reject the claim of the British securities experts? H 0 : p = 0.70 H 1 : p 0.70 n = 210 For = 0.05 critical values of z are ±1.96 The test statistic is: Do not reject H 0 if: [-1.96 z 1.96] Reject H 0 if: [z < -1.96] or z 1.96] H 0 : p = 0.70 H 1 : p 0.70 n = 210 For = 0.05 critical values of z are ±1.96 The test statistic is: Do not reject H 0 if: [-1.96 z 1.96] Reject H 0 if: [z < -1.96] or z 1.96] Examples :6

53
3-53 A floodlight is said to last an average of 65 hours. A competitor believes that the average life of the floodlight is less than that stated by the manufacturer and sets out to prove that the manufacturers claim is false. A random sample of 21 floodlight elements is chosen and shows that the sample average is 62.5 hours and the sample standard deviation is 3. Using =0.01, determine whether there is evidence to conclude that the manufacturers claim is false. H 0 : 65 H 1 : 65 n = 21 For = 0.01 an (21-1) = 20 df, the critical value -2.528 The test statistic is: Do not reject H 0 if: [t -2.528] Reject H 0 if: z ] H 0 : 65 H 1 : 65 n = 21 For = 0.01 an (21-1) = 20 df, the critical value -2.528 The test statistic is: Do not reject H 0 if: [t -2.528] Reject H 0 if: z ] Examples :7

54
3-54 Since the test statistic falls in the rejection region, H 0 is rejected, and we may conclude that the manufacturers claim is false, that the average floodlight life is less than 65 hours. 0.95 -2.528 50-5 0.4 0.3 0.2 0.1 0.0 t f(t) Nonrejection Region Rejection Region Critical Point for a Left-Tailed Test -3.82 Examples 7 : Continued

55
3-55 After looking at 1349 hotels nationwide, weve found 13 that meet our standards. This statement by the Small Luxury Hotels Association implies that the proportion of all hotels in the United States that meet the associations standards is 13/1349=0.0096. The management of a hotel that was denied acceptance to the association wanted to prove that the standards are not as stringent as claimed and that, in fact, the proportion of all hotels in the United States that would qualify is higher than 0.0096. The management hired an independent research agency, which visited a random sample of 600 hotels nationwide and found that 7 of them satisfied the exact standards set by the association. Is there evidence to conclude that the population proportion of all hotels in the country satisfying the standards set by the Small Luxury hotels Association is greater than 0.0096? H 0 : p 0.0096 H 1 : p 0.0096 n = 600 For = 0.10 the critical value 1.282 The test statistic is: Do not reject H 0 if: [z 1.282] Reject H 0 if: z ] H 0 : p 0.0096 H 1 : p 0.0096 n = 600 For = 0.10 the critical value 1.282 The test statistic is: Do not reject H 0 if: [z 1.282] Reject H 0 if: z ] Examples :8

56
3-56 Since the test statistic falls in the nonrejection region, H 0 is not rejected, and we may not conclude that proportion of all hotels in the country that meet the associations standards is greater than 0.0096. 0.90 1.282 50-5 0.4 0.3 0.2 0.1 0.0 z f(z) Nonrejection Region Rejection Region Critical Point for a Right-Tailed Test 0.519 Examples 8 : Continued

57
3-57 EXAMPLE 9 A packaging device is set to fill detergent powder packets with a mean weight of 3 kg, with a standard deviation of 0.21 kg. The weight of packets can be assumed to be normally distributed. The weight of packets is known to drift upwards over a period of time due to machine fault which is no tolerable. A sample of 100 packets is taken and weighed. This sample has a mean weight of 5.03 kg. can we conclude that the mean weight produced by the machine has increased? Use a 5 % level of significance

58
3-58 EXAMPLE 9: Continued Let null hypothesis Ho that mean weight has increased Ho : μ 5 and H1 ; μ < 5 Given n=100,x=5.3 kg, σ=0.21, α= 5% Z= (x- μ) / σn = 5.03-5 /0.21/100=1.428 Since calculated value z(cal) = 1.428 is less than its critical value z(α) =1.645 at α = 0.05, the null hypothesis Ho is accepted Hence we conclude that mean weight is likely to be more than 5 kg

59
3-59 HYPOTHESIS TESTING FOR DIFFERENCE BETWEEN TWO POPULATION MEANS Idea is to test whether there is significant difference between the means of these populations Let μ1 and μ2 be mean of two population σ1 and σ2 be std deviation of two population

60
3-60 HYPOTHESIS TESTING FOR POPULATION PARMETERS WITH LARGE SAMPLES Z statistic = [(x1-x2) – (μ1-μ2)]/ σ x1 –x2 = [(x1-x2) – (μ1-μ2)]/ σ²/n1 + σ²/n2 The null and alternate hypothesis are stated as Null hypothesis Ho :μ1 – μ2 = 0

61
3-61 Alternate hypothesis One tailed test H1:(μ1 –μ2) > 0 H1 : (μ1 –μ2) < 0 Two tailed test H1: (μ1 –μ2) 0

62
3-62 HYPOTHESIS TESTING FOR POPULATION PARMETERS WITH LARGE SAMPLES Decision Rule Reject Ho at a specified level of significance when One Tailed Test Z(cal) >z(α) [or when z<-z(α)] Two tailed test Z(cal)>z(α/2) or z(cal) <-z(α/2)

63
3-63 EXAMPLE 10 A firm believes that the tyres produced by process A on an average last longer than tyres produced by process B. To test this belief, random supplies of tyres produced by the two processes were tested and the result are Is there evidence at a 5 % level of significance that the firm is correct in its belief? ProcessSample sizeAverage lifetime(km) Standard deviation(km) A50224001000 B50218001000

64
3-64 EXAMPLE 10 Let us take the null hypothesis that there is no significant difference in the average life of tyres produced by processes A and B Ho : μ1=μ2 or μ1-μ2 = 0 H1 : μ1=μ2 Given x1(mean) = 22400, x2 (mean) =21800 Σ1=σ2=1000, n1=n2=50 Z= [(x1-x2) – (μ1-μ2)]/ σx1 –x2 = [(x1-x2) – (μ1-μ2)]/ σ²/n1 + σ²/n2

65
3-65 EXAMPLE 10 = 22400 – 21800 /(1000)²/50 + (1000)²/50 = 600/ 20000+20000 = 600/200=3 Since the calculated value z(cal )=3 is more than its critical value z (α/2) =+- 1.645at α =0.5 level of significance therefore Ho is rejected Hence we conclude that the tyre produced by process A last longer than that of B

66
3-66 HYPOTHESIS TESTING FOR SINGLE POPULATION PROPORTION The three forms of null hypothesis and alternative hypothesis pertaining to the hypothesized population proportion Po are as follows Null HypothesisAlternative Hypothesis Ho: P = PoH1: P = Po ( two tailed test) Ho :P PoH1: P < Po (left tailed test) Ho : P PoH1 : P >Po (right tailed test

67
3-67 HYPOTHESIS TESTING FOR SINGLE POPULATION PROPORTION Test statistic z = p-po/σp = P- Po /Po(1-Po)/n One tailed testTwo tailed test Z(cal) > z(α) or Z(cal) < - z(α) When H1 ; p< po Z(cal) > z(α/2) or Z(cal) < - z(α/2) P value <α

Similar presentations

© 2017 SlidePlayer.com Inc.

All rights reserved.

Ads by Google