The smokers’ proportion in H.K. is 40%. How to testify this claim ?

Slides:



Advertisements
Similar presentations
“Students” t-test.
Advertisements

Hypothesis Testing. To define a statistical Test we 1.Choose a statistic (called the test statistic) 2.Divide the range of possible values for the test.
Testing a Claim about a Proportion Assumptions 1.The sample was a simple random sample 2.The conditions for a binomial distribution are satisfied 3.Both.
Chapter 9 Hypothesis Testing Understandable Statistics Ninth Edition
Hypothesis Testing A hypothesis is a claim or statement about a property of a population (in our case, about the mean or a proportion of the population)
1 1 Slide STATISTICS FOR BUSINESS AND ECONOMICS Seventh Edition AndersonSweeneyWilliams Slides Prepared by John Loucks © 1999 ITP/South-Western College.
1 1 Slide © 2008 Thomson South-Western. All Rights Reserved Chapter 9 Hypothesis Testing Developing Null and Alternative Hypotheses Developing Null and.
Binomial Distribution & Hypothesis Testing: The Sign Test
STATISTICAL INFERENCE PART V
Chapter 10: Hypothesis Testing
Making Inferences for Associations Between Categorical Variables: Chi Square Chapter 12 Reading Assignment pp ; 485.
Cal State Northridge  320 Ainsworth Sampling Distributions and Hypothesis Testing.
Probability & Statistics for Engineers & Scientists, by Walpole, Myers, Myers & Ye ~ Chapter 10 Notes Class notes for ISE 201 San Jose State University.
Major Points An example Sampling distribution Hypothesis testing
Inference about a Mean Part II
Stat 217 – Day 15 Statistical Inference (Topics 17 and 18)
Definitions In statistics, a hypothesis is a claim or statement about a property of a population. A hypothesis test is a standard procedure for testing.
Section 7-2 Hypothesis Testing for the Mean (n  30)
1 1 Slide © 2011 Cengage Learning. All Rights Reserved. May not be scanned, copied or duplicated, or posted to a publicly accessible website, in whole.
Probability Population:
Copyright (c) 2004 Brooks/Cole, a division of Thomson Learning, Inc. Chapter 8 Tests of Hypotheses Based on a Single Sample.
Hypothesis Testing 12/01/2011. Hypothesis-testing is a very important area of statistical inference. It is a procedure which enables us to decide on the.
Overview of Statistical Hypothesis Testing: The z-Test
© 2008 McGraw-Hill Higher Education The Statistical Imagination Chapter 9. Hypothesis Testing I: The Six Steps of Statistical Inference.
Presented by Mohammad Adil Khan
Hypothesis Testing.
Sections 8-1 and 8-2 Review and Preview and Basics of Hypothesis Testing.
Fundamentals of Hypothesis Testing: One-Sample Tests
Sampling Distributions and Hypothesis Testing. 2 Major Points An example An example Sampling distribution Sampling distribution Hypothesis testing Hypothesis.
Tests of significance & hypothesis testing Dr. Omar Al Jadaan Assistant Professor – Computer Science & Mathematics.
Lecture 20: Single Sample Hypothesis Tests: Population Mean and Proportion Devore, Ch
STATISTICAL INFERENCE PART VII
Overview Basics of Hypothesis Testing
1 1 Slide © 2008 Thomson South-Western. All Rights Reserved Slides by JOHN LOUCKS St. Edward’s University.
Psy B07 Chapter 4Slide 1 SAMPLING DISTRIBUTIONS AND HYPOTHESIS TESTING.
Maximum Likelihood Estimator of Proportion Let {s 1,s 2,…,s n } be a set of independent outcomes from a Bernoulli experiment with unknown probability.
Chapter 20 Testing hypotheses about proportions
1 Chapter 8 Hypothesis Testing 8.2 Basics of Hypothesis Testing 8.3 Testing about a Proportion p 8.4 Testing about a Mean µ (σ known) 8.5 Testing about.
Statistical Inference Statistical Inference involves estimating a population parameter (mean) from a sample that is taken from the population. Inference.
AP Statistics February Coin Flipping Example  On a scrap paper record the results of my coin flips. 2.
Copyright © 2010, 2007, 2004 Pearson Education, Inc Section 8-2 Basics of Hypothesis Testing.
Lesson Testing Claims about a Population Proportion.
Economics 173 Business Statistics Lecture 4 Fall, 2001 Professor J. Petry
Statistical Inference for the Mean Objectives: (Chapter 9, DeCoursey) -To understand the terms: Null Hypothesis, Rejection Region, and Type I and II errors.
Slide Slide 1 Copyright © 2007 Pearson Education, Inc Publishing as Pearson Addison-Wesley. Overview.
Copyright ©2013 Pearson Education, Inc. publishing as Prentice Hall 9-1 σ σ.
Fall 2002Biostat Statistical Inference - Confidence Intervals General (1 -  ) Confidence Intervals: a random interval that will include a fixed.
Chap 8-1 Fundamentals of Hypothesis Testing: One-Sample Tests.
Stats Lunch: Day 3 The Basis of Hypothesis Testing w/ Parametric Statistics.
Math 4030 – 9a Introduction to Hypothesis Testing
Welcome to MM570 Psychological Statistics
Statistical Inference Making decisions regarding the population base on a sample.
© Copyright McGraw-Hill 2004
Testing Hypotheses about a Population Proportion Lecture 31 Sections 9.1 – 9.3 Wed, Mar 22, 2006.
Statistical Inference Making decisions regarding the population base on a sample.
Statistical Inference Statistical inference is concerned with the use of sample data to make inferences about unknown population parameters. For example,
Understanding Basic Statistics Fourth Edition By Brase and Brase Prepared by: Lynn Smith Gloucester County College Chapter Nine Hypothesis Testing.
1 Definitions In statistics, a hypothesis is a claim or statement about a property of a population. A hypothesis test is a standard procedure for testing.
Section 10.2: Tests of Significance Hypothesis Testing Null and Alternative Hypothesis P-value Statistically Significant.
Slide 20-1 Copyright © 2004 Pearson Education, Inc.
Statistical Inference Making decisions regarding the population base on a sample.
Statistical Inference for the Mean Objectives: (Chapter 8&9, DeCoursey) -To understand the terms variance and standard error of a sample mean, Null Hypothesis,
Hypothesis Tests for 1-Proportion Presentation 9.
Chapter 9 Hypothesis Testing Understanding Basic Statistics Fifth Edition By Brase and Brase Prepared by Jon Booze.
4-1 Statistical Inference Statistical inference is to make decisions or draw conclusions about a population using the information contained in a sample.
Review and Preview and Basics of Hypothesis Testing
Hypothesis Testing: Hypotheses
CONCEPTS OF HYPOTHESIS TESTING
Hypothesis Testing A hypothesis is a claim or statement about the value of either a single population parameter or about the values of several population.
Confidence Intervals.
Presentation transcript:

The smokers’ proportion in H.K. is 40%. How to testify this claim ?

The smokers’ proportion in H.K. is 40%. What kind of hypothesis ? Hypothesis Testing Statistical hypothesis assumption about parameter of population, e.g. It’s impractical to investigate the whole population, we just select a sample and base on the information yielded to perform testing. This kind of testing is called

Two kinds of Hypothesis John: The coin I am holding is fair. May: I don’t believe. I want to test. Let p = probability that obtaining a head. We have two kinds of hypothesis. (1) The coin is fair, p = 0.5. (2) The coin is not fair, p  0.5. is called the null hypothesis. is called the alternative hypothesis. Denoted as H 0 Denoted as H 1 “null” means nothing, for this e.g., the coin is fair, nothing special. H 0 : p = 0.5 H 1 : p  0.5 We write

Two kinds of Testing H 0 :  =  0 H 1 :    0 Two-tailed test H 0 :  =  0 H 1 :  >  0 One-tailed test H 0 :  =  0 H 1 :  <  0 One-tailed test OR

Decision making Before testing, choose a level of significance,  For this level of significance, we can find a corresponding critical value z c. In AL, we need to test involving normal distribution only. Test statistic z = Estimate: a sample parameter to estimate that of population. e.g. sample mean is an estimate for population mean. Hence estimate is normally distributed in A.L. in general. z ~ N(0,1) What’s the relation between  and z c ?

N(0,1)  zczc  /2 zczc zczc N(0,1)  zczc For One-tailed Test OR For Two-tailed Test If the test statistic lies in the shaded region, we reject H 0, otherwise, accept. H 0 :  =  0 H 1 :  >  0 H 0 :  =  0 H 1 :  <  0 H 0 :  =  0 H 1 :    0

Test statistic z = The test is OK if population data has normal distribution and any sample size, population data has any distribution and sample size is large, (n > 30) OR

E.g. 39 Given : I.Q. of children is H.K. ~ N(100,20). A group of 62 children has mean I.Q. = Is the sample group more intelligent than population? Let x be the sample mean, then Given  (significance level) = 0.05 Estimate = Test statistic z = = 1.02 We want to test whether the group is more intelligent or not. The null hypothesis is: the group is nothing special!In symbol,H 0 : x = 100 H 1 : x > 100The alternative hypothesis is: the group is more intelligent!In symbol, This is a one-tailed test N(0,1)  zczc  = 0.05 By table, z c = Since 1.02 < 1.645, i.e. The test statistic z doesn’t lie in shaded region! z Conclusion: we accept H 0 at a significance level of 0.05 Caution: we are NOT saying that H 0 is true!!

E.g. 41 A lathe is adjusted so that dimension mean = 20 cm. A sample of 40 is selected and sample mean = 20.1 cm and s.d. = 0.2 cm. Do the results, test at 0.05 significance level, indicate that the machine is out of adjustment ? H 0 : the machine is nothing special, not out of adjustment. Let x = sample mean, then H 0 : x = 20 H 1 : the machine is special, it is out of adjustment. H 1 : x  20 Estimate = 20.1 Test statistic z = This is a two-tailed test N(0,1)  /2 zczc zczc = 3.16  = 0.05 By table, z c =1.96 Since 3.16 > 1.96, i.e. The test statistic z does lie in shaded region! z Conclusion: we reject H 0 at a significance level of 0.05 Caution: we are NOT saying that H 0 is untrue!!

x is a random variable, so is the test statistic z. The test statistic z is normally distributed. Reject H 0  H 0 is false Hence z can lie on any region in ( ,  ) z lies in the “rejecting region”, either N(0,1)  /2 zczc zczc z N( ,1) z  You’re lucky! z has small chance to lie in the region and you got it. However, the mean = 0 may be true, i.e. H 0 may be true! A OR The sample selected is ordinary. z doesn’t lie in extreme regions, indicating that the true mean is  (>0) i.e. the mean = 0 is untrue, i.e.H 0 is false! B In hypothesis testing, we adopt the concept B.

E.g. 46 In a chemical plant the acid content of the effluent from the factory is measured frequently. From 400 measurements the acid content in gram per 100 liters of effluent is recorded in the following frequency distribution. Acid content Frequency (a) Find the mean acid content and the standard error of the mean. = 14.2 s= (b) Assuming a normal distribution of acid content, give 95% confidence limits for the mean acid content of the effluent. 95% confidence limits for mean are s.e. = = and 14.28

(c) Is the result consistent with a mean acid content of g per 100 litres obtained from tests over several years ? H 0 :  = H 1 :   Estimate = 14.2 Test statistic z = = 1.72  = 0.05 This is a two-tailed test By table, z c = 1.96 Since 1.72 < 1.96, Conclusion: we accept H 0 at a significance level of 0.05 N(0,1)  /2 zczc zczc z Hence at 5% significant level, the result is consistent. Caution: The level of significance must be stated! It’s meaningless in saying accept or reject H 0 alone.

Test statistic z = The test is OK if sample size is large (n  30), and np  10 and nq  10  for small sample size, use binomial.  in calculation of P s, continuity correction must be made in passing from discrete to continuous variable.

Continuity Correction Sample size = n Success no. = m P s = m/n H 1 : p > p 0 adjusted P s = (m  0.5)/n H 1 : p < p 0 adjusted P s = (m+0.5)/n Believe me, I’ll tell you why later.

E.g. 48 Last month unemployed rate = 7.1 %. This month, someone discovers 350 are unemployed in a random sample of Has the unemployment decreased this month ?  = 0.05 H 0 : p = 7.1% H 1 : p < 7.1% P s = (Continuity correction) Test statistic z = =  =  = 0.05 By table, z c = Since  >  1.645, we accept H 0 at 5% level of significance. i.e. No change in unemployment rate this month. N(0,1)  zczc z

E.g. 49 A standard medication reduces pain in 80% of patients treated. A new medication for the same purpose produces 90 patients relieved among the first 100 tested. Does this new medication relieve more patients than before ? (  = 1%) H 0 : p = 80% H 1 : p > 80% N(0,1)  zczc P s == Test statistic z = =  = 0.01 z c = 2.33 z > z c Thus, reject H 0 at 1% level of significance. i.e. New medication relieves more patients than before.

E.g. 53 A manufacturer claims that less than 2% of the women use his birth control pill suffer from side effects. We have a feeling that this estimate is too low. We decide to test his claim at the 0.01 significance level using a sample of 900 randomly selected women. Find the decision rule. Let p = probability that a randomly selected user has side effect. H 0 : p = 2% H 1 : p > 2% N(0,1)  zczc  = 0.01 z c = 2.33 z = For rejecting H 0, set z > z c, > 2.33 z yielding P s > Let m = no. of “side effect” users in the sample, then m > 28.3 Hence the rule is: if there are more than 28 “side effect” users, we say that the claimed rate is too low, otherwise, fail to reject the claim.

E.g. 55 John tells you that he can control the tosses of fair coin. To see if he is right you take two ordinary coins and give him one. You toss one and ask him to toss the same thing. You repeat this experiment 18 times and John succeeds in tossing the same as you 15 times. Is John usual ?  = Note: the no. of trials = 18 is too small to use normal distribution.Note: we use binomial instead. Let p = probability that tossing the same as you. H 0 : p = 0.5 H 1 : p > 0.5 P(15 or above success) =(0.5) 18 [ 18 C C C C 18 ] = < 0.05 Conclusion: John is usual at 0.05 significance level. (The chance of having 15 or more success is very small!)

E.g. 57 A coin which is tossed n times comes up heads n times. What’s the min. n for which we can conclude at 0.01 significance level that the coin is not fair ?  Consider two cases: one-tailed and two-tailed. H 0 : p = 0.5 H 1 : p > 0.5 One-tailed test Let p = probability of getting head in each toss. X = no. of heads in n tosses, then X ~ B(n,0.5) n heads turn up. And we want to reject H 0 at  = 0.01 X = n should lie in the rejecting region. Thus P(X = n) < 0.01 min. n =7.  = nn-1n-2 X P(X)P(X) Similarly, for Two-tailed test, we set min. n =8.

Remarks On Continuity Correction For one-tailed testing about proportion. H 0 : p = p 0 H 1 : p > p 0 We consider sample size n with no. of success X. To decide whether we reject H 0 or not, Suppose that in a sample, we find m success out of n. we look into the chance that X is m or above, see whether this chance is too small. (i.e. less than  or not.) Hence we concern P(X  m) X is discrete and if we use continuous variable to approximate it, continuity correction must be carried out. That’s why we use the adjusted sample proportion (m  0.5)/n as the test statistic instead of m/n. P(X  m) = Try to “prove” the case for H 1 : p < p 0 on your own.

Sorry for the hurry lessons and I haven’t got much time on preparing these slides, please read the following on you own: E.g. 58, 61, 63, 65 Please do the following as your class work/ homework: 3(c) 4, 9, 11, 15, 22, 24, 26, 28, 31, 33, 36, 39 3(d) 1, 2, 3, 4, 17, 19, 25, 26, 27, 28, 34, 39