4-1 Statistical Inference Statistical inference is to make decisions or draw conclusions about a population using the information contained in a sample.

Slides:



Advertisements
Similar presentations
Hypothesis Testing A hypothesis is a claim or statement about a property of a population (in our case, about the mean or a proportion of the population)
Advertisements

STATISTICAL INFERENCE PART V
Chap 9-1 Copyright ©2012 Pearson Education, Inc. publishing as Prentice Hall Basic Business Statistics 12 th Edition Chapter 9 Fundamentals of Hypothesis.
9-1 Hypothesis Testing Statistical Hypotheses Statistical hypothesis testing and confidence interval estimation of parameters are the fundamental.
1/55 EF 507 QUANTITATIVE METHODS FOR ECONOMICS AND FINANCE FALL 2008 Chapter 10 Hypothesis Testing.
Business Statistics: A Decision-Making Approach, 7e © 2008 Prentice-Hall, Inc. Chap 9-1 Business Statistics: A Decision-Making Approach 7 th Edition Chapter.
Chapter 8 Introduction to Hypothesis Testing
Statistics for Managers Using Microsoft Excel, 5e © 2008 Pearson Prentice-Hall, Inc.Chap 9-1 Statistics for Managers Using Microsoft® Excel 5th Edition.
Business Statistics: A Decision-Making Approach, 6e © 2005 Prentice-Hall, Inc. Chap 8-1 Business Statistics: A Decision-Making Approach 6 th Edition Chapter.
Chapter 8 Estimation: Single Population
Fall 2006 – Fundamentals of Business Statistics 1 Chapter 8 Introduction to Hypothesis Testing.
Basic Business Statistics, 10e © 2006 Prentice-Hall, Inc. Chap 9-1 Chapter 9 Fundamentals of Hypothesis Testing: One-Sample Tests Basic Business Statistics.
4-1 Statistical Inference The field of statistical inference consists of those methods used to make decisions or draw conclusions about a population.
Chapter 10 Hypothesis Testing
Copyright © 2010 Pearson Education, Inc. Publishing as Prentice Hall Statistics for Business and Economics 7 th Edition Chapter 9 Hypothesis Testing: Single.
9-1 Copyright ©2011 Pearson Education, Inc. publishing as Prentice Hall Statistics for Managers using Microsoft Excel 6 th Edition Chapter 9 Fundamentals.
Statistics for Managers Using Microsoft Excel, 4e © 2004 Prentice-Hall, Inc. Chap 8-1 Chapter 8 Fundamentals of Hypothesis Testing: One-Sample Tests Statistics.
Chapter 9 Hypothesis Testing.
Ch. 9 Fundamental of Hypothesis Testing
Chapter 8 Introduction to Hypothesis Testing
BCOR 1020 Business Statistics
Business Statistics: A Decision-Making Approach, 6e © 2005 Prentice-Hall, Inc. Chap 8-1 TUTORIAL 6 Chapter 10 Hypothesis Testing.
Statistics for Managers Using Microsoft® Excel 5th Edition
Copyright (c) 2004 Brooks/Cole, a division of Thomson Learning, Inc. Chapter 8 Tests of Hypotheses Based on a Single Sample.
Chapter 9 Title and Outline 1 9 Tests of Hypotheses for a Single Sample 9-1 Hypothesis Testing Statistical Hypotheses Tests of Statistical.
McGraw-Hill/IrwinCopyright © 2009 by The McGraw-Hill Companies, Inc. All Rights Reserved. Chapter 9 Hypothesis Testing.
Chapter 10 Hypothesis Testing
Overview Definition Hypothesis
Confidence Intervals and Hypothesis Testing - II
CHAPTER 2 Statistical Inference 2.1 Estimation  Confidence Interval Estimation for Mean and Proportion  Determining Sample Size 2.2 Hypothesis Testing:
Business Statistics, A First Course (4e) © 2006 Prentice-Hall, Inc. Chap 9-1 Chapter 9 Fundamentals of Hypothesis Testing: One-Sample Tests Business Statistics,
Chapter 8 Hypothesis Testing (假设检验)
Fundamentals of Hypothesis Testing: One-Sample Tests
Business Statistics: A Decision-Making Approach, 6e © 2005 Prentice-Hall, Inc. Chap th Lesson Introduction to Hypothesis Testing.
4-1 Statistical Inference The field of statistical inference consists of those methods used to make decisions or draw conclusions about a population.
Copyright © Cengage Learning. All rights reserved. 13 Linear Correlation and Regression Analysis.
More About Significance Tests
Chapter 9 Hypothesis Testing: Single Population
STATISTICAL INFERENCE PART VII
Chapter 10 Hypothesis Testing
Fundamentals of Hypothesis Testing: One-Sample Tests İŞL 276.
1 Introduction to Hypothesis Testing. 2 What is a Hypothesis? A hypothesis is a claim A hypothesis is a claim (assumption) about a population parameter:
Lecture 7 Introduction to Hypothesis Testing. Lecture Goals After completing this lecture, you should be able to: Formulate null and alternative hypotheses.
9-1 Hypothesis Testing Statistical Hypotheses Definition Statistical hypothesis testing and confidence interval estimation of parameters are.
Statistical Inference
Topic 8 Hypothesis Testing Mathematics & Statistics Statistics.
Copyright ©2011 Pearson Education 9-1 Statistics for Managers using Microsoft Excel 6 th Global Edition Chapter 9 Fundamentals of Hypothesis Testing: One-Sample.
Statistics for Managers Using Microsoft Excel, 4e © 2004 Prentice-Hall, Inc. Chap 8-1 Chapter 8 Fundamentals of Hypothesis Testing: One-Sample Tests Statistics.
McGraw-Hill/Irwin Copyright © 2007 by The McGraw-Hill Companies, Inc. All rights reserved. Chapter 8 Hypothesis Testing.
Chap 8-1 A Course In Business Statistics, 4th © 2006 Prentice-Hall, Inc. A Course In Business Statistics 4 th Edition Chapter 8 Introduction to Hypothesis.
Lecture 9 Chap 9-1 Chapter 2b Fundamentals of Hypothesis Testing: One-Sample Tests.
1 9 Tests of Hypotheses for a Single Sample. © John Wiley & Sons, Inc. Applied Statistics and Probability for Engineers, by Montgomery and Runger. 9-1.
Chap 8-1 Fundamentals of Hypothesis Testing: One-Sample Tests.
Basic Business Statistics, 11e © 2009 Prentice-Hall, Inc. Chap 9-1 Chapter 9 Fundamentals of Hypothesis Testing: One-Sample Tests Basic Business Statistics.
Business Statistics, A First Course (4e) © 2006 Prentice-Hall, Inc. Chap 9-1 Chapter 9 Fundamentals of Hypothesis Testing: One-Sample Tests Business Statistics,
© Copyright McGraw-Hill 2004
Chapter 8 Hypothesis Testing (假设检验)
Applied Quantitative Analysis and Practices LECTURE#14 By Dr. Osman Sadiq Paracha.
Statistical Inference Statistical inference is concerned with the use of sample data to make inferences about unknown population parameters. For example,
Understanding Basic Statistics Fourth Edition By Brase and Brase Prepared by: Lynn Smith Gloucester County College Chapter Nine Hypothesis Testing.
Copyright © 2013 Pearson Education, Inc. Publishing as Prentice Hall Statistics for Business and Economics 8 th Edition Chapter 9 Hypothesis Testing: Single.
Copyright (c) 2004 Brooks/Cole, a division of Thomson Learning, Inc. Chapter 7 Inferences Concerning Means.
Probability & Statistics Review I 1. Normal Distribution 2. Sampling Distribution 3. Inference - Confidence Interval.
Chap 9-1 Copyright ©2013 Pearson Education, Inc. publishing as Prentice Hall Business Statistics: A First Course 6 th Edition Chapter 9 Fundamentals of.
Conceptual Foundations © 2008 Pearson Education Australia Lecture slides for this course are based on teaching materials provided/referred by: (1) Statistics.
Copyright © 2010 Pearson Education, Inc. Publishing as Prentice Hall Statistics for Business and Economics 7 th Edition Chapter 9 Hypothesis Testing: Single.
Chapter Nine Hypothesis Testing.
Chapter 9 Fundamentals of Hypothesis Testing: One-Sample Tests
Statistics for Managers Using Microsoft® Excel 5th Edition
Chapter 9 Hypothesis Testing: Single Population
Presentation transcript:

4-1 Statistical Inference Statistical inference is to make decisions or draw conclusions about a population using the information contained in a sample from the population. Its two major areas: 1. Parameter Estimation 2. Hypothesis Testing

4-2 Point Estimation A point estimate is an observed value of a point estimator (a statistic). Point Estimator

4-2` Interval Estimation - Confidence Interval

Note: L(lower confidence limit) and U (upper confidence limit) are statistic and hence random variables. Ex) Confidence level = 95% ( 1-  =0.95)  P(C.I. will contain the true parameter) = 0.95  95% of all the C.I. s will contain the true parameter The general formula for all confidence intervals is: Point Estimate ± (Critical Value) (Standard Error) Point Estimate Lower Confidence Limit Upper Confidence Limit Width of confidence interval U=Point Estimate +Critical Value*S.E L=Point Estimate -Critical Value*S.E (=2*Critical Value*S.E.)

Chap 8-6 Confidence Interval for a Mean  when the variance  2 is known Assumptions –Population standard deviation σ is known –Population is normally distributed –If population is not normal, use large sample (CLT)  100(1-  )% (two-sided) Confidence Interval for  : or (, where Z α/2 is the standardized normal distribution critical value for a probability of α/2 in each tail)

100(1-  )% Upper-Confidence Bound for  100(1-  )% Lower-Confidence Bound for 

Chap 8-8 Critical Value: Z α/2 Consider a 95% confidence interval: 1-  =0.95 X units: Z 1-  /2 = Z α/2 = Commonly used confidence levels are 90%, 95%, and 99% Note: Z 1-  /2 = - Z α/2

Chap 8-9 Example A sample of 11 circuits from a normal population has a mean resistance of 2.20 ohms. We know from past testing that the population standard deviation is 0.35 ohms. Determine a 95% confidence interval for the true mean resistance of the population. (1.9932, )

Chap 8-10 If the population standard deviation σ is unknown, we can substitute the sample standard deviation, S This introduces extra uncertainty, since S is variable from sample to sample => Use the t distribution instead of the normal distribution Assumptions –Population standard deviation  is unknown –Population is normally distributed –If population is not normal, use large sample 100(1-  )% Confidence Interval for  : or (,where t  /2, n-1 is the critical value of the t distribution with n-1 d.f. and an area of α/2 in each tail) Confidence Interval for a Mean  when the variance  2 is unknown

100(1-  )% Upper-Confidence Bound for  100(1-  )% Lower-Confidence Bound for 

Chap 8-12 Student’s t Distribution 0 t t (df = 5) t (df = 13) Standard Normal (t with df = ∞) T-distriburions are symmetric and bell shaped but have flatter tails than normal The t value depends on degrees of freedom (d.f.) As d.f. goes infinity, t-distribution -> N(0,1 2 )

Table of T-distiribution

Chap 8-14 Example A random sample of n = 25 has the sample mean 50 and the sample variance 8. Form a 95% confidence interval for μ –d.f. = n – 1 = 24, so –The confidence interval is (48.832, )

(16.457,17,483)

Chap 8-16 Confidence Intervals for the variance of a normal population

100(1-  )% Confidence Interval for  2 100(1-  )% Upper-Confidence Bound for  2 100(1-  )% Lower-Confidence Bound for  2

19

Chap 8-20 Confidence Intervals for the Population Proportion, p

100(1-  )% Confidence Interval for p 100(1-  )% Upper-Confidence Bound for p 100(1-  )% Lower-Confidence Bound for p

Chap 8-22 [Example] A random sample of 100 people shows that 25 wear glasses. Form a 95% confidence interval for the true proportion of the population who wear glasses. Note : We are 95% confident that the true percentage of people wearing glasses in the population is between 16.51% and 33.49%. Although the interval from.1651 to.3349 may or may not contain the true proportion, 95% of intervals formed from samples of size 100 in this manner will contain the true proportion.

A (statistical) hypothesis is a statement or claim about a population parameter(not about a sample statistic): Ex) The mean electric bill per household of this city is μ = $132. The proportion of adults in this city with full-time jobs is p =0.61. Hypothesis testing is a procedure leading to a decision about a hypothesis based on a random sample Null Hypothesis (H 0 ) states the assumption to be tested. A hypothesis testing begins with the assumption that H 0 is true Alternative Hypothesis (H 1 ) is the opposite of the null hypothesis. It is the hypothesis that the researcher is trying to prove. Ex) H 0 : The mean age of smart phone users is 28. (H 0 : μ = 28) H 1 : The mean age of smart phone users is not 28. (H 1 : μ  28) 4-3 Hypothesis Testing Statistical Hypotheses

Suppose that we are interested in the burning rate of a solid propellant used to power aircrew escape systems. Suppose that our interest focuses on the mean burning rate (a parameter of the distribution of the burning rate). If we are interested in deciding whether or not the mean burning rate is 50 centimeters per second: Two-sided Alternative Hypothesis One-sided Alternative Hypotheses If we are trying to prove that the mean burning rate is less than 50 centimeters per second. H 0 :  = 50cm/s H 1 :  < 50cm/s  Note: If H 1 :  < 50cm/s then we can write the null hypothesis as H 0 :  = 50cm/s or H 0 :   50cm/s. Both expression lead to the same testing procedure and the same decision. Example- Insight into the Hypothesis Testing

4-3.2 Testing Statistical Hypotheses Hypothesis-testing procedures rely on using the information in a random sample from the population of interest. If this information is consistent with the hypothesis, then we will conclude that the hypothesis is true; if this information is inconsistent with the hypothesis, we will conclude that the hypothesis is false. Sample the population and find sample mean. Suppose the sample mean age was = 20. This is significantly lower than the claimed population mean 50. If the null hypothesis were true, the probability of getting such a different sample mean would be very small, so you reject the null hypothesis. In other words, getting a sample mean of 20 is so unlikely if the population mean was 50, thus you conclude that the population mean must not be 50.

Sampling Distribution of X 20 X μ = 50 If H 0 is true

Chap 9-28 The Test Statistic and Rejection Region If the sample mean is close to the assumed population mean, the null hypothesis is not rejected. If the sample mean is far from the assumed population mean, the null hypothesis is rejected. How far is “far enough” to reject H 0 ? Critical Values Distribution of the test statistic Rejection Region Test statistic is a statistic computed from the sample data to make a decision about the hypothesis. ex) sample mean, sample variance, sample proportion etc. If the test statistic value falls in the rejection region, we will reject H 0. The boundaries that define the rejection regions are called the critical values.

How to decide the rejection region (critical values)? 0 represents critical value Rejection region is shaded  H 0 : μ ≥ 50 H 1 : μ < 50 H 0 : μ ≤ 50 H 1 : μ > 50   Lower-tail test 0 Upper-tail test Two-tail test 0 H 0 : μ = 50 H 1 : μ ≠ 50  The critical values are decided by i) the distribution of the test statistic ii) the significance level  ( see next page)

Errors in Decision Making The conclusion from a hypothesis testing may be an error since it is based on a random sample (random experiment). Type I Error  Rejecting the null hypothesis when it is true.  The probability of a Type I Error is called the significance level or size of the test, denoted by .  The significance level is usually set by researchers in advance. Type II Error  Failing to reject the null hypothesis when it is false.  The probability of a Type II Error is denoted by β.  1- β is called the power of the test. Actual Situation DecisionH 0 TrueH 0 False Do Not Reject H 0 No Error Probability 1 - α Type II Error Probability β Reject H 0 Type I Error Probability α No Error Probability 1 - β

Chap State the null hypothesis, H 0 and the alternative hypothesis, H 1 2. Choose the significance level, α. 3. Determine the test statistic to use / Convert Sample Statistic (ex. X) to Test Statistic (ex. Z-statistic ) 4. Find the critical values and determine the rejection region(s) 5. Collect data and compute the test statistic value from the sample result 6.Compare the test statistic to the critical value to determine whether the test statistic falls in the region of rejection. Make the statistical decision: Reject H 0 if the test statistic falls in the rejection region. Hypothesis Testing procedure using Rejection Region

4-3.3 P-Values in Hypothesis Testing The p-value is the probability of obtaining a test statistic equal to or more extreme than the observed sample value when H 0 is true. Sometimes referred to as “the observed level of significance” or “Smallest value of  for which H 0 can be rejected” The p-value measures the plausibility of the null hypothesis, H 0. “The smaller the p-value, the less plausible is the null hypothesis.“

Chap State the null hypothesis, H 0 and the alternative hypothesis, H 1 2. Choose the significance level, α. 3. Determine the test statistic to use / Convert Sample Statistic (ex. X) to Test Statistic (ex. Z-statistic ) 4. Collect data and compute the test statistic from the sample result 5. Obtain the p-value from a distribution table of test statistic (or by using Excel, minitab etc) 6. Compare the p-value with  If p-value < , reject H 0 If p-value  , do not reject H 0 Hypothesis Testing procedure using P-value

Hypothesis Testing on the Mean

4-4 Inference on the Mean of a Population, Variance Known Assumptions

4-4.1 Hypothesis Testing on the Mean, Variance Known

Ex: Hypothesis Testing: σ Known, two-sided Convert sample statistic ( X ) to test statistic Determine the critical Z values for a specified level of significance  Decision Rule: If the test statistic falls in the rejection region, reject H 0, otherwise do not reject H 0 H 0 : μ = μ o H 1 : μ ≠ μ o Do not reject H 0 Reject H 0  /2 -Z 0 +Z  /2 Lower critical value Upper critical value Z X μoμo

Chap 9-39 Example To test the claim that the mean weight of chocolate bars manufactured in a factory is 3 ounces, we weighed 100 chocolate bars and the average weight was Suppose that, from past records, the standard deviation is known to be ) State the null and alternative hypotheses H 0 : μ = 3 H 1 : μ ≠ 3 (two-sided test) 2) Choose the desired level of significance Suppose that  =0.05 is chosen for this test 3) Determine the test statistic σ is known so this is a Z-test 4) Find the critical values and determine the rejection region(s) For  = 0.05, the critical Z-values are ±1.96 Reject H 0 if z ) Reach a decision and interpret the result Since z 0 = -2.0 < -1.96, you reject the null hypothesis. (That is, there is sufficient evidence that the mean weight of chocolate bars is not equal to 3.)

X = 2.84 is translated to a Z score.0228  /2 = Z  /2 =.025 Example -revisit To test the claim that the mean weight of chocolate bars manufactured in a factory is 3 ounces, we weighed 100 chocolate bars and the average weight was Suppose that, from past records, the standard deviation is known to be 0.8. Test at  =0.05 using p-value. p-value = 2P(Z > lz 0 l ) =2P(Z>2.0)=2*0.0228= p-value = <  (= 0.05) Thus, we reject the null hypothesis.

Chap 9-41 Example A phone industry manager thinks that customer monthly cell phone bills have increased, and now average more than $52 per month. Past company records indicate that the standard deviation is about $10. He collect a sample of n=64 and the sample mean was 53.1 Test this claim at  = ) H 0 : μ ≤ 52 vs H 1 : μ > 52 2)Test Statistic 3) Rejection Region: Critical Value = 1.28 If Z 0 >1.28 then reject H 0 4) Since Z 0 =0.88 < 1.28, we cannot reject H 0 5) We cannot say that the mean bill is greater than $52  = Reject H 0 1-  =.90 Z 0 =.88

Chap 9-42 P-value method: Let’s calculate the p-value and compare to  p-value =  = Reject H 0 Z =.88 We do not reject H 0 since p-value = >  (=.10)

4-5 Inference on the Mean of a Population, Variance Unknown T-distriburions are symmetric and bell shaped but have flatter tails than normal The t value depends on degrees of freedom (d.f.) As d.f. goes infinity, t-distribution -> N(0,1 2 ) Student’s t Distribution

Assumptions Population standard deviation is unknown Population is normally distributed, If population is not normal, use large sample Hypothesis Testing on the Mean, Variance Unknown

Calculating the P-value

Chap 9-46 Example The mean cost of a hotel room in LA is said to be $168 per night. A random sample of 25 hotels resulted in X = and S = Test at the  = 0.05 level Assuming the data are normally distributed. H 0 : μ = 168 H 1 : μ  168  is unknown, so use a t-statistic Critical Values: t 0.025, 24 = ± Reject H 0 if t 0 > or t 0 < Since t 0 does not fall in the rejection region, we cannot reject H 0

Relationship between Tests of Hypotheses and Confidence Intervals  The test of significance level  of the hypothesis will lead to rejection of H 0  The hypothesized value  0 is not in the 100(1 -  ) percent confidence interval [l, u].  The test of significance level  of the hypothesis will lead to rejection of H 0  The hypothesized value  0 is not in the 100(1 -  ) percent confidence interval [- , u]. <  The test of significance level  of the hypothesis will lead to rejection of H 0  The hypothesized value  0 is not in the 100(1 -  ) percent confidence interval [l,  ]. >

4-6 Inference on the Variance of a Normal Population Hypothesis Testing on the Variance of a Normal Population

4-7 Inference on Population Proportion Hypothesis Testing on a Binomial Proportion We will consider testing: