Lecture 10 Inference about the difference between population proportions (Chapter 13.6) One-way analysis of variance (Chapter 15.2)

Slides:



Advertisements
Similar presentations
1 Selected Sections of Chapters 22 and 24 Confidence Intervals for p 1 - p 2 and µ 1 - µ 2.
Advertisements

Lecture 11 One-way analysis of variance (Chapter 15.2)
i) Two way ANOVA without replication
1 1 Slide © 2011 Cengage Learning. All Rights Reserved. May not be scanned, copied or duplicated, or posted to a publicly accessible website, in whole.
Chapter 12 Simple Linear Regression
1 1 Slide © 2014 Cengage Learning. All Rights Reserved. May not be scanned, copied or duplicated, or posted to a publicly accessible website, in whole.
© 2010 Pearson Prentice Hall. All rights reserved Single Factor ANOVA.
1 1 Slide © 2009, Econ-2030 Applied Statistics-Dr Tadesse Chapter 10: Comparisons Involving Means n Introduction to Analysis of Variance n Analysis of.
Analysis of Variance Chapter Introduction Analysis of variance compares two or more populations of interval data. Specifically, we are interested.
Chapter 3 Analysis of Variance
Analysis of Variance Chapter Introduction Analysis of variance compares two or more populations of interval data. Specifically, we are interested.
1 Inference about Comparing Two Populations Chapter 13.
Analysis of Variance Chapter 15 - continued Two-Factor Analysis of Variance - Example 15.3 –Suppose in Example 15.1, two factors are to be examined:
1 Inference about Comparing Two Populations Chapter 13.
Lecture 9 Inference about the ratio of two variances (Chapter 13.5)
Lecture 12 One-way Analysis of Variance (Chapter 15.2)
Inferences About Process Quality
Lecture 13: Tues., Feb. 24 Comparisons Among Several Groups – Introduction (Case Study 5.1.1) Comparing Any Two of the Several Means (Chapter 5.2) The.
1 1 Slide © 2003 South-Western/Thomson Learning™ Slides Prepared by JOHN S. LOUCKS St. Edward’s University.
1 1 Slide © 2009 Thomson South-Western. All Rights Reserved Slides by JOHN LOUCKS St. Edward’s University.
1 1 Slide 統計學 Spring 2004 授課教師:統計系余清祥 日期: 2004 年 3 月 30 日 第八週:變異數分析與實驗設計.
1 1 Slide © 2015 Cengage Learning. All Rights Reserved. May not be scanned, copied or duplicated, or posted to a publicly accessible website, in whole.
Chapter 13: Inference in Regression
HAWKES LEARNING SYSTEMS math courseware specialists Copyright © 2010 by Hawkes Learning Systems/Quant Systems, Inc. All rights reserved. Chapter 14 Analysis.
1 Economics 173 Business Statistics Lectures 3 & 4 Summer, 2001 Professor J. Petry.
12-1 Chapter Twelve McGraw-Hill/Irwin © 2005 The McGraw-Hill Companies, Inc., All Rights Reserved.
1 1 Slide © 2006 Thomson/South-Western Slides Prepared by JOHN S. LOUCKS St. Edward’s University Slides Prepared by JOHN S. LOUCKS St. Edward’s University.
1 1 Slide © 2005 Thomson/South-Western Chapter 13, Part A Analysis of Variance and Experimental Design n Introduction to Analysis of Variance n Analysis.
1 1 Slide Analysis of Variance Chapter 13 BA 303.
Analysis of Variance Chapter 12 Introduction Analysis of variance compares two or more populations of interval data. Specifically, we are interested.
Analysis of Variance ( ANOVA )
12-1 Chapter Twelve McGraw-Hill/Irwin © 2006 The McGraw-Hill Companies, Inc., All Rights Reserved.
Analysis of Variance ST 511 Introduction n Analysis of variance compares two or more populations of quantitative data. n Specifically, we are interested.
Economics 173 Business Statistics Lectures 9 & 10 Summer, 2001 Professor J. Petry.
1 1 Slide Simple Linear Regression Coefficient of Determination Chapter 14 BA 303 – Spring 2011.
Chapter 10 Analysis of Variance.
1 Chapter 13 Analysis of Variance. 2 Chapter Outline  An introduction to experimental design and analysis of variance  Analysis of Variance and the.
Chapter 15 Analysis of Variance ( ANOVA ). Analysis of Variance… Analysis of variance is a technique that allows us to compare two or more populations.
Chapter 13 Inference About Comparing Two Populations.
1 Inference about Two Populations Chapter Introduction Variety of techniques are presented to compare two populations. We are interested in:
1 Analysis of Variance Chapter 14 2 Introduction Analysis of variance helps compare two or more populations of quantitative data. Specifically, we are.
Economics 173 Business Statistics Lecture 4 Fall, 2001 Professor J. Petry
Learning Objectives Copyright © 2002 South-Western/Thomson Learning Statistical Testing of Differences CHAPTER fifteen.
CHAPTER 4 Analysis of Variance One-way ANOVA
1 Inference about Two Populations Chapter Introduction Variety of techniques are presented whose objective is to compare two populations. We.
Chapter Seventeen. Figure 17.1 Relationship of Hypothesis Testing Related to Differences to the Previous Chapter and the Marketing Research Process Focus.
1 Confidence Intervals for Two Proportions Section 6.1.
Confidence Intervals for µ 1 - µ 2 and p 1 - p 2 1.
12-1 Chapter Twelve McGraw-Hill/Irwin © 2006 The McGraw-Hill Companies, Inc., All Rights Reserved.
Econ 3790: Business and Economic Statistics Instructor: Yogesh Uppal
Econ 3790: Business and Economic Statistics Instructor: Yogesh Uppal
1 Economics 173 Business Statistics Lectures 5 & 6 Summer, 2001 Professor J. Petry.
1 1 Slide The Simple Linear Regression Model n Simple Linear Regression Model y =  0 +  1 x +  n Simple Linear Regression Equation E( y ) =  0 + 
Chapter 14: Analysis of Variance One-way ANOVA Lecture 9a Instructor: Naveen Abedin Date: 24 th November 2015.
1/54 Statistics Analysis of Variance. 2/54 Statistics in practice Introduction to Analysis of Variance Analysis of Variance: Testing for the Equality.
1 1 Slide © 2008 Thomson South-Western. All Rights Reserved Slides by JOHN LOUCKS St. Edward’s University.
 List the characteristics of the F distribution.  Conduct a test of hypothesis to determine whether the variances of two populations are equal.  Discuss.
Copyright © 2008 by Hawkes Learning Systems/Quant Systems, Inc.
Keller: Stats for Mgmt & Econ, 7th Ed Analysis of Variance
Confidence Intervals for p1 - p2 and µ1 - µ2
Inference about Two Populations
Inference about Comparing Two Populations
i) Two way ANOVA without replication
Statistics Analysis of Variance.
Statistics for Business and Economics (13e)
Chapter 10 Two-Sample Tests and One-Way ANOVA.
Econ 3790: Business and Economic Statistics
Chapter 14: Analysis of Variance One-way ANOVA Lecture 8
Chapter 10 – Part II Analysis of Variance
Presentation transcript:

Lecture 10 Inference about the difference between population proportions (Chapter 13.6) One-way analysis of variance (Chapter 15.2)

Testing p 1 – p 2 There are two cases to consider: Case 1: H 0 : p 1 -p 2 =0 Calculate the pooled proportion Then Case 2: H 0 : p 1 -p 2 =D (D is not equal to 0) Do not pool the data

Example 13.9 (Revisit Example 13.8) –Management needs to decide which of two new packaging designs to adopt, to help improve sales of a certain soap. –A study is performed in two supermarkets: –For the brightly-colored design to be financially viable it has to outsell the simple design by at least 3%. Testing p 1 – p 2

Solution –The hypotheses to test are H 0 : p 1 - p 2 =.03 H 1 : p 1 - p 2 >.03 –We identify this application as case 2 (the hypothesized difference is not equal to zero). Testing p 1 – p 2 (Case 2)

Compute: Manually The rejection region is z > z  = z.05 = Conclusion: Since 1.15 < do not reject the null hypothesis. There is insufficient evidence to infer that the brightly-colored design will outsell the simple design by 3% or more. Testing p 1 – p 2 (Case 2)

Confidence Interval for confidence interval :

Estimating p 1 – p 2 Estimating the cost of life saved –Two drugs are used to treat heart attack victims: Streptokinase (available since 1959, costs $460) t-PA (genetically engineered, costs $2900). –The maker of t-PA claims that its drug outperforms Streptokinase. –An experiment was conducted in 15 countries. 20,500 patients were given t-PA 20,500 patients were given Streptokinase The number of deaths by heart attacks was recorded.

Experiment results –A total of 1497 patients treated with Streptokinase died. –A total of 1292 patients treated with t-PA died. Estimate the cost per life saved by using t-PA instead of Streptokinase. Estimating p 1 – p 2

Interpretation –We estimate that between.51% and 1.49% more heart attack victims will survive because of the use of t-PA. –The difference in cost per life saved is = $2440. –The cost per life saved by switching to t-PA is estimated to be between 2440/.0149 = $163,758 and 2440/.0051 = $478,431 Estimating p 1 – p 2

15.2 One-way ANOVA Analysis of variance compares two or more populations of interval data. Specifically, we are interested in determining whether differences exist between the population means. We obtain independent samples from each population. Generalization of two sample problem to two or more populations

Examples Compare the effect of three different teaching methods on test scores. Compare the effect of four different therapies on how long a cancer patient lives. Compare the effect of using different amounts of fertilizer on the yield of a crop. Compare the amount of time that ten different tire brands last.

Example 15.1 –An apple juice manufacturer is planning to develop a new product -a liquid concentrate. –The marketing manager has to decide how to market the new product. –Three strategies are considered Emphasize convenience of using the product. Emphasize the quality of the product. Emphasize the product’s low price. One Way Analysis of Variance

Example continued –An experiment was conducted as follows: In three cities an advertisement campaign was launched. In each city only one of the three characteristics (convenience, quality, and price) was emphasized. The weekly sales were recorded for twenty weeks following the beginning of the campaigns. One Way Analysis of Variance

See file Xm Weekly sales

Solution –The data are interval. –The problem objective is to compare sales in three cities. –We hypothesize that the three population means are equal. One Way Analysis of Variance

H 0 :  1 =  2 =  3 H 1 : At least two means differ To build the statistic needed to test the hypotheses use the following notation: Solution Defining the Hypotheses

Independent samples are drawn from k populations (treatments). 12k X 11 x 21. X n1,1 X 12 x 22. X n2,2 X 1k x 2k. X nk,k Sample size Sample mean First observation, first sample Second observation, second sample X is the “response variable”. The variables’ value are called “responses”. Notation

Terminology In the context of this problem… Response variable – weekly sales Responses – actual sale values Experimental unit – weeks in the three cities when we record sales figures. Factor – the criterion by which we classify the populations (the treatments). In this problems the factor is the marketing strategy. Factor levels – the population (treatment) names. In this problem factor levels are the marketing strategies.

Rationale Behind Test Statistic Two types of variability are employed when testing for the equality of population means –Variability of the sample means –Variability within samples Test statistic is essentially (Variability of the sample means)/(Variability within samples)

The rationale behind the test statistic – I If the null hypothesis is true, we would expect all the sample means to be close to one another (and as a result, close to the grand mean). If the alternative hypothesis is true, at least some of the sample means would differ. Thus, we measure variability between sample means.

The variability between the sample means is measured as the sum of squared distances between each mean and the grand mean. This sum is called the Sum of Squares for Treatments SST In our example treatments are represented by the different advertising strategies. Variability between sample means

There are k treatments The size of sample j The mean of sample j Sum of squares for treatments (SST) Note: When the sample means are close to one another, their distance from the grand mean is small, leading to a small SST. Thus, large SST indicates large variation between sample means, which supports H 1.

Solution – continued Calculate SST = 20( ) ( ) ( ) 2 = = 57, The grand mean is calculated by Sum of squares for treatments (SST)

Is SST = 57, large enough to reject H 0 in favor of H 1 ? Large compared to what? Sum of squares for treatments (SST)

Treatment 1Treatment 2 Treatment Treatment 1Treatment 2Treatment The sample means are the same as before, but the larger within-sample variability makes it harder to draw a conclusion about the population means. A small variability within the samples makes it easier to draw a conclusion about the population means.

Large variability within the samples weakens the “ability” of the sample means to represent their corresponding population means. Therefore, even though sample means may markedly differ from one another, SST must be judged relative to the “within samples variability”. The rationale behind test statistic – II

The variability within samples is measured by adding all the squared distances between observations and their sample means. This sum is called the Sum of Squares for Error SSE In our example this is the sum of all squared differences between sales in city j and the sample mean of city j (over all the three cities). Within samples variability

Solution – continued Calculate SSE Sum of squares for errors (SSE)  (n 1 - 1)s (n 2 -1)s (n 3 -1)s 3 2 = (20 -1)10, (20 -1)7, (20-1)8, = 506,983.50

Is SST = 57, large enough relative to SSE = 506, to reject the null hypothesis that specifies that all the means are equal? Sum of squares for errors (SSE)

mean squares To perform the test we need to calculate the mean squares as follows: The mean sum of squares Calculation of MST - M ean S quare for T reatments Calculation of MSE M ean S quare for E rror

Calculation of the test statistic with the following degrees of freedom: v 1 =k -1 and v 2 =n-k Required Conditions: 1. The populations tested are normally distributed. 2. The variances of all the populations tested are equal.

And finally the hypothesis test: H 0 :  1 =  2 = …=  k H 1 : At least two means differ Test statistic: R.R: F>F ,k-1,n-k The F test rejection region

The F test H o :  1 =  2 =  3 H 1 : At least two means differ Test statistic F= MST  MSE= 3.23 Since 3.23 > 3.15, there is sufficient evidence to reject H o in favor of H 1, and argue that at least one of the mean sales is different than the others.