Five types of statistical analysis

Slides:

Advertisements

Similar presentations

Five types of statistical analysis

Advertisements

Descriptive Statistics

EPIDEMIOLOGY AND BIOSTATISTICS DEPT Esimating Population Value with Hypothesis Testing.

1/55 EF 507 QUANTITATIVE METHODS FOR ECONOMICS AND FINANCE FALL 2008 Chapter 10 Hypothesis Testing.

Example 10.1a Experimenting with a New Pizza Style at the Pepperoni Pizza Restaurant Hypothesis Tests for a Population Mean.

Lecture Inference for a population mean when the stdev is unknown; one more example 12.3 Testing a population variance 12.4 Testing a population.

Basic Business Statistics, 10e © 2006 Prentice-Hall, Inc. Chap 9-1 Chapter 9 Fundamentals of Hypothesis Testing: One-Sample Tests Basic Business Statistics.

Copyright © 2010 Pearson Education, Inc. Publishing as Prentice Hall Statistics for Business and Economics 7 th Edition Chapter 9 Hypothesis Testing: Single.

Inferences About Process Quality

Today Concepts underlying inferential statistics

5-3 Inference on the Means of Two Populations, Variances Unknown

Hypothesis Testing Using The One-Sample t-Test

Hypothesis Testing.

Inferential Statistics

1 Nominal Data Greg C Elvers. 2 Parametric Statistics The inferential statistics that we have discussed, such as t and ANOVA, are parametric statistics.

Statistical Analysis. Purpose of Statistical Analysis Determines whether the results found in an experiment are meaningful. Answers the question: –Does.

Chapter 5For Explaining Psychological Statistics, 4th ed. by B. Cohen 1 Suppose we wish to know whether children who grow up in homes without access to.

Inferential Statistics

Example 10.1 Experimenting with a New Pizza Style at the Pepperoni Pizza Restaurant Concepts in Hypothesis Testing.

Introduction to Hypothesis Testing

Chapter Ten Introduction to Hypothesis Testing. Copyright © Houghton Mifflin Company. All rights reserved.Chapter New Statistical Notation The.

AM Recitation 2/10/11.

Hypothesis Testing:.

McGraw-Hill/IrwinCopyright © 2009 by The McGraw-Hill Companies, Inc. All Rights Reserved. Chapter 9 Hypothesis Testing.

Overview of Statistical Hypothesis Testing: The z-Test

Chapter 10 Hypothesis Testing

Week 9 Chapter 9 - Hypothesis Testing II: The Two-Sample Case.

Confidence Intervals and Hypothesis Testing - II

Business Statistics, A First Course (4e) © 2006 Prentice-Hall, Inc. Chap 9-1 Chapter 9 Fundamentals of Hypothesis Testing: One-Sample Tests Business Statistics,

Fundamentals of Hypothesis Testing: One-Sample Tests

Hypothesis testing – mean differences between populations

Business Statistics: A Decision-Making Approach, 6e © 2005 Prentice-Hall, Inc. Chap th Lesson Introduction to Hypothesis Testing.

Statistical Analysis Statistical Analysis

Copyright © Cengage Learning. All rights reserved. 13 Linear Correlation and Regression Analysis.

1 Level of Significance α is a predetermined value by convention usually 0.05 α = 0.05 corresponds to the 95% confidence level We are accepting the risk.

Copyright © 2012 Wolters Kluwer Health | Lippincott Williams & Wilkins Chapter 17 Inferential Statistics.

Copyright © 2008 Wolters Kluwer Health | Lippincott Williams & Wilkins Chapter 22 Using Inferential Statistics to Test Hypotheses.

Hypothesis Testing. Steps for Hypothesis Testing Fig Draw Marketing Research Conclusion Formulate H 0 and H 1 Select Appropriate Test Choose Level.

Example 10.1 Experimenting with a New Pizza Style at the Pepperoni Pizza Restaurant Concepts in Hypothesis Testing.

Chapter 15 Data Analysis: Testing for Significant Differences.

Chapter 9: Testing Hypotheses

1 Introduction to Hypothesis Testing. 2 What is a Hypothesis? A hypothesis is a claim A hypothesis is a claim (assumption) about a population parameter:

t(ea) for Two: Test between the Means of Different Groups When you want to know if there is a ‘difference’ between the two groups in the mean Use “t-test”.

Testing of Hypothesis Fundamentals of Hypothesis.

Chapter 12 A Primer for Inferential Statistics What Does Statistically Significant Mean? It’s the probability that an observed difference or association.

Chapter 8 Introduction to Hypothesis Testing ©. Chapter 8 - Chapter Outcomes After studying the material in this chapter, you should be able to: 4 Formulate.

Essential Question:  How do scientists use statistical analyses to draw meaningful conclusions from experimental results?

Chapter 9 Three Tests of Significance Winston Jackson and Norine Verberg Methods: Doing Social Research, 4e.

Educational Research Chapter 13 Inferential Statistics Gay, Mills, and Airasian 10 th Edition.

Statistics for Managers Using Microsoft Excel, 4e © 2004 Prentice-Hall, Inc. Chap 8-1 Chapter 8 Fundamentals of Hypothesis Testing: One-Sample Tests Statistics.

McGraw-Hill/Irwin Copyright © 2007 by The McGraw-Hill Companies, Inc. All rights reserved. Chapter 8 Hypothesis Testing.

1 Chapter 8 Introduction to Hypothesis Testing. 2 Name of the game… Hypothesis testing Statistical method that uses sample data to evaluate a hypothesis.

1 ANALYSIS OF VARIANCE (ANOVA) Heibatollah Baghi, and Mastee Badii.

Chapter Seventeen. Figure 17.1 Relationship of Hypothesis Testing Related to Differences to the Previous Chapter and the Marketing Research Process Focus.

Information Technology and Decision Making Information Technology and Decision Making Example 10.1 Experimenting with a New Pizza Style at the Pepperoni.

Chapter 8 Parameter Estimates and Hypothesis Testing.

Chap 8-1 Fundamentals of Hypothesis Testing: One-Sample Tests.

© Copyright McGraw-Hill 2004

Formulating the Hypothesis null hypothesis 4 The null hypothesis is a statement about the population value that will be tested. null hypothesis 4 The null.

1 Testing Statistical Hypothesis The One Sample t-Test Heibatollah Baghi, and Mastee Badii.

Chapter 9: Introduction to the t statistic. The t Statistic The t statistic allows researchers to use sample data to test hypotheses about an unknown.

Hypothesis Tests. An Hypothesis is a guess about a situation that can be tested, and the test outcome can be either true or false. –The Null Hypothesis.

Copyright © 2013 Pearson Education, Inc. Publishing as Prentice Hall Statistics for Business and Economics 8 th Edition Chapter 9 Hypothesis Testing: Single.

Educational Research Inferential Statistics Chapter th Chapter 12- 8th Gay and Airasian.

 What is Hypothesis Testing?  Testing for the population mean  One-tailed testing  Two-tailed testing  Tests Concerning Proportions  Types of Errors.

Chapter 9 Hypothesis Testing Understanding Basic Statistics Fifth Edition By Brase and Brase Prepared by Jon Booze.

Chapter 10: The t Test For Two Independent Samples.

Hypothesis Testing. Steps for Hypothesis Testing Fig Draw Marketing Research Conclusion Formulate H 0 and H 1 Select Appropriate Test Choose Level.

Hypothesis Testing. Steps for Hypothesis Testing Fig Draw Marketing Research Conclusion Formulate H 0 and H 1 Select Appropriate Test Choose Level.

Presentation transcript:

Five types of statistical analysis Descriptive What are the characteristics of the respondents? Inferential What are the characteristics of the population? Differences Are two or more groups the same or different? Associative Are two or more variables related in a systematic way? Predictive Can we predict one variable if we know one or more other variables?

General Procedure for Hypothesis Test Formulate H0 (null hypothesis) and H1 (alternative hypothesis) Select appropriate test Choose level of significance Calculate the test statistic (SPSS) Determine the probability associated with the statistic. Determine the critical value of the test statistic.

General Procedure for Hypothesis Test a) Compare with the level of significance,  b) Determine if the critical value falls in the rejection region. (check tables) Reject or do not reject H0 Draw a conclusion

1. Formulate H1and H0 The hypothesis the researcher wants to test is called the alternative hypothesis H1. The opposite of the alternative hypothesis is the null hypothesis H0 (the status quo)(no difference between the sample and the population, or between samples). The objective is to DISPROVE the null hypothesis. The Significance Level is the Critical probability of choosing between the null hypothesis and the alternative hypothesis

2. Select Appropriate Test The selection of a proper Test depends on: Scale of the data nominal interval the statistic you seek to compare Proportions (percentages) means the sampling distribution of such statistic Normal Distribution T Distribution 2 Distribution Number of variables Univariate Bivariate Multivariate Type of question to be answered

Testing for Differences Between Mean of the Sample and Mean of the Population The manager of Pepperoni Pizza Restaurant has recently begun experimenting with a new method of baking its pepperoni pizzas. He believes that the new method produces a better-tasting pizza, but he would like to base a decision on whether to switch from the old method to the new method on customer reactions. Therefore he performs an experiment.

The Experiment For 40 randomly selected customers who order a pepperoni pizza for home delivery, he includes both an old style and a free new style pizza in the order. All he asks is that these customers rate the difference between pizzas on a -10 to +10 scale, where -10 means they strongly favor the old style, +10 means they strongly favor the new style, and 0 means they are indifferent between the two styles. -10 +10 Old pizza New pizza

One-Tailed Versus Two-Tailed Tests 1. Formulate H1and H0 One-Tailed Versus Two-Tailed Tests The form of the alternative hypothesis can be either a one-tailed or two-tailed, depending on what you are trying to prove. A one-tailed hypothesis is one where the only sample results which can lead to rejection of the null hypothesis are those in a particular direction, namely, those where the sample mean rating is positive. A two-tailed test is one where results in either of two directions can lead to rejection of the null hypothesis.

One-Tailed Versus Two-Tailed Tests -- continued 1. Formulate H1and H0 One-Tailed Versus Two-Tailed Tests -- continued Once the hypotheses are set up, it is easy to detect whether the test is one-tailed or two-tailed. One tailed alternatives are phrased in terms of “>” or “<“ whereas two tailed alternatives are phrased in terms of “” The real question is whether to set up hypotheses for a particular problem as one-tailed or two-tailed. There is no statistical answer to this question. It depends entirely on what we are trying to prove.

1. Formulate H1and H0 As the manager you would like to observe a difference between both pizzas If the new baking method is cheaper, you would like the preference to be for it. Null Hypothesis Alternative H0 =0 (there is no difference between the old style and the new style pizzas) (The difference between the mean of the sample and the mean of the population is zero) H1 0 or H1  >0 Two tail test One tail test = mu=population mean

2. Select Appropriate Test What we want to test is whether consumers prefer the new style pizza to the old style. We assume that there is no difference (i.e. the mean of the population is zero) and want to know whether our observed result is significantly (i.e. statistically) different. The one-sample t test is used to test whether the mean of the sample is equal to a hypothesized value of the population from which the sample is drawn.

Type I Error Rejecting the null hypothesis that the pizzas are equal, (and saying that they are different or the new style is better) when they really are perceived equal by the customers of the entire population. Type II error Accepting the null hypothesis that the pizzas are equal, when they are really perceived to be different by the customers of the entire population.

3. Choose Level of Significance Significance Level selected is typically .05 or .01 i.e 5% or 1%

The ratings of 40 randomly selected customers produces the following table and statistics From the summary statistics, we see that the sample mean is 2.10 and the sample standard deviation is 4.717 The positive sample mean suggests a slight preference for the new pizza, (alternative hypothesis) but there is a fair degree of variation. What we don’t know is whether this preference is significant

4. Calculate the Test Statistic X- 0 s/n  T(n-1)

5. Determine the Probability-value (Critical Value) We use the right tail because the alternative is one-tailed of the “greater than” variety The probability beyond this value in the right tail of the t distribution with n-1 = 39 degrees of freedom is approximately 0.004 The probability, 0.004, is the p-value for the test. It indicates that these sample results would be very unlikely if the null hypothesis is true.

7. Reject or do not reject H0 6. Compare with the level of significance,  (.05)and determine if the critical value falls in the rejection region Do not Reject H0 1- 2.816 2.023 -2.023 /2 Reject H0 Reject H0 7. Reject or do not reject H0 Since the statistic falls in the rejection area we reject Ho and conclude that the perceived difference between the pizzas is significantly different from zero.

8 Conclusion the sample evidence is fairly convincing that customers, on average, prefer the new-style pizza. Should the manager switch to the new-style pizza on the basis of these sample results? Depends. There is no indication that the new-style pizza costs any more to make than the old-style pizza. Therefore, unless there are reasons for not switching (for example, costs) then we recommend the switch.

Comparing Means Suppose you are the brand manager for Tylenol, and a recent TV ad tells the consumers that Advil is more effective (quicker) at treating headaches than Tylenol. An independent random sample of 400 people with a headache is given Advil, and 260 people report they feel better within an hour. Another independent sample of 400 people is taken and 252 people that took Tylenol reported feeling better. Is the TV ad correct? Or, in other words, is there a difference between the means of the two samples

Hypothesis Test for Two Independent Samples Test for mean difference: Null Hypothesis Alternative H0 1= 2 H1 1 2 Under H0 1- 2 = 0. So, the test concludes whether there is a difference between the means or not.

Comparison of means: Graphically Are the means equal? Or are the differences simply due to chance?

2. Select Appropriate Test In this example we have two independent samples Other examples populations of users and non-users of a brand differ in perceptions of the brand high income consumers spend more on the product than low income consumers The proportion of brand-loyal users in Segment 1 (eg males) is more than the proportion in segment II (e.g. females) The proportion of households with Internet in Canada exceeds that in USA Can be used for examining differences between means and proportions

2. Select Appropriate Test The two populations are sampled and the means and variances computed based on the samples of sizes n1 and n2 If both populations are found to have the same variance then a t-statistic is calculated. The comparison of means of independent samples assumes that the variances are equal. If the variances are not known an F-test is conducted to test the equality of the variances of the two populations.  F f

Unequal variances: The problem

Tylenol vs Advil H0: A - T = 0; H1: A - T  0 We would need to test if the difference is zero or not. H0: A - T = 0; H1: A - T  0 pA = 260/400= 0.65 pT = 252/400= 0.63 Variability of random means mean 1 – mean 2 t = z = .65 - .63 = 0.66 (.65)(.35)/400+ (.63)(.37)/400 For large samples the t-distribution approaches the normal distribution and so the t-test and the z-test are equivalent.

Differences Between Groups when Comparing Means Ratio scaled dependent variables t-test When groups are small When population standard deviation is unknown z-test When groups are large

Degrees of Freedom d.f. = n - k n = n1 + n2 k = number of groups where: n = n1 + n2 k = number of groups The degrees of freedom is (n1 + n2 –2)

Tylenol vs Advil  = 0.10 Critical value = 1.64  -1 /2 /2 - -1.64 0.66  Since 0.66 is less than the critical value of 1.64 we accept the null hypothesis: there is no difference between Advil and Tylenol users

Test for Means Difference on Paired Samples What is a paired sample? When two sets of observations relate to the same respondents When you want to measure brand recall before and after an ad campaign. Shoppers consider brand name to be more important than price Households spend more money on pizza than on hamburgers The proportion of a bank’s customers who have a checking account exceeds the proportion who have a savings account Since it is the same population that is being sampled the observations are not independent. The appropriate test is a paired-t-test

Example Q1. When purchasing golf clubs rate the importance 1-5 of price Q2. When purchasing golf clubs rate the importance 1-5 of brand H0 H1 One tailed H1 Two Tailed There is no difference in importance between brand and price Price is more important than brand There is a difference in importance between brand and price

What is an ANOVA? One-way ANOVA stands for Analysis of Variance Purpose: Extends the test for mean difference between two independent samples to multiple samples. Employed to analyze the effects of manipulations (independent variables) on a random variable (dependent).

What does ANOVA test? The null hypothesis tests whether the mean of all the independent samples is equal H0 1= 2 = 3 …..= n H1 1 2  3 …..  n The alternative hypothesis specifies that all the means are not equal

Definitions Dependent variable: the variable we are trying to explain, also known as response variable (Y). Independent variable: also known as explanatory variables or Factors (X). Research normally involves determining whether the independent variable has an effect on the variability of the dependent variable

Comparing Antacids Non comparative ad: Explicit Comparative ad: The maker of Acid-off, an antacid stomach remedy wants to know which type of ad results in the most positive brand attitude among consumers. Non comparative ad: Acid-off provides fast relief Explicit Comparative ad: Acid-off provides faster relief than Tums Non explicit comparative ad Acid-off provides the fastest relief Three groups of people are exposed to one type of ad and asked to rate their attitude towards the ad.

Comparing Antacids Brand Attitude Means Non Comparative Explicit Non Explicit Comparative Type of Ad

The dependent variable (denoted by Y) is called the response variable and in this case it is brand attitude (I.e. we want to know what effect ad type has on attitude toward the brand) The independent variables are called factors, in this case type of ad: non-comparative, explicit comparative, non-explicit comparative The different levels of the factor are called treatments. In this case the treatments are the different ratings for each of the three types of ads. There will be two sources of variation. Variation within the treatment (e.g. within the non-comparative ad etc.) Variation between the treatments (I.e. between the three types of ads)

The whole idea behind the analysis of variance is to compare the ratio of between group variance to within group variance. If the variance caused by the interaction between the samples is much larger when compared to the variance that appears within each group, then it is because the means are different. Degrees of Freedom The F statistic has DF for both numerator (between group) and denominator (within group) DF between group = (c-1) where c=number of groups DF within group = (N-c) where N is sample size

Decomposition of the Total Variation Independent Variable X Categories Total Sample X1 X2 X3 …. Xc Y1 Y1 Y1 …. Y1 Y1 Y2 Y2 Y2 …. Y2 Y2 Yn Yn Yn …. Yn Yn Y1 Y2 Y3 Yc Y Within Category Variation SSwithin Total Variation SSy Category Mean Grand Mean Between Category Variation SSbetween

ANOVA Test The null hypothesis would be tested with the F distribution Reject H0  df(c-1)/(N-c)

One way ANOVA investigates: Main effects factor has an across-the-board effect e.g., type of ad Or age or involvement

A TWO-WAY ANOVA investigates: INTERACTIONS effect of one factor depends on another factor e.g., larger advertising effects for those with no experience importance of price depends on income level and involvement with the product