Testing the Differences between Means Statistics for Political Science Levin and Fox Chapter Seven 1.

Slides:



Advertisements
Similar presentations
Introductory Mathematics & Statistics for Business
Advertisements

Hypothesis Testing W&W, Chapter 9.
1 COMM 301: Empirical Research in Communication Lecture 15 – Hypothesis Testing Kwan M Lee.
Anthony Greene1 Simple Hypothesis Testing Detecting Statistical Differences In The Simplest Case:  and  are both known I The Logic of Hypothesis Testing:
Statistics.  Statistically significant– When the P-value falls below the alpha level, we say that the tests is “statistically significant” at the alpha.
Hypothesis Testing A hypothesis is a claim or statement about a property of a population (in our case, about the mean or a proportion of the population)
COURSE: JUST 3900 INTRODUCTORY STATISTICS FOR CRIMINAL JUSTICE Instructor: Dr. John J. Kerbs, Associate Professor Joint Ph.D. in Social Work and Sociology.
EPIDEMIOLOGY AND BIOSTATISTICS DEPT Esimating Population Value with Hypothesis Testing.
Introduction to Hypothesis Testing
Evaluating Hypotheses Chapter 9. Descriptive vs. Inferential Statistics n Descriptive l quantitative descriptions of characteristics.
Introduction to Hypothesis Testing
Evaluating Hypotheses Chapter 9 Homework: 1-9. Descriptive vs. Inferential Statistics n Descriptive l quantitative descriptions of characteristics ~
BCOR 1020 Business Statistics Lecture 21 – April 8, 2008.
Independent Sample T-test Often used with experimental designs N subjects are randomly assigned to two groups (Control * Treatment). After treatment, the.
8-2 Basics of Hypothesis Testing
BCOR 1020 Business Statistics
Copyright © 2005 Brooks/Cole, a division of Thomson Learning, Inc Chapter 11 Introduction to Hypothesis Testing.
Chapter 5For Explaining Psychological Statistics, 4th ed. by B. Cohen 1 Suppose we wish to know whether children who grow up in homes without access to.
Chapter Ten Introduction to Hypothesis Testing. Copyright © Houghton Mifflin Company. All rights reserved.Chapter New Statistical Notation The.
AM Recitation 2/10/11.
McGraw-Hill/IrwinCopyright © 2009 by The McGraw-Hill Companies, Inc. All Rights Reserved. Chapter 9 Hypothesis Testing.
Overview of Statistical Hypothesis Testing: The z-Test
Overview Definition Hypothesis
Introduction to Hypothesis Testing
Inferential Statistics & Test of Significance
Hypothesis testing is used to make decisions concerning the value of a parameter.
Hypothesis Testing II The Two-Sample Case.
Significance Tests …and their significance. Significance Tests Remember how a sampling distribution of means is created? Take a sample of size 500 from.
Chapter 6 Preview – Generalizing from Samples to Populations  Sampling error always occurs.  Standard error of the mean allows us to construct confidence.
Section 10.1 ~ t Distribution for Inferences about a Mean Introduction to Probability and Statistics Ms. Young.
T-Tests and Chi2 Does your sample data reflect the population from which it is drawn from?
Chapter 9 Large-Sample Tests of Hypotheses
The Hypothesis of Difference Chapter 10. Sampling Distribution of Differences Use a Sampling Distribution of Differences when we want to examine a hypothesis.
1 Today Null and alternative hypotheses 1- and 2-tailed tests Regions of rejection Sampling distributions The Central Limit Theorem Standard errors z-tests.
1 Power and Sample Size in Testing One Mean. 2 Type I & Type II Error Type I Error: reject the null hypothesis when it is true. The probability of a Type.
Statistics and Quantitative Analysis U4320
Copyright ©2011 Nelson Education Limited Large-Sample Tests of Hypotheses CHAPTER 9.
Individual values of X Frequency How many individuals   Distribution of a population.
Introduction to Probability and Statistics Thirteenth Edition Chapter 9 Large-Sample Tests of Hypotheses.
10.2 Tests of Significance Use confidence intervals when the goal is to estimate the population parameter If the goal is to.
1 Lecture note 4 Hypothesis Testing Significant Difference ©
Chapter 6 USING PROBABILITY TO MAKE DECISIONS ABOUT DATA.
Chapter 7 Testing Differences between Means. 2 Testing Differences Between Means  Establish hypothesis about populations, collect sample data, and see.
1 Psych 5500/6500 The t Test for a Single Group Mean (Part 1): Two-tail Tests & Confidence Intervals Fall, 2008.
Statistical Hypotheses & Hypothesis Testing. Statistical Hypotheses There are two types of statistical hypotheses. Null Hypothesis The null hypothesis,
Chapter 9 Probability. 2 More Statistical Notation  Chance is expressed as a percentage  Probability is expressed as a decimal  The symbol for probability.
1 Chapter 9 Hypothesis Testing. 2 Chapter Outline  Developing Null and Alternative Hypothesis  Type I and Type II Errors  Population Mean: Known 
Copyright © 2010, 2007, 2004 Pearson Education, Inc Section 8-2 Basics of Hypothesis Testing.
McGraw-Hill/Irwin Copyright © 2007 by The McGraw-Hill Companies, Inc. All rights reserved. Chapter 8 Hypothesis Testing.
Chapter 221 What Is a Test of Significance?. Chapter 222 Thought Question 1 The defendant in a court case is either guilty or innocent. Which of these.
Economics 173 Business Statistics Lecture 4 Fall, 2001 Professor J. Petry
1 Chapter 8 Introduction to Hypothesis Testing. 2 Name of the game… Hypothesis testing Statistical method that uses sample data to evaluate a hypothesis.
Chapter 20 Testing Hypothesis about proportions
Chapter 8 Parameter Estimates and Hypothesis Testing.
Testing Differences between Means, continued Statistics for Political Science Levin and Fox Chapter Seven.
Nonparametric Tests of Significance Statistics for Political Science Levin and Fox Chapter Nine Part One.
Introduction Suppose that a pharmaceutical company is concerned that the mean potency  of an antibiotic meet the minimum government potency standards.
Introduction to hypothesis testing Hypothesis testing is about making decisions Is a hypothesis true or false? Ex. Are women paid less, on average, than.
INTRODUCTION TO HYPOTHESIS TESTING From R. B. McCall, Fundamental Statistics for Behavioral Sciences, 5th edition, Harcourt Brace Jovanovich Publishers,
Statistical Techniques
Chapter 8: Introduction to Hypothesis Testing. Hypothesis Testing A hypothesis test is a statistical method that uses sample data to evaluate a hypothesis.
Statistical Inference for the Mean Objectives: (Chapter 8&9, DeCoursey) -To understand the terms variance and standard error of a sample mean, Null Hypothesis,
 What is Hypothesis Testing?  Testing for the population mean  One-tailed testing  Two-tailed testing  Tests Concerning Proportions  Types of Errors.
Chapter 9 Hypothesis Testing Understanding Basic Statistics Fifth Edition By Brase and Brase Prepared by Jon Booze.
GOVT 201: Statistics for Political Science
Hypothesis Testing I The One-sample Case
Chapters 20, 21 Hypothesis Testing-- Determining if a Result is Different from Expected.
Hypothesis Testing Review
Chapter Nine Part 1 (Sections 9.1 & 9.2) Hypothesis Testing
1 Chapter 8: Introduction to Hypothesis Testing. 2 Hypothesis Testing The general goal of a hypothesis test is to rule out chance (sampling error) as.
Presentation transcript:

Testing the Differences between Means Statistics for Political Science Levin and Fox Chapter Seven 1

What is hypothesis testing? When we evaluate sample data collected about a particular population and see how likely the sample results are, given our hypothesis about the population. If the sample results are plausible under the hypothesis about the population, we accept the hypothesis. If the sample results are unlikely (less than 5 chances in 100) we then reject the hypothesis (or retain the null) and attribute any departure form our expected results to be pure chance based on sampling error. 2

The Null Hypothesis Null Hypothesis: It is the hypothesis that says that two samples have been drawn from equivalent populations. Any observed difference between samples is a result of chance occurrence resulting from sampling error alone. The difference in sample means does not imply a difference in population means. To conclude that sampling error is responsible for obtaining a difference between sample means is to retain the null hypothesis: µ 1 = µ 2 Where µ 1 = mean of the first population µ 2 = mean of the second population To Retain: Does not imply that we have proven the population means are equal, but rather that we lack sufficient evidence to say otherwise (that is, to say they there is a difference between the populations). 3

The Research Hypothesis for Means Difference Research Hypothesis: If we reject the null hypothesis, then we automatically accept the research hypothesis that a true population difference does exist. The difference between sample means is too large to be accounted for by sampling error. The research hypothesis for mean differences is symbolized by (the population means are not equal): µ 1 ≠ µ 2 4

Null and Research Hypothesis Hypothesis: Example Men are more permissive than women with regards to disciplining children. Null Hypothesis: The null hypothesis holds that there is no difference between Men and Women (as populations) when it comes to disciplining children. Any observed difference is the result of sampling error (rather than actual difference). 5

Null and Research Hypothesis Hypothesis: Example Men are more permissive than women with regards to disciplining children. Research Hypothesis: The research hypothesis holds that there IS a difference between between Men and Women (as populations) when it comes to disciplining children. 6

Sampling Distribution of Differences between Means Sampling Distribution of Differences between Means: Recall from our long-distance phone calling example, that if a researcher was to take multiple samples, he/she could get a sampling distribution of means (rather than raw scores). Paired Samples: What if the researcher, while gathering samples, studies or compares two samples at a time. 7

Sampling Distribution of Differences between Means Example: Child Rearing: Comparing Males and Females To test the difference, a researcher constructs a scale of permissiveness from 1 (Strict: not very permissive) to 100 (very permissive). Then they study two random samples of 30 men and 30 women. Results: Women: (sample mean) = 58.0 (more permissive) Men: (sample mean) = 54.0 (less permissive) Difference Between Means: (58.0 – 54.0) = Is the difference the result of chance alone/sampling error (Null hypothesis)? Or is there a difference between men and women (as populations) (Research Hypothesis)? 8

Sampling Distribution of Differences between Means Example: Child Rearing: Comparing Males and Females What if the researcher continued to take samples, and took 70 additional pairs of samples, each containing 30 women and 30 men. This would give us, as it did with the first two paired samples a difference between the means. Sampling Distribution of Differences between Means: And, just as we did first with raw scores, and then sample means, once we have a distribution of mean differences we can construct a Sampling Distribution of Differences between Means. 9

Sampling Distribution of Differences between Means Example: Child Rearing: Comparing Males and Females What if the researcher continued to take samples, and took 70 additional pairs of samples, each containing 30 women and 30 men. 10 Population: µ = ? Population: µ = ? 57 Samples: Women (30 in each)Samples: Men (30 in each) … _ _ _ = + 3 = - 1 = + 2 Note: You always subtract the second sample mean (men) from the first sample mean (women).

The Purpose and Function of a Sampling Distribution of Differences between Means Child Rearing: Males and Females Here is what it looks like as a frequency distribution. Mean Difference f N = 35 11

Testing Hypotheses with the Distribution of Differences between Means Sampling Distribution of Differences between Means: 1) It assumes that all sample pairs differ only by virtue of sample error and not as a function of true population differences. 2) The mean of the difference between means equals zero (this is so because the resulting positive and negative numbers tend to cancel each other out. 3) Approximates the normal curve (most of the mean differences fall near zero, which is expected since any difference between means is a product of sampling error.) 12

Testing Hypotheses with the Distribution of Differences between Means Probability and Sampling Distribution of Differences between Means: Since Sampling Distribution of Differences between Means approximates the normal curve, we can use the properties of the normal curve to make statements of probability about mean differences, specifically whether it is likely or not that the mean difference is a result of chance/sampling error or true population differences. 13

Testing Hypotheses with the Distribution of Differences between Means 14 Closer to zero, more likely to be sample error Further from zero, less likely to be sample error Null Research

Probability and Sampling Distribution of Differences between Means: If the obtained difference between means lies so far from a difference of zero that it has only a small probability of occurrence in the sampling distribution of differences between means, we reject the null hypothesis. If our sample mean difference falls so close to zero that its probability of occurrence is large, we must retain the null hypothesis and treat the obtained difference as a sampling error. 15

Testing Hypotheses with the Distribution of Differences between Means Example: Child Rearing: Comparing Males and Females What if the researcher examines one pair (as opposed to 70 pairs) containing 30 men and 30 women. (Subtract second mean from the first.) Results: Women: (sample mean) = 45.0 Men: (sample mean) = 40.0 Difference Between Means: (45.0 – 40.0) = How far does fall from the mean of zero? 16

So, to determine how far our obtained difference betweens lies from the mean difference of zero, we must translate our obtained difference into units of standard deviation. Step 1: Recall this formula for standardizing units of deviation: Raw Score 17 Z = X - µ σ Wherex = raw score µ = mean of the distribution of raw scores σ = standard deviation of the distribution of raw scores Child Rearing: Comparing Males and Females

Step 2a: Use this formula as step in translating the mean scores in a distribution of sample means into units of standard deviation. 18 Z = - µ σ Where = sample mean µ = population mean (mean of means) σ = standard error of the mean (standard deviation of the distribution of means) Child Rearing: Comparing Males and Females

Step 2b: Translate our sample mean difference into units of standard deviation. 19 Z = ( 1 – 2 ) - 0 Where = mean of the first sample = mean of the second sample 0 = zero, the value of the mean of the sampling distribution of differences between means (we assume that µ 1 - µ 2 = 0) = standard error of the mean (standard deviation of the distribution of the difference between means) We can reduce this equation down to the following: Child Rearing: Comparing Males and Females

Result: (assuming equals 2) 20 Z = ( 45 – 40) Thus, a difference of 5 between the means of the two samples (women and men) falls 2.5 standard deviations from a mean of zero. 2 Z =+ 2.5 Child Rearing: Comparing Males and Females

z = % P =.4938 P=.006 P =.4938 P =.012 What is the probability that a difference of 5 between sample means could be caused by sampling error? The probability of getting 5 or move (above or below the mean) because of sample error is roughly P =.01 (1 in a 100). 5 and above P =.006 (.06 in 100) %.62 %

The α (alpha) value is the level of probability at which the null hypothesis can be rejected with confidence and the research hypothesis accepted with confidence. We decide to reject the null hypothesis if the probability is very small. This is symbolized as P ≤.05 (P is less than or equal to.05) 22 Levels of Significance Is a mean difference of 5, which has a P =.01 chance of resulting from sample error statistically significant, that is, does it result from population difference? Levels of Significance: We need to establish this to determine whether or not our obtained sample difference is statistically significant.

Things to Know about Levels of Significance: A small probability is symbolized by – P ≤.05 Alpha is generally defined as (95 % Confidence Interval) – α =.05 level of significance This means that we are willing to reject the null hypothesis if an obtained sample difference occurs by chance less than 5 times out of 100. Thus, a mean difference of 5, between men women with regards to their approach to child-rearing is statistically significant, and is not the result of sampling error but differences between the populations. 23

Child Rearing: Comparing Males and Females Thus, a mean difference of 5, between men women with regards to their approach to child-rearing is statistically significant, and is not the result of sampling error but differences between the populations. 24

In this case, the z scores are called critical values. With α =.05, the z score ±1.96 is a critical value. If we obtain a z score that exceeds 1.96 (z>1.96 or z<-1.96), it is statistically significant. Critical or rejection regions are those areas beyond the z score to the tail of the normal curve and scores within these areas lead us to reject the null hypothesis. 25 Critical Values

z = % 47.5 % Critical Values: Z Score 2.50 % z = %

Critical Values: Z Score 2.50 % z= z= If we obtain a z score that exceeds 1.96 it is called statistically significant. 0 0 Statistically Significant: reject Null Hypothesis Statistically Significant: reject Null Hypothesis Statistically Insignificant: accept Null Hypothesis

Significance levels can be set up for any degree of probability. NOTE: Levels of significance do not give us an absolute statement as to the correctness of the null hypothesis. We can choose to accept or reject the null hypothesis anyway. 28 Significance levels

Type I Errors Type 1 Error: Rejecting the hypothesis when it should have been retained For example, if we reject the the null hypothesis at the.05 level of significance and conclude that there are gender differences in child-rearing attitudes, then there are 5 chances out of 100 that we are wrong. Or P =.05 that we committed a Type I error and that gender actually has no effect. The more stringent our level of significance (the farther out in the tail it lies), the less likely we are to make a Type 1 error. The probability of a Type I error is represented by α or alpha. 29

Type II Errors Type II Error: Accepting the null hypothesis when it should have been rejected The farther out in the tail of the curve that our critical value falls, the greater the risk of a Type II error. The research hypothesis may still be correct, despite the decision to reject it and retain the null hypothesis. One method for reducing the risk of a Type II error is to increase the size of the sample so that the true population difference is more likely to be represented. The probability of a Type II error is β or beta. 30

Error Types: Type I z= z= Type I: Reject Null, when we should have retained it % The larger the significance level, and thus % on the tail, the more likely we are to mistakenly reject the null. Example: 95% confidence interval, α =.05, z =1.96

Error Types: Type I z= +258 z= Type I: Reject Null, when we should have retained it 0 0.5% The smaller the significance level, and thus % on the tail, the less likely we are to mistakenly reject the null. Example: 99% confidence interval, α =.01, z =2.58

Error Types: Type II z= +258 z= Type II: Accept Null, when we should have rejected it 0 0.5% The smaller the significance level, and thus % on the tail, the more likely we are to mistakenly accept the null. Example: 99% confidence interval, α =.01, z =2.58

Some notes on Type I and Type II Errors The probabilities of Type I and Type II errors are inversely related. The larger the level of significance, the larger the chance of a Type I error. We predetermine our level of significance for a hypothesis test depending on which error is the least damaging or costly. If it would be far worse to reject a true null hypothesis (that is, suggest statistically significance, or different populations where there is none) (Type I error) than to retain a false null hypothesis (to suggest there is no population difference where there is difference) (Type II error), we should use a use a smaller level of significance: α =.01 34

The Difference between P and α P is the exact probability that the null hypothesis is true in light of some sample data. Alpha is the threshold below which is considered so small that we decide to reject the null hypothesis. We reject the null hypothesis if the P value is less than the alpha value. 35

The Difference between P and α z= z= Example: a mean difference of 5 has a P of.006 x2 = roughly.01 (1 in a 100), whereas α =.05 cuts off the null hypothesis at.025 x2 =.05 (5 chances in 100) % α =.05/z =1.96: 95.0% z = % P = % P= P =.025%

The Difference between P and α z= z= Example: a mean difference of 5 has a P of.006 x2 = roughly.01 (1 in a 100), whereas α =.05 cuts off the null hypothesis at.025 x2 =.05 (5 chances in 100). Any mean difference below 5 chances in 100 Supports the research hypothesis % 5 5 P =.025% 2.50% P =.006 Statistically Significant