Difference Between Means Test (“t” statistic) Analysis of Variance (F statistic)

Slides:



Advertisements
Similar presentations
Hypothesis Testing Steps in Hypothesis Testing:
Advertisements

Bivariate Analyses.
Bivariate Analysis Cross-tabulation and chi-square.
5/15/2015Slide 1 SOLVING THE PROBLEM The one sample t-test compares two values for the population mean of a single variable. The two-sample test of a population.
Hypothesis testing Week 10 Lecture 2.
Lecture 9: One Way ANOVA Between Subjects
UNDERSTANDING RESEARCH RESULTS: STATISTICAL INFERENCE © 2012 The McGraw-Hill Companies, Inc.
PSY 307 – Statistics for the Behavioral Sciences Chapter 19 – Chi-Square Test for Qualitative Data Chapter 21 – Deciding Which Test to Use.
Cross-Tabulations.
Today Concepts underlying inferential statistics
Hypothesis Testing Using The One-Sample t-Test
Relationships Among Variables
Inferential Statistics
Statistical Analysis. Purpose of Statistical Analysis Determines whether the results found in an experiment are meaningful. Answers the question: –Does.
1 Chapter 20 Two Categorical Variables: The Chi-Square Test.
Presentation 12 Chi-Square test.
AM Recitation 2/10/11.
Hypothesis Testing:.
Week 9 Chapter 9 - Hypothesis Testing II: The Two-Sample Case.
HYPOTHESIS TESTING Dr. Aidah Abu Elsoud Alkaissi
Hypothesis Testing II The Two-Sample Case.
1 GE5 Lecture 6 rules of engagement no computer or no power → no lesson no SPSS → no lesson no homework done → no lesson.
Statistical Analysis Statistical Analysis
1 Tests with two+ groups We have examined tests of means for a single group, and for a difference if we have a matched sample (as in husbands and wives)
Single Sample Inferences
Copyright © 2012 Wolters Kluwer Health | Lippincott Williams & Wilkins Chapter 17 Inferential Statistics.
Copyright © 2008 Wolters Kluwer Health | Lippincott Williams & Wilkins Chapter 22 Using Inferential Statistics to Test Hypotheses.
Week 8 Chapter 8 - Hypothesis Testing I: The One-Sample Case.
Chapter 9: Testing Hypotheses
Chi-square (χ 2 ) Fenster Chi-Square Chi-Square χ 2 Chi-Square χ 2 Tests of Statistical Significance for Nominal Level Data (Note: can also be used for.
Hypothesis Testing Using the Two-Sample t-Test
CATEGORICAL VARIABLES Testing hypotheses using. Independent variable: Income, measured categorically (nominal variable) – Two values: low income and high.
© 2011 Cengage Learning. All Rights Reserved. May not be copied, scanned, or duplicated, in whole or in part, except for use as permitted in a license.
Inference and Inferential Statistics Methods of Educational Research EDU 660.
Chapter 12 A Primer for Inferential Statistics What Does Statistically Significant Mean? It’s the probability that an observed difference or association.
Chi-Square X 2. Parking lot exercise Graph the distribution of car values for each parking lot Fill in the frequency and percentage tables.
Preparing for the final - sample questions with answers.
Difference Between Means Test (“t” statistic) Analysis of Variance (“F” statistic)
Essential Question:  How do scientists use statistical analyses to draw meaningful conclusions from experimental results?
DIRECTIONAL HYPOTHESIS The 1-tailed test: –Instead of dividing alpha by 2, you are looking for unlikely outcomes on only 1 side of the distribution –No.
Chapter 9 Three Tests of Significance Winston Jackson and Norine Verberg Methods: Doing Social Research, 4e.
Educational Research Chapter 13 Inferential Statistics Gay, Mills, and Airasian 10 th Edition.
CATEGORICAL VARIABLES Testing hypotheses using. When only one variable is being measured, we can display it. But we can’t answer why does this variable.
BPS - 5th Ed. Chapter 221 Two Categorical Variables: The Chi-Square Test.
Chapter 13 - ANOVA. ANOVA Be able to explain in general terms and using an example what a one-way ANOVA is (370). Know the purpose of the one-way ANOVA.
Psych 230 Psychological Measurement and Statistics
METHODS IN BEHAVIORAL RESEARCH NINTH EDITION PAUL C. COZBY Copyright © 2007 The McGraw-Hill Companies, Inc.
Review - Confidence Interval Most variables used in social science research (e.g., age, officer cynicism) are normally distributed, meaning that their.
Inferential Statistics Introduction. If both variables are categorical, build tables... Convention: Each value of the independent (causal) variable has.
© Copyright McGraw-Hill 2004
Chi-Square X 2. Review: the “null” hypothesis Inferential statistics are used to test hypotheses Whenever we use inferential statistics the “null hypothesis”
Other Types of t-tests Recapitulation Recapitulation 1. Still dealing with random samples. 2. However, they are partitioned into two subsamples. 3. Interest.
Chi-Square X 2. Review: the “null” hypothesis Inferential statistics are used to test hypotheses Whenever we use inferential statistics the “null hypothesis”
Copyright c 2001 The McGraw-Hill Companies, Inc.1 Chapter 11 Testing for Differences Differences betweens groups or categories of the independent variable.
Chapter Eleven Performing the One-Sample t-Test and Testing Correlation.
© 2006 by The McGraw-Hill Companies, Inc. All rights reserved. 1 Chapter 11 Testing for Differences Differences betweens groups or categories of the independent.
Chapter 13 Understanding research results: statistical inference.
T-tests Chi-square Seminar 7. The previous week… We examined the z-test and one-sample t-test. Psychologists seldom use them, but they are useful to understand.
Educational Research Inferential Statistics Chapter th Chapter 12- 8th Gay and Airasian.
Class Seven Turn In: Chapter 18: 32, 34, 36 Chapter 19: 26, 34, 44 Quiz 3 For Class Eight: Chapter 20: 18, 20, 24 Chapter 22: 34, 36 Read Chapters 23 &
Copyright © 2009 Pearson Education, Inc t LEARNING GOAL Understand when it is appropriate to use the Student t distribution rather than the normal.
Hypothesis Testing. Steps for Hypothesis Testing Fig Draw Marketing Research Conclusion Formulate H 0 and H 1 Select Appropriate Test Choose Level.
SAMPLING. Basic concepts Why not measure everything? – Practical reason: Measuring every member of a population is too expensive or impractical – Mathematical.
Other tests of significance. Independent variables: continuous Dependent variable: continuous Correlation: Relationship between variables Regression:
Final exam practice questions (answers at the end)
Chi-Square X2.
Difference Between Means Test (“t” statistic)
Inferential Statistics
Statistical Inference about Regression
UNDERSTANDING RESEARCH RESULTS: STATISTICAL INFERENCE
Presentation transcript:

Difference Between Means Test (“t” statistic) Analysis of Variance (F statistic)

Difference Between Means (“t”) Test So far we’ve examined several statistics that can be used to test hypotheses: – Chi-Square (X 2 ), which requires all variables be categorical – Regression (R 2 ), which requires all variables be continuous – Logistic regression (b and Exp b), which requires a nominal dependent variable The difference between the means test (t) is used to test hypotheses with categorical independent and continuous dependent variables – Gender  height – Gender  cynicism (1-5 scale) We compare the means of two randomly drawn samples – The null hypothesis can be rejected if the difference, reflected in the magnitude of the t statistic, goes so far beyond what would be expected by sampling error, that there are less than five chances in 100 (p>.05) that the relationship between the variables is due to chance – This sampling error is called the “standard error of the difference between means” – the difference between all possible pairs of means, due to chance alone When using the t table we must know whether the hypothesis is 1-tailed (direction of effect predicted) or 2-tailed (direction not predicted) Major advantage: Remember that weak real-life effects can produce significant results? – When comparing means, we know their actual values. This lets us recognize situations where differences are, in the real world, trivial.

Calculating t 1. Obtain the “pooled sample variance” S p 2 (Simplified method – midpoint between the two sample variances) 2.Compute the S.E. of the Diff. Between Means 3.Compute the t statistic 4.Compute the “degrees of freedom” df = n 1 + n (total number of cases in both samples minus 2) Actual (“obtained”) difference between means Predicted difference due to sampling error The t is a ratio: the greater the difference between means, the smaller the predicted error, the larger the t coefficient The larger the t, the more likely we are to reject the null hypothesis. According to the null, any relationship between variables, any difference between means, is due to chance. Our actual difference between means must be “significantly” larger than the difference we would obtain by chance. We use a table to determine whether the t is large enough to reject the null hypothesis (see next slide). We can reject the null if the probability that the difference between means is due to chance is less than five in one-hundred (p<.05). If the probability that the difference between the means is due to chance is five in one-hundred or larger (p>.05), the null hypothesis is true.

Classroom exercise - Jay’s police department H 1 : Male officers more cynical than females (1 - tailed) H 2 : Officer gender determines cynicism (2 - tailed) 1.Draw one sample of male officers, one of females 2.Compute each sample’s variance, then obtain the pooled sample variance 3.Compute the S.E. of the Difference Between Means 4.Calculate the t coefficient 5. C ompute the degrees of freedom df = n 1 + n Check the t table (next slide) for significance. Confirm the (working) hypothesis if there are less than 5 chances in 100 that the null is true. Be sure to use the correct significance row (1-tailed or 2-tailed). Use the one-tailed test if the working hypothesis predicted the direction of the difference (H 1 ) - that is, that one group, male or female, would be significantly more cynical than the other. Use the two-tailed test if you predicted that cynicism would be significantly different, but not which group would be more cynical (H 2 ). One-tailed hypotheses require a smaller t to reach statistical significance. t-table

1. Is hypothesis one-tailed (direction of change in the DV predicted) or two-tailed (direction not predicted)? H 1 : Males more cynical than females. This is one-tailed, so use the top row. H 2 : Males and females differ in cynicism. This is two-tailed, so use the second row. 2. df, “Degrees of Freedom” represents sample size – add the numbers of cases in both samples, then subtract two: df = n 1 + n 2 – 2 3. To call a t “significant” (thus reject the null hypothesis) the coefficient must be as large or larger than what is required at the.05 level; that is, we cannot take more than 5 chances in 100 that the difference between means is due to chance. For a one-tailed test, use the top row, then slide over to the.05 column. For a two-tailed test, use the second row, then slide to.05 column. If the t is smaller than the number at the intersection of the.05 column and the appropriate df row, it is non-significant. If the t is that size or larger, it is significant. Slide to the right to see if it is large enough to be significant at a more stringent level.

Parking lot exercise Higher income  More expensive car 1.Transfer your panel’s data from the other coding sheet 2.Compute each sample’s variance, then calculate the pooled sample variance = = Compute the S.E. of the Difference Between Means  1.49 ( )  1.49 (.2) = .3 =.55 …continued on next slide

4.Calculate the t coefficient = = -3.5 Note: the sign, + or -, indicates the direction of the difference between groups. Keep that in mind! It turns out that, consistent with the hypothesis, the faculty lot has the more expensive cars. We had arbitrarily placed it second, so subtracting yields a negative t. 5.Compute the df (degrees of freedom) df = n 1 + n = = 18 6.Check the t table for significance. Note: Use the one-tailed test if the working hypothesis predicts the direction of the difference (which parking lot would have more expensive cars). Use the two-tailed test if the hypothesis predicts there will be a difference in car values between the lots, but not which lot would have the more expensive cars df = 18 t = -3.5 Yes! There are less than five chances in one-thousand that the null hypothesis is true (one-tailed) or less than one in one-hundred that it is true (two-tailed)

More complex mean comparisons: Analysis of Variance

Dependent variable: continuous Independent variable(s): categorical Example: does officer professionalism vary between cities? (scale 1-10) Calculate the “F” statistic, look up the table. An “F” statistic that is sufficiently large can overcome the null hypothesis that the differences between the means are due to chance. When there are more than two groups: Analysis of Variance CityL.A.S.F.S.D. Mean853

Stratified independent variable(s) F statistic is a ratio of “between-group” to “within” group differences. To overcome the null hypothesis, the differences in scores between groups (between cities and, overall, between genders) should be much greater than the differences in scores within cities Between group variance (error + systematic effects of ind. variable) Within group variance (how scores disperse within each city) “Two-way” Analysis of Variance CityL.A.S.F.S.D. Mean – M1075 Mean - F632 Between Withi n

Homework

Two random samples of 10 patrol officers from the XYZ Police Department, each officer tested for cynicism (continuous variable, scale 1-5) Sample 1 scores: Variance =.99 Sample 2 scores: Variance =.93 Homework assignment

Pooled sample variance S p 2 Simplified method: midpoint between the two sample variances S p 2 = Standard error of the difference between means   x1 -  x2   x 1 -  x 2 =  S p 2 ( ) T-Test for significance of the difference between means  x 1 -  x 2 t =   x 1 -  x 2 s s n 1 n 2 +

CALCULATIONS Pooled sample variance:.96 Standard error of the difference between means:.44 t statistic: 1.14 df – degrees of freedom: (n1 + n2) – 2 = 18 Would you use a ONE-tailed t-test OR a TWO-tailed t- test? Depends on the hypothesis Two-tailed (does not predict direction of the change): Gender  cynicism One-tailed (predicts direction of the change): Males more cynical than females Can you reject the NULL hypothesis? (probability that the t coefficient could have been produced by chance must be less than five in a hundred) NO – For a ONE-tailed test need a t of or higher NO – For a TWO-tailed test need a t of or higher

Final exam practice

You will be given scores and variances for two samples and asked to decide whether their means are significantly different. You will be asked to state the null hypothesis. You will then compute the t statistic. You be given formulas, but should know the methods by heart. Please refer to week 15 slide show. To compute the t you will compute the pooled sample variance and the standard error of the difference between means. You will then compute the degrees of freedom (adjusted sample size) and use the t table to determine whether the coefficient is sufficiently large to reject the null hypothesis. – Print and bring to class: – Use the one-tailed test if the direction of the effect is specified, or two-tailed if not You will be asked to express using words what the t-table conveys about the significance (or non-significance) of the t coefficient Sample question: Are male CJ majors significantly more cynical than female CJ majors? We randomly sampled five males and five females. Males: 4, 5, 5, 3, 4 Females: 4, 3, 4, 4, 5 – Null hypothesis: No significant difference between cynicism of males and females – Variance for males (provided): 0.7 Variance for females (provided): 0.5 – Pooled sample variance =.6 SE of the difference between means =.49 t =.41 df = 8 – Check the “t” table. Can you reject the null hypothesis? NO – Describe conclusion using words: The t must be at least 1.86 (one-tailed test) to reject the null hypothesis of no significant difference in cynicism, with only five chances in 100 that it is true.