Comparing Two Means Prof. Andy Field.

Slides:



Advertisements
Similar presentations
Comparing Two Means: One-sample & Paired-sample t-tests Lesson 12.
Advertisements

Comparing Two Means Dr. Andy Field.
Comparing Two Means.
Hypothesis Testing Steps in Hypothesis Testing:
Statistical Decision Making
Beyond Null Hypothesis Testing Supplementary Statistical Techniques.
T-Tests.
Chapter Seventeen HYPOTHESIS TESTING
PSY 307 – Statistics for the Behavioral Sciences
T-tests Part 2 PS1006 Lecture 3
Intro to Statistics for the Behavioral Sciences PSYC 1900 Lecture 10: Hypothesis Tests for Two Means: Related & Independent Samples.
C82MCP Diploma Statistics School of Psychology University of Nottingham 1 Overview of Lecture Independent and Dependent Variables Between and Within Designs.
Independent Sample T-test Often used with experimental designs N subjects are randomly assigned to two groups (Control * Treatment). After treatment, the.
Today Concepts underlying inferential statistics
Independent Sample T-test Classical design used in psychology/medicine N subjects are randomly assigned to two groups (Control * Treatment). After treatment,
C82MCP Diploma Statistics School of Psychology University of Nottingham 1 Overview of Lecture Between Group & Within Subjects Designs Mann-Whitney Test.
5-3 Inference on the Means of Two Populations, Variances Unknown
Simple Linear Regression Analysis
Relationships Among Variables
Chapter 12 Inferential Statistics Gay, Mills, and Airasian
Introduction to Linear Regression and Correlation Analysis
Regression Analysis Regression analysis is a statistical technique that is very useful for exploring the relationships between two or more variables (one.
Statistical Significance R.Raveendran. Heart rate (bpm) Mean ± SEM n In men ± In women ± The difference between means.
Education Research 250:205 Writing Chapter 3. Objectives Subjects Instrumentation Procedures Experimental Design Statistical Analysis  Displaying data.
Inferential Statistics 2 Maarten Buis January 11, 2006.
Comparing Two Means Prof. Andy Field.
Educational Research: Competencies for Analysis and Application, 9 th edition. Gay, Mills, & Airasian © 2009 Pearson Education, Inc. All rights reserved.
Intro: “BASIC” STATS CPSY 501 Advanced stats requires successful completion of a first course in psych stats (a grade of C+ or above) as a prerequisite.
Comparing Two Means Chapter 9. Experiments Simple experiments – One IV that’s categorical (two levels!) – One DV that’s interval/ratio/continuous – For.
Educational Research Chapter 13 Inferential Statistics Gay, Mills, and Airasian 10 th Edition.
CHAPTER 7 COMPARING TWO MEANS Difference between two groups.
Experimental Design and Statistics. Scientific Method
Statistical Inference for the Mean Objectives: (Chapter 9, DeCoursey) -To understand the terms: Null Hypothesis, Rejection Region, and Type I and II errors.
Experimental Research Methods in Language Learning Chapter 10 Inferential Statistics.
Chapter 10 The t Test for Two Independent Samples
General Linear Model.
Comparing Two Means Chapter 9. Experiments Simple experiments – One IV that’s categorical (two levels!) – One DV that’s interval/ratio/continuous – For.
T tests comparing two means t tests comparing two means.
Copyright © 2011 by The McGraw-Hill Companies, Inc. All rights reserved. McGraw-Hill/Irwin Simple Linear Regression Analysis Chapter 13.
Chapter 13 Understanding research results: statistical inference.
HYPOTHESIS TESTING FOR DIFFERENCES BETWEEN MEANS AND BETWEEN PROPORTIONS.
Statistical Inference for the Mean Objectives: (Chapter 8&9, DeCoursey) -To understand the terms variance and standard error of a sample mean, Null Hypothesis,
Educational Research Inferential Statistics Chapter th Chapter 12- 8th Gay and Airasian.
Inferential Statistics Assoc. Prof. Dr. Şehnaz Şahinkarakaş.
Inferential Statistics
T-Tests and ANOVA I Class 15.
Logic of Hypothesis Testing
Dependent-Samples t-Test
DTC Quantitative Methods Bivariate Analysis: t-tests and Analysis of Variance (ANOVA) Thursday 20th February 2014  
Psych 706: stats II Class #4.
Non-Parametric Tests 12/1.
Comparing Two Means t-tests: Rationale for the tests t-tests as a GLM
Comparing several means: ANOVA (GLM 1)
Estimation & Hypothesis Testing for Two Population Parameters
Non-Parametric Tests 12/1.
Virtual COMSATS Inferential Statistics Lecture-26
Non-Parametric Tests 12/6.
Linear Regression Prof. Andy Field.
Non-Parametric Tests.
Inferential statistics,
STAT120C: Final Review.
Comparing Groups.
Ass. Prof. Dr. Mogeeb Mosleh
Association, correlation and regression in biomedical research
Non-parametric tests, part A:
CS 594: Empirical Methods in HCC Experimental Research in HCI (Part 2)
UNDERSTANDING RESEARCH RESULTS: STATISTICAL INFERENCE
What are their purposes? What kinds?
InferentIal StatIstIcs
Type I and Type II Errors
Presentation transcript:

Comparing Two Means Prof. Andy Field

Aims t-tests Rationale for the tests Interpretation Reporting results Dependent (aka paired, matched) Independent Rationale for the tests Assumptions Interpretation Reporting results Calculating an Effect Size Categorical predictors in the linear model.

Experiments The simplest form of experiment that can be done is one with only one independent variable that is manipulated in only two ways and only one outcome is measured. More often than not the manipulation of the independent variable involves having an experimental condition and a control. E.g., Is the movie Scream 2 scarier than the original Scream? We could measure heart rates (which indicate anxiety) during both films and compare them. This situation can be analysed with a t-test

t-test Dependent t-test Independent t-test Significance testing Compares two means based on related data. E.g., Data from the same people measured at different times. Data from ‘matched’ samples. Independent t-test Compares two means based on independent data E.g., data from different groups of people Significance testing Testing the significance of Pearson’s correlation coefficient Testing the significance of b in regression.

Rationale to Experiments Group 1 Group 2 Lecturing Skills Variance created by our manipulation Removal of brain (systematic variance) Variance created by unknown factors E.g. Differences in ability (unsystematic variance) Slide 5

Rationale for the t-test Two samples of data are collected and the sample means calculated. These means might differ by either a little or a lot. If the samples come from the same population, then we expect their means to be roughly equal. Although it is possible for their means to differ by chance alone, we would expect large differences between sample means to occur very infrequently.

Rationale for the t-test (2) We compare the difference between the sample means that we collected with the difference between the sample means that we would expect to obtain if there were no effect (i.e. if the null hypothesis were true). We use the standard error as a gauge of the variability between sample means. If the difference between the samples we have collected is larger than what we would expect based on the standard error then we can assume one of two: There is no effect and sample means in our population fluctuate a lot and we have, by chance, collected two samples that are atypical of the population from which they came. The two samples come from different populations but are typical of their respective parent population. In this scenario, the difference between samples represents a genuine difference between the samples (and so the null hypothesis is incorrect).

Rationale for the t-test (3) As the observed difference between the sample means gets larger, the more confident we become that the second explanation is correct (i.e. that the null hypothesis should be rejected). If the null hypothesis is incorrect, then we gain confidence that the two sample means differ because of the different experimental manipulation imposed on each sample.

Rationale to the t-test (4) = observed difference between sample means − expected difference between population means (if null hypothesis is true) estimate of the standard error of the difference between two sample means

The Dependent t-test

The Independent t-test 1 2 Equal sample sizes: Denominator, unequal sample sizes:

Assumptions of the t-test Both the independent t-test and the dependent t-test are parametric tests based on the normal distribution. Therefore, they assume: The sampling distribution is normally distributed. In the dependent t­-test this means that the sampling distribution of the differences between scores should be normal, not the scores themselves. Data are measured at least at the interval level. The independent t-test, because it is used to test different groups of people, also assumes: Variances in these populations are roughly equal (homogeneity of variance). Scores in different treatment conditions are independent (because they come from different people).

Independent t-test Example Are invisible people mischievous? 24 Participants Manipulation Placed participants in an enclosed community riddled with hidden cameras. 12 participants were given an invisibility cloak. 12 participants were not given an invisibility cloak. Outcome measured how many mischievous acts participants performed in a week.

Independent t-test using SPSS

Independent t-test Output I

Independent t-test Output II Bootstrapping Output

Calculating the Effect Size

Reporting the independent t-test On average, participants given a cloak of invisibility apparently engaged in apparently more acts of mischief (M = 5, SE = 0.48), than those not given a cloak (M = 3.75, SE = 0.55). However, this difference, 1.25, BCa 95% CI [2.606, 0.043], was not significant t(22) = −1.71, p = .101; nevertheless, it did represent a medium-sized effect d = .65.

Matched-samples t-test Example Are invisible people mischievous? 24 Participants Manipulation Placed participants in an enclosed community riddled with hidden cameras. For first week participants normal behaviour was observed. For the second week, participants were given an invisibility cloak. Outcome measured how many mischievous acts participants performed in week 1 and week 2.

Paired- samples t-test Output

Paired- samples t-test Output Continued

Calculating an Effect Size

Reporting the paired-samples t-test On average, participants given a cloak of invisibility engaged in more acts of mischief (M = 5, SE = 0.48), than those not given a cloak (M = 3.75, SE = 0.55). This difference, 1.25, BCa 95% CI [−1.67, −0.83], was significant t(11) = −3.80, p = .003 and represented a medium-sized effect d = .65.

Categorical predictors in the linear model

No-cloak group The group variable = 0 Intercept = mean of baseline group

Cloak Group The group variable = 1 b1 = Difference between means

Output from a Regression

When Assumptions are Broken Independent t-test Mann-Whitney Test (Wilcoxon rank-sum test) Dependent t-test Wilcoxon Signed-Rank Test Robust Tests Bootstrapping Trimmed means

Conclusion Simple experimental designs Parametric data Related t-test Between subjects Within subjects Parametric data Related t-test Unrelated t-test Effect sizes: d and r Unrelated t-test as regression Dummy variables Bootstrapping Trimmed means