Comparing Two Means Chapter 9. Experiments Simple experiments – One IV that’s categorical (two levels!) – One DV that’s interval/ratio/continuous – For.

Slides:



Advertisements
Similar presentations
Comparing Two Means: One-sample & Paired-sample t-tests Lesson 12.
Advertisements

Comparing Two Means Dr. Andy Field.
Comparing Two Means.
Tests of Significance for Regression & Correlation b* will equal the population parameter of the slope rather thanbecause beta has another meaning with.
Independent t -test Features: One Independent Variable Two Groups, or Levels of the Independent Variable Independent Samples (Between-Groups): the two.
PSY 307 – Statistics for the Behavioral Sciences
T-tests Part 2 PS1006 Lecture 3
The t-test:. Answers the question: is the difference between the two conditions in my experiment "real" or due to chance? Two versions: (a) “Dependent-means.
Lecture 9: One Way ANOVA Between Subjects
Independent Sample T-test Often used with experimental designs N subjects are randomly assigned to two groups (Control * Treatment). After treatment, the.
Copyright © 2010 Pearson Education, Inc. Chapter 24 Comparing Means.
Today Concepts underlying inferential statistics
Independent Sample T-test Classical design used in psychology/medicine N subjects are randomly assigned to two groups (Control * Treatment). After treatment,
Major Points Formal Tests of Mean Differences Review of Concepts: Means, Standard Deviations, Standard Errors, Type I errors New Concepts: One and Two.
Chapter 14 Inferential Data Analysis
Non-parametric statistics
Relationships Among Variables
Bootstrapping applied to t-tests
Chapter 12 Inferential Statistics Gay, Mills, and Airasian
Analysis of Variance. ANOVA Probably the most popular analysis in psychology Why? Ease of implementation Allows for analysis of several groups at once.
Psy B07 Chapter 1Slide 1 ANALYSIS OF VARIANCE. Psy B07 Chapter 1Slide 2 t-test refresher  In chapter 7 we talked about analyses that could be conducted.
AM Recitation 2/10/11.
ANCOVA Lecture 9 Andrew Ainsworth. What is ANCOVA?
Chapter Eleven Inferential Tests of Significance I: t tests – Analyzing Experiments with Two Groups PowerPoint Presentation created by Dr. Susan R. Burns.
Choosing and using statistics to test ecological hypotheses
Chapter 15 Data Analysis: Testing for Significant Differences.
Bootstrapping (And other statistical trickery). Reminder Of What We Do In Statistics Null Hypothesis Statistical Test Logic – Assume that the “no effect”
Inferential Statistics 2 Maarten Buis January 11, 2006.
Statistical Decision Making. Almost all problems in statistics can be formulated as a problem of making a decision. That is given some data observed from.
Comparing Two Means Prof. Andy Field.
TAUCHI – Tampere Unit for Computer-Human Interaction ERIT 2015: Data analysis and interpretation (1 & 2) Hanna Venesvirta Tampere Unit for Computer-Human.
Statistical Analysis. Statistics u Description –Describes the data –Mean –Median –Mode u Inferential –Allows prediction from the sample to the population.
Educational Research: Competencies for Analysis and Application, 9 th edition. Gay, Mills, & Airasian © 2009 Pearson Education, Inc. All rights reserved.
Psychology 301 Chapters & Differences Between Two Means Introduction to Analysis of Variance Multiple Comparisons.
1 rules of engagement no computer or no power → no lesson no SPSS → no lesson no homework done → no lesson GE 5 Tutorial 5.
Stats Lunch: Day 4 Intro to the General Linear Model and Its Many, Many Wonders, Including: T-Tests.
 Slide 1 Two-Way Independent ANOVA (GLM 3) Chapter 13.
Intro: “BASIC” STATS CPSY 501 Advanced stats requires successful completion of a first course in psych stats (a grade of C+ or above) as a prerequisite.
Chapter 10: Analyzing Experimental Data Inferential statistics are used to determine whether the independent variable had an effect on the dependent variance.
Comparing Two Means Chapter 9. Experiments Simple experiments – One IV that’s categorical (two levels!) – One DV that’s interval/ratio/continuous – For.
Introduction to Quantitative Research Analysis and SPSS SW242 – Session 6 Slides.
Jeopardy Hypothesis Testing t-test Basics t for Indep. Samples Related Samples t— Didn’t cover— Skip for now Ancient History $100 $200$200 $300 $500 $400.
Educational Research Chapter 13 Inferential Statistics Gay, Mills, and Airasian 10 th Edition.
CHAPTER 7 COMPARING TWO MEANS Difference between two groups.
ANOVA: Analysis of Variance.
Experimental Design and Statistics. Scientific Method
+ Comparing several means: ANOVA (GLM 1) Chapter 11.
AP Statistics Chapter 24 Comparing Means.
Review. Statistics Types Descriptive – describe the data, create a picture of the data Mean – average of all scores Mode – score that appears the most.
Ch11: Comparing 2 Samples 11.1: INTRO: This chapter deals with analyzing continuous measurements. Later, some experimental design ideas will be introduced.
The Independent- Samples t Test Chapter 11. Quick Test Reminder >One person = Z score >One sample with population standard deviation = Z test >One sample.
Chapter Eight: Using Statistics to Answer Questions.
Chapter 13 Repeated-Measures and Two-Factor Analysis of Variance
Stats Lunch: Day 8 Repeated-Measures ANOVA and Analyzing Trends (It’s Hot)
Assumptions 5.4 Data Screening. Assumptions Parametric tests based on the normal distribution assume: – Independence – Additivity and linearity – Normality.
Inferential Statistics Introduction. If both variables are categorical, build tables... Convention: Each value of the independent (causal) variable has.
Comparing Two Means: One-sample & Paired-sample t-tests Lesson 13.
Analysis of Variance STAT E-150 Statistical Methods.
Comparing Means Chapter 24. Plot the Data The natural display for comparing two groups is boxplots of the data for the two groups, placed side-by-side.
Chapter 21prepared by Elizabeth Bauer, Ph.D. 1 Ranking Data –Sometimes your data is ordinal level –We can put people in order and assign them ranks Common.
HYPOTHESIS TESTING FOR DIFFERENCES BETWEEN MEANS AND BETWEEN PROPORTIONS.
BIOL 582 Lecture Set 2 Inferential Statistics, Hypotheses, and Resampling.
Educational Research Inferential Statistics Chapter th Chapter 12- 8th Gay and Airasian.
Data Screening. What is it? Data screening is very important to make sure you’ve met all your assumptions, outliers, and error problems. Each type of.
Exploring Group Differences
Comparing Two Means Prof. Andy Field.
Comparing Two Means t-tests: Rationale for the tests t-tests as a GLM
Applied Statistical Analysis
Psych 231: Research Methods in Psychology
Chapter Nine: Using Statistics to Answer Questions
Presentation transcript:

Comparing Two Means Chapter 9

Experiments Simple experiments – One IV that’s categorical (two levels!) – One DV that’s interval/ratio/continuous – For example, manipulation of the independent variable involves having an experimental condition and a control. This situation can be analysed with a t-test

The t-test In the book, this is related to regression/correlation. – We will cover regression later, as it is traditionally used with more continuous type predictors (IVs). – However, one thing to know is that t-tests and ANOVAs are all special types of regression with categorical predictors. – Why not do it all regression in SPSS? The output is much easier using t-tests/ANOVAs.

Experiments Median splits are bad. – You separate the people who are close together and lump them with people who are not really like them. – Effect sizes get smaller. – Power/Type 2 problems.

Experiments Reminder: when you manipulate the levels of the IV, you are working with experimental research. If you just are examining two naturally occurring categories, then quasi- experimental/correlational.

Experiments Between subjects / Independent designs – When the people are in separate levels and only do one of the manipulations Repeated measures / within subjects / dependent designs – When the people get all of the levels

The t-test Independent t-test – Compares two means based on independent data – E.g., data from different groups of people Dependent t-test – Compares two means based on related data. – E.g., Data from the same people measured at different times. – Data from ‘matched’ samples.

Slide 8 Rationale to Experiments Variance created by our manipulation – Removal of brain (systematic variance) Variance created by unknown factors – E.g. Differences in ability (unsystematic variance) Lecturing Skills Group 1 Group 2

Independent t-test Example Are invisible people mischievous? – 24 Participants Manipulation – Placed participants in an enclosed community riddled with hidden cameras. – 12 participants were given an invisibility cloak. – 12 participants were not given an invisibility cloak. Outcome – measured how many mischievous acts participants performed in a week.

Rational for the t-test Let’s use the invisibility cloak data. – Two groups (levels!): – Cloak versus no cloak. What would our null hypothesis be? What would our research hypothesis be?

Rational for the t-test Two samples of data are collected and the sample means calculated. These means might differ by either a little or a lot. – Let’s calculate the two means. – Review of split file.

Rational for the t-test If the samples come from the same population, then we expect their means to be roughly equal. Although it is possible for their means to differ by chance alone, we would expect large differences between sample means to occur very infrequently.

Rational for the t-test We compare the difference between the sample means that we collected to the difference between the sample means that we would expect to obtain if there were no effect (i.e. if the null hypothesis were true). – So Mean Cloak versus Mean No Cloak – If the null is true, what does that imply for the means?

Rational for the t-test We use the standard error as a gauge of the variability between sample means. If the difference between the samples we have collected is larger than what we would expect based on the standard error then we can assume one of two: – Draw histograms here (use a stacked histogram).

Rational for the t-test 1.There is no effect and sample means in our population fluctuate a lot and we have, by chance, collected two samples that are atypical of the population from which they came. 2.The two samples come from different populations but are typical of their respective parent population. In this scenario, the difference between samples represents a genuine difference between the samples (and so the null hypothesis is incorrect).

Rational for the t-test As the observed difference between the sample means gets larger, the more confident we become that the second explanation is correct (i.e. that the null hypothesis should be rejected). If the null hypothesis is incorrect, then we gain confidence that the two sample means differ because of the different experimental manipulation imposed on each sample.

Rationale to the t-test t = observed difference between sample means − expected difference between population means (if null hypothesis is true) estimate of the standard error of the difference between two sample means

The Independent t-test

Let’s try this with the descriptives we collected earlier. – What are all these parts in the things we’ve talked about? DF Standard deviation Standard error

The Independent t-test Ok, I’ve got t, now what? – The t table. – Why should I care? SPSS gives me p? – Many Type 1 corrections require you to use t or q tables.

Independent t-test using SPSS

All that data screening still applies. – With this data, univariate = multivariate because we only have one continuous measure to screen. – I usually always screen multivariate (until problems arise) because they are equal with one variable (so it’s one set of rules to remember).

Independent t-test using SPSS Analyze > Compare means > Independent samples t-test

Independent t-test using SPSS IV (categorical) goes in grouping variable. DV (continuous) goes in test variables.

Independent t-test using SPSS What to do about those ? ? Marks. Hit define groups > enter your numbers you had for the value labels. – (This is why you can’t just use text).

Independent t-test using SPSS Hit ok, get some output. First box = means.

Independent t-test using SPSS Second box = t-test.

Calculating the Effect Size

Reporting the independent t-test On average, participants given a cloak of invisibility engaged in more acts of mischief (M = 5.00, SE = 0.48), than those not given a cloak (M = 3.75, SE = 0.55). This difference,  1.25, BCa 95% CI [  2.61, 0.04], was not significant t(22) = −1.71, p =.10; however, it did represent a medium-sized effect d =.65. (two decimals!).

Dependent t-test Example Are invisible people mischievous? – 24 Participants Manipulation – Placed participants in an enclosed community riddled with hidden cameras. – For first week participants normal behaviour was observed. – For the second week, participants were given an invisibility cloak. Outcome – measured how many mischievous acts participants performed in week 1 and week 2.

Dependent t-test The logic of this test is roughly the same, but you have to consider the matched nature of dependent results. – So now we are going to use the standard error of the differences rather than standard error. – Compute difference scores in SPSS (practice with tranforms).

The Dependent t-test

Dependent t-test in SPSS Analyze > Compare means > Paired samples t-test

Dependent t-test in SPSS Move both over to the right, hit ok.

Dependent t-test in SPSS

Effect Size Please note if you use r, then you cannot use the paired samples correlation … we are correcting for the matched nature of the experiment.

Calculating an Effect Size

Reporting the paired-samples t-test On average, participants given a cloak of invisibility engaged in more acts of mischief (M = 5.00, SE = 0.48), than those not given a cloak (M = 3.75, SE = 0.55). This difference,  1.25, 95% CI [−1.67, −0.83], was significant t(11) = −3.80, p =.003 and represented a medium-sized effect d =.65.

Back to graphs Create these using bar charts! How to deal with the stupid error bar problem.

Error Bar Solution Step 1) Calculate the mean for each participant (i.e. average cloak and no cloak). – Transform > compute – MEAN(variables) – Hit ok.

Error Bar Solution Step 2) Calculate the grand mean. Analyze > descriptives > frequencies. Move over the new mean you just calculated.

Error Bar Solution Step 3) Calculate the adjustment (i.e. we are dealing with the repeated nature business). Transform > compute. Your adjusted variable = Grand mean – Mean of participants.

Error Bar Solution Step 4) Create new adjusted original scores. Transform > compute. – Cloak_adj = cloak + adjustment (from last step). – Nocloak_adj = nocloak + adjustment (from last step).

Error Bar Solution Now graph those!

Assumptions of the t-test Both the independent t-test and the dependent t-test are parametric tests based on the normal distribution. Therefore, they assume: – The sampling distribution is normally distributed. In the dependent t­-test this means that the sampling distribution of the differences between scores should be normal, not the scores themselves. – Data are measured at least at the interval level.

Assumptions of the t-test The independent t-test, because it is used to test different groups of people, also assumes: – Variances in these populations are roughly equal (homogeneity of variance). – Scores in different treatment conditions are independent (because they come from different people).

Assumptions of the t-test In plain old English, – Independence – (accuracy, missing, outliers are dealt with). – Normality – Linearity – Homogeneity/Homoscedasticity – (basically not multicollinearity because there’s only one DV).

When Assumptions are Broken Dependent t-test – Mann-Whitney Test – Wilcoxon rank-sum test Independent t-test – Wilcoxon Signed-Rank Test Robust Tests: – Bootstrapping – Trimmed means