The t-test:. Answers the question: is the difference between the two conditions in my experiment "real" or due to chance? Two versions: (a) “Dependent-means.

Slides:



Advertisements
Similar presentations
The independent-means t-test:
Advertisements

Topics Today: Case I: t-test single mean: Does a particular sample belong to a hypothesized population? Thursday: Case II: t-test independent means: Are.
Review of the Basic Logic of NHST Significance tests are used to accept or reject the null hypothesis. This is done by studying the sampling distribution.
Evaluation (practice). 2 Predicting performance  Assume the estimated error rate is 25%. How close is this to the true error rate?  Depends on the amount.
The Two Sample t Review significance testing Review t distribution
PSY 307 – Statistics for the Behavioral Sciences
Behavioural Science II Week 1, Semester 2, 2002
Cal State Northridge  320 Ainsworth Sampling Distributions and Hypothesis Testing.
UNDERSTANDING RESEARCH RESULTS: STATISTICAL INFERENCE.
Lecture 8 PY 427 Statistics 1 Fall 2006 Kin Ching Kong, Ph.D
Lecture 11 Psyc 300A. Null Hypothesis Testing Null hypothesis: the statistical hypothesis that there is no relationship between the variables you are.
Inferential Stats for Two-Group Designs. Inferential Statistics Used to infer conclusions about the population based on data collected from sample Do.
Independent Samples and Paired Samples t-tests PSY440 June 24, 2008.
Hypothesis testing & Inferential Statistics
Overview of Lecture Parametric Analysis is used for
Don’t spam class lists!!!. Farshad has prepared a suggested format for you final project. It will be on the web
Lecture 9: One Way ANOVA Between Subjects
T-Tests Lecture: Nov. 6, 2002.
Independent Sample T-test Often used with experimental designs N subjects are randomly assigned to two groups (Control * Treatment). After treatment, the.
DTC Quantitative Research Methods Statistical Inference II: Statistical Testing Thursday 7th November 2014  
Chapter 11: Inference for Distributions
PY 427 Statistics 1Fall 2006 Kin Ching Kong, Ph.D Lecture 6 Chicago School of Professional Psychology.
Major Points Formal Tests of Mean Differences Review of Concepts: Means, Standard Deviations, Standard Errors, Type I errors New Concepts: One and Two.
Statistical Analysis. Purpose of Statistical Analysis Determines whether the results found in an experiment are meaningful. Answers the question: –Does.
Inferential Statistics
Psy B07 Chapter 1Slide 1 ANALYSIS OF VARIANCE. Psy B07 Chapter 1Slide 2 t-test refresher  In chapter 7 we talked about analyses that could be conducted.
AM Recitation 2/10/11.
Statistics 11 Hypothesis Testing Discover the relationships that exist between events/things Accomplished by: Asking questions Getting answers In accord.
Hypothesis Testing:.
Week 9 Chapter 9 - Hypothesis Testing II: The Two-Sample Case.
Sampling Distributions and Hypothesis Testing. 2 Major Points An example An example Sampling distribution Sampling distribution Hypothesis testing Hypothesis.
Statistical Analysis Statistical Analysis
Copyright © Cengage Learning. All rights reserved. 13 Linear Correlation and Regression Analysis.
Section 10.1 ~ t Distribution for Inferences about a Mean Introduction to Probability and Statistics Ms. Young.
Education 793 Class Notes T-tests 29 October 2003.
T-distribution & comparison of means Z as test statistic Use a Z-statistic only if you know the population standard deviation (σ). Z-statistic converts.
The Hypothesis of Difference Chapter 10. Sampling Distribution of Differences Use a Sampling Distribution of Differences when we want to examine a hypothesis.
Comparing Means: t-tests Wednesday 22 February 2012/ Thursday 23 February 2012.
From last lecture (Sampling Distribution): –The first important bit we need to know about sampling distribution is…? –What is the mean of the sampling.
Hypothesis Testing CSCE 587.
Inferential Statistics 2 Maarten Buis January 11, 2006.
t(ea) for Two: Test between the Means of Different Groups When you want to know if there is a ‘difference’ between the two groups in the mean Use “t-test”.
Copyright © Cengage Learning. All rights reserved. 10 Inferences Involving Two Populations.
Independent Samples 1.Random Selection: Everyone from the Specified Population has an Equal Probability Of being Selected for the study (Yeah Right!)
Chapter 10: Analyzing Experimental Data Inferential statistics are used to determine whether the independent variable had an effect on the dependent variance.
Experimental Design Experiment: A type of research study that tests the idea that one variable causes an effect on another variable.
Test for Significant Differences T- Tests. T- Test T-test – is a statistical test that compares two data sets, and determines if there is a significant.
Essential Question:  How do scientists use statistical analyses to draw meaningful conclusions from experimental results?
Quick and Simple Statistics Peter Kasper. Basic Concepts Variables & Distributions Variables & Distributions Mean & Standard Deviation Mean & Standard.
DIRECTIONAL HYPOTHESIS The 1-tailed test: –Instead of dividing alpha by 2, you are looking for unlikely outcomes on only 1 side of the distribution –No.
Smith/Davis (c) 2005 Prentice Hall Chapter Nine Probability, the Normal Curve, and Sampling PowerPoint Presentation created by Dr. Susan R. Burns Morningside.
Psych 230 Psychological Measurement and Statistics
Chapter 8 Parameter Estimates and Hypothesis Testing.
Testing Differences between Means, continued Statistics for Political Science Levin and Fox Chapter Seven.
Stats Lunch: Day 3 The Basis of Hypothesis Testing w/ Parametric Statistics.
Chapter 10 The t Test for Two Independent Samples
Student’s t test This test was invented by a statistician WS Gosset ( ), but preferred to keep anonymous so wrote under the name “Student”. This.
Statistical Inference Statistical inference is concerned with the use of sample data to make inferences about unknown population parameters. For example,
Sampling Distribution (a.k.a. “Distribution of Sample Outcomes”) – Based on the laws of probability – “OUTCOMES” = proportions, means, test statistics.
Chapter Eleven Performing the One-Sample t-Test and Testing Correlation.
T-tests Chi-square Seminar 7. The previous week… We examined the z-test and one-sample t-test. Psychologists seldom use them, but they are useful to understand.
HYPOTHESIS TESTING FOR DIFFERENCES BETWEEN MEANS AND BETWEEN PROPORTIONS.
©2013, The McGraw-Hill Companies, Inc. All Rights Reserved Chapter 4 Investigating the Difference in Scores.
Inferential Statistics Psych 231: Research Methods in Psychology.
Chapter 9 Introduction to the t Statistic
When the means of two groups are to be compared (where each group consists of subjects that are not related) then the excel two-sample t-test procedure.
Central Limit Theorem, z-tests, & t-tests
Hypothesis Testing.
Statistics for the Social Sciences
Some statistics questions answered:
Presentation transcript:

The t-test:

Answers the question: is the difference between the two conditions in my experiment "real" or due to chance? Two versions: (a) “Dependent-means t-test: (”Matched-pairs" or "one-sample" t-test). Same subjects do both experimental conditions e.g., two conditions A and B: half subjects do A then B; rest do B then A. (Randomly allocated to one order or the other).

(b) “Independent-means t-test: (“Two-sample" t-test). Different subjects do each experimental condition. e.g., two conditions A and B: half subjects do A; rest do B. (Randomly allocated to A or B).

Both types of t-test have one independent variable, with two levels (the two different conditions of our experiment). There is one dependent variable (the thing we actually measure). Effects of alcohol on reaction-time performance. I.V. is "alcohol consumption". Two levels - drunk and sober. D.V. is RT. Use a repeated-measures t-test: measure each subject's RT twice, once while drunk and once while sober. Effects of personality type on a memory test. I.V. is "personality type". Two levels - introversion and extraversion. D.V. is memory test score. Use an independent-measures t-test: measure each subject's memory score once, then compare introverts and extraverts.

Rationale behind the t-test: Experiment on the effects of alcohol on RT. Measure RT for subjects when drunk, and when sober. Null hypothesis: alcohol has no effect on RT: variation between the drunk sample mean and the sober sample mean is due to sampling variation. The drunk and sober scores are samples from the same population (sober RTs).

If the difference between the sober and drunk means is large, we might prefer to believe that alcohol has affected RT: the difference is not due to sampling variation, but arose because the drunk and sober scores are samples from two different populations - the population of sober RTs and the population of drunk RTs. How large is “large”? The t-test enables us to decide.

Both types of t-test are similar in principle to the z-score. Observed difference between sample means Predicted difference between sample means (that there will be no difference at all) measure of the extent to which pairs of sample means might differ

t-distribution becomes progressively more like the normal distribution as sample size (n) increases:

1. We have two sample means, which differ. 2. Null hypothesis is that the two samples come from the same population; if so, ideally the sample means should be identical. RT for drunk sample: 800 ms RT for sober sample: 300 ms Difference: 500 ms “Sober” sample RT really reflects sober population RT: say, 600 ms “Drunk” sample RT also really reflects sober population RT: 600 ms Difference: 0 The difference between 500 ms and 0 ms is due to chance (sampling variation).

3. Alternative hypothesis is that our experimental manipulation has affected our subjects. The two samples (drunk and sober) are samples from different populations with different means. If so, the samples might well have different means. (e.g. the sober sample mean of 300 ms might reflect a sober population mean of 300 ms; the drunk sample mean of 800 ms might reflect a drunk population mean of 800 ms).

A big difference between our two sample means therefore suggests that either: (a) the two sample means are poor reflections of the mean of the single population that they are supposed to represent (i.e., our samples are atypical ones). OR (b) The two sample means are actually from two different parent populations, and our initial assumption that the samples both come from the same population is wrong. The bigger the difference between our two sample means, the less plausible (a) becomes, and the more likely that (b) is true.

Does Prozac affect driving ability? Ten subjects have their driving performance tested twice on a sheep farm: test A after they have taken Prozac (= experimental condition); test B while they are drug-free (= control condition). Each subject thus provides two scores (one for each condition). Five do A then B, five do B then A. Repeated Measures t-test, step-by-step:

Number of sheep hit during a 30-minute driving test: Subject: Test A Score Test B Score Difference, D  D = 6

the average difference between scores in our two samples (should be close to zero if there is no difference between the two conditions) the predicted average difference between scores in our two samples (usually zero, since we assume the two samples don’t differ ) estimated standard error of the mean difference (a measure of how much the mean difference might vary from one occasion to the next).

1. Add up the differences:  D = 6 2. Find the mean difference: 3. Estimate of the population standard deviation (the standard deviation of the differences):

4. Estimate of the population standard error (the standard error of the differences between two sample means):

5. Hypothesised difference between the sample means. Our null hypothesis is usually that there is no difference between the two sample means. (In statistical terms, that they have come from two identical populations):  D (hypothesised) = 0 6. Work out t: 7. "Degrees of freedom" (d.f.) are the number of subjects minus one:d.f. = n - 1 = = 9

8. Find the critical value of t from a table (at the back of many statistics books; also on my website). (a) “Two-tailed test”: if we are predicting a difference between tests A and B find the critical value of t for a "two-tailed" test. With 9 d.f., critical value = (b) “One-tailed test”: if we are predicting that A is bigger than B, or A is smaller than B, find the critical value of t for a "one-tailed" test. For 9 d.f., critical value = critical values of t (two-tailed):

If obtained t is bigger or equal to the critical t-value, "reject the null hypothesis" - the difference between our sample means is probably too large to have arisen by chance. Here, obtained t = This is less than There was no significant difference between performance on the two tests; the observed difference is so small, it probably arose by chance. Conclusion: Prozac does not significantly affect driving ability

Results of analysis using SPSS: (Analyze >Compare Means > Paired samples t-test)