T-test. Tests ParametricNon-parametric between / independent / unrelated Independent t- test Mann-Whitney U within / dependent / related / repeated measures.

Slides:



Advertisements
Similar presentations
Chapter 16 Introduction to Nonparametric Statistics
Advertisements

Irwin/McGraw-Hill © Andrew F. Siegel, 1997 and l Chapter 16 l Nonparametrics: Testing with Ordinal Data or Nonnormal Distributions.
Lecture 6 Outline – Thur. Jan. 29
Objective: To test claims about inferences for two sample means from matched-pair tests or blocked samples, under specific conditions.
PSY 307 – Statistics for the Behavioral Sciences Chapter 20 – Tests for Ranked Data, Choosing Statistical Tests.
IB Psychology Internal Assessment Higher Level 20% of final grade
statistics NONPARAMETRIC TEST
INDEPENDENT SAMPLES T Purpose: Test whether two means are significantly different Design: between subjects scores are unpaired between groups.
T-Tests.
t-Tests Overview of t-Tests How a t-Test Works How a t-Test Works Single-Sample t Single-Sample t Independent Samples t Independent Samples t Paired.
T-Tests.
QUANTITATIVE DATA ANALYSIS
What z-scores represent
The t-test:. Answers the question: is the difference between the two conditions in my experiment "real" or due to chance? Two versions: (a) “Dependent-means.
Lecture 6 Outline: Tue, Sept 23 Review chapter 2.2 –Confidence Intervals Chapter 2.3 –Case Study –Two sample t-test –Confidence Intervals Testing.
Intro to Statistics for the Behavioral Sciences PSYC 1900 Lecture 10: Hypothesis Tests for Two Means: Related & Independent Samples.
Non-parametric equivalents to the t-test Sam Cromie.
Final Review Session.
Lecture 9: One Way ANOVA Between Subjects
Non-parametric tests, part B:. Non-parametric tests for comparing three or more groups or conditions: (a) Kruskal-Wallis test: Similar to the Mann-Whitney.
Data measurement, probability and statistical tests
© 2005 The McGraw-Hill Companies, Inc., All Rights Reserved. Chapter 13 Using Inferential Statistics.
Independent Sample T-test Classical design used in psychology/medicine N subjects are randomly assigned to two groups (Control * Treatment). After treatment,
Non-parametric statistics
The Research Skills exam: The four horsemen of the apocalypse: pestilence, war, famine and the RS1 exam.
Statistical Analysis. Purpose of Statistical Analysis Determines whether the results found in an experiment are meaningful. Answers the question: –Does.
Comparing Means: T-Test PSYC 301 /SPSS SPRING 2014.
ABOUT TWO DEPENDENT POPULATIONS
Measures of Central Tendency
Nonparametric or Distribution-free Tests
Chapter 9 Comparing Means
Basic Statistics Michael Hylin. Scientific Method Start w/ a question Gather information and resources (observe) Form hypothesis Perform experiment and.
Hypothesis Testing Charity I. Mulig. Variable A variable is any property or quantity that can take on different values. Variables may take on discrete.
ITEC6310 Research Methods in Information Technology Instructor: Prof. Z. Yang Course Website: c6310.htm Office:
Statistics in psychology Describing and analyzing the data.
Comparing Means: t-tests Wednesday 22 February 2012/ Thursday 23 February 2012.
Special Topics 504: Practical Methods in Analyzing Animal Science Experiments The course is: Designed to help familiarize you with the most common methods.
T-TEST Statistics The t test is used to compare to groups to answer the differential research questions. Its values determines the difference by comparing.
Biostatistics, statistical software VII. Non-parametric tests: Wilcoxon’s signed rank test, Mann-Whitney U-test, Kruskal- Wallis test, Spearman’ rank correlation.
Ordinally Scale Variables
Last Time Today we will learn how to do t-tests using excel, and we will review for Monday’s exam.
ITEC6310 Research Methods in Information Technology Instructor: Prof. Z. Yang Course Website: c6310.htm Office:
Nonparametric Statistics. In previous testing, we assumed that our samples were drawn from normally distributed populations. This chapter introduces some.
Chapter 9 Three Tests of Significance Winston Jackson and Norine Verberg Methods: Doing Social Research, 4e.
Experimental Design and Statistics. Scientific Method
Experimental Psychology PSY 433 Appendix B Statistics.
Section 10.1 Estimating with Confidence AP Statistics February 11 th, 2011.
Research Methods: 2 M.Sc. Physiotherapy/Podiatry/Pain Inferential Statistics.
The Critical Period for Language Acquisition: Evidence from Second Language Learning CATHERINE E. SNOW AND MARIAN HOEFNAGEL-HÖHLE UNIVERSITY OF AMSTERDAM.
Central Tendency A statistical measure that serves as a descriptive statistic Determines a single value –summarize or condense a large set of data –accurately.
CD-ROM Chap 16-1 A Course In Business Statistics, 4th © 2006 Prentice-Hall, Inc. A Course In Business Statistics 4 th Edition CD-ROM Chapter 16 Introduction.
ONE-WAY BETWEEN-GROUPS ANOVA Psyc 301-SPSS Spring 2014.
Biostatistics Nonparametric Statistics Class 8 March 14, 2000.
Stats/Methods II JEOPARDY. Jeopardy Chi-Square Single-Factor Designs Factorial Designs Ordinal Data Surprise $100 $200$200 $300 $500 $400 $300 $400 $300.
Descriptive and Inferential Statistics Or How I Learned to Stop Worrying and Love My IA.
Statistics Statistics Data measurement, probability and statistical tests.
© 2006 by The McGraw-Hill Companies, Inc. All rights reserved. 1 Chapter 11 Testing for Differences Differences betweens groups or categories of the independent.
HYPOTHESIS TESTING FOR DIFFERENCES BETWEEN MEANS AND BETWEEN PROPORTIONS.
 Kolmogor-Smirnov test  Mann-Whitney U test  Wilcoxon test  Kruskal-Wallis  Friedman test  Cochran Q test.
Inferential Statistics Assoc. Prof. Dr. Şehnaz Şahinkarakaş.
Chapter 11: Test for Comparing Group Means: Part I.
Non-parametric Tests Research II MSW PT Class 8. Key Terms Power of a test refers to the probability of rejecting a false null hypothesis (or detect a.
Dr Hidayathulla Shaikh. Objectives At the end of the lecture student should be able to – Discuss normal curve Classify parametric and non parametric tests.
Irwin/McGraw-Hill © Andrew F. Siegel, 1997 and l Chapter 7 l Hypothesis Tests 7.1 Developing Null and Alternative Hypotheses 7.2 Type I & Type.
Effects of Word Concreteness and Spacing on EFL Vocabulary Acquisition 吴翼飞 (南京工业大学,外国语言文学学院,江苏 南京211816) Introduction Vocabulary acquisition is of great.
Statistics in psychology
Some statistics questions answered:
InferentIal StatIstIcs
Inferential testing.
PSYCHOLOGY AND STATISTICS
Presentation transcript:

t-test

Tests ParametricNon-parametric between / independent / unrelated Independent t- test Mann-Whitney U within / dependent / related / repeated measures Paired t-testWilcoxon

Exercise 24 people were involved in an experiment to determine whether background noise (e.g. music) affects short-term memory (recall of words). Half of the sample was randomly allocated to the NOISE condition, and half to the NO NOISE condition. The participants in the NOISE condition tried to memorize a list of 20 words in two minutes, while listening to pre-recorded noise through earphones. The other participants wore earphones but heard no noise as they attempted to memorize the words. Immediately after this, they were tested to see how many words they recalled. Table 1 shows the raw results.

NOISE (group 1)NO NOISE (group 2)  =87 X=7.3 SD=2.5  =166 X=13.8 SD=2.8

NOISE (group 1)NO NOISE (group 2) Between group variance (difference between M1 and M2) Within group variance

Effect size 7.3 – 13.8 = –6.5 Mean SD = 2 SD 1 + SD = 2.65 = d = – 13.8 = 2.45 Mean SD M1 – M2

Effect size dPercentage of overlap Small Medium Large

Interval data For small samples (N < 15) the data must be normally distributed. Homogeneity-of-variance (Levene’s test ) Preconditions for the t-test

Exercise The word that is ambiguous. Among other things, it can be a demonstrative (e.g. That’s my car) or a complementizer (e.g. I regret that I didn’t go). The two categories tend to occur in different contexts. At the beginning of a sentence, that tends to be a demonstrative and is only rarely a complementizer, but after verbs that is usually a complementizer and only rarely a demonstrative. A psycholinguist wants to know if the different frequencies of the demonstrative and complementizer affect the interpretation of that in different contexts. In order to test this hypothesis, he measures the reading times (i.e. the time it takes to move from one word to another while reading a sentence) of the complementizer and the

demonstrative after verbs that frequently occur with sentential complements but may also occur with an NP including a demonstrative (e.g. find, know, regret). (1)Peter knows that she was coming. (2)Peter knows that guy. Since the complementizer is more frequent in this context than the demonstrative, it is reasonable to assume that the complementizer has shorter reading times than the demonstrative. Twenty subjects were tested: 10 subjects listened to sentences in which the verbs were followed by a that-clause, and 10 subjects listened to sentences that were followed by an NP including a that-determiner. Table 1 shows the reading times.

Group 1That-NPsGroup 2That-clause

Group 1That-NPsGroup 2That-clause

Mann-Whitney U-Test When children begin to speak, there is often great variation in the pronunciation of particular speech sounds. A researcher wants to find out if a two-year old child pronounces /g/ and /k/ differently, or if the two speech sounds are basically pronounced in the same way at this age. In adult language, /g/ and /k/ are primarily distinguished by voice onset time VOT. In order to decide if two-year old children pronounce /g/ and /k/ differently, the researcher collects a pitiful corpus of 13 words, six words including /g/ in adult language and seven words including /k/ in adult language. The words were selected such that /g/ and /k/ are surrounded by the same speech sounds (i.e. they occur in the same phonetic environment). For each word, the researcher measured the voice onset time in milliseconds.

Speech soundVOT in mscRanks /g/ /k/

Wilcoxon Nurses were asked to rate their sympathy on a scale between 1 and 10 for MS patients before and after discussing with these patients. Table 1 shows the nurses’ sympathy scores before and after the discussion.

Before discussionAfter discussion Mean: 4.8 SD: 1.48 Median: 5 Mean: 6.5 SD: 1.58 Median: 7

Difference between scores from individual subjects –2 0 –1 –4 –1 +1 –4 –3 0 –3 7 minuses, 1 plus

Difference between scores from individual subjects Ranking of the scores in the left-hand column –2 0 –1 –4 –1 +1 –4 –3 0 –3 4 – – minuses, 1 plus

Interval + ordinal data In order to find out if the is a difference in the acquisition of subject- and object-relative clauses, a reseacher designed a repetition experiment, in which children had to repeat four different instances of each type of relative clause. The responses were assigned a score: 1 = correct, 0.5 = minor error, 0 = false. 0.0 – 0.5 – 1.0 – 1.5 – 2.0 – 2.5 – 3.0 – 3.5 – – 0 ­– 0.5 – 1 = 2.5/4 = (mean)

Exercise 1. Determine if the A and P relatives are normally distributed in the Diessel&Tomasello study. 2.Determine if the children in that study performed differently on A and P relative clauses.

One-sample t-test Previous research has shown that English-speaking children have an MLU of 301 at age 3;2. A researcher wants to know if SLI children (i.e. children with a specific language impairment) have a lower (or higher MLU) at this age. We know that SLI children have difficulties in processing morphological units, but it is unclear, if their MLUs are lower than in normally developing children. In order to test this hypothesis, the researcher collected data from 24 SLI children aged 3;1 to 3;3 and determined the MLU for each child.

ChildMLU ,7 3,0 2,8 2,9 3,1 3,0 3,1 2,5 3,2 3,1 2,9 2,8 3,1 3,2 2,4 2,3 2,8 3,1 2,5 2,7 2,9 3,0