Comparing Means From Two Sets of Data

Slides:

Advertisements

Similar presentations

Introduction to Hypothesis Testing

Advertisements

Comparing Two Means: One-sample & Paired-sample t-tests Lesson 12.

PTP 560 Research Methods Week 9 Thomas Ruediger, PT.

Statistical Issues in Research Planning and Evaluation

Statistical Decision Making

Business 205. Review Sampling Continuous Random Variables Central Limit Theorem Z-test.

Chapter Seventeen HYPOTHESIS TESTING

PSY 307 – Statistics for the Behavioral Sciences

Evaluating Hypotheses Chapter 9. Descriptive vs. Inferential Statistics n Descriptive l quantitative descriptions of characteristics.

Evaluating Hypotheses Chapter 9 Homework: 1-9. Descriptive vs. Inferential Statistics n Descriptive l quantitative descriptions of characteristics ~

Understanding Research Results. Effect Size Effect Size – strength of relationship & magnitude of effect Effect size r = √ (t2/(t2+df))

Lecture 9: One Way ANOVA Between Subjects

Chapter 3 Hypothesis Testing. Curriculum Object Specified the problem based the form of hypothesis Student can arrange for hypothesis step Analyze a problem.

Independent Sample T-test Often used with experimental designs N subjects are randomly assigned to two groups (Control * Treatment). After treatment, the.

UNDERSTANDING RESEARCH RESULTS: STATISTICAL INFERENCE © 2012 The McGraw-Hill Companies, Inc.

PY 427 Statistics 1Fall 2006 Kin Ching Kong, Ph.D Lecture 6 Chicago School of Professional Psychology.

© 2005 The McGraw-Hill Companies, Inc., All Rights Reserved. Chapter 13 Using Inferential Statistics.

Intro to Statistics for the Behavioral Sciences PSYC 1900 Lecture 11: Power.

Today Concepts underlying inferential statistics

Independent Sample T-test Classical design used in psychology/medicine N subjects are randomly assigned to two groups (Control * Treatment). After treatment,

Major Points Formal Tests of Mean Differences Review of Concepts: Means, Standard Deviations, Standard Errors, Type I errors New Concepts: One and Two.

The t Tests Independent Samples.

Chapter 14 Inferential Data Analysis

Richard M. Jacobs, OSA, Ph.D.

Inferential Statistics

Chapter 12 Inferential Statistics Gay, Mills, and Airasian

AM Recitation 2/10/11.

Overview of Statistical Hypothesis Testing: The z-Test

Inferential Statistics & Test of Significance

Chapter 8 Introduction to Hypothesis Testing

Statistical Power The ability to find a difference when one really exists.

The Hypothesis of Difference Chapter 10. Sampling Distribution of Differences Use a Sampling Distribution of Differences when we want to examine a hypothesis.

T tests comparing two means t tests comparing two means.

RMTD 404 Lecture 8. 2 Power Recall what you learned about statistical errors in Chapter 4: Type I Error: Finding a difference when there is no true difference.

Copyright © 2012 Wolters Kluwer Health | Lippincott Williams & Wilkins Chapter 17 Inferential Statistics.

Copyright © 2008 Wolters Kluwer Health | Lippincott Williams & Wilkins Chapter 22 Using Inferential Statistics to Test Hypotheses.

Learning Objectives In this chapter you will learn about the t-test and its distribution t-test for related samples t-test for independent samples hypothesis.

Educational Research: Competencies for Analysis and Application, 9 th edition. Gay, Mills, & Airasian © 2009 Pearson Education, Inc. All rights reserved.

Step 3 of the Data Analysis Plan Confirm what the data reveal: Inferential statistics All this information is in Chapters 11 & 12 of text.

Chapter 10: Analyzing Experimental Data Inferential statistics are used to determine whether the independent variable had an effect on the dependent variance.

Chapter 12 A Primer for Inferential Statistics What Does Statistically Significant Mean? It’s the probability that an observed difference or association.

Introduction to Inferential Statistics Statistical analyses are initially divided into: Descriptive Statistics or Inferential Statistics. Descriptive Statistics.

Educational Research Chapter 13 Inferential Statistics Gay, Mills, and Airasian 10 th Edition.

1 Chapter 8 Introduction to Hypothesis Testing. 2 Name of the game… Hypothesis testing Statistical method that uses sample data to evaluate a hypothesis.

Analysis of Variance (One Factor). ANOVA Analysis of Variance Tests whether differences exist among population means categorized by only one factor or.

Statistical Inference for the Mean Objectives: (Chapter 9, DeCoursey) -To understand the terms: Null Hypothesis, Rejection Region, and Type I and II errors.

METHODS IN BEHAVIORAL RESEARCH NINTH EDITION PAUL C. COZBY Copyright © 2007 The McGraw-Hill Companies, Inc.

KNR 445 Statistics t-tests Slide 1 Introduction to Hypothesis Testing The z-test.

Chapter 10 The t Test for Two Independent Samples

Statistical Inference Drawing conclusions (“to infer”) about a population based upon data from a sample. Drawing conclusions (“to infer”) about a population.

Statistical Inference Statistical inference is concerned with the use of sample data to make inferences about unknown population parameters. For example,

PART 2 SPSS (the Statistical Package for the Social Sciences)

Chapter 13 Understanding research results: statistical inference.

HYPOTHESIS TESTING FOR DIFFERENCES BETWEEN MEANS AND BETWEEN PROPORTIONS.

Statistical Inference for the Mean Objectives: (Chapter 8&9, DeCoursey) -To understand the terms variance and standard error of a sample mean, Null Hypothesis,

Educational Research Inferential Statistics Chapter th Chapter 12- 8th Gay and Airasian.

Inferential Statistics Psych 231: Research Methods in Psychology.

Chapter 10: The t Test For Two Independent Samples.

Chapter 9 Introduction to the t Statistic

Dependent-Samples t-Test

The ability to find a difference when one really exists.

Two-Sample Hypothesis Testing

Inference and Tests of Hypotheses

Hypothesis Testing: Hypotheses

Kin 304 Inferential Statistics

Issues in Inferential Statistics

UNDERSTANDING RESEARCH RESULTS: STATISTICAL INFERENCE

What are their purposes? What kinds?

Inferential Statistics & Test of Significance

Psych 231: Research Methods in Psychology

Presentation transcript:

Comparing Means From Two Sets of Data t-Test Comparing Means From Two Sets of Data

Steps For Comparing Groups

Assumptions of t-Test Dependent variables are interval or ratio. The population from which samples are drawn is normally distributed. Samples are randomly selected. The groups have equal variance (Homogeneity of variance). The t-statistic is robust (it is reasonably reliable even if assumptions are not fully met.

Computing Confidence Intervals We can determine the probability that a population mean lies between certain limits using a sample mean. With inferential statistics we reverse this process and determine the probability that a random sample drawn from a specific population would differ by an observed result.

t Values Critical value decreases if N is increased. Critical value decreases if alpha is increased. Differences between the means will not have to be as large to find sig if N is large or alpha is increased.

Probability that a sample came from a population? Using the standard error we compute the probability that two means come from the same population. If Z or t exceed the level of significance we conclude that the sample was Not drawn from the population or Has been modified so that it no longer represents the population

Relationship between t Statistic and Power To increase power: Increase the difference between the means. Reduce the variance Increase N Increase α from α = .01 to α = .05

Does Volleyball Serve Training Improve Serving Ability? Population mean = 31, sd = 7.5. 30 students given serve training. Following training mean = 35, sd = 8.3. Critical Z = 1.96 Probability is greater than 99 to 1 that the mean did not come from original population. The training was effective.

Volleyball Example Using t-statistic Critical value of t(29)= 2.045, p = 0.05 Since obtained t > critical value these means are statistical different.

Comparing Two Independent Samples Independent samples (males, females), (swimmers, runners). Must be different subjects in each group.

If the t statistic is greater than the critical value we Independent t Test If the t statistic is greater than the critical value we Conclude the independent variable had a significant effect And we reject chance as the cause of the mean difference.

Effects of Verbal Lesson of Basketball Shooting Skill Critical value of t(120) = 1.98, p = 0.05 Since our obtained t(98) = -1.36 is NOT greater than the critical value we ACCEPT the Null Hypothesis. The training had no effect upon shooting skill. Note: The sign +/- of t does not matter.

Does Positive Reinforcement Affect Bowling? Critical value t(40) = 2.201, p = 0.05 Since obtained t > critical t We reject the Null and state that positive reinforcement significantly improves bowling ability.

Summary Table for Effects of Praise on Bowling

The t-test With Unequal N When you have unequal numbers of subjects in each group the statistic uses a different equation to estimate the standard error of the differences between groups.

The t-test With Unequal N Critical value of t(16) = 2.120, p = .05. The groups are significantly different.

Dependent or Paired t-test Note that the equation uses the correlation between pre and post samples. The Dependent t-test is more powerful that the Independent Groups t-test.

Dependent or Paired t-test The same subjects are in each group (DEPENDENT or PAIRED t-test). Critical value t(29) = 2.045, p = 0.05 The groups ARE SIGNIFICANTLY Different. Note: the correction formula adjusts the variance between groups. Since the same subjects are in each group you can expect less variance. Repeated Measures experiments are more powerful than independent groups

Does a Bicycle Tour Affect Self-Esteem Does a Bicycle Tour Affect Self-Esteem? Are these differences MEANINGFUL???? Critical value of t(60) = 2.000, p = 0.05, so there is a significant difference. BUT DOES IT MEAN ANYTHING???

The Magnitude of the Difference (Size of Effect) Omega squared can be used to determine the importance, or usefulness of the mean difference. ω2 is the percentage of the variance (diff between means) that can be explained by the independent variable. In this case the low-back and hip study explains 21% of variance between the means (pre & post).

Effect size of .2 is small, .5 moderate, .8 large Cohen’s Effect Size Effect size of .2 is small, .5 moderate, .8 large The control group is used to compute SD because it is not contaminated by the treatment effect.

The Percent Change is also useful in evaluating if a change is meaningful. Before doing an experiment you should know what Percent Change would be considered meaningful. For an Olympic athlete, a 1% (meaningful) improvement can be the difference between winning and losing. For an untrained individual a 1% improvement would probably be meaningless.

Practical & Meaningful Significance If two means are significantly different, that does not imply that they are practical. If two means are NOT statistically significant, that does not imply that their differences are not practical. Use ω2, Effect Size and Percent Change to evaluate the meaningfulness of an outcome.

Type I and Type II Errors Type I Error: Stating that there is a difference when there isn’t. Type II Error: Stating there is no difference when there is one.

We can never know if we have made a Type I or II error. Statistics only provide the probability of making a Type I or II error. The critical factor in this decision is the consequence of being wrong. The confidence level should be set to protect against the most costly error. Which is worse: to accept the null hypothesis when it is really false or to reject it when it is really true?

Two Tailed Test: Null No Difference.

One Tail Test: Null A > B. More Powerful, easier to find differences.

Power: the ability to detect differences if they exist.

Power ( 1 - β ) depends upon: Alpha [Zα (.10) = 1.65, Zα (.05) = 1.96] Statistical Power Power ( 1 - β ) depends upon: Alpha [Zα (.10) = 1.65, Zα (.05) = 1.96] Difference between the means. Standard deviations between the two groups. Sample size N.

To Increase Power Increase alpha, Power for α = .10 is greater than power for α = .05 Increase the difference between means. Decrease the sd’s of the groups. Increase N.

In this example Power (1 - β ) = 70.5% Calculation of Power From Table A.1 Zβ of .54 is 20.5% Power is 20.5% + 50% = 70.5% In this example Power (1 - β ) = 70.5%

Calculation of Sample Size to Produce a Given Power Compute Sample Size N for a Power of .80 at p = 0.05 The area of Zβ must be 30% (50% + 30% = 80%) From Table A.1 Zβ = .84 If the Mean Difference is 5 and SD is 6 then 22.6 subjects would be required to have a power of .80

Calculation of Sample Sized Need to Obtain a Desired Level of Power PSD 30 Newtons Alpha 1.96 this is p=.05 Beta 80 0.84 these are beta values 90 1.28 95 1.645 Power Stdev 16 21 26 These values in red are the N needed based on your PSD. 20 7 9 12 10 2 3 The boxed values are values you must input, based on previous literature. PSD = Practical Significant Difference

Power Research performed with insufficient power may result in a Type II error, Or waste time and money on a study that has little chance of rejecting the null. In power calculation, the values for mean and sd are usually not known beforehand. Either do a PILOT study or use prior research on similar subjects to estimate the mean and sd.

Independent t-Test For an Independent t-Test you need a grouping variable to define the groups. In this case the variable Group is defined as 1 = Active 2 = Passive Use value labels in SPSS

Independent t-Test: Defining Variables Be sure to enter value labels. Grouping variable GROUP, the level of measurement is Nominal.

Independent t-Test

Independent t-Test: Independent & Dependent Variables

Independent t-Test: Define Groups

Independent t-Test: Options

Independent t-Test: Output Assumptions: Groups have equal variance [F = .513, p =.483, YOU DO NOT WANT THIS TO BE SIGNIFICANT. The groups have equal variance, you have not violated an assumption of t-statistic. Are the groups different? t(18) = .511, p = .615 NO DIFFERENCE 2.28 is not different from 1.96

Dependent or Paired t-Test: Define Variables

Dependent or Paired t-Test: Select Paired-Samples

Dependent or Paired t-Test: Select Variables

Dependent or Paired t-Test: Options

Dependent or Paired t-Test: Output Is there a difference between pre & post? t(9) = -4.881, p = .001 Yes, 4.7 is significantly different from 6.2