Thursday, September 12, 2013 Effect Size, Power, and Exam Review.

Slides:

Advertisements

Similar presentations

Lecture 2: Null Hypothesis Significance Testing Continued Laura McAvinue School of Psychology Trinity College Dublin.

Advertisements

Statistics 101 Class 8. Overview Hypothesis Testing Hypothesis Testing Stating the Research Question Stating the Research Question –Null Hypothesis –Alternative.

Copyright © 2011 by Pearson Education, Inc. All rights reserved Statistics for the Behavioral and Social Sciences: A Brief Course Fifth Edition Arthur.

Confidence Intervals, Effect Size and Power

Statistical Issues in Research Planning and Evaluation

COURSE: JUST 3900 INTRODUCTORY STATISTICS FOR CRIMINAL JUSTICE Instructor: Dr. John J. Kerbs, Associate Professor Joint Ph.D. in Social Work and Sociology.

Statistical Significance What is Statistical Significance? What is Statistical Significance? How Do We Know Whether a Result is Statistically Significant?

HYPOTHESIS TESTING Four Steps Statistical Significance Outcomes Sampling Distributions.

Statistics for the Social Sciences

Evaluating Hypotheses Chapter 9. Descriptive vs. Inferential Statistics n Descriptive l quantitative descriptions of characteristics.

Statistical Significance What is Statistical Significance? How Do We Know Whether a Result is Statistically Significant? How Do We Know Whether a Result.

Evaluating Hypotheses Chapter 9 Homework: 1-9. Descriptive vs. Inferential Statistics n Descriptive l quantitative descriptions of characteristics ~

Statistics for the Social Sciences Psychology 340 Fall 2006 Review For Exam 1.

Statistics for the Social Sciences Psychology 340 Fall 2006 Hypothesis testing.

1 Statistical Inference Note: Only worry about pages 295 through 299 of Chapter 12.

Statistics for the Social Sciences Psychology 340 Spring 2005 Hypothesis testing.

Statistics for the Social Sciences Psychology 340 Fall 2006 Hypothesis testing.

8-2 Basics of Hypothesis Testing

PY 427 Statistics 1Fall 2006 Kin Ching Kong, Ph.D Lecture 6 Chicago School of Professional Psychology.

PSY 307 – Statistics for the Behavioral Sciences

Intro to Statistics for the Behavioral Sciences PSYC 1900 Lecture 11: Power.

Introduction to Hypothesis Testing

Chapter Ten Introduction to Hypothesis Testing. Copyright © Houghton Mifflin Company. All rights reserved.Chapter New Statistical Notation The.

Statistics for the Social Sciences

AM Recitation 2/10/11.

Hypothesis Testing:.

Overview of Statistical Hypothesis Testing: The z-Test

Lecture Slides Elementary Statistics Twelfth Edition

Overview Definition Hypothesis

Tuesday, September 10, 2013 Introduction to hypothesis testing.

Chapter 8 Introduction to Hypothesis Testing

1/2555 สมศักดิ์ ศิวดำรงพงศ์

Hypothesis Testing: One Sample Cases. Outline: – The logic of hypothesis testing – The Five-Step Model – Hypothesis testing for single sample means (z.

The Argument for Using Statistics Weighing the Evidence Statistical Inference: An Overview Applying Statistical Inference: An Example Going Beyond Testing.

Chapter 8 Introduction to Hypothesis Testing

Chapter 9: Hypothesis Testing 9.1 Introduction to Hypothesis Testing Hypothesis testing is a tool you use to make decision from data. Something you usually.

LECTURE 19 THURSDAY, 14 April STA 291 Spring

Statistics (cont.) Psych 231: Research Methods in Psychology.

Psy B07 Chapter 4Slide 1 SAMPLING DISTRIBUTIONS AND HYPOTHESIS TESTING.

Chapter 20 Testing hypotheses about proportions

Last Time Today we will learn how to do t-tests using excel, and we will review for Monday’s exam.

1 Chapter 10: Introduction to Inference. 2 Inference Inference is the statistical process by which we use information collected from a sample to infer.

1 ConceptsDescriptionHypothesis TheoryLawsModel organizesurprise validate formalize The Scientific Method.

1 Chapter 8 Hypothesis Testing 8.2 Basics of Hypothesis Testing 8.3 Testing about a Proportion p 8.4 Testing about a Mean µ (σ known) 8.5 Testing about.

1 Chapter 8 Introduction to Hypothesis Testing. 2 Name of the game… Hypothesis testing Statistical method that uses sample data to evaluate a hypothesis.

ERRORS: TYPE 1 (  ) TYPE 2 (  ) & POWER Federal funds are to be allocated for financial aid to students for the cost of books. It is believe the mean.

Stats Lunch: Day 3 The Basis of Hypothesis Testing w/ Parametric Statistics.

Aron, Aron, & Coups, Statistics for the Behavioral and Social Sciences: A Brief Course (3e), © 2005 Prentice Hall Chapter 5 Introduction to Hypothesis.

1 URBDP 591 A Lecture 12: Statistical Inference Objectives Sampling Distribution Principles of Hypothesis Testing Statistical Significance.

6 Making Sense of Statistical Significance.

Aron, Aron, & Coups, Statistics for the Behavioral and Social Sciences: A Brief Course (3e), © 2005 Prentice Hall Chapter 7 Making Sense of Statistical.

© Copyright McGraw-Hill 2004

© 2013 Pearson Education, Inc. Active Learning Lecture Slides For use with Classroom Response Systems Introductory Statistics: Exploring the World through.

Chapter 8: Introduction to Hypothesis Testing. Hypothesis Testing A hypothesis test is a statistical method that uses sample data to evaluate a hypothesis.

Statistics (cont.) Psych 231: Research Methods in Psychology.

Hypothesis test flow chart

SPSS Problem and slides Is this quarter fair? How could you determine this? You assume that flipping the coin a large number of times would result in.

Hypothesis Testing and Statistical Significance

Reasoning in Psychology Using Statistics Psychology

Chapter 9 Introduction to the t Statistic

Chapter 9: Hypothesis Tests for One Population Mean 9.5 P-Values.

Introduction to Statistics for the Social Sciences SBS200, COMM200, GEOG200, PA200, POL200, or SOC200 Lecture Section 001, Spring 2016 Room 150 Harvill.

Statistics for the Social Sciences

Central Limit Theorem, z-tests, & t-tests

Reasoning in Psychology Using Statistics

Reasoning in Psychology Using Statistics

Reasoning in Psychology Using Statistics

Psych 231: Research Methods in Psychology

Psych 231: Research Methods in Psychology

1 Chapter 8: Introduction to Hypothesis Testing. 2 Hypothesis Testing The general goal of a hypothesis test is to rule out chance (sampling error) as.

Presentation transcript:

Thursday, September 12, 2013 Effect Size, Power, and Exam Review

Last Time:

Which tail? Above or Below? In hypothesis testing, we are interested in extreme values of our test statistics. We are interested in the tails where p-values are very low. Probability is always <.50 in the tail (by definition) For 2-tailed tests, we reject H 0 when we encounter values more extreme than +/-Z (higher than +Z or lower than –Z) For 1-tailed tests, we reject H 0 when we encounter extreme values consistent with H A

One-Tailed

One-tailed vs. 2-tailed One-tailed – if the results are in the direction your theory predicts, then an extreme value of Z (in the tail) will lead you to reject H 0 regardless of whether Z is positive or negative. Two-tailed – It does not matter whether Z is positive or negative. You reject H 0 if the absolute value of Z is sufficiently extreme (in either tail)

Finishing up chapter 8 (some loose ends) Error types (quick review from last time) Effect size: Cohen’s d Statistical Power Analysis

Performing your statistical test Real world (‘truth’) H 0 is correct H 0 is wrong Real world (‘truth’) H 0 is correct H 0 is wrong Type I error Type II error So there is only one distribution Real world (‘truth’) H 0 is correct H 0 is wrong Type I error Type II error So there are two distributions The original (null) distribution The new (treatment) distribution The original (null) distribution

Performing your statistical test Real world (‘truth’) H 0 is correct H 0 is wrong So there is only one distribution Real world (‘truth’) H 0 is correct H 0 is wrong Type I error Type II error So there are two distributions The original (null) distribution The new (treatment) distribution The original (null) distribution

Effect Size Hypothesis test tells us whether the observed difference is probably due to chance or not It does not tell us how big the difference is H 0 is wrong Real world (‘truth’) H 0 is correct H 0 is wrong Type I error Type II error So there are two distributions The original (null) distribution The new (treatment) distribution –Effect size tells us how much the two populations don’t overlap

Effect Size Figuring effect size The original (null) distribution The new (treatment) distribution –Effect size tells us how much the two populations don’t overlap But this is tied to the particular units of measurement

Effect Size The original (null) distribution The new (treatment) distribution Standardized effect size –Effect size tells us how much the two populations don’t overlap –Puts into neutral units for comparison (same logic as z- scores) Cohen’s d

Effect Size The original (null) distribution The new (treatment) distribution Effect size conventions –smalld =.2 –mediumd =.5 –larged =.8 –Effect size tells us how much the two populations don’t overlap

Error types Real world (‘truth’) H 0 is correct H 0 is wrong Experimenter’s conclusions Reject H 0 Fail to Reject H 0 I conclude that there is an effect I can’t detect an effect There really isn’t an effect There really is an effect

Error types Real world (‘truth’) H 0 is correct H 0 is wrong Experimenter’s conclusions Reject H 0 Fail to Reject H 0 Type I error Type II error Type I error (α): concluding that there is a difference between groups (“an effect”) when there really isn’t. Type I error (α): concluding that there is a difference between groups (“an effect”) when there really isn’t. Type II error (β): concluding that there isn’t an effect, when there really is. Type II error (β): concluding that there isn’t an effect, when there really is.

Statistical Power The probability of making a Type II error is related to Statistical Power – Statistical Power: The probability that the study will produce a statistically significant results if the research hypothesis is true (there is an effect) So how do we compute this?

Statistical Power H 0 : is true (is no treatment effect) Real world (‘truth’) Fail to reject H 0 Reject H 0 Real world (‘truth’) H 0 is correct H 0 is wrong Type I error Type II error α = 0.05 The original (null) distribution

Statistical Power H 0 : is false (is a treatment effect) α = 0.05 Reject H 0 Real world (‘truth’) H 0 is correct H 0 is wrong Type I error Type II error The original (null) distribution Real world (‘truth’) The new (treatment) distribution Fail to reject H 0

Statistical Power Fail to reject H 0 Reject H 0 Real world (‘truth’) H 0 is correct H 0 is wrong Type I error Type II error β = probability of a Type II error The new (treatment) distribution H 0 : is false (is a treatment effect) The original (null) distribution Real world (‘truth’) α = 0.05 Failing to Reject H 0, even though there is a treatment effect Failing to Reject H 0, even though there is a treatment effect

Statistical Power Fail to reject H 0 Reject H 0 Real world (‘truth’) H 0 is correct H 0 is wrong Type I error Type II error Power = 1 -  The new (treatment) distribution H 0 : is false (is a treatment effect) The original (null) distribution Real world (‘truth’)  = probability of a Type II error α = 0.05 Failing to Reject H 0, even though there is a treatment effect Failing to Reject H 0, even though there is a treatment effect Probability of (correctly) Rejecting H 0 Probability of (correctly) Rejecting H 0

Statistical Power –α -level – Sample size – Population standard deviation σ – Effect size – 1-tail vs. 2-tailed Factors that affect Power:

Statistical Power α = 0.05 Fail to reject H 0 Reject H 0  Power = 1 - β Factors that affect Power:  -level Change from  = 0.05 to 0.01

Statistical Power α = 0.05 Fail to reject H 0 Reject H 0 β Power = 1 - β Factors that affect Power:  -level Change from α = 0.05 to 0.01 α = 0.01

Statistical Power α = 0.05 Fail to reject H 0 Reject H 0 β Power = 1 - β Factors that affect Power:α-level Change from α = 0.05 to 0.01 α = 0.01

Statistical Power α = 0.05 Fail to reject H 0 Reject H 0 β Power = 1 -β Factors that affect Power:α-level Change from α = 0.05 to 0.01 α = 0.01

Statistical Power α = 0.05 Fail to reject H 0 Reject H 0 β Power = 1 - β Factors that affect Power:α-level Change from α = 0.05 to 0.01 α = 0.01

Statistical Power α = 0.05 Fail to reject H 0 Reject H 0 β Power = 1 - β Factors that affect Power:α-level Change from α = 0.05 to 0.01 α = 0.01 So as the α level gets smaller, so does the Power of the test So as the α level gets smaller, so does the Power of the test

Statistical Power α = 0.05 Fail to reject H 0 Reject H 0 β Power = 1 - β Factors that affect Power:Sample size Change from n = 25 to 100 Recall that sample size is related to the spread of the distribution

Statistical Power α = 0.05 Fail to reject H 0 Reject H 0 β Power = 1 - β Factors that affect Power:Sample size Change from n = 25 to 100

Statistical Power α = 0.05 Fail to reject H 0 Reject H 0 β Power = 1 - β Factors that affect Power:Sample size Change from n = 25 to 100

Statistical Power α = 0.05 Fail to reject H 0 Reject H 0 β Power = 1 - β Factors that affect Power:Sample size Change from n = 25 to 100

Statistical Power α = 0.05 Fail to reject H 0 Reject H 0 β Power = 1 - β Factors that affect Power:Sample size Change from n = 25 to 100 As the sample gets bigger, the standard error gets smaller and the Power gets larger

Statistical Power α = 0.05 Fail to reject H 0 Reject H 0 β Power = 1 - β Factors that affect Power:Population standard deviation Change from σ = 25 to 20 Recall that standard error is related to the spread of the distribution

Statistical Power α = 0.05 Fail to reject H 0 Reject H 0 β Power = 1 - β Factors that affect Power:Population standard deviation Change from σ = 25 to 20

Statistical Power α = 0.05 Fail to reject H 0 Reject H 0 β Power = 1 - β Factors that affect Power:Population standard deviation Change from σ = 25 to 20

Statistical Power α = 0.05 Fail to reject H 0 Reject H 0 β Power = 1 - β Factors that affect Power:Population standard deviation Change from σ = 25 to 20

Statistical Power α = 0.05 Fail to reject H 0 Reject H 0 β Power = 1 - β Factors that affect Power: As the σ gets smaller, the standard error gets smaller and the Power gets larger Population standard deviation Change from σ = 25 to 20

Statistical Power α = 0.05 Fail to reject H 0 Reject H 0 β Power = 1 - β Factors that affect Power:Effect size Compare a small effect (difference) to a big effect μ treatment μ no treatment

Statistical Power α = 0.05 Fail to reject H 0 Reject H 0 β Power = 1 - β Factors that affect Power:Effect size Compare a small effect (difference) to a big effect μ treatment μ no treatment

Statistical Power α = 0.05 Fail to reject H 0 Reject H 0 β Power = 1 -β Factors that affect Power:Effect size Compare a small effect (difference) to a big effect

Statistical Power α= 0.05 Fail to reject H 0 Reject H 0 β Power = 1 - β Factors that affect Power:Effect size Compare a small effect (difference) to a big effect

Statistical Power α = 0.05 Fail to reject H 0 Reject H 0 β Power = 1 - β Factors that affect Power:Effect size Compare a small effect (difference) to a big effect

Statistical Power α = 0.05 Fail to reject H 0 Reject H 0 β Power = 1 - β Factors that affect Power:Effect size Compare a small effect (difference) to a big effect

Statistical Power α = 0.05 Fail to reject H 0 Reject H 0 β Power = 1 - β Factors that affect Power:Effect size Compare a small effect (difference) to a big effect

Statistical Power α = 0.05 Fail to reject H 0 Reject H 0 β Power = 1 - β Factors that affect Power:Effect size Compare a small effect (difference) to a big effect As the effect gets bigger, the Power gets larger

Statistical Power α = 0.05 Fail to reject H 0 Reject H 0 β Power = 1 - β Factors that affect Power:1-tail vs. 2-tailed Change from α = 0.05 two-tailed to  = 0.05 two- tailed

Statistical Power α = 0.05 Fail to reject H 0 Reject H 0 β Power = 1 - β Factors that affect Power:1-tail vs. 2-tailed Change from  = 0.05 two-tailed to  = 0.05 two-tailed p = 0.025

Statistical Power Fail to reject H 0 Reject H 0 β Power = 1 - β Factors that affect Power:1-tail vs. 2-tailed Change from  = 0.05 two-tailed to α = 0.05 two- tailed p = α = 0.05

Statistical Power Fail to reject H 0 Reject H 0 β Power = 1 - β Factors that affect Power:1-tail vs. 2-tailed Change from  = 0.05 two-tailed to α = 0.05 two- tailed p = α = 0.05

Statistical Power Fail to reject H 0 Reject H 0 β Power = 1 - β Factors that affect Power:1-tail vs. 2-tailed Change from  = 0.05 two-tailed to  = 0.05 two-tailed p = α = 0.05

Statistical Power Fail to reject H 0 Reject H 0 β Power = 1 - β Factors that affect Power:1-tail vs. 2-tailed Change from  = 0.05 two-tailed to α = 0.05 two- tailed p = Two tailed functionally cuts the α-level in half, which decreases the power. p = α = 0.05

Statistical Power Factors that affect Power: – α -level: So as the  level gets smaller, so does the Power of the test – Sample size: As the sample gets bigger, the standard error gets smaller and the Power gets larger – Population standard deviation: As the population standard deviation gets smaller, the standard error gets smaller and the Power gets larger – Effect size: As the effect gets bigger, the Power gets larger – 1-tail vs. 2-tailed: Two tailed functionally cuts the α -level in half, which decreases the power

Why care about Power? Determining your sample size – Using an estimate of effect size, and population standard deviation, you can determine how many participants need to achieve a particular level of power When a result if not statistically significant – Is is because there is no effect, or not enough power When a result is significant – Statistical significance versus practical significance

Ways of Increasing Power

Exam I (Tuesday 9/17) Covers first 8 chapters Closed book (materials will be provided) Bring calculator Emphasis on material covered in class No need to use SPSS during the test

Exam I (Tuesday 9/17) Ways to describe distributions (using tables, graphs, numeric summaries of center and spread) How to enter data into SPSS Z-scores Probability Characteristics of the normal distribution Normal approximation of the binomial distribution Use of the unit normal table Central Limit Theorem & distribution of sample means Standard error of the mean (σ M ) Hypothesis testing – Four steps of hypothesis testing – Error Types – One-Sample Z-test – Effect sizes and Power