Review You run a t-test and get a result of t = 0.5. What is your conclusion? Reject the null hypothesis because t is bigger than expected by chance Reject.

Slides:



Advertisements
Similar presentations
Anthony Greene1 Simple Hypothesis Testing Detecting Statistical Differences In The Simplest Case:  and  are both known I The Logic of Hypothesis Testing:
Advertisements

Statistics.  Statistically significant– When the P-value falls below the alpha level, we say that the tests is “statistically significant” at the alpha.
Hypothesis Testing making decisions using sample data.
Binomial Distribution & Hypothesis Testing: The Sign Test
Evaluating Hypotheses Chapter 9. Descriptive vs. Inferential Statistics n Descriptive l quantitative descriptions of characteristics.
Cal State Northridge  320 Ainsworth Sampling Distributions and Hypothesis Testing.
1/55 EF 507 QUANTITATIVE METHODS FOR ECONOMICS AND FINANCE FALL 2008 Chapter 10 Hypothesis Testing.
Evaluating Hypotheses Chapter 9 Homework: 1-9. Descriptive vs. Inferential Statistics n Descriptive l quantitative descriptions of characteristics ~
Statistics for the Social Sciences Psychology 340 Fall 2006 Hypothesis testing.
Understanding Statistics in Research
Basic Business Statistics, 10e © 2006 Prentice-Hall, Inc. Chap 9-1 Chapter 9 Fundamentals of Hypothesis Testing: One-Sample Tests Basic Business Statistics.
Hypothesis Tests for Means The context “Statistical significance” Hypothesis tests and confidence intervals The steps Hypothesis Test statistic Distribution.
Copyright © 2010 Pearson Education, Inc. Publishing as Prentice Hall Statistics for Business and Economics 7 th Edition Chapter 9 Hypothesis Testing: Single.
8-2 Basics of Hypothesis Testing
BCOR 1020 Business Statistics
PSY 307 – Statistics for the Behavioral Sciences
Using Statistics in Research Psych 231: Research Methods in Psychology.
Probability Population:
Inferential Statistics
Copyright (c) 2004 Brooks/Cole, a division of Thomson Learning, Inc. Chapter 8 Tests of Hypotheses Based on a Single Sample.
Review As sample size increases, the distribution of sample means A.Becomes wider and more normally shaped B.Becomes narrower and more normally shaped.
Inferential Statistics
Hypothesis Testing:.
Overview of Statistical Hypothesis Testing: The z-Test
Chapter 10 Hypothesis Testing
Confidence Intervals and Hypothesis Testing - II
© 2008 McGraw-Hill Higher Education The Statistical Imagination Chapter 9. Hypothesis Testing I: The Six Steps of Statistical Inference.
Presented by Mohammad Adil Khan
Descriptive statistics Inferential statistics
Introduction to Hypothesis Testing for μ Research Problem: Infant Touch Intervention Designed to increase child growth/weight Weight at age 2: Known population:
Introduction to Biostatistics and Bioinformatics
Tuesday, September 10, 2013 Introduction to hypothesis testing.
Fundamentals of Hypothesis Testing: One-Sample Tests
1/2555 สมศักดิ์ ศิวดำรงพงศ์
Week 8 Fundamentals of Hypothesis Testing: One-Sample Tests
Hypothesis Testing: One Sample Cases. Outline: – The logic of hypothesis testing – The Five-Step Model – Hypothesis testing for single sample means (z.
The Argument for Using Statistics Weighing the Evidence Statistical Inference: An Overview Applying Statistical Inference: An Example Going Beyond Testing.
Chapter 10 Hypothesis Testing
Chapter 8 Introduction to Hypothesis Testing
Lecture 7 Introduction to Hypothesis Testing. Lecture Goals After completing this lecture, you should be able to: Formulate null and alternative hypotheses.
Making decisions about distributions: Introduction to the Null Hypothesis 47:269: Research Methods I Dr. Leonard April 14, 2010.
Individual values of X Frequency How many individuals   Distribution of a population.
Exam 1 Median: 74 Quartiles: 68, 84 Interquartile range: 16 Mean: 74.9 Standard deviation: 12.5 z = -1: 62.4 z = -1: 87.4 z = -1z = +1 Worst Question:
Chapter 9 Power. Decisions A null hypothesis significance test tells us the probability of obtaining our results when the null hypothesis is true p(Results|H.
Chapter 20 Testing hypotheses about proportions
Hypothesis Testing A procedure for determining which of two (or more) mutually exclusive statements is more likely true We classify hypothesis tests in.
Chapter 9 Fundamentals of Hypothesis Testing: One-Sample Tests.
1 ConceptsDescriptionHypothesis TheoryLawsModel organizesurprise validate formalize The Scientific Method.
Review Nine men and nine women are tested for their memory of a list of abstract nouns. The mean scores are M male = 15 and M female = 17. The mean square.
Chap 8-1 A Course In Business Statistics, 4th © 2006 Prentice-Hall, Inc. A Course In Business Statistics 4 th Edition Chapter 8 Introduction to Hypothesis.
1 Chapter 8 Introduction to Hypothesis Testing. 2 Name of the game… Hypothesis testing Statistical method that uses sample data to evaluate a hypothesis.
Copyright ©2013 Pearson Education, Inc. publishing as Prentice Hall 9-1 σ σ.
Chap 8-1 Fundamentals of Hypothesis Testing: One-Sample Tests.
Welcome to MM570 Psychological Statistics
© Copyright McGraw-Hill 2004
STA Lecture 221 !! DRAFT !! STA 291 Lecture 22 Chapter 11 Testing Hypothesis – Concepts of Hypothesis Testing.
Course Overview Collecting Data Exploring Data Probability Intro. Inference Comparing Variables Relationships between Variables Means/Variances Proportions.
Chapter 8: Introduction to Hypothesis Testing. Hypothesis Testing A hypothesis test is a statistical method that uses sample data to evaluate a hypothesis.
Hypothesis Testing Steps for the Rejection Region Method State H 1 and State H 0 State the Test Statistic and its sampling distribution (normal or t) Determine.
Chapter Ten McGraw-Hill/Irwin © 2006 The McGraw-Hill Companies, Inc., All Rights Reserved. One-Sample Tests of Hypothesis.
Inferential Statistics Introduction to Hypothesis Testing.
Introduction to Hypothesis Testing: The Binomial Test
Putting Things Together
Hypothesis Testing.
Putting Things Together
Review Ordering company jackets, different men’s and women’s styles, but HR only has database of employee heights. How to divide people so only 5% of.
Review You run a t-test and get a result of t = 0.5. What is your conclusion? Reject the null hypothesis because t is bigger than expected by chance Reject.
Three Views of Hypothesis Testing
Introduction to Hypothesis Testing: The Binomial Test
Statistical Test A test of significance is a formal procedure for comparing observed data with a claim (also called a hypothesis) whose truth we want to.
Presentation transcript:

Review You run a t-test and get a result of t = 0.5. What is your conclusion? Reject the null hypothesis because t is bigger than expected by chance Reject the null hypothesis because t is smaller than expected by chance Keep the null hypothesis because t is bigger than expected by chance Keep the null hypothesis because t is smaller than expected by chance

Review Your null hypothesis states µ = 50, and your sample has a mean of M = 58. If your t statistic equals 4, what is the standard error of the mean (sM)? Depends on the sample size 0.5 4 2

Review Your null hypothesis predicts the population mean should be µ0 = 100. You measure a sample of 25 people and calculate statistics of M = 94 and s = 10. What is the value of your t statistic? 5 -0.6 -3 2.4

Hypothesis Testing 10/9

Where Am I? Wake up after a rough night in unfamiliar surroundings Still in Boulder? Expected if in Boulder (large likelihood) Couldn’t happen IF in Boulder (likelihood near zero)  Can’t be in Boulder Surprising but not impossible (moderate likelihood)

Steps of Hypothesis Testing State clearly the two hypotheses Determine which is the null hypothesis (H0) and which is the alternative hypothesis (H1) Compute a relevant test statistic from the sample Find the likelihood function of the test statistic according to the null hypothesis Choose alpha level (a): how willing you are to abandon null (usually .05) Find the critical value: cutoff with probability  of being exceeded under H0 Compare the actual result to the critical value Less than critical value  retain null hypothesis Greater than critical value  reject null hypothesis; accept alternative hypothesis

Specifying Hypotheses Both hypotheses are statements about population parameters Null Hypothesis (H0) Always more specific, e.g. 50% chance, mean of 100 Usually the less interesting, "default" explanation Alternative Hypothesis (H1) More interesting – researcher’s goal is usually to support the alternative hypothesis Less precise, e.g. > 50% chance,  > 100

Test Statistic Statistic computed from sample to decide between hypotheses Relevant to hypotheses being tested Based on mean if hypotheses are about means Based on number correct (frequency) if hypotheses are about probability correct Sampling distribution according to null hypothesis must be fully determined Can only depend on data and on values assumed by H0 Often a complex formula with little intuitive meaning Inferential statistic: Only used in testing reliability

Likelihood Function Probability distribution of a statistic according to a hypothesis Gives probability of obtaining any possible result Usually interested in distribution of test statistic according to null hypothesis Same as sampling distribution, assuming the population is accurately described by the hypothesis Test statistic chosen because we know its likelihood function Binomial test: Binomial distribution t-test: t distribution

Critical Value Cutoff for test statistic between retaining and rejecting null hypothesis If test statistic is beyond critical value, null will be rejected Otherwise, null will be retained Before collecting data: What strength of evidence will you require to reject null? How many correct outcomes? How big a difference between M and m0, relative to sM? Critical region Range of values that will lead to rejecting null hypothesis All values beyond critical value Frequency Probability t Probability

Types of Errors Goal: Reject null hypothesis when it’s false; retain it when it’s true Two ways to be wrong Type I Error: Null is correct but you reject it Type II Error: Null is false but you retain it Type I Error rate IF H0 is true, probability of mistakenly rejecting H0 Proportion of false theories we conclude are true E.g., proportion of useless treatments that are deemed effective Logic of hypothesis testing is founded on controlling Type I Error rate Set critical value to give desired Type I Error rate

Alpha Level Choice of acceptable Type I Error rate Usually .05 in psychology Higher  more willing to abandon null hypothesis Lower  require stronger evidence before abandoning null hypothesis Determines critical value Under the sampling distribution of the test statistic according to the null hypothesis, the probability of a result beyond the critical value is  Test Statistic Sampling Distribution from H0 Critical Value a

Doping Analogy Measure athletes' blood for signs of doping Cheaters have high RBCs, but even honest people vary What rule to use? Must set some cutoff, and punish anyone above it Will inevitably punish some innocent people H0 likelihood function is like distribution of innocent athletes’ RBCs Cutoff determines fraction of innocent people that get unfairly punished This fraction is alpha Distribution of Innocent Athletes Don’t Punish Punish RBC

Power H0 H0 H1 H1 Type II Error rate Power IF H0 is false, probability of failing to reject it E.g., fraction of cheaters that don’t get caught Power IF H0 is false, probability of correctly rejecting it Equal to one minus Type II Error rate E.g., fraction of cheaters that get caught Power depends on sample size Choose sample size to give adequate power Researchers must make a guess at effect size to compute power H0 Type I error rate (a) H0 H1 H1 Type II error rate Power

Two-Tailed Tests Sometimes want to detect effects in either direction Drugs that help or drugs that hurt Formalized in alternative hypothesis m < m0 or m > m0 Two critical values, one in each tail Type I error rate is sum from both critical regions Need to divide errors between both tails Each gets a/2 (2.5%) t tcrit -tcrit M m0 Reject H0 a/2

One-Tailed vs. Two-Tailed Tests tcrit One-tailed a Two-tailed a/2 a/2 -tcrit tcrit t

An Alternative View: p-values Reversed approach to hypothesis testing After you collect sample and compute test statistic How big must a be to reject H0 p-value Measure of how consistent data are with H0 Probability of a value equal to or more extreme than what you actually got Large p-value  H0 is a good explanation of the data Small p-value  H0 is a poor explanation of the data p > : Retain null hypothesis p < : Reject null hypothesis; accept alternative hypothesis Researchers generally report p-values, because then reader can choose own alpha level E.g. “p = .03” If willing to allow 5% error rate, then accept result as reliable If more stringent, say 1% (a = .01), then remain skeptical tcrit for a = .05 tcrit for a = .03 tcrit for a = .01 t t t = 2.15  p = .03

Review Later this semester, we’ll learn about hypothesis tests for distributions of nominal variables. For example, we’ll poll everyone on their favorite colors and count the frequency for each color. What would be a reasonable null hypothesis? Each color is chosen by the same number of people in the class Each color would be chosen by the same number of people in the population Some colors are more popular than others among people in this class Some colors are more popular than others among the population

Review If you run a 1-tailed t-test with a sample size of n = 10 and a = .05, the critical value is tcrit = 1.81. Now imagine you ran the same test, but 2-tailed. Which of the following are the new critical values? (You should be able to rule out all wrong answers.) 1.21, 2.41 -2.01, 2.45 -2.23, 2.23 -1.67, 1.67

Review You run a t-test and get a result of t = 0.56 and p = .32. If your chosen alpha level was 5%, what do you conclude? Retain the null hypothesis, because p > a Reject the null hypothesis, because p > a Retain the null hypothesis, because p < t Reject the null hypothesis, because p < t