1 Statistical Inference Greg C Elvers. 2 Why Use Statistical Inference Whenever we collect data, we want our results to be true for the entire population.

Slides:



Advertisements
Similar presentations
Anthony Greene1 Simple Hypothesis Testing Detecting Statistical Differences In The Simplest Case:  and  are both known I The Logic of Hypothesis Testing:
Advertisements

1 Hypothesis Testing William P. Wattles, Ph.D. Psychology 302.
Inference Sampling distributions Hypothesis testing.
Introduction to Statistics
Statistical Significance What is Statistical Significance? What is Statistical Significance? How Do We Know Whether a Result is Statistically Significant?
1. Estimation ESTIMATION.
Review: What influences confidence intervals?
HYPOTHESIS TESTING Four Steps Statistical Significance Outcomes Sampling Distributions.
Evaluating Hypotheses Chapter 9. Descriptive vs. Inferential Statistics n Descriptive l quantitative descriptions of characteristics.
Cal State Northridge  320 Ainsworth Sampling Distributions and Hypothesis Testing.
Statistical Significance What is Statistical Significance? How Do We Know Whether a Result is Statistically Significant? How Do We Know Whether a Result.
Evaluating Hypotheses Chapter 9 Homework: 1-9. Descriptive vs. Inferential Statistics n Descriptive l quantitative descriptions of characteristics ~
Inferences About Means of Single Samples Chapter 10 Homework: 1-6.
1 SOC 3811 Basic Social Statistics. 2 Announcements  Assignment 2 Revisions (interpretation of measures of central tendency and dispersion) — due next.
Chapter Sampling Distributions and Hypothesis Testing.
Chapter 3 Hypothesis Testing. Curriculum Object Specified the problem based the form of hypothesis Student can arrange for hypothesis step Analyze a problem.
Inference about a Mean Part II
© 2013 Pearson Education, Inc. Active Learning Lecture Slides For use with Classroom Response Systems Introductory Statistics: Exploring the World through.
BCOR 1020 Business Statistics Lecture 18 – March 20, 2008.
PY 427 Statistics 1Fall 2006 Kin Ching Kong, Ph.D Lecture 6 Chicago School of Professional Psychology.
PSY 307 – Statistics for the Behavioral Sciences
Richard M. Jacobs, OSA, Ph.D.
Testing Hypotheses.
AM Recitation 2/10/11.
Testing Hypotheses I Lesson 9. Descriptive vs. Inferential Statistics n Descriptive l quantitative descriptions of characteristics n Inferential Statistics.
Chapter 13 – 1 Chapter 12: Testing Hypotheses Overview Research and null hypotheses One and two-tailed tests Errors Testing the difference between two.
© 2008 McGraw-Hill Higher Education The Statistical Imagination Chapter 9. Hypothesis Testing I: The Six Steps of Statistical Inference.
Descriptive statistics Inferential statistics
Introduction to Hypothesis Testing for μ Research Problem: Infant Touch Intervention Designed to increase child growth/weight Weight at age 2: Known population:
Jeopardy Hypothesis Testing T-test Basics T for Indep. Samples Z-scores Probability $100 $200$200 $300 $500 $400 $300 $400 $300 $400 $500 $400.
Tuesday, September 10, 2013 Introduction to hypothesis testing.
14. Introduction to inference
Go to Index Analysis of Means Farrokh Alemi, Ph.D. Kashif Haqqi M.D.
1 Today Null and alternative hypotheses 1- and 2-tailed tests Regions of rejection Sampling distributions The Central Limit Theorem Standard errors z-tests.
Single Sample Inferences
From last lecture (Sampling Distribution): –The first important bit we need to know about sampling distribution is…? –What is the mean of the sampling.
Copyright © 2012 Wolters Kluwer Health | Lippincott Williams & Wilkins Chapter 17 Inferential Statistics.
The Argument for Using Statistics Weighing the Evidence Statistical Inference: An Overview Applying Statistical Inference: An Example Going Beyond Testing.
Chapter 8 Introduction to Hypothesis Testing
Making decisions about distributions: Introduction to the Null Hypothesis 47:269: Research Methods I Dr. Leonard April 14, 2010.
Comparing two sample means Dr David Field. Comparing two samples Researchers often begin with a hypothesis that two sample means will be different from.
Statistical Decision Making. Almost all problems in statistics can be formulated as a problem of making a decision. That is given some data observed from.
Psy B07 Chapter 4Slide 1 SAMPLING DISTRIBUTIONS AND HYPOTHESIS TESTING.
FOUNDATIONS OF NURSING RESEARCH Sixth Edition CHAPTER Copyright ©2012 by Pearson Education, Inc. All rights reserved. Foundations of Nursing Research,
1 Lecture note 4 Hypothesis Testing Significant Difference ©
Chapter 9 Fundamentals of Hypothesis Testing: One-Sample Tests.
Introduction to Inferece BPS chapter 14 © 2010 W.H. Freeman and Company.
10.1: Confidence Intervals Falls under the topic of “Inference.” Inference means we are attempting to answer the question, “How good is our answer?” Mathematically:
PSY 307 – Statistics for the Behavioral Sciences Chapter 9 – Sampling Distribution of the Mean.
Chapter 8 Parameter Estimates and Hypothesis Testing.
Chap 8-1 Fundamentals of Hypothesis Testing: One-Sample Tests.
Chapter 9: Testing Hypotheses Overview Research and null hypotheses One and two-tailed tests Type I and II Errors Testing the difference between two means.
Stats Lunch: Day 3 The Basis of Hypothesis Testing w/ Parametric Statistics.
Inen 460 Lecture 2. Estimation (ch. 6,7) and Hypothesis Testing (ch.8) Two Important Aspects of Statistical Inference Point Estimation – Estimate an unknown.
Inferential Statistics Inferential statistics allow us to infer the characteristic(s) of a population from sample data Slightly different terms and symbols.
INTRODUCTION TO HYPOTHESIS TESTING From R. B. McCall, Fundamental Statistics for Behavioral Sciences, 5th edition, Harcourt Brace Jovanovich Publishers,
Hypothesis Testing Introduction to Statistics Chapter 8 Feb 24-26, 2009 Classes #12-13.
Statistical Inference Statistical inference is concerned with the use of sample data to make inferences about unknown population parameters. For example,
Course Overview Collecting Data Exploring Data Probability Intro. Inference Comparing Variables Relationships between Variables Means/Variances Proportions.
Chapter 8: Introduction to Hypothesis Testing. Hypothesis Testing A hypothesis test is a statistical method that uses sample data to evaluate a hypothesis.
Uncertainty and confidence Although the sample mean,, is a unique number for any particular sample, if you pick a different sample you will probably get.
BIOL 582 Lecture Set 2 Inferential Statistics, Hypotheses, and Resampling.
Hypothesis Testing and Statistical Significance
Statistical Inference for the Mean Objectives: (Chapter 8&9, DeCoursey) -To understand the terms variance and standard error of a sample mean, Null Hypothesis,
10.1 Estimating with Confidence Chapter 10 Introduction to Inference.
Inferential Statistics Introduction to Hypothesis Testing.
INTRODUCTORY STATISTICS FOR CRIMINAL JUSTICE Test Review: Ch. 7-9
INTRODUCTION TO HYPOTHESIS TESTING
Confidence Intervals.
Testing Hypotheses I Lesson 9.
Presentation transcript:

1 Statistical Inference Greg C Elvers

2 Why Use Statistical Inference Whenever we collect data, we want our results to be true for the entire population and not just the sample that we used But our sample may not be representative of the population Inferential statistics allow us to decide if our sample results are probably true for the population Inferential statistics also allow us to decide if a treatment probably had an effect

3 Point Estimates One of our fundamental questions is: “How well does our sample statistic estimate the value of the population parameter?” Equivalently, we may ask “Is our point estimate good?” A point estimate is a statistic (e.g. X) that is calculated from sample data in order to estimate the value of the population parameter (e.g.  )

4 Point Estimates What makes a point estimate “good”? First, we must define “good” A good estimate is one that is close to the actual value What statistic is used to calculate how close a value is to another? A difference score, or deviate score (X -  ) What statistic should we use to measure the average “goodness?” Standard deviation

5 Sampling Distribution Draw a sample from the population Calculate the point estimate Repeat the previous two steps many times Draw a frequency distribution of the point estimates That distribution is called a sampling distribution

6 Standard Error of the Mean The standard error of the mean is the standard deviation of the sampling distribution Thus, it is measure of how good our point estimate is likely to be The symbol s X represents the standard error of the mean

7 Which Sampling Distribution Is Better? Which sampling distribution is better? Why?

8 Factors Influencing s X What influences the size of the standard error of the mean? That is, what can you do to make the sample mean closer to the population mean (on average)? Increase sample size! A sample mean based on a single observation will not be as accurate as a sample mean based on 10 or 100 observations

9 Standard Error of the Mean The standard error of the mean can be estimated from the standard deviation of the sample:

10 Central Limit Theorem The central limit theorem states that the shape of a sampling distribution will be normal (or Gaussian) as long as the sample size is sufficiently large The mean of the sampling distribution will equal the mean of the population The standard deviation of the sampling distribution (I.e. the standard error of the mean) will equal the standard deviation of the samples divided by the  n

11 Confidence Intervals How confident are we in our point estimate of the population mean? The population mean almost always is larger or smaller than the sample mean Given the sample mean and standard deviation, we can infer an interval, or range of scores, that probably contain the population mean This interval is called the confidence interval

12 Confidence Intervals Because of the central limit theorem, the sampling distribution of means is normally distributed, as long as the sample size is sufficiently large We can use the table of areas under the normal curve to find a range of numbers that probably contain the population mean

13 Confidence Intervals The area under the normal curve between z- scores of -1 and +1 is.68 Thus, the 68% confidence interval is given by X ± 1 standard deviation of the sampling distribution E.g., X = 4.32 s X =.57, n = 32 X ± z x s X /  n /  32 to /  to 4.42

14 Confidence Intervals The area under the normal curve between z- scores of and is.95 Thus, the 95% confidence interval is given by X ± 1.96 standard deviation of the sampling distribution E.g., X = 4.32 s X =.57, n = 32 X ± z X s X /  n X.57 /  32 to X.57 /  to 4.52

15 Hypothesis Testing Hypothesis testing is the procedure by which we infer if two (or more) groups are different from each other The first step is to write the statistical hypotheses which are expressed in precise mathematical terms The statistical hypotheses always come in pairs -- the null hypothesis and the alternative hypothesis

16 H 0 : The Null Hypothesis The null hypothesis usually takes the following form: H 0 :  1 =  2 This is read as: “The null hypothesis is that the mean of condition one equals the mean of condition two” Notice that the null hypothesis always deals with population parameters and not the sample statistics

17 H0H0 The null hypothesis must contain an equal sign of some sort (=, ,  ) Statistical tests are designed to reject H 0, never to accept it

18 H 1 : The Alternative Hypothesis The alternative hypothesis usually takes the following form: H 1 :  1   2 This is read as: “The alternative hypothesis states that the mean of condition one does not equal the mean of condition two” As is true for the null, the alternative hypothesis deals with the population parameter and not the sample statistic

19 H 0 and H 1 Together, the null and alternative hypotheses must be mutually exclusive and exhaustive Mutual exclusion implies that H 0 and H 1 cannot both be true at the same time Exhaustive implies that each of the possible outcomes of the experiment must make either H 0 or H 1 true

20 Directional vs Non-Directional Hypotheses The hypotheses we have been talking about are called non-directional hypotheses because they do not specify how the means should differ That is, they do not say that the mean of condition 1 should be larger than the mean of condition 2 They only state that the means should differ Non-directional hypotheses are sometimes called two-tailed tests

21 Directional vs Non-Diretional Hypotheses Directional hypotheses include an ordinal relation between the means That is, they state that one mean should be larger than the other mean For directional hypotheses, the H 0 and H 1 are written as: H 0 :  1   2 H 1 :  1 >  2 Directional hypotheses are sometimes called one-tailed tests

22 Converting Word Hypotheses into Statistical Hypotheses Convert the following hypothesis into statistical hypotheses: Frequently occurring words are easier to recall than words that occur infrequently Is this hypothesis directional or non- directional? Directional

23 Converting Word Hypotheses into Statistical Hypotheses Write the relation that we hope to demonstrate. This will be the alternative hypothesis: H 1 :  frequent >  infrequent Write a hypothesis that covers all possibilities that are not covered by the alternative hypothesis. This will be H 0 : H 0 :  frequent   infrequent

24 Converting Word Hypotheses into Statistical Hypotheses Convert the following hypotheses into statistical hypotheses: People who eat breakfast will run a race faster or slower than those who do not eat breakfast People who own cats will live longer than those who do not own cats People who earn an A in statistics are more likely to be admitted to graduate school than those who do not earn an A

25 Inferential Reasoning Statistical inference can never tell us if two means are equal; it can only tell us if the two means are not equal Why? Statistical inference never proves that two means are not equal; it only tells us if they probably are not equal

26 Inferential Reasoning If two sample means are different from each other, does that imply that the null hypothesis is false? NO! Why? Sample means are point estimates of the population mean; thus, they are not precise predictors of the population and they change from sample to sample

27 Inferential Reasoning How different do two sample means need to be before we are willing to state that the population means are probably different? The answer depends on the distribution of sampling means The more variable the sampling distribution is, the more different the sample means need to be

28 Inferential Reasoning The answer also depends on how willing you are to make an error and incorrectly reject H 0 when, in fact, H 0 is true The less willing you are to make such an error, then the larger the difference needs to be This type of error is called a Type-I or an  error

29  The Type-I or  error occurs when you reject H 0 when in fact H 0 is true We are free to decide how likely we want to be in making an  error The probability of making an  error is given by  Psychologists usually set  to either.05 or.01

30 Inferential Reasoning At some point, the sample means are sufficiently different from each other that we are comfortable in concluding that the population means are probably different That is, an inferential statistic has told us that the probability of making an  error is less than the  value that we arbitrarily selected

31 Inferential Reasoning When we decide that H 0 is probably not true, we reject H 0 If H 0 is not tenable, then H 1 is the only remaining alternative Technically, we never accept H 1 as true; we only reject H 0 as being likely

32 Inferential Reasoning We never accept H 0 as true either We only fail to reject H 0 It is always possible that the population means are different, but that the sample means are not sufficiently different

33  Error (Type-II Error) A second type of error can occur in statistical inference A  error or Type-II error occurs when we fail to reject H 0 when H 0 really is false

34 Type-I and Type-II Errors Ideally, we would like to minimize both Type-I and Type-II errors This is not possible for a given sample size When we lower the  level to minimize the probability of making a Type-I error, the  level will rise When we lower the  level to minimize the probability of making a Type-II error, the  level will rise

35 Type-I and Type-II Errors Probability of rejecting H 0 when H 0 is true Probability of failing to reject H 0 when H 0 is false

36