Single Sample Inferences

Slides:



Advertisements
Similar presentations
“Students” t-test.
Advertisements

1 COMM 301: Empirical Research in Communication Lecture 15 – Hypothesis Testing Kwan M Lee.
Comparing One Sample to its Population
Chapter 12: Testing hypotheses about single means (z and t) Example: Suppose you have the hypothesis that UW undergrads have higher than the average IQ.
1 Matched Samples The paired t test. 2 Sometimes in a statistical setting we will have information about the same person at different points in time.
PSY 307 – Statistics for the Behavioral Sciences
Intro to Statistics for the Behavioral Sciences PSYC 1900 Lecture 9: Hypothesis Tests for Means: One Sample.
BCOR 1020 Business Statistics Lecture 21 – April 8, 2008.
T-Tests Lecture: Nov. 6, 2002.
Chapter 9 Hypothesis Testing.
PSY 307 – Statistics for the Behavioral Sciences
The one sample t-test November 14, From Z to t… In a Z test, you compare your sample to a known population, with a known mean and standard deviation.
PSY 307 – Statistics for the Behavioral Sciences
1 (Student’s) T Distribution. 2 Z vs. T Many applications involve making conclusions about an unknown mean . Because a second unknown, , is present,
Hypothesis Testing Using The One-Sample t-Test
Getting Started with Hypothesis Testing The Single Sample.
Probability Population:
Chapter 9: Introduction to the t statistic
PSY 307 – Statistics for the Behavioral Sciences
1 Nominal Data Greg C Elvers. 2 Parametric Statistics The inferential statistics that we have discussed, such as t and ANOVA, are parametric statistics.
Statistical Analysis. Purpose of Statistical Analysis Determines whether the results found in an experiment are meaningful. Answers the question: –Does.
AM Recitation 2/10/11.
Statistics 11 Hypothesis Testing Discover the relationships that exist between events/things Accomplished by: Asking questions Getting answers In accord.
Hypothesis Testing:.
McGraw-Hill/IrwinCopyright © 2009 by The McGraw-Hill Companies, Inc. All Rights Reserved. Chapter 9 Hypothesis Testing.
Chapter Eleven Inferential Tests of Significance I: t tests – Analyzing Experiments with Two Groups PowerPoint Presentation created by Dr. Susan R. Burns.
Jeopardy Hypothesis Testing T-test Basics T for Indep. Samples Z-scores Probability $100 $200$200 $300 $500 $400 $300 $400 $300 $400 $500 $400.
Statistical Analysis Statistical Analysis
Significance Tests …and their significance. Significance Tests Remember how a sampling distribution of means is created? Take a sample of size 500 from.
Section 10.1 ~ t Distribution for Inferences about a Mean Introduction to Probability and Statistics Ms. Young.
Copyright © Cengage Learning. All rights reserved. 10 Inferences Involving Two Populations.
1 Level of Significance α is a predetermined value by convention usually 0.05 α = 0.05 corresponds to the 95% confidence level We are accepting the risk.
T-distribution & comparison of means Z as test statistic Use a Z-statistic only if you know the population standard deviation (σ). Z-statistic converts.
Today’s lesson Confidence intervals for the expected value of a random variable. Determining the sample size needed to have a specified probability of.
Copyright © 2012 Wolters Kluwer Health | Lippincott Williams & Wilkins Chapter 17 Inferential Statistics.
1 Statistical Inference Greg C Elvers. 2 Why Use Statistical Inference Whenever we collect data, we want our results to be true for the entire population.
Statistics and Quantitative Analysis U4320
Copyright © 2012 by Nelson Education Limited. Chapter 7 Hypothesis Testing I: The One-Sample Case 7-1.
1 Statistical Inference. 2 The larger the sample size (n) the more confident you can be that your sample mean is a good representation of the population.
Statistics and Research methods Wiskunde voor HMI Bijeenkomst 3 Relating statistics and experimental design.
Mid-Term Review Final Review Statistical for Business (1)(2)
Hypothesis Testing Using the Two-Sample t-Test
Ordinally Scale Variables
Chapter 6 USING PROBABILITY TO MAKE DECISIONS ABOUT DATA.
1 Psych 5500/6500 The t Test for a Single Group Mean (Part 1): Two-tail Tests & Confidence Intervals Fall, 2008.
Essential Question:  How do scientists use statistical analyses to draw meaningful conclusions from experimental results?
1 Chapter 9 Hypothesis Testing. 2 Chapter Outline  Developing Null and Alternative Hypothesis  Type I and Type II Errors  Population Mean: Known 
McGraw-Hill/Irwin Copyright © 2007 by The McGraw-Hill Companies, Inc. All rights reserved. Chapter 8 Hypothesis Testing.
Psych 230 Psychological Measurement and Statistics
Chapter 8 Parameter Estimates and Hypothesis Testing.
Chapter 9: Testing Hypotheses Overview Research and null hypotheses One and two-tailed tests Type I and II Errors Testing the difference between two means.
Monday, October 22 Hypothesis testing using the normal Z-distribution. Student’s t distribution. Confidence intervals.
_ z = X -  XX - Wow! We can use the z-distribution to test a hypothesis.
Other Types of t-tests Recapitulation Recapitulation 1. Still dealing with random samples. 2. However, they are partitioned into two subsamples. 3. Interest.
Monday, October 21 Hypothesis testing using the normal Z-distribution. Student’s t distribution. Confidence intervals.
Chapter 13 Understanding research results: statistical inference.
Chapter 9: Introduction to the t statistic. The t Statistic The t statistic allows researchers to use sample data to test hypotheses about an unknown.
Hypothesis Testing and Statistical Significance
Copyright (c) 2004 Brooks/Cole, a division of Thomson Learning, Inc. Chapter 7 Inferences Concerning Means.
Copyright © 2009 Pearson Education, Inc t LEARNING GOAL Understand when it is appropriate to use the Student t distribution rather than the normal.
Inferential Statistics Introduction to Hypothesis Testing.
Chapter 9 Introduction to the t Statistic
Chapter 9 Hypothesis Testing.
Chapter 9 Hypothesis Testing.
Monday, October 19 Hypothesis testing using the normal Z-distribution.
Reasoning in Psychology Using Statistics
Statistical Inference about Regression
Reasoning in Psychology Using Statistics
Hypothesis Testing in the Real World
Presentation transcript:

Single Sample Inferences Greg C Elvers

Single Samples A single sample implies that you have collected data from one group of people or objects You have not collected data from a comparison, or control, group Rather, you will compare your data to pre-existing data, perhaps from the census or other archive

Single Sample Inferences The basic questions that is asked is: Is the sample mean different from a pre-existing population mean? E.g. Is the mean IQ of the students in this class different from the mean IQ of people in general? E.g. Is the mean number of cigarettes smoked per hour higher in a sample of people with schizophrenia than in the population in general?

Steps To Follow When trying to answer these types of questions, there are several steps that you should follow: Write the null and alternative hypotheses Are they one or two tailed? Specify the a level (usually .05) Calculate the appropriate test statistic Determine the critical value from a table Decide whether to reject H0 or not

Write the Hypotheses Lets consider the first example: The mean IQ of the people in a statistics class is 103. Is this value different from the population mean (100)? H0: m = 100 H1: m ¹ 100

1 vs 2 Tailed? The hypothesis does not state whether the sample mean should be larger (or smaller) than the population mean It only states that the sample mean should be different from the population mean Thus, this should be a two-tailed test

Specify the a Level The a level is the probability of making a Type-I error The a level specifies how willing we are to reject H0 when in fact H0 is true While a can take on any value between 0 and 1 inclusive, psychologists usually adopt an a level of either .05, .01, or .001 .05 is the most common a = .05

Calculate the Appropriate Test Statistic First, you must decide what the appropriate test statistic is If the mean and standard deviation of the population are known, and the sampling distribution is normally distributed, then the appropriate test statistic is the z-score for the sampling distribution

Calculate the Appropriate Test Statistic IQs are normally distributed with a mean of 100, and a standard deviation of 15 Thus, we are safe in using the z-score of the sampling distribution

z-scores of the Sampling Distribution The standard error of the mean is given by the population standard deviation (s = 15) divided by the square root of the sample size (n = 225) sX = 15 / Ö225 = 1

z-scores of the Sampling Distribution The z-score is the difference of the sample and population means divided by the standard error of the mean z = (103 - 100) / 1 z = 3

Determine the Critical Value There are two ways of determining the critical value One way is used when calculating the statistic by hand The other way is used when calculating the statistic with statistical software such as SPSS, SAS, or BMDP

Determining the Critical Value by Hand (One-Tailed) When determining the critical value by hand, you want to determine the z-score beyond which is your a level The diagram to the right shows this for a one-tailed test Area beyond = a

Determining the Critical Value by Hand (One-Tailed) In this example, find the z-score whose area above the z-score equals a (.05) Consult the table of areas under the unit normal curve A z-score of 1.65 has an area of .05 above it; 1.65 is our critical z Area beyond = a

Determining the Critical Value by Hand (Two-Tailed) When determining the critical value by hand, you want to determine the z-score beyond which is your a level The diagram to the right shows this for a two-tailed test Area beyond = a / 2

Determining the Critical Value by Hand In this example, find the z-score whose area above the z-score equals .5 X a (.025) Consult the table of areas under the unit normal curve A z-score of 1.96 has an area of .025 above it; 1.96 is our critical z Area beyond = a / 2

Decide Whether to Reject H0 When the absolute value of the observed z is larger than the critical z, you can reject H0 That is, when | observed | > critical, the sample is different from the population This is the 2-tailed rule; 1-tailed rule is slightly different Observed z = 3 Critical z (two tailed) = 1.96 Reject H0 The sample is probably different from the population

Deciding To Reject H0 when Using the Computer When you use SPSS or similar software, the program will print the observed statistic and the probability that of observing a sample that large due to chance The probability is called the p value When the p value is less than or equal to a, you can reject H0 Thus, the sample is probably different from the population

Problem A professor gives a test in statistics. Based on the 81 students who took the test, the class average on the test is 75. From students who took the test in previous classes, the professor knows that the mean grade is 80 with a standard deviation of 27. Is the current class performing more poorly than the average?

Student’s t Test When the population mean and / or standard deviation are not known, a different inferential statistical procedure should be used: Student’s t test Student’s t, or just t, test is, conceptually, very similar to the z-score test we have been using The t test is used to determine if a sample is different from the population

The t Test When the standard deviation of the population is not known, as is usually the case, we must estimate the standard deviation of the population We use the standard deviation of the sample to estimate the population standard deviation:

Sample and Population Standard Deviations The sample standard deviation consistently underestimates the value of the population standard deviation It is biased An unbiased estimate of the population standard deviation is given by:

Sample and Population Standard Deviations Even the unbiased estimate of the population standard deviation will be inexact when the sample size is small (< 30) The smaller the sample size is, the less precise the unbiased estimate of the population standard deviation will be Because of this imprecision, it is inappropriate to use the normal distribution

The t Distributions William Gossett created a series of distributions known as the t distributions The t distributions are similar to the unit normal distributions, but account for the imprecision in the estimation of the population mean Because the imprecision in the estimate depends on sample size, there are multiple t distributions, depending on the degrees of freedom

Degrees of Freedom Degrees of freedom correspond to the number of scores that are free to take on any value after restrictions are placed on the set of data E.g., if the mean of 5 data points is 0, then how many data points can take on any value and still have the mean equal 0?

Degrees of Freedom 4 of the 5 numbers can take on any value But the fifth number must equal -1 times the sum of the other four for the mean to equal 0 Thus n - 1 scores are free to vary In this case df = n - 1

The t Test The t test is used to decide if a sample is different from a population when the population standard deviation is unknown

Example On average, do people with schizophrenia smoke more cigarettes (X = 9 per day) than the population (m0 = 6 per day) Step 1: Write the hypotheses: H0: m £ 6 H1: m > 6

1 vs 2 Tailed? What is a? The hypothesis asks if people with schizophrenia smoke more cigarettes than average; thus we have a 1 tailed test We will adopt our standard a level, .05 a is the probability of making a Type - I error a is the probability of rejecting H0 when H0 is true

Calculate the Appropriate Test Statistic Because we do not know the population standard deviation, we will estimate it from the sample standard deviation s = 5, n = 10

Calculate the Appropriate Test Statistic Plug and chug the t test value Determine the degrees of freedom: df = n - 1 = 10 - 1 = 9

Determine the Critical Value To determine the critical t value, consult a table of critical t values Find the column that is labeled with your a level Make sure you select the right number of tails (1 vs 2) Find the row that is labeled with your degrees of freedom The critical t value is at the intersection

Determine the Critical Value If the table does not contain the desired degrees of freedom, use the critical t value for the next smallest degrees of freedom With a = .05, one tailed, and df = 9, the critical t value is 1.833

Decide Whether to Reject H0 If the observed t (the value you calculated) is larger than the critical t, then you can reject H0 Because our observed t (1.80) is not larger than the critical t (1.833), we fail to reject H0 that people with schizophrenia smoke less than or equal to the population

Decide Whether to Reject H0 That is, there is no statistically reliable difference in the average number of cigarettes smoked by the population and by people with schizophrenia This does not claim that there is no difference, but rather that we failed to observe the difference if it did exist

Problem Do indoor cats weigh a different amount than outdoor cats which weigh an average of 11 pounds? Xindoor = 13 pounds sindoor = 3.75 pounds nindoor = 16

Other Uses of t The t distributions can also be used to determine if a correlation coefficient is probably different from 0. The correlation between your scores on the first and second exam is r = .7840, n = 23 Is this correlation probably different from 0?

Write Hypotheses; Specify a H0: r = 0 H1: r ¹ 0 This is a two tailed hypothesis as it does not state whether the correlation should be positive or negative a = .05

Calculate the Appropriate Statistic The formula for t given r and n is to the right: The degrees of freedom is the number of pairs of scores minus 2

Calculate the Appropriate Statistic Plug and chug the t formula Calculate the degrees of freedom

Determine the Critical t Value Consult a table to determine the critical t with a = .05, two-tailed, and df = 21 The critical t value is 2.080

Decide Whether to Reject H0 If the observed t (the calculated value) is larger than the critical t, we can reject H0 that the correlation does not exist The observed t (5.79) is larger than the critical t (2.080), so we reject H0 This implies that a relation probably does exist between the two exam scores

Problem The correlation between how many cats you own and how introverted you are is r = 0.6 (made-up) The sample size was 102 Is this correlation reliably different from r = 0?