The t-test With small samples, the z-test has to be modified. Statisticians use the t-test.

Slides:



Advertisements
Similar presentations
© 2011 Pearson Education, Inc
Advertisements

Review of the Basic Logic of NHST Significance tests are used to accept or reject the null hypothesis. This is done by studying the sampling distribution.
Hypothesis Testing Math Overview Introduction One-Sample z Tests and t Tests Two-Sample z Tests Chi-Squared Tests Summary.
Chapter 10: Hypothesis Testing
BHS Methods in Behavioral Sciences I April 25, 2003 Chapter 6 (Ray) The Logic of Hypothesis Testing.
Hypothesis Testing Using a Single Sample
ANOVA Determining Which Means Differ in Single Factor Models Determining Which Means Differ in Single Factor Models.
Independent Samples and Paired Samples t-tests PSY440 June 24, 2008.
Overview of Lecture Parametric Analysis is used for
Lecture 9: One Way ANOVA Between Subjects
Hypothesis Testing for Population Means and Proportions
Statistics 101 Class 9. Overview Last class Last class Our FAVORATE 3 distributions Our FAVORATE 3 distributions The one sample Z-test The one sample.
T-Tests Lecture: Nov. 6, 2002.
Copyright © 2014, 2013, 2010 and 2007 Pearson Education, Inc. Chapter Hypothesis Tests Regarding a Parameter 10.
Chapter 11: Inference for Distributions
PSY 307 – Statistics for the Behavioral Sciences
Statistics 270– Lecture 25. Cautions about Z-Tests Data must be a random sample Outliers can distort results Shape of the population distribution matters.
1 Confidence Interval for Population Mean The case when the population standard deviation is unknown (the more common case).
Confidence Intervals: Estimating Population Mean
Hypothesis Testing Using The One-Sample t-Test
Hypothesis Testing: Two Sample Test for Means and Proportions
Getting Started with Hypothesis Testing The Single Sample.
Normal Distributions Section 4.4. Normal Distribution Function Among all the possible probability density functions, there is an important class of functions.
Chapter 12 Section 1 Inference for Linear Regression.
PSY 307 – Statistics for the Behavioral Sciences
Copyright © 2012 Pearson Education. All rights reserved Copyright © 2012 Pearson Education. All rights reserved. Chapter 10 Sampling Distributions.
Aron, Aron, & Coups, Statistics for the Behavioral and Social Sciences: A Brief Course (3e), © 2005 Prentice Hall Chapter 8 Introduction to the t Test.
Statistics 11 Hypothesis Testing Discover the relationships that exist between events/things Accomplished by: Asking questions Getting answers In accord.
Copyright © 2012 by Nelson Education Limited. Chapter 8 Hypothesis Testing II: The Two-Sample Case 8-1.
Confidence Intervals and Hypothesis Testing
Significance Tests …and their significance. Significance Tests Remember how a sampling distribution of means is created? Take a sample of size 500 from.
Single-Sample T-Test Quantitative Methods in HPELS 440:210.
Section 10.1 ~ t Distribution for Inferences about a Mean Introduction to Probability and Statistics Ms. Young.
Education 793 Class Notes T-tests 29 October 2003.
Statistical Analysis A Quick Overview. The Scientific Method Establishing a hypothesis (idea) Collecting evidence (often in the form of numerical data)
1 CSI5388: Functional Elements of Statistics for Machine Learning Part I.
Statistics Introduction to Statistic [stuh-tis-tik] noun. A numerical fact or datum, especially one computed from a sample.
Week 8 Chapter 8 - Hypothesis Testing I: The One-Sample Case.
Copyright © 2012 by Nelson Education Limited. Chapter 7 Hypothesis Testing I: The One-Sample Case 7-1.
Chapter 11 Inference for Distributions AP Statistics 11.1 – Inference for the Mean of a Population.
RDPStatistical Methods in Scientific Research - Lecture 11 Lecture 1 Interpretation of data 1.1 A study in anorexia nervosa 1.2 Testing the difference.
When σ is Unknown The One – Sample Interval For a Population Mean Target Goal: I can construct and interpret a CI for a population mean when σ is unknown.
One-sample In the previous cases we had one sample and were comparing its mean to a hypothesized population mean However in many situations we will use.
CHAPTER 11 DAY 1. Assumptions for Inference About a Mean  Our data are a simple random sample (SRS) of size n from the population.  Observations from.
10.2 Tests of Significance Use confidence intervals when the goal is to estimate the population parameter If the goal is to.
Test for Significant Differences T- Tests. T- Test T-test – is a statistical test that compares two data sets, and determines if there is a significant.
Copyright © 2012 Pearson Education. All rights reserved © 2010 Pearson Education Copyright © 2012 Pearson Education. All rights reserved. Chapter.
Sociology 5811: Lecture 11: T-Tests for Difference in Means Copyright © 2005 by Evan Schofer Do not copy or distribute without permission.
Statistical Inference for the Mean Objectives: (Chapter 9, DeCoursey) -To understand the terms: Null Hypothesis, Rejection Region, and Type I and II errors.
3-1 MGMG 522 : Session #3 Hypothesis Testing (Ch. 5)
Chapter 12 Confidence Intervals and Hypothesis Tests for Means © 2010 Pearson Education 1.
Aim: How do we use a t-test?
Chapter 10 The t Test for Two Independent Samples
Statistical Analysis. Null hypothesis: observed differences are due to chance (no causal relationship) Ex. If light intensity increases, then the rate.
Copyright © 1998, Triola, Elementary Statistics Addison Wesley Longman 1 Assumptions 1) Sample is large (n > 30) a) Central limit theorem applies b) Can.
© 2010 Pearson Prentice Hall. All rights reserved Chapter Hypothesis Tests Regarding a Parameter 10.
Hypothesis test flow chart
Chapter 9: Introduction to the t statistic. The t Statistic The t statistic allows researchers to use sample data to test hypotheses about an unknown.
The inference and accuracy We learned how to estimate the probability that the percentage of some subjects in the sample would be in a given interval by.
The accuracy of averages We learned how to make inference from the sample to the population: Counting the percentages. Here we begin to learn how to make.
Tests of Significance We use test to determine whether a “prediction” is “true” or “false”. More precisely, a test of significance gets at the question.
The expected value The value of a variable one would “expect” to get. It is also called the (mathematical) expectation, or the mean.
Statistical Inference for the Mean Objectives: (Chapter 8&9, DeCoursey) -To understand the terms variance and standard error of a sample mean, Null Hypothesis,
Review Statistical inference and test of significance.
Introduction Sample surveys involve chance error. Here we will study how to find the likely size of the chance error in a percentage, for simple random.
Measurement Error In practice, if the same thing is measured several times, each result is thrown off by chance error, and the error changes from measurement.
Review Law of averages, expected value and standard error, normal approximation, surveys and sampling.
Copyright © 2009 Pearson Education, Inc t LEARNING GOAL Understand when it is appropriate to use the Student t distribution rather than the normal.
Chapter 10: The t Test For Two Independent Samples.
Introduction to the t Test
Presentation transcript:

The t-test With small samples, the z-test has to be modified. Statisticians use the t-test.

Example In Los Angeles, many studies have been conducted to determine the concentration of CO (carbon monoxide) near freeways with various conditions of traffic flow. Some machines are used to measure concentrations up to about 100 ppm (parts per million by volume) with errors on the order of 10 ppm. The machines are so delicate that they have to be calibrated every day. This involves measuring CO concentration in a manufactured gas sample, called span gas, where the concentration is precisely controlled at 70 ppm. If the machine reads close to 70 ppm on the span gas, then it’s ready for use; otherwise, it has to be adjusted. The size of the measurement errors varies from day to day. We assume that the errors are independent and follow the normal curve; the SD is unknown and changes from day to day.

Example One day, a technician makes five readings on span gas: Four out of five are higher than 70, and some of them by quite a bit. Can this be explained on the basis of chance variation? Or does it show bias, perhaps from improper adjustment of the machine? A test of significance is needed. First, we need to set up the equation: Measurement = true value + bias + chance error That is, observed value = 70 ppm + bias + error.

Set up a test We know that the errors will in general even out when repeating measurements. By the previous assumption, it follows the normal curve. What is the expected average of the errors? The errors should average out to 0, and the SD is unknown. According to the equation, how do we set up the null hypothesis? The null hypothesis says that, the difference between the true value and the average of the measurements is due to chance variation. That is, there is no bias. Hence, the difference follows the normal curve.

Set up a test

Calculation

Modify the test There could be a problem in the previous calculation. Why? The SD of the measurements is only an estimate for the SD of the errors. This estimate is good only when the number of measurements is reasonably large. In the example, the number (5) of measurements is so small that the estimate could be way off. So we have to modify the test in two steps.

Step 1

Step 2 Now we are ready to find the P-value. But with a small number of measurements, we use a different curve, called Student’s curve, rather than the normal curve. Using Student’s curve requires some work: there is one of the curves for each number of degrees of freedom. (The Student’s curves are a bunch of curves, the degrees of freedom tells you which to use.) To calculate the degree, we have the following formula: Degrees of freedom = number of measurements – one. In our example, the t-statistic is 2.2. From the table, we see that the P-value is less than 5%. That is quite a bit more than the 1% from the normal curve. (We still get rejection to the null hypothesis by significance, but not that strong.)

Student’s curves The dashed line above is Student’s curve for 4 degrees of freedom. The solid line is a normal curve, for comparison.

Student’s curves Here the dashed line is Student’s curve for 9 degrees of freedom. Again, we compare it with the normal curve.

Student’s curves Student’s curves look quite a lot like the normal curve: They are symmetric, bell-shaped, and centered at 0. But they are less piled up in the middle and more spread out. As the number of degrees of freedom goes up, the curves get closer and closer to the normal, reflecting the fact that the SD of the measurements is getting closer and closer to the SD of the error box. All the curves are symmetric around 0, and have the total area 100% for each.

To read the table The rows are labeled by degrees of freedom. The columns are labeled by the percentages of the areas. The entries are the t-statistics.

To read the table For example, if we look at the row for 4 degrees of freedom. The first entry is 1.53, in the column headed 10%. This means the area to the right of 1.53 under Student’s curve with 4 degrees of freedom is about 10%.

To find the P-value In our previous example, with 5 measurements, there are 5 – 1 = 4 degrees of freedom. The t-statistic is about 2.2, which is greater than 0. So we look at the area to the right of 2.2 under Student’s curve with 4 degrees of freedom. From the table, this is about 5%. (In fact, less than 5%.)

Remarks The t-test should be used under the following circumstances: The data are like draws from a box. The SD of the box is unknown. The number of observations is small, so the SD of the box cannot be estimated very accurately. The histogram for the errors does not look too different from the normal curve. With 25 or more measurements, the normal curve would ordinarily be used. If the SD of the box is known, and the errors follow the normal curve, then the normal curve can be used even for small samples.

Another example On another day, 6 readings on span gas turn out to be: Is the machine properly calibrated? Or do the measurements show bias?

Solution

Notes

Summary