The t-distribution William Gosset lived from 1876 to 1937 Gosset invented the t -test to handle small samples for quality control in brewing. He wrote.

Slides:



Advertisements
Similar presentations
The t-distribution William Gosset lived from 1876 to 1937 Gosset invented the t -test to handle small samples for quality control in brewing. He wrote.
Advertisements

“Students” t-test.
Student’s t test This test was invented by a statistician working for the brewer Guinness. He was called WS Gosset ( ), but preferred to keep.
Statistical tests for Quantitative variables (z-test & t-test) BY Dr.Shaikh Shaffi Ahamed Ph.D., Associate Professor Dept. of Family & Community Medicine.
Statistics Review – Part II Topics: – Hypothesis Testing – Paired Tests – Tests of variability 1.
Chapter 12: Testing hypotheses about single means (z and t) Example: Suppose you have the hypothesis that UW undergrads have higher than the average IQ.
Review of the Basic Logic of NHST Significance tests are used to accept or reject the null hypothesis. This is done by studying the sampling distribution.
10 Hypothesis Testing. 10 Hypothesis Testing Statistical hypothesis testing The expression level of a gene in a given condition is measured several.
Independent Samples and Paired Samples t-tests PSY440 June 24, 2008.
Today Today: Chapter 10 Sections from Chapter 10: Recommended Questions: 10.1, 10.2, 10-8, 10-10, 10.17,
BCOR 1020 Business Statistics
Inferences About Means of Single Samples Chapter 10 Homework: 1-6.
Business Statistics: A Decision-Making Approach, 7e © 2008 Prentice-Hall, Inc. Chap 10-1 Business Statistics: A Decision-Making Approach 7 th Edition Chapter.
T-Tests Lecture: Nov. 6, 2002.
Testing the Difference Between Means (Small Independent Samples)
S519: Evaluation of Information Systems
 What is t test  Types of t test  TTEST function  T-test ToolPak 2.
© 2004 Prentice-Hall, Inc.Chap 10-1 Basic Business Statistics (9 th Edition) Chapter 10 Two-Sample Tests with Numerical Data.
Basic Business Statistics (9th Edition)
Chapter 9 Hypothesis Testing.
Hypothesis Testing Using The One-Sample t-Test
Basic Business Statistics, 10e © 2006 Prentice-Hall, Inc. Chap 10-1 Chapter 10 Two-Sample Tests Basic Business Statistics 10 th Edition.
Hypothesis Testing and T-Tests. Hypothesis Tests Related to Differences Copyright © 2009 Pearson Education, Inc. Chapter Tests of Differences One.
AM Recitation 2/10/11.
Hypothesis Testing:.
Probability Distributions and Test of Hypothesis Ka-Lok Ng Dept. of Bioinformatics Asia University.
Jeopardy Hypothesis Testing T-test Basics T for Indep. Samples Z-scores Probability $100 $200$200 $300 $500 $400 $300 $400 $300 $400 $500 $400.
Statistical inference: confidence intervals and hypothesis testing.
Hypothesis testing – mean differences between populations
Copyright © Cengage Learning. All rights reserved. 13 Linear Correlation and Regression Analysis.
CHP400: Community Health Program - lI Research Methodology. Data analysis Hypothesis testing Statistical Inference test t-test and 22 Test of Significance.
Education 793 Class Notes T-tests 29 October 2003.
T-distribution & comparison of means Z as test statistic Use a Z-statistic only if you know the population standard deviation (σ). Z-statistic converts.
- Interfering factors in the comparison of two sample means using unpaired samples may inflate the pooled estimate of variance of test results. - It is.
The Hypothesis of Difference Chapter 10. Sampling Distribution of Differences Use a Sampling Distribution of Differences when we want to examine a hypothesis.
Chapter 9 Hypothesis Testing and Estimation for Two Population Parameters.
Chapter 9: Testing Hypotheses
One-sample In the previous cases we had one sample and were comparing its mean to a hypothesized population mean However in many situations we will use.
Copyright © Cengage Learning. All rights reserved. 10 Inferences Involving Two Populations.
Jeopardy Hypothesis Testing t-test Basics t for Indep. Samples Related Samples t— Didn’t cover— Skip for now Ancient History $100 $200$200 $300 $500 $400.
Chapter 9 Introduction to the t Statistic. 9.1 Review Hypothesis Testing with z-Scores Sample mean (M) estimates (& approximates) population mean (μ)
Interval Estimation and Hypothesis Testing Prepared by Vera Tabakova, East Carolina University.
Statistical Inference for the Mean Objectives: (Chapter 9, DeCoursey) -To understand the terms: Null Hypothesis, Rejection Region, and Type I and II errors.
Reasoning in Psychology Using Statistics Psychology
Chapter 8 Parameter Estimates and Hypothesis Testing.
T test-origin Founder WS Gosset Wrote under the pseudonym “Student” Mostly worked in tea (t) time ? Hence known as Student's t test. Preferable when the.
Business Statistics: A First Course (3rd Edition)
- We have samples for each of two conditions. We provide an answer for “Are the two sample means significantly different from each other, or could both.
Student’s t test This test was invented by a statistician WS Gosset ( ), but preferred to keep anonymous so wrote under the name “Student”. This.
Statistical tests for Quantitative variables (z-test, t-test & Correlation) BY Dr.Shaikh Shaffi Ahamed Ph.D., Associate Professor Dept. of Family & Community.
T tests comparing two means t tests comparing two means.
Lecture 8 Estimation and Hypothesis Testing for Two Population Parameters.
Chapter 7 Inference Concerning Populations (Numeric Responses)
Statistical hypothesis Statistical hypothesis is a method for testing a claim or hypothesis about a parameter in a papulation The statement H 0 is called.
Chapter 10: The t Test For Two Independent Samples.
Micro array Data Analysis. Differential Gene Expression Analysis The Experiment Micro-array experiment measures gene expression in Rats (>5000 genes).
When the means of two groups are to be compared (where each group consists of subjects that are not related) then the excel two-sample t-test procedure.
HYPOTHESIS TESTING.
Lecture Nine - Twelve Tests of Significance.
Math 4030 – 10b Inferences Concerning Variances: Hypothesis Testing
Math 4030 – 10a Tests for Population Mean(s)
Chapter 8 Hypothesis Testing with Two Samples.
“Students” t-test Prof Dr Najlaa Fawzi
Reasoning in Psychology Using Statistics
Reasoning in Psychology Using Statistics
Hypothesis Tests for a Standard Deviation
What are their purposes? What kinds?
Dr.Shaikh Shaffi Ahamed Ph.D., Dept. of Family & Community Medicine
Presentation transcript:

The t-distribution William Gosset lived from 1876 to 1937 Gosset invented the t -test to handle small samples for quality control in brewing. He wrote under the name "Student".

t test-origin Founder WS Gosset Wrote under the pseudonym “Student” Mostly worked in tea (t) time ? Hence known as Student's t test. Certainly if n < 30

Is there a difference? between you…means, who is meaner?

Types One sample compare with population Unpaired compare with control Paired same subjects: pre-post

T-test 1.Test for single mean Whether the sample mean is equal to the predefined population mean ?. Test for difference in means 2. Test for difference in means Whether the CD4 level of patients taking treatment A is equal to CD4 level of patients taking treatment B ? Test for paired observation 3. Test for paired observation Whether the treatment conferred any significant benefit ?

Test direction One tailed t test Two tailed test

Developing the Pooled-Variance t Test (Part 1) Setting Up the Hypothesis: H 0 :  1 -  2 = 0 H 1 :  1 -  2  0 H 0 :  1 =  2 H 1 :  1   2 OR Two Tail

Developing the Pooled-Variance t Test (Part 1) Setting Up the Hypothesis: H 0 :  1   2 H 1 :  1 >  2 H 0 :  1 -  2 = 0 H 1 :  1 -  2  0 H 0 :  1 =  2 H 1 :  1   2 H 0 :  1 -  2  0 H 1 :  1 -  2 > 0 OR Right Tail Two Tail

Developing the Pooled-Variance t Test (Part 1) Setting Up the Hypothesis: H 0 :  1   2 H 1 :  1 >  2 H 0 :  1 -  2 = 0 H 1 :  1 -  2  0 H 0 :  1 =  2 H 1 :  1   2 H 0 :  1  2 H 0 :  1 -  2  0 H 1 :  1 -  2 > 0 H 0 :  1 -  2  H 1 :  1 -  2 < 0 OR Left Tail Right Tail Two Tail  H 1 :  1 <  2

Mean systolic BP in nephritis is significantly higher than of normal person

Mean systolic BP in nephritis is significantly different from that of normal person Mean systolic BP in nephritis is significantly different from that of normal person

Statistical Analysis control group mean treatment group mean Is there a difference?

What does difference mean? medium variability high variability low variability The mean difference is the same for all three cases

What does difference mean? medium variability high variability low variability Which one shows the greatest difference?

What does difference mean? a statistical difference is a function of the difference between means relative to the variability a small difference between means with large variability could be due to chance like a signal-to-noise ratio low variability Which one shows the greatest difference?

So we estimate low variability signal noise difference between group means variability of groups = X T - X C SE(X T - X C ) = = t-value __ __

Two sample t-test Difference between means Sample size Variability of data t-test t t t t + +

Probability - p With t we check the probability Reject or do not reject Null hypothesis You reject if p < 0.05 or still less Difference between means (groups) is more & more significant if p is less & less

Area =.025 Area =.005 Z Area =.025 Area = Determining the p-Value

.9 5 t 0 f(t)  red area = rejection region for 2-sided test

Assumptions Normal distribution Equal variance Random sampling

t-Statistic When the sampled population is normally distributed, the t statistic is Student t distributed with n-1 degrees of freedom.

T- test for single mean The following are the weight (mg) of each of 20 rats drawn at random from a large stock. Is it likely that the mean weight of these 20 rats are similar to the mean weight ( 24 mg) of the whole stock ?

Steps for test for single mean 1.Questioned to be answered Is the Mean weight of the sample of 20 rats is 24 mg? N=20, =21.0 mg, sd=5.91, =24.0 mg 2. Null Hypothesis The mean weight of rats is 24 mg. That is, The sample mean is equal to population mean. 3. Test statistics --- t (n-1) df 4. Comparison with theoretical value if tab t (n-1) < cal t (n-1) reject Ho, if tab t (n-1) > cal t (n-1) accept Ho, 5. Inference

t –test for single mean Test statistics n=20, =21.0 mg, sd=5.91, =24.0 mg n=20, =21.0 mg, sd=5.91, =24.0 mg t  = t.05, 19 = Accept H 0 if t = Inference : There is no evidence that the sample is taken from the population with mean weight of 24 gm There is no evidence that the sample is taken from the population with mean weight of 24 gm

Given below are the 24 hrs total energy expenditure (MJ/day) in groups of lean and obese women. Examine whether the obese women’s mean energy expenditure is significantly higher ?. Lean Lean T-test for difference in means Obese Obese

Null Hypothesis Obese women’s mean energy expenditure is equal to the lean women’s energy expenditure. Obese women’s mean energy expenditure is equal to the lean women’s energy expenditure. Data Summary lean Obese lean Obese N S

H 0 :  1 -  2 = 0 (  1 =  2 ) H 1 :  1 -  2  0 (  1   2 )  = 0.05 df = = 20 Critical Value(s): t Reject H Solution

t XX S nSnS nn dfnn P               Hypothesized Difference (usually zero when testing for equal means) Compute the Test Statistic: ()) ( ()() ()() n1n1 n2n2 __ Calculating the Test Statistic:

Developing the Pooled-Variance t Test Calculate the Pooled Sample Variances as an Estimate of the Common Populations Variance: = Pooled-Variance = Variance of Sample 1 = Variance of sample 2 = Size of Sample 1 = Size of Sample 2

S nSnS nn P              (( (( ( ( ( ( ) ) ) ) ) )) ) First, estimate the common variance as a weighted average of the two sample variances using the degrees of freedom as weights

t XX S nn P       8.1. Calculating the Test Statistic: ( (())) tab t =20 dff = t 0.05,20 =2.086

T-test for difference in means Inference : The cal t (3.82) is higher than tab t at 0.05, 20. ie This implies that there is a evidence that the mean energy expenditure in obese group is significantly (p<0.05) higher than that of lean group

Example Suppose we want to test the effectiveness of a program designed to increase scores on the quantitative section of the Graduate Record Exam (GRE). We test the program on a group of 8 students. Prior to entering the program, each student takes a practice quantitative GRE; after completing the program, each student takes another practice exam. Based on their performance, was the program effective?

Each subject contributes 2 scores: repeated measures design StudentBefore ProgramAfter Program

Can represent each student with a single score: the difference (D) between the scores Student Before ProgramAfter Program D

Approach: test the effectiveness of program by testing significance of D Null hypothesis: There is no difference in the scores of before and after program Alternative hypothesis: program is effective → scores after program will be higher than scores before program → average D will be greater than zero H 0 : µ D = 0 H 1 : µ D > 0

Student Before Program After ProgramDD2D ∑D = 180∑D 2 = 7900 So, need to know ∑D and ∑D 2 :

Recall that for single samples: For related samples: where: and

Standard deviation of D: Mean of D: Standard error:

Under H 0, µ D = 0, so: From Table B.2: for α = 0.05, one-tailed, with df = 7, t critical = > → reject H 0 The program is effective.

t-Value t is a measure of: How difficult is it to believe the null hypothesis? High t Difficult to believe the null hypothesis - accept that there is a real difference. Low t Easy to believe the null hypothesis - have not proved any difference.

In Conclusion ! Student ‘s t-test will be used: Student ‘s t-test will be used: --- When Sample size is small --- When Sample size is small and for the following situations: and for the following situations: (1) to compare the single sample mean (1) to compare the single sample mean with the population mean with the population mean (2) to compare the sample means of (2) to compare the sample means of two indpendent samples two indpendent samples (3) to compare the sample means of paired samples (3) to compare the sample means of paired samples