Lunch & Learn Statistics By Jay. Goals Introduce / reinforce statistical thinking Understand statistical models Appreciate model assumptions Perform simple.

Slides:



Advertisements
Similar presentations
Chapter 9 Introduction to the t-statistic
Advertisements

Lecture (11,12) Parameter Estimation of PDF and Fitting a Distribution Function.
Statistics Review – Part II Topics: – Hypothesis Testing – Paired Tests – Tests of variability 1.
CHAPTER 21 Inferential Statistical Analysis. Understanding probability The idea of probability is central to inferential statistics. It means the chance.
Inference Sampling distributions Hypothesis testing.
Statistical Significance What is Statistical Significance? What is Statistical Significance? How Do We Know Whether a Result is Statistically Significant?
1 Analysis of Variance This technique is designed to test the null hypothesis that three or more group means are equal.
Business 205. Review Sampling Continuous Random Variables Central Limit Theorem Z-test.
Statistical Significance What is Statistical Significance? How Do We Know Whether a Result is Statistically Significant? How Do We Know Whether a Result.
9-1 Hypothesis Testing Statistical Hypotheses Statistical hypothesis testing and confidence interval estimation of parameters are the fundamental.
Chapter 6 Hypotheses texts. Central Limit Theorem Hypotheses and statistics are dependent upon this theorem.
Hypothesis : Statement about a parameter Hypothesis testing : decision making procedure about the hypothesis Null hypothesis : the main hypothesis H 0.
1 Hypothesis Testing In this section I want to review a few things and then introduce hypothesis testing.
4-1 Statistical Inference The field of statistical inference consists of those methods used to make decisions or draw conclusions about a population.
Chapter 3 Hypothesis Testing. Curriculum Object Specified the problem based the form of hypothesis Student can arrange for hypothesis step Analyze a problem.
BCOR 1020 Business Statistics Lecture 21 – April 8, 2008.
Inference about a Mean Part II
Chapter 2 Simple Comparative Experiments
Horng-Chyi HorngStatistics II41 Inference on the Mean of a Population - Variance Known H 0 :  =  0 H 0 :  =  0 H 1 :    0, where  0 is a specified.
Inferences About Process Quality
Chapter 9 Hypothesis Testing.
BCOR 1020 Business Statistics Lecture 20 – April 3, 2008.
BCOR 1020 Business Statistics
Today Concepts underlying inferential statistics
5-3 Inference on the Means of Two Populations, Variances Unknown
“There are three types of lies: Lies, Damn Lies and Statistics” - Mark Twain.
Copyright (c) 2004 Brooks/Cole, a division of Thomson Learning, Inc. Chapter 8 Tests of Hypotheses Based on a Single Sample.
McGraw-Hill/IrwinCopyright © 2009 by The McGraw-Hill Companies, Inc. All Rights Reserved. Chapter 9 Hypothesis Testing.
Statistical Hypothesis Testing. Suppose you have a random variable X ( number of vehicle accidents in a year, stock market returns, time between el nino.
Descriptive statistics Inferential statistics
More About Significance Tests
Statistical Analysis A Quick Overview. The Scientific Method Establishing a hypothesis (idea) Collecting evidence (often in the form of numerical data)
1 Power and Sample Size in Testing One Mean. 2 Type I & Type II Error Type I Error: reject the null hypothesis when it is true. The probability of a Type.
Topics: Statistics & Experimental Design The Human Visual System Color Science Light Sources: Radiometry/Photometry Geometric Optics Tone-transfer Function.
Hypothesis Testing: One Sample Cases. Outline: – The logic of hypothesis testing – The Five-Step Model – Hypothesis testing for single sample means (z.
Unit 8 Section : z Test for a Mean  Many hypotheses are tested using the generalized statistical formula: Test value = (Observed Value)-(expected.
LECTURE 19 THURSDAY, 14 April STA 291 Spring
9-1 Hypothesis Testing Statistical Hypotheses Definition Statistical hypothesis testing and confidence interval estimation of parameters are.
Statistical Decision Making. Almost all problems in statistics can be formulated as a problem of making a decision. That is given some data observed from.
1 Lecture 19: Hypothesis Tests Devore, Ch Topics I.Statistical Hypotheses (pl!) –Null and Alternative Hypotheses –Testing statistics and rejection.
Hypothesis testing Summer Program Brian Healy. Last class Study design Study design –What is sampling variability? –How does our sample effect the questions.
Chapter 9 Fundamentals of Hypothesis Testing: One-Sample Tests.
1 Chapter 10: Introduction to Inference. 2 Inference Inference is the statistical process by which we use information collected from a sample to infer.
Chapter 9 Tests of Hypothesis Single Sample Tests The Beginnings – concepts and techniques Chapter 9A.
Statistical Hypotheses & Hypothesis Testing. Statistical Hypotheses There are two types of statistical hypotheses. Null Hypothesis The null hypothesis,
CHAPTER 9 Testing a Claim
Introduction to Inferece BPS chapter 14 © 2010 W.H. Freeman and Company.
5.1 Chapter 5 Inference in the Simple Regression Model In this chapter we study how to construct confidence intervals and how to conduct hypothesis tests.
McGraw-Hill/Irwin Copyright © 2007 by The McGraw-Hill Companies, Inc. All rights reserved. Chapter 8 Hypothesis Testing.
Interval Estimation and Hypothesis Testing Prepared by Vera Tabakova, East Carolina University.
Statistical Inference for the Mean Objectives: (Chapter 9, DeCoursey) -To understand the terms: Null Hypothesis, Rejection Region, and Type I and II errors.
CHAPTER 15: Tests of Significance The Basics ESSENTIAL STATISTICS Second Edition David S. Moore, William I. Notz, and Michael A. Fligner Lecture Presentation.
3-1 MGMG 522 : Session #3 Hypothesis Testing (Ch. 5)
KNR 445 Statistics t-tests Slide 1 Introduction to Hypothesis Testing The z-test.
26134 Business Statistics Tutorial 11: Hypothesis Testing Introduction: Key concepts in this tutorial are listed below 1. Difference.
Inen 460 Lecture 2. Estimation (ch. 6,7) and Hypothesis Testing (ch.8) Two Important Aspects of Statistical Inference Point Estimation – Estimate an unknown.
Hypothesis Testing Errors. Hypothesis Testing Suppose we believe the average systolic blood pressure of healthy adults is normally distributed with mean.
© Copyright McGraw-Hill 2004
Chapter 9 Day 2 Tests About a Population Proportion.
Statistical Inference Statistical inference is concerned with the use of sample data to make inferences about unknown population parameters. For example,
Education 793 Class Notes Inference and Hypothesis Testing Using the Normal Distribution 8 October 2003.
Hypothesis Testing. Suppose we believe the average systolic blood pressure of healthy adults is normally distributed with mean μ = 120 and variance σ.
Statistical Inference for the Mean Objectives: (Chapter 8&9, DeCoursey) -To understand the terms variance and standard error of a sample mean, Null Hypothesis,
A.P. STATISTICS EXAM REVIEW TOPIC #2 Tests of Significance and Confidence Intervals for Means and Proportions Chapters
Copyright © 2013, 2009, and 2007, Pearson Education, Inc. 1 FINAL EXAMINATION STUDY MATERIAL III A ADDITIONAL READING MATERIAL – INTRO STATS 3 RD EDITION.
1 Underlying population distribution is continuous. No other assumptions. Data need not be quantitative, but may be categorical or rank data. Very quick.
Statistical Decision Making. Almost all problems in statistics can be formulated as a problem of making a decision. That is given some data observed from.
Chapter 9: Hypothesis Tests for One Population Mean 9.5 P-Values.
Virtual University of Pakistan
Chapter 2 Simple Comparative Experiments
Presentation transcript:

Lunch & Learn Statistics By Jay

Goals Introduce / reinforce statistical thinking Understand statistical models Appreciate model assumptions Perform simple statistical tests

What topics will we cover? Statistical concepts. Probability. Definitions Descriptive statistics. Hypothesis Formulation Hypothesis testing. Normal Distribution.  and  errors. Student’s t distribution Paired and unpaired t tests. Analysis of variance Regression Categorical data. Sensitivity / specificity. Chi square tests.

Session #1 Review Observations vary: sample v. population Observational vs. experimental data Graphing data

Session #2 Review Statistics are functions of the data Useful statistics have known distributions Statistical inference Estimation Testing hypotheses Tests seek to disprove a “null” hypothesis

Session #3 Review Tests involve a NULL hypothesis (H 0 ) an ALTERNATIVE hypothesis (H A ) Try to disprove H 0 4 steps in hypothesis testing –Identify the test statistic –State the null and alternative hypotheses –Identify the rejection region –State your conclusion

Session #4 Concepts  (type I) and  (type II) errors. Normal Distribution and the z- statistic: a mathematical construct. The Central Limit Theorem: a divine gift for statistical inference? We will use the Normal distribution to: … perform hypothesis tests … calculate power and sample size

Type I error …rejecting the null hypothesis when in fact, it is true = P (reject H 0 | H 0 true) =  Generally,  =0.05 For one-sided tests, it is conservative to choose  =0.025

Type II error …accepting H 0 when in fact H A is true. = P (accepting H 0 | H A true) =  Often we pick  =0.20 or 0.10 and calculate the sample size to achieve this goal. In drug development, it is wasteful (but less expensive) to choose  >0.10

What is Power? Power is 1- , that is… = 1 - P (accepting H 0 | H A true) = P (rejecting H 0 | H A true) …rejecting H 0 when in fact H A is true. This is something we want to happen: our goal!

Normal Distribution Parametric Mean  and Variance  2 Mean = point of symmetry  2 = “spread” of bell curve Complicated mathematical formula f(x) = exp[-(x-  ) 2 /2  2 ]/ [  (2  )] Looks like a bell centered at  Distance from midline to inflection point =  Importance: Central Limit Theorem

Central Limit Theorem Averages of random variables ~ N( ,  2 /n) Proof: uses Taylor and MacLauren expansions of infinite series and other nasty mathematical tricks When n ~ teens  averages ~Normal for reasonable distributions When n>30  averages ~Normal no matter how weird the original distribution

Z- statistic If xbar ~ N( ,  2 /n), then… (xbar-  )/(  /  n) ~ N(0,1) [Note change in the denominator] We say that “xbar has been normalized”

Why the z-statistic is useful Hypothesis tests require a test statistic with a known distribution The z-statistic distribution is known Averages of anything (if n>30) can use the z-statistic

Why the z-statistic is useful Hypothesis tests require a test statistic with a known distribution The z-statistic distribution is known Averages of anything (if n>30) can use the z-statistic ASSUMPTION: WE KNOW THE VARIANCE  2 OF THE POPULATION STUDIED

Example: C-section data Test if initial SBP is too low, I.e., < 85 mm Hg Four steps in testing: 1.Identify test statistic 2.State hypotheses 3.Identify rejection region 4.State conclusions

Example: C-section data Identify a test statistic: … minimum value? How is it distributed? … median value? How is it distributed? … average value? How is it distributed? N( ,  2 / n )

Example: C-section data Identify a test statistic: xbar ~ N( ,  2 / n ) Therefore… : z = (xbar-  )/(  /  n) is the test statistic. We know it’s distribution: N(0,1)

Example: C-section data Identify a test statistic: z State null and alternative hypotheses: H 0 :  >=85 (remember, put = with H 0 ) H A :  < 85

Example: C-section data Identify a test statistic: z for average SBP State the null and alternative hypotheses H 0 :  >=85 H A :  < 85 Identify the rejection region Under H 0, we use  0 for population mean: z = (xbar-  0 )/(  /  n) Here  0 is 85 mmHg When z < -z , we reject H 0.

Example: C-section data Final step: state conclusion Calculate z = (xbar-  0 )/(  /  n) xbar =  0 = 85 (according to H 0 ) For now, we will use for  n = 20 z = -2.65, which is < (z 0.05 ) Therefore, we reject H 0 and conclude: Data not consistent with SBP at least 85 mmHg

Example: C-section data Final step: state conclusion Calculate z = (xbar-  0 )/(  /  n) xbar =  0 = 85 (according to H 0 ) For now, we will use for  n = 20 z = -2.65, which is < (z 0.05 ) Therefore, we reject H 0 and conclude: Data not consistent with SBP at least 85 mmHg “Nominal p-value” Look up  value associated with z = -2.65: Get

Calculating Power of a Test Recall, power = P (rejecting H 0 | H A true). [reject H 0 at  level, when H A is true] Start with “reject H 0 at  level”: P (reject H 0 | H 0 true) =  = P (z < - z  | H 0 true) for our example = P [(xbar-  0 )/(  /  n) < -z  |  =  0 ). = P [xbar <  0 – z  (  /  n)].

Calculating Power Note in this case P [(xbar-85)/1.79 < ] = 0.05 P [xbar < 85 –(1.645)(1.79)] = 0.05 P (xbar < ) = 0.05 (Another way to write the rejection region of the hypothesis test)

Calculating Power Step 2: re-write probability for type II: Power = P (reject H 0 | H A true) P [xbar <  0 – z  (  /  n)]. = P [(xbar-  A )/(  /  n) < (  0 – z  (  /  n) -  A )/ (  /  n)] = P [z A < (  0 -  A )/ (  /  n) – z  ]

Calculating Power = P [z A < (  0 -  A )/ (  /  n) – z  ] = P [z A < (85-80)/1.79 – 1.645] = P (z A < 1.148) = 0.875

Calculating Sample Size = P [z A < (  0 -  A )/ (  /  n) – z  ] Choose power of 0.90  z A = (table) < (  0 -  A )/ (  /  n) – z  Solve for n. Usually  0 =0;  A,  specified;  2 estimated. For two-sided test, use z  /2

Calculating Sample Size = P [z A < (  0 -  A )/ (  /  n) – z  ] Choose power of 0.90  z A = (table) < (  0 -  A )/ (  /  n) – z  < (85-80)/ (8.006/  n) – Solve for n: 21.97, round up to 22

Review of Session #4  (type I) and  (type II) errors. Normal Distribution and the z- statistic The Central Limit Theorem Hypothesis testing using z and N(0,1) Calculating power Calculating sample size

Session #4 Homework Using the C-section data… (1)Determine whether or not the increase in SBP exceeds 20 mmHg. [Hint: form paired differences. Calculate  2 on diffs.] (2)What is the power of this test to detect an increase of 10 mmHg in SBP? (3)Extra Credit: Find sample size that provides  90% chance of detecting an increase in SBP of 5 mmHg or more.