Statistical Modeling and Analysis of Scientific Inquiry: The Basics of Hypothesis Testing.

Slides:



Advertisements
Similar presentations
Copyright © 2009 Pearson Education, Inc. Chapter 29 Multiple Regression.
Advertisements

Significance Testing Chapter 13 Victor Katch Kinesiology.
Copyright © 2010, 2007, 2004 Pearson Education, Inc. *Chapter 29 Multiple Regression.
1 Analysis of Variance This technique is designed to test the null hypothesis that three or more group means are equal.
MARE 250 Dr. Jason Turner Hypothesis Testing II To ASSUME is to make an… Four assumptions for t-test hypothesis testing: 1. Random Samples 2. Independent.
MARE 250 Dr. Jason Turner Hypothesis Testing II. To ASSUME is to make an… Four assumptions for t-test hypothesis testing:
Lecture 5 Outline – Tues., Jan. 27 Miscellanea from Lecture 4 Case Study Chapter 2.2 –Probability model for random sampling (see also chapter 1.4.1)
Independent Samples and Paired Samples t-tests PSY440 June 24, 2008.
Topic 2: Statistical Concepts and Market Returns
Analysis of Differential Expression T-test ANOVA Non-parametric methods Correlation Regression.
Statistical Methods in Computer Science Hypothesis Testing I: Treatment experiment designs Ido Dagan.
Inference about a Mean Part II
Chapter 2 Simple Comparative Experiments
Chapter 11: Inference for Distributions
Inferences About Process Quality
Chapter 9 Hypothesis Testing.
Statistical Methods in Computer Science Hypothesis Testing I: Treatment experiment designs Ido Dagan.
Getting Started with Hypothesis Testing The Single Sample.
Statistical Analysis. Purpose of Statistical Analysis Determines whether the results found in an experiment are meaningful. Answers the question: –Does.
AM Recitation 2/10/11.
Aaker, Kumar, Day Ninth Edition Instructor’s Presentation Slides
McGraw-Hill/IrwinCopyright © 2009 by The McGraw-Hill Companies, Inc. All Rights Reserved. Chapter 9 Hypothesis Testing.
HAWKES LEARNING SYSTEMS math courseware specialists Copyright © 2010 by Hawkes Learning Systems/Quant Systems, Inc. All rights reserved. Chapter 14 Analysis.
Overview Definition Hypothesis
Review of Statistical Inference Prepared by Vera Tabakova, East Carolina University ECON 4550 Econometrics Memorial University of Newfoundland.
Section #4 October 30 th Old: Review the Midterm & old concepts 1.New: Case II t-Tests (Chapter 11)
Statistical Analysis Statistical Analysis
LECTURE 12 Tuesday, 6 October STA291 Fall Five-Number Summary (Review) 2 Maximum, Upper Quartile, Median, Lower Quartile, Minimum Statistical Software.
T-distribution & comparison of means Z as test statistic Use a Z-statistic only if you know the population standard deviation (σ). Z-statistic converts.
The paired sample experiment The paired t test. Frequently one is interested in comparing the effects of two treatments (drugs, etc…) on a response variable.
Go to Index Analysis of Means Farrokh Alemi, Ph.D. Kashif Haqqi M.D.
Jan 17,  Hypothesis, Null hypothesis Research question Null is the hypothesis of “no relationship”  Normal Distribution Bell curve Standard normal.
Statistics & Biology Shelly’s Super Happy Fun Times February 7, 2012 Will Herrick.
Interval Estimation and Hypothesis Testing
Chapter 11 Inference for Distributions AP Statistics 11.1 – Inference for the Mean of a Population.
CHAPTER 18: Inference about a Population Mean
Statistical Analysis I have all this data. Now what does it mean?
LECTURE 19 THURSDAY, 14 April STA 291 Spring
Statistical Decision Making. Almost all problems in statistics can be formulated as a problem of making a decision. That is given some data observed from.
Basic concept Measures of central tendency Measures of central tendency Measures of dispersion & variability.
Maximum Likelihood Estimator of Proportion Let {s 1,s 2,…,s n } be a set of independent outcomes from a Bernoulli experiment with unknown probability.
1 Lecture note 4 Hypothesis Testing Significant Difference ©
Review of Chapters 1- 6 We review some important themes from the first 6 chapters 1.Introduction Statistics- Set of methods for collecting/analyzing data.
Inference and Inferential Statistics Methods of Educational Research EDU 660.
PCB 3043L - General Ecology Data Analysis. OUTLINE Organizing an ecological study Basic sampling terminology Statistical analysis of data –Why use statistics?
Statistics - methodology for collecting, analyzing, interpreting and drawing conclusions from collected data Anastasia Kadina GM presentation 6/15/2015.
Essential Question:  How do scientists use statistical analyses to draw meaningful conclusions from experimental results?
5.1 Chapter 5 Inference in the Simple Regression Model In this chapter we study how to construct confidence intervals and how to conduct hypothesis tests.
McGraw-Hill/Irwin Copyright © 2007 by The McGraw-Hill Companies, Inc. All rights reserved. Chapter 8 Hypothesis Testing.
Interval Estimation and Hypothesis Testing Prepared by Vera Tabakova, East Carolina University.
Chapter Eight: Using Statistics to Answer Questions.
PCB 3043L - General Ecology Data Analysis.
Statistical Inference Statistical inference is concerned with the use of sample data to make inferences about unknown population parameters. For example,
Chapter 9: Introduction to the t statistic. The t Statistic The t statistic allows researchers to use sample data to test hypotheses about an unknown.
Hypothesis Tests. An Hypothesis is a guess about a situation that can be tested, and the test outcome can be either true or false. –The Null Hypothesis.
Statistical principles: the normal distribution and methods of testing Or, “Explaining the arrangement of things”
Statistical Decision Making. Almost all problems in statistics can be formulated as a problem of making a decision. That is given some data observed from.
4-1 Statistical Inference Statistical inference is to make decisions or draw conclusions about a population using the information contained in a sample.
Lecture Nine - Twelve Tests of Significance.
Two-Sample Hypothesis Testing
Hypothesis Testing and Confidence Intervals (Part 1): Using the Standard Normal Lecture 8 Justin Kern October 10 and 12, 2017.
Chapter 2 Simple Comparative Experiments
Central Limit Theorem, z-tests, & t-tests
Georgi Iskrov, MBA, MPH, PhD Department of Social Medicine
Chapter 9 Hypothesis Testing.
Descriptive and inferential statistics. Confidence interval
STA 291 Spring 2008 Lecture 5 Dustin Lueker.
STA 291 Spring 2008 Lecture 5 Dustin Lueker.
Interval Estimation and Hypothesis Testing
Chapter Nine: Using Statistics to Answer Questions
Presentation transcript:

Statistical Modeling and Analysis of Scientific Inquiry: The Basics of Hypothesis Testing

Statistics: The Science of Data Data comprises quantitative measurements of individuals Individuals are representative sample from a population Population is modeled by a probability density function representing the likelihood of measurement values Statistics is a collection of tools and techniques for organizing, analyzing, illustrating, and interpreting data

Basic Data Analysis Tools Data: Mean and median: what’s the middle – Sample mean,,is the average – Median is the middle data point (of the sorted list) Standard deviation, IQR, median absolute deviation: how much variability Histograms and box plots: what does the distribution look like?

Histograms and Box Plots Each bar is the number of data points between the ordinate values of the bar Should look like a piecewise constant approximation (like Riemann sums in calc) The box is bounded by the first and third quartiles, with the mid line being the median. The whiskers go out to q1-1.5*IQR and q3+1.5*IQR Outliers are plotted beyond the whiskers

Science and Statistics: An Abstract View Theory: we have a population of individuals or “experimental units” (EUs) – In bio applications, these are typically organisms – In medical applications, these are typically patients Inquiry: we propose hypotheses about the properties of these EUs. – How an organism respond to stress – How a patient responds to treatment – Does one treatment work better than another

Principles of Statistical Modeling Modeling Concept 1: We can characterize the EUs with a vector of attributes that can be observed Modeling Concept 2: EUs selected randomly from the population produce attributes according to a probability distribution Modeling Concept 3: The population’s probability distribution is known except for a parameter vector that must be estimated from observations Modeling Concept 4: “Truth” is defined by this unknown parameter vector.

Elements of a Hypothesis Test Sample of data Two competing hypotheses: the null and its alternative A statistic, which is a function of the data with a known sampling distribution A rejection criterion against which we assess the statistic’s value to decide whether or not we can reject the null.

The Math of Statistics, 1 The parametrically modeled probability distribution The parameter represents truth about the population Question: what can we say about after we’ve seen some x ’s?

The Math of Statistics, 2 The probability density models EUs by weighting the possible measurement values Area under curve tells us probabilities

The Math of Statistics, 3 The sample is a collection Ideally the histogram of these would look like the probability density (if we knew )

Population vs. Sample Population is fixed – Very large – Impractical to investigate all members Population has one distribution Population has parameters – Fixed, but usually not known Samples are random – Large enough to be representative – Small enough to be studied Each sample has a histogram Sample has statistics – Known, but repeated samples will have different values Meta: we can think of a population of possible statistic values!!!!!

The biggest idea in statistics In most circumstances, a larger sample produces an average that more accurately represents a population mean. If has average If the population has mean  and std dev  Then the population of averages has mean  and std dev And the sample average tends to be normally distributed as n grows

Hypothesis Testing For the Mean Population is characterized by a central value  and a spread  of values around that. – Should be symmetric – Tails should taper relatively quickly – The actual values of  and  are not known The question is the following: Is the unknown  equal to a specified value  0 ? – H 0 :  =  0 – H A :  ≠  0

Mistakes That Can Be Made, 1 Choosing H A when H 0 is true – Type I error – The greek letter  is used denote the likelihood – In applications, this is usually a false positive or false detection. – Common approach is to select a value of  we’re willing to tolerate  =0.05 is the most common choice – Concept: Over many many repetitions when H 0 is true,  percent of the time, we’d declare H 0 to be false

Mistakes That Can Be Made, 2 Choosing H 0 when H 0 is false – Type II error – The greek letter  is used denote the likelihood – In applications, this is usually a false negative or missed detection. – Common approach is to hope  is small – 1 -  is called the power of the test Represents the likelihood of detecting a real effect!!! This is the probability of selecting H A when H A is true – Note that H A being true is complicated: as long as  ≠  0 the alternative H A is true! Even if by !!!!

Some Concepts and Lingo Generally H 0 is something you expect not to be true. – For example, you expect a non-zero mean In science, models can only be demonstrated to be false. We reject an actually true H 0 fairly infrequently (depends on the  we choose) When H 0 is not rejected by the test, we say that we “fail to reject H 0,” not that we accept H 0. – The Type II error probability is difficult to assess

How to Test Collect a sample Form the t-statistic If H 0 is true, T has a known probability density – Student’s T distribution with n-1 degrees of freedom Choose critical value, t , of T distribution – Such that would occur with probability . H 0 :  =  0 H A :  ≠  0

The P Value Instead of the critical value and the T statistic, we often use  directly with the p value statistic – Plug the T statistic into its (null) distribution and find the associated probability. T value and its minus P-value is the shaded area added together

Doing this in Excel Data in a column or row Compute the sample mean with the average function Compute the sample standard deviation with the stdev function Compute the t statistic Compute the p-value by plugging the t statistic into the integral with tdist(T,n- 1,2) – That last 2 is for two-tailed integral Alternatively, use ttest to compute. – Ttest is designed for two-sample comparison, so you have to trick it by creating a sample with all  0 ’s

More On Student’s T Null true: Centered at 0 Slightly false null: Centered near 0 Extremely false null: Centered far from 0

Type I and Type II  is the black shaded: Depends Only on Null  is the red shaded: Depends on how far the red curve is shifted Some alternatives are easier to detect

The Alternative Hypothesis If H 0 is true, T has Student’s T distribution with n-1 degrees of freedom If H A is true, then has the T distribution! H 0 :  =  0 H A :  ≠  0

The Alternative Hypothesis

We fail to reject the null when What this tell us: – If we have s and n fixed, an effect of size d leads to a power of 1 –  – If we have s and n fixed, a power of 1 –  requires an effect size no smaller than d. – If we want a power of 1 –  and an effect size of d, then we need n samples to achieve our goal.

Effect Size, Sample Size, and Power To detect an alternative of with power 1- , we need With n samples, an effect size of d can be detected with power from

Multi-Group Similarity Testing Population comprises a fixed set of groups: 1,2, …, p – Usually thought of as “statistically identical” individuals within the groups – Each group receives a different “treatment” – Process leads to groups that may have different means  1  p, – Groups have the same variance   – We sample from each group, size n 1,…n p The question is the following: Is at least one treatment different? – H 0 :   =  2 =…  p – H A : At least one of the  i ’s is different

A Digression Given two numbers, how do we compare them? – Subtract to compute the difference – Divide to compute the ratio Statistical use of subtraction relies on T-statistics Two numbers are equal if difference is 0 Statistical use of division relies on F-statistics Two numbers are equal if ratio is 1

Probability Density Functions The normal distribution(mu,sigma): bell shaped, with – mu+/- sigma containing 68% – mu+/- 2sigma containing 95.4% – mu+/- 3sigma containing 99.7% Chi squared (m) – This distribution is what you get when you square m normal(0,1)’s and add them up – – The quantity below is chi squared (n-1)

Probability Density Functions The T-distribution comes from dividing a normal(0,1) by the square root of a chi- squared The F-distribution comes from a ratio of chi- squareds

: ANOVA Collect a sample Test the hypothesis: Assumption: common variance  2 H 0 :   =  2 =…  p H A : At least one of the  i ’s is different

How To Test All treatments have the same mean under H0

Pseudo-ANOVA Collect a sample Test the hypothesis: Assumption: common variance  2 H 0 :   =  2 =…  p =0 H A : At least one of the  i ’s is different from 0

How To Test All treatments have the same mean under H0