Hypothesis Testing - the scientists' moral imperative moral imperative moral imperative To tell whether our data supports or rejects our ideas, we use.

Slides:



Advertisements
Similar presentations
Introductory Mathematics & Statistics for Business
Advertisements

Chapter 16 Inferential Statistics
1 COMM 301: Empirical Research in Communication Lecture 15 – Hypothesis Testing Kwan M Lee.
Lecture (11,12) Parameter Estimation of PDF and Fitting a Distribution Function.
Chapter 12: Testing hypotheses about single means (z and t) Example: Suppose you have the hypothesis that UW undergrads have higher than the average IQ.
Hypothesis Testing Steps in Hypothesis Testing:
Hypothesis: It is an assumption of population parameter ( mean, proportion, variance) There are two types of hypothesis : 1) Simple hypothesis :A statistical.
Hypothesis Testing A hypothesis is a claim or statement about a property of a population (in our case, about the mean or a proportion of the population)
Hypothesis Testing IV Chi Square.
Physics 270 – Experimental Physics. Let say we are given a functional relationship between several measured variables Q(x, y, …) What is the uncertainty.
Chapter 10: Hypothesis Testing
1-1 Copyright © 2015, 2010, 2007 Pearson Education, Inc. Chapter 25, Slide 1 Chapter 25 Comparing Counts.
Making Inferences for Associations Between Categorical Variables: Chi Square Chapter 12 Reading Assignment pp ; 485.
Ch 15 - Chi-square Nonparametric Methods: Chi-Square Applications
T-Tests Lecture: Nov. 6, 2002.
Inferences About Process Quality
PSY 307 – Statistics for the Behavioral Sciences Chapter 19 – Chi-Square Test for Qualitative Data Chapter 21 – Deciding Which Test to Use.
Definitions In statistics, a hypothesis is a claim or statement about a property of a population. A hypothesis test is a standard procedure for testing.
Statistical Analysis. Purpose of Statistical Analysis Determines whether the results found in an experiment are meaningful. Answers the question: –Does.
8/15/2015Slide 1 The only legitimate mathematical operation that we can use with a variable that we treat as categorical is to count the number of cases.
AM Recitation 2/10/11.
Hypothesis Testing:.
Overview Definition Hypothesis
Review of Statistical Inference Prepared by Vera Tabakova, East Carolina University ECON 4550 Econometrics Memorial University of Newfoundland.
STAT 5372: Experimental Statistics Wayne Woodward Office: Office: 143 Heroy Phone: Phone: (214) URL: URL: faculty.smu.edu/waynew.
Copyright © 2010 Pearson Education, Inc. Warm Up- Good Morning! If all the values of a data set are the same, all of the following must equal zero except.
Copyright © 2012 Wolters Kluwer Health | Lippincott Williams & Wilkins Chapter 17 Inferential Statistics.
Topics: Statistics & Experimental Design The Human Visual System Color Science Light Sources: Radiometry/Photometry Geometric Optics Tone-transfer Function.
Copyright © 2012 by Nelson Education Limited. Chapter 7 Hypothesis Testing I: The One-Sample Case 7-1.
Individual values of X Frequency How many individuals   Distribution of a population.
Chi-square Test of Independence Hypotheses Neha Jain Lecturer School of Biotechnology Devi Ahilya University, Indore.
Chapter 11 Chi-Square Procedures 11.3 Chi-Square Test for Independence; Homogeneity of Proportions.
Statistical Decision Making. Almost all problems in statistics can be formulated as a problem of making a decision. That is given some data observed from.
Chi-square (χ 2 ) Fenster Chi-Square Chi-Square χ 2 Chi-Square χ 2 Tests of Statistical Significance for Nominal Level Data (Note: can also be used for.
Maximum Likelihood Estimator of Proportion Let {s 1,s 2,…,s n } be a set of independent outcomes from a Bernoulli experiment with unknown probability.
Statistical Hypotheses & Hypothesis Testing. Statistical Hypotheses There are two types of statistical hypotheses. Null Hypothesis The null hypothesis,
Slide 26-1 Copyright © 2004 Pearson Education, Inc.
Statistical Inference Statistical Inference involves estimating a population parameter (mean) from a sample that is taken from the population. Inference.
Learning Objectives Copyright © 2002 South-Western/Thomson Learning Statistical Testing of Differences CHAPTER fifteen.
Two-Sample Hypothesis Testing. Suppose you want to know if two populations have the same mean or, equivalently, if the difference between the population.
Copyright © 2010 Pearson Education, Inc. Slide
Physics 270 – Experimental Physics. Let say we are given a functional relationship between several measured variables Q(x, y, …) x ±  x and x ±  y What.
Hypothesis Testing An understanding of the method of hypothesis testing is essential for understanding how both the natural and social sciences advance.
Stats Lunch: Day 3 The Basis of Hypothesis Testing w/ Parametric Statistics.
Chapter 10 The t Test for Two Independent Samples
Copyright © 2010 Pearson Education, Inc. Warm Up- Good Morning! If all the values of a data set are the same, all of the following must equal zero except.
© Copyright McGraw-Hill 2004
Inferential Statistics Inferential statistics allow us to infer the characteristic(s) of a population from sample data Slightly different terms and symbols.
Leftover Slides from Week Five. Steps in Hypothesis Testing Specify the research hypothesis and corresponding null hypothesis Compute the value of a test.
The Analysis of Variance ANOVA
Copyright © 1998, Triola, Elementary Statistics Addison Wesley Longman 1 Assumptions 1) Sample is large (n > 30) a) Central limit theorem applies b) Can.
1 Chi-square Test Dr. T. T. Kachwala. Using the Chi-Square Test 2 The following are the two Applications: 1. Chi square as a test of Independence 2.Chi.
1 1 Slide © 2008 Thomson South-Western. All Rights Reserved Chapter 12 Tests of Goodness of Fit and Independence n Goodness of Fit Test: A Multinomial.
Comparing Counts Chapter 26. Goodness-of-Fit A test of whether the distribution of counts in one categorical variable matches the distribution predicted.
Hypothesis Testing.  Hypothesis is a claim or statement about a property of a population.  Hypothesis Testing is to test the claim or statement  Example.
Hypothesis Tests. An Hypothesis is a guess about a situation that can be tested, and the test outcome can be either true or false. –The Null Hypothesis.
Fundamentals of Data Analysis Lecture 4 Testing of statistical hypotheses pt.1.
Copyright © 2009 Pearson Education, Inc. 9.2 Hypothesis Tests for Population Means LEARNING GOAL Understand and interpret one- and two-tailed hypothesis.
Chapter 9 Hypothesis Testing Understanding Basic Statistics Fifth Edition By Brase and Brase Prepared by Jon Booze.
Statistical Decision Making. Almost all problems in statistics can be formulated as a problem of making a decision. That is given some data observed from.
15 Inferential Statistics.
Inferential Statistics 3: The Chi Square Test
Geog4B The Chi Square Test.
Assumptions For testing a claim about the mean of a single population
Chapter 12 Tests with Qualitative Data
Chapter 9 Hypothesis Testing.
Discrete Event Simulation - 4
Inference on Categorical Data
Hypothesis Tests for a Standard Deviation
Skills 5. Skills 5 Standard deviation What is it used for? This statistical test is used for measuring the degree of dispersion. It is another way.
Presentation transcript:

Hypothesis Testing - the scientists' moral imperative moral imperative moral imperative To tell whether our data supports or rejects our ideas, we use statistical hypothesis testing. The problem is that we often get data that seem to support our ideas. The literature is full of papers that accept a pet idea uncritically. Statistical testing keeps scientists honest. If you read a paper that suggests some alternative hypothesis should be accepted, but there is no statistical test, don't believe it. Immanuel Kant

“If you haven't measured it you don't know what you are talking about.” -William Thompson, Lord Kelvin If a topic is part of science, ideas have consequences that can be checked. We can count and measure aspects of nature to check our ideas. If our measurements are contrary to the predictions of our model, our hypothesis is FALSE, and we reject it "It does not make any difference how beautiful your guess is. It does not make any difference how smart you are, who made the guess, or what his name is-- if it disagrees with experiment it is wrong. That is all there is to it." -Richard Feynman (The Character of Physical Law)The Character of Physical Law

The Mean The mean is one of the commonly used statistics in science. It is often the "Expected Value" i.e. the value we expect to get. The mean is found by totalling the values for all observations (∑x) and dividing by the total number of observations (n). The formula for finding the mean is: Mean = ∑x n

Measures of the spread of data are the standard deviation,  and variance   Standard Deviation For s.d. calculate the difference of each observation and the mean, square it, add all these up, divide by the sample size, and take the square root. Refers to actual population Sample variance

Sample Standard Deviation "It is rarely possible to obtain observations from every item … in a population" Fowler et al p36 The estimate of  is s Where N-1 is called the degrees of freedom, and is one less than the number of observations

Knowing the Distribution In coin toss experiments, we know a formula for calculating the probability of any number of k heads in n trials, the Binomial Distribution.coin toss experiments Fortunately, we don’t have to know the distribution for every situation in nature. We are saved by the Central Limit Theorem

“… the means of a large number of samples drawn randomly from the same population are normally distributed ….". Fowler et al p 91 So it “pays us” to use means, not raw data Central Limit Theorem Sample this, formula unknown Means distributed like this, formula known

So if we make many observations and use averages as our data, we can draw valid conclusions because we know their distribution Many test statistics are available for this. In a few moments we will learn Chi-Square (X 2 ) Normal Distribution

Expected Value The expected value is the average result we expect. It is the product of the probability times the number of observations E = p x n A really useful case is the where the probability of all cases is equal. For example, in fair coin tosses, the probability of Heads =1/2. If we flip a coin 24 times, we EXPECT ½ x 24 = 12 Heads Equal probability cases are usually the basis of the "Null Hypothesis"

Hypotheses A hypothesis is a statement of the researcher’s idea or guess. To test a hypothesis the first thing we do is write down a statement – called the null hypothesis. The null hypothesis is often the opposite of the researcher’s guess.

For Example Some null hypotheses may be: –“there is no difference in lava viscosity between Hawaiian and Cascades volcanoes”. –“there is no relation between a volcanic islands’ height and its age.” –“there is no connection between the time since subaerial exposure of sediment and the height of the hills it forms”

The Hypotheses Null Hypothesis H 0 : ‘There is no difference between the average number of streams per square kilometer and the bedrock type.' Alternative Hypothesis H A or H 1 : ‘There is a difference between the average number of streams per square kilometer and the bedrock type.'

Significance (1) Before carrying out any test we have to decide on a significance level which lets us determine at what point to reject the null hypothesis and accept the alternative hypothesis.

Significance (2) Significance is based on the probability of a particular result. Statisticians have calculated the probability of all possible ‘chance’ events occurring.

Significance (3) Many trials, and the use of a higher significance level (P=.01 not P=.05), make this less likely If the probability of a particular result is less than 1 in 20 (P=0.05), we say the result is significant, ie: the result is not just a chance event. If the probability of a particular result is less than 1 in 100 (P=0.01), we say the result is highly significant; again, the result is not just a chance event.

Avoiding Decision Errors We always run the risk that we will observe a rare event, and we will draw the wrong conclusion. Usually we want to avoid a Type I error, where we reject H 0 even though it is true. Many trials, and the use of a higher significance level (P=.01 not P=.05), make this less likely In a Type II error, we accept the null hypothesis even though it is false. We will see an example of a type II error later.

Test Statistics We often want to see if two things are different from each other. In science we calculate what we would expect if there was no difference between them (the usual null hypothesis). We can then compare this to what we actually observe.

Test Statistics To check the null hypothesis we calculate a figure known as a test statistic, which is based on data from our samples. Different types of problems require different test statistics. Values for comparison to our data have all been put into statistical tables. All we need to do is to calculate our value and compare it with the value in the table to get our answer.

Using a Test Statistic If the test statistic shows you “observed an unlikely result”, you reject the null hypothesis and accept the alternative hypothesis

x 2 Significance Tables The significance levels available on a x 2 table are usually 0.05, 0.01, and.001, which means there is, respectively, only a 1 in 20 (0.05), a 1 in 100 (0.01), or a 1 in 1000 (0.001), probability of the event occurring by chance if that x 2 is obtained. The values in the tables are called critical values.

Chi-Square (X 2 ) Critical Values In Use Chi-Square (X 2)For Chi-Square (X 2) : If the value of the test statistic you have calculated is greater than the value in the table (the critical value) you decided to use, you can reject the null hypothesis and accept the alternative hypothesis. H 0 : There is no relation between the number of peaks along a ridge and the time since exposure

dfP = 0.05P = 0.01P = Chi-Square (X 2) Critical Values Significance table of X 2 values. This is the critical value table we will use in the examples below.

Important The chi square test can only be used on observations that have the following characteristics: The data must be in the form of frequencies The frequency data must have a precise numerical value and must be organised into categories or groups. The total number of observations must be greater than 20. The expected frequency in any one cell of the table must be greater than 5. * It is prudent to use means, not raw data, to insure a normal distribution * See the exception next slide **There are statistics designed to test this assumption Objects being counted are independent**

An Exception "The discrepancy is not large, however, when X 2 is computed from contingency tables with a fairly large number of cells (more than 4, at a minimum) and only a few theoretical frequencies are less than 5." Source: Spence, J. et al.(1968) Elementary Statistics The expected frequency in any one cell of the table must be greater than 5.

Other Statistics If any of the assumptions for X 2 are false, we cannot use X 2 However, there are test statistics for most situations, and they are all similar in their use. Once you know X 2 you can look up the correct statistic and apply it

The x 2 formula   means take the sum

Step 1. Write down the NULL HYPOTHESIS (H 0 ) and ALTERNATIVE HYPOTHESES (H a ) and set the LEVEL OF SIGNIFICANCE.Step 1. Write down the NULL HYPOTHESIS (H 0 ) and ALTERNATIVE HYPOTHESES (H a ) and set the LEVEL OF SIGNIFICANCE. H 0 ' A basaltic sand pile will not spread further than a quartz sand pile in the same time'H 0 ' A basaltic sand pile will not spread further than a quartz sand pile in the same time' H a ' A basaltic sand pile will spread further than a quartz sand pile in the same time 'H a ' A basaltic sand pile will spread further than a quartz sand pile in the same time ' We will set the level of significance at 0.05.We will set the level of significance at 0.05.

Step 2: Construct a table with the information you have observed. Use averages as data Method: 220 Hawaiian sand and 250 New Jersey sand piles of 50 cc each are left out in the weather for 1 week After 1 week, the distance of the furthest grain beyond the initial perimeter is measured. Every 5 piles are averaged. Furthest grain (mm) Row Total Quartz Basaltic Column Total Note that although there are 3 cells in the table that are not greater than 5, these are observed frequencies. It is only the expected frequencies that have to be greater than 5.

Work out the expected frequency. Expected frequency = row total x column total Grand total Furthest grain (mm) Row Total Quartz7.07 Basaltic Column Total Eg: expected frequency for oaks in PL1 = (50 x 13) / 92 = 7.07 You do the rest

Furthest grain (mm) Row Total Quartz Basaltic Column Total The Expected Frequencies

For each of the cells calculate: Furthest grain (mm) Row Total Quartz0.53 Basaltic Column Total Eg: Basaltic in 1-5 mm is (9 – 7.07) 2 / 7.07 = 0.53 (O – E) 2 E You do the rest

Furthest grain (mm) Quartz Basaltic Add up all of the above numbers to obtain the value for chi square: x 2 = (O – E) 2 E So: x 2 = (O – E) 2 E These are the But 

Look up the X 2 value on the table in the slide above. This will tell you whether to accept the null hypothesis or reject it. The number of degrees of freedom to use is: the number of rows in the table minus 1, multiplied by the number of columns minus 1. This is (2-1) x (5-1) = 1 x 4 = 4 degrees of freedom. We find that our answer of is greater than the critical value of 9.49 (for 4 degrees of freedom and a significance level of 0.05) and so we reject the null hypothesis. Now:

‘The distribution of grains spreading from sand piles made of basaltic minerals versus quartz is significantly different.’ Now you have to look for physical factors to explain your findings If you ask me, you should check the density. Basaltic Hawaiian basaltic sands are mostly pyroxenes The density of pyroxene is 3.24 g/cm3. The density of quartz is The pyroxene has greater potential energy when hilled to the same height as quartz sand We conclude: