Economics 5550 Statistical Tools. Statistics and Hypotheses If you are following the health care system, you see a lot of discussion. Let's consider health.

Slides:



Advertisements
Similar presentations
Partly based on material by Sherry O’Sullivan
Advertisements

Chapter 15 ANOVA.
Chapter 12: Testing hypotheses about single means (z and t) Example: Suppose you have the hypothesis that UW undergrads have higher than the average IQ.
Chapter 10.  Real life problems are usually different than just estimation of population statistics.  We try on the basis of experimental evidence Whether.
Statistical Tools for Health Economics Hypothesis Testing Difference of Means Regression Analysis Multiple Regression Analysis Statistical Inference in.
Dual Tragedies in the B-ham Paper. Module 2 Simple Descriptive Statistics and Univariate Displays of Data A Tale of Three Cities George Howard, DrPH.
Empirical Analysis Doing and interpreting empirical work.
1 Trust and divorce Separated or Divorced trust | No Yes | Total Low | | 247 | |
Statistics Are Fun! Analysis of Variance
Topic 2: Statistical Concepts and Market Returns
Copyright (c) Bani Mallick1 Lecture 4 Stat 651. Copyright (c) Bani Mallick2 Topics in Lecture #4 Probability The bell-shaped (normal) curve Normal probability.
Data and Analysis Let’s Recall Some Numbers Early on, we wanted to look at the difference between men and women … w.r.t. cholesterol.
Hypothesis Testing. G/RG/R Null Hypothesis: The means of the populations from which the samples were drawn are the same. The samples come from the same.
Mitchell Hoffman UC Berkeley. Statistics: Making inferences about populations (infinitely large) using finitely large data. Crucial for Addressing Causal.
Statistical Analysis. Purpose of Statistical Analysis Determines whether the results found in an experiment are meaningful. Answers the question: –Does.
The problem of sampling error in psychological research We previously noted that sampling error is problematic in psychological research because differences.
Statistical hypothesis testing – Inferential statistics I.
Sampling error Error that occurs in data due to the errors inherent in sampling from a population –Population: the group of interest (e.g., all students.
Introduction to Hypothesis Testing
Psy B07 Chapter 1Slide 1 ANALYSIS OF VARIANCE. Psy B07 Chapter 1Slide 2 t-test refresher  In chapter 7 we talked about analyses that could be conducted.
Hypothesis Testing – Introduction
HYPOTHESIS TESTING Dr. Aidah Abu Elsoud Alkaissi
Hypothesis Testing, Part I. Learning Objectives By the end of this lecture, you should be able to: – Describe what is meant by null hypothesis and alternative.
CHP400: Community Health Program - lI Research Methodology. Data analysis Hypothesis testing Statistical Inference test t-test and 22 Test of Significance.
CHAPTER 16: Inference in Practice. Chapter 16 Concepts 2  Conditions for Inference in Practice  Cautions About Confidence Intervals  Cautions About.
F OUNDATIONS OF S TATISTICAL I NFERENCE. D EFINITIONS Statistical inference is the process of reaching conclusions about characteristics of an entire.
Topics: Statistics & Experimental Design The Human Visual System Color Science Light Sources: Radiometry/Photometry Geometric Optics Tone-transfer Function.
What is the MPC?. Learning Objectives 1.Use linear regression to establish the relationship between two variables 2.Show that the line is the line of.
User Study Evaluation Human-Computer Interaction.
Significance Toolbox 1) Identify the population of interest (What is the topic of discussion?) and parameter (mean, standard deviation, probability) you.
Nature of Science Science Nature of Science Scientific methods Formulation of a hypothesis Formulation of a hypothesis Survey literature/Archives.
Statistics and Quantitative Analysis U4320 Segment 12: Extension of Multiple Regression Analysis Prof. Sharyn O’Halloran.
Basic concept Measures of central tendency Measures of central tendency Measures of dispersion & variability.
Experimental Design If a process is in statistical control but has poor capability it will often be necessary to reduce variability. Experimental design.
Please turn off cell phones, pagers, etc. The lecture will begin shortly.
Economics 5550/6550 Statistical Tools Statistics and Hypotheses If you are following the health care system, you see a lot of discussion. Let's.
Inference and Inferential Statistics Methods of Educational Research EDU 660.
PCB 3043L - General Ecology Data Analysis. OUTLINE Organizing an ecological study Basic sampling terminology Statistical analysis of data –Why use statistics?
1 …continued… Part III. Performing the Research 3 Initial Research 4Research Approaches 5 Hypotheses 6 Data Collection 7 Data Analysis.
Non-parametric tests (chi-square test) Dr. Omar Al Jadaan Assistant Professor – Computer Science & Mathematics.
Statistical Inference for the Mean Objectives: (Chapter 9, DeCoursey) -To understand the terms: Null Hypothesis, Rejection Region, and Type I and II errors.
AP Statistics Chapter 24 Comparing Means.
Chapter 10: Introduction to Statistical Inference.
1 Where we are going : a graphic: Hypothesis Testing. 1 2 Paired 2 or more Means Variances Proportions Categories Slopes Ho: / CI Samples Ho: / CI Ho:
Describing Distributions with Numbers Chapter 2. What we will do We are continuing our exploration of data. In the last chapter we graphically depicted.
1 URBDP 591 A Lecture 12: Statistical Inference Objectives Sampling Distribution Principles of Hypothesis Testing Statistical Significance.
PCB 3043L - General Ecology Data Analysis.
Week 6. Statistics etc. GRS LX 865 Topics in Linguistics.
ANOVA, Regression and Multiple Regression March
Hypothesis Testing Introduction to Statistics Chapter 8 Feb 24-26, 2009 Classes #12-13.
Intro to Inference & The Central Limit Theorem. Learning Objectives By the end of this lecture, you should be able to: – Describe what is meant by the.
Results: How to interpret and report statistical findings Today’s agenda: 1)A bit about statistical inference, as it is commonly described in scientific.
Uncertainty and confidence Although the sample mean,, is a unique number for any particular sample, if you pick a different sample you will probably get.
MATH Section 4.4.
Statistical Inference for the Mean Objectives: (Chapter 8&9, DeCoursey) -To understand the terms variance and standard error of a sample mean, Null Hypothesis,
Data Analysis. Qualitative vs. Quantitative Data collection methods can be roughly divided into two groups. It is essential to understand the difference.
SECTION 1 TEST OF A SINGLE PROPORTION
HYPOTHESES TESTING. Concept of Hypotheses A hypotheses is a proposition which the researcher wants to verify. It may be mentioned that while a hypotheses.
Outline Sampling Measurement Descriptive Statistics:
Descriptive and Inferential Statistics
Chapter 8: Inferences Based on a Single Sample: Tests of Hypotheses
PCB 3043L - General Ecology Data Analysis.
Inferential statistics,
What goes in a results section?
Module 8 Statistical Reasoning in Everyday Life
Hypotheses Hypothesis Testing
Chapter Nine Part 1 (Sections 9.1 & 9.2) Hypothesis Testing
Chapter Outline Inferences About the Difference Between Two Population Means: s 1 and s 2 Known.
Variance and Hypothesis Tests
Presentation transcript:

Economics 5550 Statistical Tools

Statistics and Hypotheses If you are following the health care system, you see a lot of discussion. Let's consider health care costs. Most simply: How fast are they rising? Are ours higher than Canada's; if so how much? Do HMO's reduce costs; if so, how much? Are patients in managed care settings sicker, healthier; if so how much?

Most of the time, such assertions are made with certainty. Yet, they are almost certainly not certain. These involve testing hypotheses. Many of you have had some passing experience with experimental laboratory work. Most often the work is a demonstration, rather than an experiment (i.e. mix up these 2 chemicals and you'll blow up the lab). Let's see what we have in mind with empirical work. Statistics and Hypotheses

Why? Economists look at statements that while plausible, merit validation. Suppose for example that you are looking at a public health intervention that would seek to reduce cholesterol in the population. One useful question for targeting the intervention is who has a higher cholesterol level, men or women? Why is this important? It could turn out that one group is at risk, and another group isn't. You just want to know.

How do we formulate hypotheses? The style is to use the "null" hypothesis; then contrast it to the "alternative" hypothesis. Let's look at a sample of Wayne State college sophomores. We'll discuss this in more detail as we go along. We customarily refer to the null hypothesis as H 0, and the alternative hypothesis as H 1. So, H 0 : c m = c w. H 1 : c m  c w.

H 0 : c m = c w. H 1 : c m  c w. So, if c m seems to be about the same as c w, we'll accept the null hypothesis. If it seems to be different, we'll reject the null hypothesis. This type of hypothesis is called a "simple hypothesis." Simple Hypotheses

Composite Hypotheses Often we want to consider more than simply equality. Suppose we are examining the health expenditures of rich people, E r, as opposed to poor people, E p. We could certainly compare equalities. We might also wish to test the received wisdom that the rich spend more, or: H 0 : E r = E p H 1 : E r > E p

Composite Hypotheses H 0 : E r = E p H 1 : E r > E p As opposed to a "simple hypothesis" above, this is a "composite hypothesis." Note that this is not the same as above. If somehow, we discovered that E p > E r, it would not validate this hypothesis. So what? Well, let's go back to the health care system, for example. What costs more? What is more effective? Ultimately, statistical analysis is how the linkages between smoking and cancer were discovered.

So, how do we test hypotheses? Suppose that we start with WSU sophomores. Is this a good sample? Why yes? Why not? Let's find a man and test his cholesterol. We discover that c m = 150. We find a woman and test her cholesterol. We discover that c w = 180. It looks like women's rates are higher. Is this a good test? Why or why not? What if we had picked a different man? or a different woman?

Measure of central tendency There is an enormous body of statistical literature that suggests that if we take the average, or arithmetic mean, of a group of people, we will have a better measure of "central tendency." The intuition is straightforward. While any single observation may be suspect, the premise is that a measure taken from a larger number of observations may be less suspect. What other measures are sometimes used? Median, mode. Why?

From EXCEL CH_3.XLS

Hypothesis testing Again, in the book's example (sample of 30 each), which is based on real life numbers, we find that:

Hypothesis testing Well, they look different. Are they different? A couple of answers. You can plot the data. The men's numbers kind of look less than the women's, but some of the women less than some of the men. It may not give a lot of additional information.

Testing Differences Let's see if we can do things a bit more rigorously, looking at the "spread" of the data. So:

Testing Differences We'd like to determine whether the difference d could have been generated randomly; that is, what is the possibility that in truth c w = c m, and somehow we just managed to find 2 samples that generated these separate means? We want to compare the difference to the spread. If the difference is large (relative to the spread), it would seem to be unlikely that it was caused at random. If it is small, it might have been caused at random.

Testing Differences A useful way, and by far the most frequent way, that we calculate the "spread" is to subtract each value from the mean, square the term, add the squared terms, and divide by the number of observations.

Testing Differences If the spread is large, then this measure is large. If the spread is small, then this measure will be small. We often take the square root of this difference to get the standard error s w. We can do the same for men, as well. Our EXCEL worksheet helps us out.

Testing Differences It turns out that we can calculate a standard error of the difference which equals: s d = (s w 2 + s m 2 ) 1/2 s d = ( ) 1/2 = ( ) 1/2 = Let's compare the spread, or standard error, of the difference to the difference. The difference of 10.2 is about 1.65 times as big as the standard error. Clearly, if the difference was 0, we'd accept the fact that c w = c m.

Testing Differences The larger the difference relative to the spread, the more likely that c w  c m. If d/s d = 1.645, we could be about 90% certain that men's cholesterol is not equal to women's. As it turns out, that’s just about the certainty that we get here.

Key points for hypothesis testing: State hypotheses clearly Choose suitable sample Calculate appropriate measures of central tendency and dispersion Draw appropriate inferences