Presentation is loading. Please wait.

Presentation is loading. Please wait.

Economics 5550/6550 Statistical Tools 2014. Statistics and Hypotheses If you are following the health care system, you see a lot of discussion. Let's.

Similar presentations


Presentation on theme: "Economics 5550/6550 Statistical Tools 2014. Statistics and Hypotheses If you are following the health care system, you see a lot of discussion. Let's."— Presentation transcript:

1 Economics 5550/6550 Statistical Tools 2014

2 Statistics and Hypotheses If you are following the health care system, you see a lot of discussion. Let's consider health care costs. Most simply: How fast are they rising? Are ours higher than Canada's, UK, or others; if so how much? Does managed care reduce costs; if so, how much? Are patients in managed care settings sicker, healthier; if so how much?

3 Most of the time, such assertions are made with certainty. Yet, they are almost certainly not certain. These involve testing hypotheses. Many of you have had some passing experience with experimental laboratory work. Most often the work is a demonstration, rather than an experiment (i.e. mix up these 2 chemicals and you'll blow up the lab). Let's see what we have in mind with empirical work. Statistics and Hypotheses

4 Toggle to Country Data Pct GDP Spent Over Time (EXCEL SS)Pct GDP Spent Over Time Look at the Data

5 What does this look like to you?

6 How do we formulate hypotheses? The style is to use the "null" hypothesis; then contrast it to the "alternative" hypothesis. Null means “nothing is happening.” Let's look at several countries. Are the health expenditure shares equal? H 0 : s US = s UK. H 1 : s US  s UK.

7 H 0 : s US = s UK. H 1 : s US  s UK. So, if s US seems to be about the same as s UK, we'll accept the null hypothesis. If it seems to be different, we'll reject the null hypothesis. This type of hypothesis is called a "simple hypothesis." Simple Hypotheses

8 Composite Hypotheses Often we want to consider more than simply equality. Suppose we are examining the health expenditures of rich people, E r, as opposed to poor people, E p. We could certainly compare equalities. We might also wish to test the received wisdom that the rich spend more, or: H 0 : E r = E p H 1 : E r > E p

9 Composite Hypotheses H 0 : E r = E p H 1 : E r > E p As opposed to a "simple hypothesis" above, this called a "composite hypothesis." Note that this is not the same as above. If somehow, we discovered that E p > E r, it would not validate this hypothesis. So what? Well, let's go back to the health care system, for example. What costs more? What is more effective? Ultimately, statistical analysis is how the linkages between smoking and cancer were discovered. Tobacco industry fought this for years!

10 So, how do we test hypotheses? Let’s look at the US v. the UK Is the share different? It was in 1960. That’s only one data point. Let’s look at some others. It looks like most of the time US > UK.

11 Measure of central tendency There is an enormous body of statistical literature that suggests that if we take the average, or arithmetic mean, of a group of people, we will have a better measure of "central tendency." Think GPA. The intuition is straightforward. While any single observation may be suspect, the premise is that a measure taken from a larger number of observations may be less suspect. What other measures are sometimes used? Median, mode. Why?

12 Hypothesis testing Here (Spreadsheet):Spreadsheet

13 Hypothesis testing Well, they look different. Hard to imagine that they would be identical. Are they different? d = 10.6 – 6.0 = 4.6. Is that little or big, compared to 0?

14 Testing Differences Let's see if we can do things a bit more rigorously, looking at the "spread" of the data. So:

15 Testing Differences We'd like to determine whether the difference d could have been generated randomly; that is, what is the possibility that in truth s US = s UK, and somehow we just managed to find 2 samples that generated these separate means? We want to compare the difference to the spread. If the difference is large (relative to the spread), it would seem to be unlikely that it was caused at random. If it is small, it might have been caused at random.

16 Testing Differences A useful way, and by far the most frequent way, that we calculate the "spread" is to subtract each value from the mean, square the term, add the squared terms, and divide by the number of observations.

17 Testing Differences If the spread is large, then this measure is large. If the spread is small, then this measure will be small. We often take the square root of this difference to get the standard error for the US. We can do the same for the UK, as well. Our EXCEL worksheet helps us out.EXCEL

18 Testing Differences It turns out that we can calculate a standard error of the difference which equals: sd d = (sd US 2 + sd UK 2 ) 1/2 sd d = (.2038 2 +.5207 2 ) 1/2 = 0.5592. Let's compare the spread, or standard error, of the difference to the difference. The difference of 4.6 is about 8.3 times as big as the standard error. Clearly, if the difference was 0, we'd accept the fact that s US = s UK.

19 t statistic The ratio of the difference to the standard error is called a Student’s t statistic. You used to have to look these up with tables, but now we can actually use a spreadsheet. Let’s take a look.

20 What I learned on this summer on vacation “Student” was William Sealy Gossett, a statistician for Guiness. Gosset applied his statistical knowledge – both in the brewery and on the farm – to the selection of the best yielding varieties of barley.

21 Another researcher at Guinness had previously published a paper containing trade secrets of the Guinness brewery. To prevent further disclosure of confidential information, Guinness prohibited its employees from publishing any papers regardless of the contained information. However, after pleading with the brewery and explaining that his mathematical and philosophical conclusions were of no possible practical use to competing brewers, he was allowed to publish them, but under a pseudonym ("Student"), to avoid difficulties with the rest of the staff. Thus his most noteworthy achievement is now called Student's, rather than Gosset's, t-distribution.

22 Key points for hypothesis testing: State hypotheses clearly Choose suitable sample Calculate appropriate measures of central tendency and dispersion Draw appropriate inferences


Download ppt "Economics 5550/6550 Statistical Tools 2014. Statistics and Hypotheses If you are following the health care system, you see a lot of discussion. Let's."

Similar presentations


Ads by Google