Empirical Analysis Doing and interpreting empirical work.

Slides:



Advertisements
Similar presentations
Tests of Hypotheses Based on a Single Sample
Advertisements

Statistics.  Statistically significant– When the P-value falls below the alpha level, we say that the tests is “statistically significant” at the alpha.
Hypothesis Testing Steps in Hypothesis Testing:
Copyright © 2010, 2007, 2004 Pearson Education, Inc. Chapter 21 More About Tests and Intervals.
CHAPTER 21 Inferential Statistical Analysis. Understanding probability The idea of probability is central to inferential statistics. It means the chance.
Inference Sampling distributions Hypothesis testing.
Decision Errors and Power
Testing Hypotheses About Proportions Chapter 20. Hypotheses Hypotheses are working models that we adopt temporarily. Our starting hypothesis is called.
Statistical Techniques I EXST7005 Lets go Power and Types of Errors.
Data and Doing Empirical Work. Data Often its own section of may be part of the analysis section. If there are a lot of variables or other things, put.
Statistics Micro Mini Threats to Your Experiment!
Using Statistics in Research Psych 231: Research Methods in Psychology.
Review for Exam 2 Some important themes from Chapters 6-9 Chap. 6. Significance Tests Chap. 7: Comparing Two Groups Chap. 8: Contingency Tables (Categorical.
Copyright (c) 2004 Brooks/Cole, a division of Thomson Learning, Inc. Chapter 8 Tests of Hypotheses Based on a Single Sample.
Inferential Statistics
AM Recitation 2/10/11.
Overview Definition Hypothesis
Statistical Techniques I
Chapter 8 Hypothesis testing 1. ▪Along with estimation, hypothesis testing is one of the major fields of statistical inference ▪In estimation, we: –don’t.
Chapter 8 Introduction to Hypothesis Testing
Hypothesis Testing.
Testing Hypotheses Tuesday, October 28. Objectives: Understand the logic of hypothesis testing and following related concepts Sidedness of a test (left-,
Fundamentals of Data Analysis Lecture 4 Testing of statistical hypotheses.
+ Chapter 9 Summary. + Section 9.1 Significance Tests: The Basics After this section, you should be able to… STATE correct hypotheses for a significance.
Chapter 8 Introduction to Hypothesis Testing
User Study Evaluation Human-Computer Interaction.
Chapter 20 Testing hypotheses about proportions
CHAPTER 17: Tests of Significance: The Basics
Chapter 9 Fundamentals of Hypothesis Testing: One-Sample Tests.
1 Chapter 10: Introduction to Inference. 2 Inference Inference is the statistical process by which we use information collected from a sample to infer.
Inferential Statistics Body of statistical computations relevant to making inferences from findings based on sample observations to some larger population.
1 Chapter 8 Hypothesis Testing 8.2 Basics of Hypothesis Testing 8.3 Testing about a Proportion p 8.4 Testing about a Mean µ (σ known) 8.5 Testing about.
Chapter 8 Delving Into The Use of Inference 8.1 Estimating with Confidence 8.2 Use and Abuse of Tests.
1 Chapter 9 Hypothesis Testing. 2 Chapter Outline  Developing Null and Alternative Hypothesis  Type I and Type II Errors  Population Mean: Known 
Chapter 20 Testing Hypothesis about proportions
Chapter 21: More About Test & Intervals
Statistical Inference for the Mean Objectives: (Chapter 9, DeCoursey) -To understand the terms: Null Hypothesis, Rejection Region, and Type I and II errors.
Ch 10 – Intro To Inference 10.1: Estimating with Confidence 10.2 Tests of Significance 10.3 Making Sense of Statistical Significance 10.4 Inference as.
Foundations of Sociological Inquiry Statistical Analysis.
CHAPTER 9 Testing a Claim
Statistics for Decision Making Basic Inference QM Fall 2003 Instructor: John Seydel, Ph.D.
"Classical" Inference. Two simple inference scenarios Question 1: Are we in world A or world B?
9.3/9.4 Hypothesis tests concerning a population mean when  is known- Goals Be able to state the test statistic. Be able to define, interpret and calculate.
Copyright © 2005 Pearson Education, Inc. Slide 6-1.
Statistical Techniques
Statistical Inference Statistical inference is concerned with the use of sample data to make inferences about unknown population parameters. For example,
One Sample Inf-1 In statistical testing, we use deductive reasoning to specify what should happen if the conjecture or null hypothesis is true. A study.
The Practice of Statistics, 5th Edition Starnes, Tabor, Yates, Moore Bedford Freeman Worth Publishers CHAPTER 9 Testing a Claim 9.1 Significance Tests:
CHAPTER 15: Tests of Significance The Basics ESSENTIAL STATISTICS Second Edition David S. Moore, William I. Notz, and Michael A. Fligner Lecture Presentation.
Chapter 7: Hypothesis Testing. Learning Objectives Describe the process of hypothesis testing Correctly state hypotheses Distinguish between one-tailed.
Fundamentals of Data Analysis Lecture 4 Testing of statistical hypotheses pt.1.
Statistical Inference for the Mean Objectives: (Chapter 8&9, DeCoursey) -To understand the terms variance and standard error of a sample mean, Null Hypothesis,
Copyright © 2013, 2009, and 2007, Pearson Education, Inc. 1 FINAL EXAMINATION STUDY MATERIAL III A ADDITIONAL READING MATERIAL – INTRO STATS 3 RD EDITION.
The Practice of Statistics, 5th Edition Starnes, Tabor, Yates, Moore Bedford Freeman Worth Publishers CHAPTER 9 Testing a Claim 9.1 Significance Tests:
Copyright © 2010, 2007, 2004 Pearson Education, Inc. Chapter 21 More About Tests and Intervals.
Unit 5: Hypothesis Testing
Understanding Results
CHAPTER 9 Testing a Claim
Central Limit Theorem, z-tests, & t-tests
CHAPTER 9 Testing a Claim
Chapter 9: Hypothesis Tests Based on a Single Sample
CHAPTER 9 Testing a Claim
Significance Tests: The Basics
Significance Tests: The Basics
CHAPTER 9 Testing a Claim
Simple Linear Regression and Correlation
Inferential Statistics
CHAPTER 9 Testing a Claim
CHAPTER 9 Testing a Claim
CHAPTER 9 Testing a Claim
Presentation transcript:

Empirical Analysis Doing and interpreting empirical work

Rules on Data You should never use data that you don’t know – Know the source and how it was collected – Understand the coding – Know how variables are measured – Know the time frame, locations and other relevant information You should be able to describe the data – Summary statistics – Be very clear about definitions Understand its limitations and special features Know why and how it addresses your research question

Rules for good empirical analysis Never use a technique you don’t understand Plan your approach – You are looking for something like “a” causes “b” – Find a technique that leads to that conclusion; don’t just automatically run a regression – Think about how your results could provide the answer to your hypotheses. In fact, plan your approach from the hypotheses Know whose behavior you are modeling Understand your causation. Be careful of spurious correlation, and of bi-directional causation. Defend the exogeneity of RHS variables.

Specifics on Empirical Models Describe what economic mechanism caused the dispersion in your right hand variables. Remember, natural experiments are rare. Do you really have one? Understand what economic mechanism constitutes the error term. What variation in the dependent variable is not covered by your predetermined variables. – Understand why your error term is uncorrelated with RHS variables, or how you will fix it – There should be economic as well as statistical reasons. If you use instrumental variables – Understand the difference between an instrument and a control. Should it be an additional variable not an instrument? – If it is an IV, why it is a good instrument and uncorrelated with the error

Think about the specification in terms of your hypotheses – Do you need a nonlinear model? – Remember that high R2 can be bad (left shoes = b0 + b1 right shoes) – What out for estimating identities

Significance There is a distinct difference between economic significance and statistical significance. You need to understand the difference. – Statistical significance is a measure of the strength of the signal relative to background noise. – Economic significance is a measure of the importance of the finding to supporting or disproving your hypotheses. Use statistical significance to test your null hypothesis. Use economic significance to test your research question.

Significance - Rules to Remember Statistical significance does not indicate causality. It simply measures how precisely the degree of correlation between the variables can be measured. Statistical significance is influenced by sample size. With a sufficiently large sample, almost any estimate can be found to be statistically significant at some level. Hence, the significance level to reject a null hypothesis should vary inversely with your sample size. This is why the 5% level is not absolute. Statistical significance is not the same as economic significance. Size matters!

Statistical Significance Statistical (Fisherian) significance has little to do with economic significance. What is statistical significance? What does it measure? Which of the following is the correct interpretation? – Given the data, the p-value tells us the probability that the null hypothesis is true. P(H0|D) – Given that the null hypothesis is true, the p-value tells us the probability of getting these data. P(D|H0) Does P(H0|D)= P(D|H0)? No

Example About 2% of a population has a disease. A test for the disease is accurate 95% of the time when the disease is present (called sensitivity) and will indicate the absence of the disease 97% of the time when the disease is absent (called specificity). Let H0 be that the disease is absent in the patient. Hence we have P(D|H0)<0.05. But when a test is done on a patient and we get an indication that the disease is present, we need to adjust for the fact that most people don’t have the disease, and there are false positives.

Example (continued). For every 1000 tests A positive test is more likely a false positive than a true positive. Recent suggestion to do away with PSA tests in men reflects this sort of result. Think about the example in Cohen about his colleague. Does anyone recall what that example is? ResultNormalDiseasedTotal Test shows patient is normal 9491 (5%)950 Test shows patient has the disease 30 (3%)2050 TOTAL

Pearson v. Fisher Buchanan-Wollaston claimed in any hypothesis test, there is a large region in the distribution of the criterion for which neither the hypothesis nor its reverse can be assumed true. He focused his comments on the  2 as a goodness of fit test. Pearson responded that the test is useful only to choose between two distributions, not to validate which distribution (hypothesis) is correct. Fisher responded that Buchanan was correct; rejection of the null is not equivalent to acceptance of the alternative. Indeed, it is the source of type II errors.

Pearson v. Fisher (continued) Pearson responded again, arguing that the conclusions and significance value used should be chosen with regard to the problem at hand. The test is a measure of the adequacy of some model to explain the observed data (Power of the test). Anything beyond that is inferential, not statistical. Both agree that the failure to reject the null hypothesis does not mean it is true. The only hypothesis proved true by a statistical test is the negation of a hypothesis that the observed sample has zero possibility. Think about the colleague example in the Smith paper.

Pearson v. Fisher (continued) Fisher tells us that the value of significance tests is only in that they tell us what to ignore. This is why null results (that the parameter is not significantly different from zero) are useful. Pearson tells us that scientists do not seek the truth, only ways to summarize the data; that is what significance testing is doing. What are we doing with econometrics? (discuss from class) Conclusion; statistical inference and economic inference are different things.

McCloskey and McCloskey and Ziliak What is their point? – Fisherian significance is meaningless in almost all cases of economic application, yet economists often report statistical significance as important. – Size matters! A well-measured but trivial economic effect may be neglected even if statistically significant. But an economically large but poorly measured effect should not be rejected because the signal is noisy.

Blank’s study in AER (1991) about acceptance rates and blind refereeing.. She found that the acceptance rates of papers written by women were lower than those written by men when refereeing was not blind. But the difference was statistically insignificant (at p<0.05) – If the difference was statistically significant with the sample, we know it would be so with a larger sample too. – But with it not statistically significant at an acceptable level, it tells us nothing about truth.

Example 2 Suppose you measure the equation Investment = a + b1*interest rate + b2*tax rates + e If tax rates and b are large, then tax rates are economically important. But if there is little variation in tax rates, you would find b2 not different from 0 at conventional levels of statistical significance.

So how big is big? There is no real answer, but if you do empirical analysis, keep the following rules in mind. Statistical significance is about sample size. The level of statistical significance “used” should fit your data, problem and importance of the variable to your hypotheses and conclusion, and to the context of your research question. Your assumptions matter, but you can’t just assume something to be true. For example, you can’t just assume a variable is exogenous. You need to argue for it, and sometimes test it.