Download presentation

Presentation is loading. Please wait.

Published byLouis Elsey Modified over 2 years ago

1
Parameter Estimation, Dummies, & Model Fit We know mechanically how to “run a regression”…but how are the parameters actually estimated? How can we handle “categorical” explanatory (independent) variables? What is a measure of “goodness of fit” of a statistical model to data?

2
Example: Alien Species Exotic species cause economic and ecological damage Not all countries equally invaded Want to understand characteristics of country that make it more likely to be “invaded”.

3
Understanding Invasive Species Steps to improving our understanding: 1. Generate a set of hypotheses (so they can be “accepted” or “rejected”) 2. Develop a statistical model. Interpret hypotheses in context of statistical model. 3. Collect data. Estimate parameters of model. 4. Test hypotheses.

4
2 Hypotheses (in words) We’ll measure “invasiveness” as proportion of Alien/Native species (article by Dalmazzone). 1. Population density plays a role in a country’s invasiveness. 2. Island nations are more invaded than mainland nations.

5
Population Density

6
Island vs. Mainland

7
Variables Variables: Dependent: Proportion of number of alien species to native species in each country. Independent: Island? Population Density GDP per capita Agricultural activity

8
Computer Minimizes e i 2 Remember, OLS finds coefficients that minimize sum squared residuals Graphical representation Why is this appropriate? Can show that this criterion leads to estimates that are most precise unbiased estimates.

9
Dummy Variable Generally: Male/Female; Pre-regulation/Post-regulation; etc.. Use a “Dummy Variable”. Value = 1 if country is Island, 0 otherwise. More generally, if n categories, use n-1 dummies. E.g. if want to distinguish between 6 continents Problem: Lose “degrees of freedom”.

10
A Simple Model A simple linear model looks like this: Dummy changes intercept (explain). Interaction dummy variable? E.g. Invasions of island nations more strongly affected by agricultural activity.

11
Translating our Hypotheses 2 Hypotheses Hypothesis 1: Population: Focus on 3 Hypothesis 2: Island: Focus on 2 “Hypothesis Testing”… forthcoming in course. Parameter Estimates: Value Std.Error t value Pr(>|t|) (Intercept) -0.0184 0.0859 -0.2141 0.8326 Island 0.0623 0.0821 0.7588 0.4564 Pop.dens 0.0010 0.0002 6.2330 0.0000 GDP 0.0000 0.0000 3.3249 0.0032 Agr -0.0014 0.0015 -0.9039 0.3763

12
“Goodness of Fit”: R 2 “Coefficient of Determination” R 2 =Squared correlation between Y and OLS prediction of Y R 2 =% of total variation that is explained by regression, [0,1] OLS maximizes R 2. Adding independent cannot R 2 Adjusted R 2 penalizes for # vars.

13
Answers Island nations are more heavily invaded (.0623) Not significant (p=.46) Population density has impact on invasions (.001) Significant (p=.0000) R 2 =.80; about 80% of variation in dependent variable explained by model. Also, corr(A,Ahat)=.89

Similar presentations

OK

Multiple Regression Models. The Multiple Regression Model The relationship between one dependent & two or more independent variables is a linear function.

Multiple Regression Models. The Multiple Regression Model The relationship between one dependent & two or more independent variables is a linear function.

© 2017 SlidePlayer.com Inc.

All rights reserved.

Ads by Google

Ppt on electricity for class 10th Ppt on 4 stroke petrol engine Ppt on is matter around us pure Ppt on marie curie cancer Convert pdf to ppt online mac Ppt on area of plane figures geometry Ppt on index numbers and time Ppt on college website Flat panel display ppt on tv Ppt on diode transistor logic gates