Presentation on theme: "REGRESI SEDERHANA BAGIAN 2"— Presentation transcript:
1 REGRESI SEDERHANA BAGIAN 2 Ekonometrika 1Al Muizzuddin F
2 REVIEW BAGIAN 1 Menentukan nilai koefisien bo dan b1 Asumsi dalam regresi OLS
3 A Measure of “Goodness of Fit” THE COEFFICIENT OF DETERMINATION (R2)we shall find out how “well” the sample regression line fits the dataFigure A :Venn diagram
4 In this figure the circle Y represents variation in the dependent variable Y and the circle X represents variation in the explanatory variable X.When there is no overlap, R2 is obviously zero, but when the overlap is complete, R2 is 1, since 100 percent of the variation in Y is explained by X. As we shall show shortly, R2 lies between 0 and 1.
5 Analysis of VarianceThe total variability in a regression analysis, SST, can be partitioned into a component explained by the regression, SSR, and a component due to unexplained error, SSE
6 With the components defined as, Total sum of squares :Error sum of squares :Regression sum of squares :
7 Measure of Coefficient of Determination, R2 The Coefficient of Determination for a regression equation is defined as
8 Some Examples EXAMPLE 1 : HYPOTHETICAL DATA ON WEEKLY FAMILY CONSUMPTION EXPENDITURE (Y) AND WEEKLY FAMILY INCOME (X)
10 The estimated regression line is The value of β2 = , which measures the slope of the line, shows that, within the sample range of X between $80 and $260 per week, as X increases, say, by $1, the estimated increase in the mean or average weekly consumption expenditure amounts to about 51 cents.Y = 24, ,5091 XiR2 = 0,9621
11 The value of β1 = , which is the intercept of the line, indicates the average level of weekly consumption expenditure when weekly income is zero.The value of R2 of means that about 96 percent of the variation in the weekly consumption expenditure is explained by income. Since R2 can at most be 1, the observed R2 suggests that the sample regression line fits the data very well.
12 EXAMPLE 2 :FOOD EXPENDITURE IN INDIAThe data relate to a sample of 55 rural households in India. The regressand in this example is expenditure on food and the regressor is total expenditure, a proxy for income, both figures in rupees. The data in this example are thus cross-sectional data.
13 FoodExpi = 94,20 + 0,43 TotalExpiIf total expenditure increases by 1 rupee, on average, expenditure on food goes up by about 44 paise (1 rupee = 100 paise).If total expenditure were zero, the average expenditure on food would be about 94 rupees.The R2 value of about 0.37 means that only 37 percent of the variation in food expenditure is explained by the total expenditure.
14 EXAMPLE 3 :THE RELATIONSHIP BETWEEN EARNINGSAND EDUCATIONThe data relating average hourly earnings and education, as measured by years of schooling. Using that data, if we regress average hourly earnings (Y) on education (X), we obtain the following results.Yi = -0, ,7241 Xi
15 As the regression results show, there is a positive association between education and earnings, an unsurprising finding.For every additional year of schooling, the average hourly earnings go up by about 72 cents an hour. The intercept term is positive but it may have no economic meaning.The R2 value suggests that about 89 percent of the variation in average hourly earnings is explained by education. For cross-sectional data, such a high R2 is rather unusual.
16 Basis for Inference About the Population Regression Slope Let 1 be a population regression slope and b1 its least squares estimate based on n pairs of sample observations. Then, if the standard regression assumptions hold and it can also be assumed that the errors i are normally distributed.
17 Excel Output for Retail Sales Model The regression equation is Y Retail Sales = X Income
18 Tests of the Population Regression Slope If the regression errors i are normally distributed and the standard least squares assumptions hold (or if the distribution of b1 is approximately normal), the following tests have significance value :To test either null hypothesisagainst the alternativethe decision rule is
19 2. To test either null hypothesis against the alternativethe decision rule is
20 3. To test the null hypothesis Against the two-sided alternativethe decision rule is
21 F test for Simple Regression Coefficient We can test the hypothesisagainst the alternativeBy using the F statisticThe decision rule isWe can also show that the F statistic isFor any simple regression analysis.
22 ConclusionCoefficient intercept and slopeR-squaredF-testt-tes