THE NATURE OF REGRESSION ANALYSIS Al Muizzuddin F.

Slides:



Advertisements
Similar presentations
MTH 161: Introduction To Statistics
Advertisements

Correlation and regression Dr. Ghada Abo-Zaid
Correlation & Regression Chapter 10. Outline Section 10-1Introduction Section 10-2Scatter Plots Section 10-3Correlation Section 10-4Regression Section.
Simple Linear Regression and Correlation
CHAPTER 1: THE NATURE OF REGRESSION ANALYSIS
Lecture 8 Relationships between Scale variables: Regression Analysis
Linear Regression.
Describing the Relation Between Two Variables
Note 14 of 5E Statistics with Economics and Business Applications Chapter 12 Multiple Regression Analysis A brief exposition.
Statistics for Business and Economics
Fall 2006 – Fundamentals of Business Statistics 1 Chapter 13 Introduction to Linear Regression and Correlation Analysis.
Statistical Relationship Between Quantitative Variables
Linear Regression and Correlation Analysis
Managerial Economics By Mr. Tahir Islam. Demand Estimation by regression analysis  What is regression The term regression was first introduced by Francis.
Chapter 13 Introduction to Linear Regression and Correlation Analysis
Analysis of Individual Variables Descriptive – –Measures of Central Tendency Mean – Average score of distribution (1 st moment) Median – Middle score (50.
Ch 2 and 9.1 Relationships Between 2 Variables
Econometrics I Summer 2011/2012 Course Guarantor: prof. Ing. Zlata Sojková, CSc., Lecturer: Ing. Martina Hanová, PhD.

Chapter 7 Scatterplots, Association, Correlation Scatterplots and correlation Fitting a straight line to bivariate data © 2006 W. H. Freeman.
Review for Final Exam Some important themes from Chapters 9-11 Final exam covers these chapters, but implicitly tests the entire course, because we use.
Regression Analysis British Biometrician Sir Francis Galton was the one who used the term Regression in the later part of 19 century.
CHAPTER 2: TWO VARIABLE REGRESSION ANALYSIS: SOME BASIC IDEAS
405: ECONOMETRICS Chapter # 1: THE NATURE OF REGRESSION ANALYSIS By: Domodar N. Gujarati Prof. M. El-Sakka Dept of Economics: Kuwait University.
Correlation and Regression
Section 7.3 ~ Best-Fit Lines and Prediction Introduction to Probability and Statistics Ms. Young.
Statistics for Business and Economics Chapter 10 Simple Linear Regression.
LECTURE UNIT 7 Understanding Relationships Among Variables Scatterplots and correlation Fitting a straight line to bivariate data.
Probabilistic and Statistical Techniques 1 Lecture 24 Eng. Ismail Zakaria El Daour 2010.
Lecture 22 Dustin Lueker.  The sample mean of the difference scores is an estimator for the difference between the population means  We can now use.
Bivariate Regression Analysis The most useful means of discerning causality and significance of variables.
TWO-VARIABLEREGRESSION ANALYSIS: SOME BASIC IDEAS In this chapter:
Wednesday, October 12 Correlation and Linear Regression.
© 2001 Prentice-Hall, Inc. Statistics for Business and Economics Simple Linear Regression Chapter 10.
FINANCIAL ECONOMETRIC Financial econometrics is the econometrics of financial markets Econometrics is a mixture of economics, mathematics and statistics.
Correlation Analysis. A measure of association between two or more numerical variables. For examples height & weight relationship price and demand relationship.
Business Statistics for Managerial Decision Farideh Dehkordi-Vakil.
Statistical Reasoning for everyday life Intro to Probability and Statistics Mr. Spering – Room 113.
Y X 0 X and Y are not perfectly correlated. However, there is on average a positive relationship between Y and X X1X1 X2X2.
MGS3100_04.ppt/Sep 29, 2015/Page 1 Georgia State University - Confidential MGS 3100 Business Analysis Regression Sep 29 and 30, 2015.
Lecture 7: What is Regression Analysis? BUEC 333 Summer 2009 Simon Woodcock.
Copyright © 2006 The McGraw-Hill Companies, Inc. All rights reserved. McGraw-Hill/Irwin Basic Ideas of Linear Regression: The Two- Variable Model chapter.
STA291 Statistical Methods Lecture LINEar Association o r measures “closeness” of data to the “best” line. What line is that? And best in what terms.
CORRELATION. Correlation key concepts: Types of correlation Methods of studying correlation a) Scatter diagram b) Karl pearson’s coefficient of correlation.
LECTURE 9 Tuesday, 24 FEBRUARY STA291 Fall Administrative 4.2 Measures of Variation (Empirical Rule) 4.4 Measures of Linear Relationship Suggested.
© 2006 by The McGraw-Hill Companies, Inc. All rights reserved. 1 Chapter 12 Testing for Relationships Tests of linear relationships –Correlation 2 continuous.
Dept of Bioenvironmental Systems Engineering National Taiwan University Lab for Remote Sensing Hydrology and Spatial Modeling STATISTICS Linear Statistical.
Chapter 8: Simple Linear Regression Yang Zhenlin.
Independent Dependent Scatterplot Least Squares
EED 401: ECONOMETRICS COURSE OUTLINE
Lecture 5 Discussion for Today  Probability sampling  Non probability sampling  Questionnaire.
Lecturer: Ing. Martina Hanová, PhD. Business Modeling.
11-1 Copyright © 2014, 2011, and 2008 Pearson Education, Inc.
1/25 Introduction to Econometrics. 2/25 Econometrics Econometrics – „economic measurement“ „May be defined as the quantitative analysis of actual economic.
Lecturer: Ing. Martina Hanová, PhD. Business Modeling.
Introduction Many problems in Engineering, Management, Health Sciences and other Sciences involve exploring the relationships between two or more variables.
The Nature of Econometrics Tools of Using Econometrics.
Econometrics ECM712s P. Nangula Tel:
Chapter 4 Basic Estimation Techniques
BASIC ECONOMETRICS.
Regression Analysis Chapters 1-2
Elementary Statistics
Chapter 1: THE NATURE OF REGRESSION ANALYSIS
Introduction to Econometrics
Econometrics Analysis
Linear Regression Summer School IFPRI
Regression Part II.
CORRELATION & REGRESSION compiled by Dr Kunal Pathak
REGRESSION ANALYSIS 11/28/2019.
Presentation transcript:

THE NATURE OF REGRESSION ANALYSIS Al Muizzuddin F

HISTORICAL ORIGIN OF THE TERM REGRESSION  The term regression was introduced by Francis Galton.  In a famous paper, Galton found that, although there was a tendency for tall parents to have tall children and for short parents to have short children, the average height of children born of parents of a given height tended to move or “regress” toward the average height in the population as a whole.

THE MODERN INTERPRETATION OF REGRESSION  Regression analysis is concerned with the study of the dependence of one variable, the dependent variable, on one or more other variables, the explanatory variables, with a view to estimating and/or predicting the (population) mean or average value of the former in terms of the known or fixed (in repeated sampling) values of the latter.

Examples 1.Reconsider Galton’s law of universal regression. Galton was interested in finding out why there was a stability in the distribution of heights in a population. But in the modern view our concern is not with this explanation but rather with finding out how the average height of sons changes, given the fathers’ height. 2.Corresponding to any given age, we have a range, or distribution, of heights.Obviously, not all boys of a given age are likely to have identical heights. But height on the average increases with age (of course, up to a certain age), which can be seen clearly if we draw a line (the regression line)

Next

Examples (2) 3.Turning to economic examples, an economist may be interested in studying the dependence of personal consumption expenditure on aftertax or disposable real personal income. Such an analysis may be helpful in estimating the marginal propensity to consume (MPC), that is, average change in consumption expenditure for, say, a dollar’s worth of change in real income 4.A labor economist may want to study the rate of change of money wages in relation to the unemployment rate.

STATISTICAL VERSUS DETERMINISTIC RELATIONSHIPS  In statistical relationships among variables we essentially deal with random or stochastic variables, that is, variables that have probability distributions.  In functional or deterministic dependency, on the other hand, we also deal with variables, but these variables are not random or stochastic.

Next  The dependence of crop yield on temperature, rainfall, sunshine, and fertilizer, for example, is statistical in nature in the sense that the explanatory variables, although certainly important, will not enable the agronomist to predict crop yield exactly because of errors involved in measuring these variables as well as a host of other factors (variables) that collectively affect the yield but may be difficult to identify individually. Thus, there is bound to be some “intrinsic” or random variability in the dependent-variable crop yield that cannot be fully explained no matter how many explanatory variables we consider.

Next  In deterministic phenomena, on the other hand, we deal with relationships of the type, say, exhibited by Newton’s law of gravity, which states: Every particle in the universe attracts every other particle with a force directly proportional to the product of their masses and inversely proportional to the square of the distance between them. Symbolically, F = k(m1m2/r 2), where F = force, m1 and m2 are the masses of the two particles, r = distance, and k = constant of proportionality.

REGRESSION VERSUS CAUSATION  Although regression analysis deals with the dependence of one variable on other variables, it does not necessarily imply causation.  In the crop-yield example cited previously, there is no statistical reason to assume that rainfall does not depend on crop yield. The fact that we treat crop yield as dependent on rainfall (among other things) is due to nonstatistical considerations: Common sense suggests that the relationship cannot be reversed, for we cannot control rainfall by varying crop yield.

REGRESSION VERSUS CORRELATION  Closely related to but conceptually very much different from regression analysis is correlation analysis, where the primary objective is to measure the strength or degree of linear association between two variables.  The correlation coefficient, which we shall study in detail in Chapter 3, measures this strength of (linear) association. For example, we may be interested in finding the correlation (coefficient) between smoking and lung cancer, between scores on statistics and mathematics examinations, between high school grades and college grades, and so on.

TERMINOLOGY AND NOTATION

Types of regression  If we are studying the dependence of a variable on only a single explanatory variable, such as that of consumption expenditure on real income, such a study is known as simple, or two-variable, regression analysis.  if we are studying the dependence of one variable on more than one explanatory variable, as in the crop-yield, rainfall, temperature, sunshine, and fertilizer examples, it is known as multiple regression analysis.

Random  The term random is a synonym for the term stochastic. As noted earlier, a random or stochastic variable is a variable that can take on any set of values, positive or negative, with a given probability.

THE NATURE AND SOURCES OF DATA FOR ECONOMIC ANALYSIS  Three types of data may be available for empirical analysis: time series, cross-section, and pooled (i.e., combination of time series and crosssection) data. See page 27 Gujarati.  Note on the Measurement Scales of Variables. See page 30 Gujarati.