Regression Regression: Mathematical method for determining the best equation that reproduces a data set Linear Regression: Regression method applied with.

Slides:



Advertisements
Similar presentations
Kin 304 Regression Linear Regression Least Sum of Squares
Advertisements

Copyright © 2006 The McGraw-Hill Companies, Inc. Permission required for reproduction or display. 1 ~ Curve Fitting ~ Least Squares Regression Chapter.
Regression Analysis Module 3. Regression Regression is the attempt to explain the variation in a dependent variable using the variation in independent.
Chapter 10 Regression. Defining Regression Simple linear regression features one independent variable and one dependent variable, as in correlation the.
EGR 105 Foundations of Engineering I
Correlation & Regression Chapter 15. Correlation statistical technique that is used to measure and describe a relationship between two variables (X and.
Copyright © 2012 Pearson Education, Inc. 2.3 Another Look at Linear Graphs ■ Graphing Horizontal Lines and Vertical Lines ■ Graphing Using Intercepts ■
The Best-Fit Line Linear Regression. PGCC CHM 103 Sinex How do you determine the best-fit line through data points? x-variable y-variable Fortunately.
2.2 Correlation Correlation measures the direction and strength of the linear relationship between two quantitative variables.
Regression and Correlation
Least Square Regression
Chapter 12a Simple Linear Regression
EGR 105 Foundations of Engineering I Fall 2007 – week 7 Excel part 3 - regression.
Gordon Stringer, UCCS1 Regression Analysis Gordon Stringer.
Correlation and Regression. Correlation What type of relationship exists between the two variables and is the correlation significant? x y Cigarettes.
Linear Regression Larson/Farber 4th ed. 1 Section 9.2.
C82MCP Diploma Statistics School of Psychology University of Nottingham 1 Linear Regression and Linear Prediction Predicting the score on one variable.
Regression in EXCEL r2 SSE b0 b1 SST.
Introduction to Regression Analysis, Chapter 13,
Chapter 2 – Simple Linear Regression - How. Here is a perfect scenario of what we want reality to look like for simple linear regression. Our two variables.
Linear Regression Analysis
Linear Regression.
Introduction to Linear Regression and Correlation Analysis
Relationship of two variables
12b. Regression Analysis, Part 2 CSCI N207 Data Analysis Using Spreadsheet Lingma Acheson Department of Computer and Information Science,
Biostatistics Unit 9 – Regression and Correlation.
Jon Curwin and Roger Slater, QUANTITATIVE METHODS: A SHORT COURSE ISBN © Thomson Learning 2004 Jon Curwin and Roger Slater, QUANTITATIVE.
12a. Regression Analysis, Part 1 CSCI N207 Data Analysis Using Spreadsheet Lingma Acheson Department of Computer and Information Science,
Linear Trend Lines = b 0 + b 1 X t Where is the dependent variable being forecasted X t is the independent variable being used to explain Y. In Linear.
Section 5.2: Linear Regression: Fitting a Line to Bivariate Data.
Applied Quantitative Analysis and Practices LECTURE#23 By Dr. Osman Sadiq Paracha.
Statistical Methods Statistical Methods Descriptive Inferential
Elementary Statistics Correlation and Regression.
Regression. Population Covariance and Correlation.
Sullivan – Fundamentals of Statistics – 2 nd Edition – Chapter 4 Section 2 – Slide 1 of 20 Chapter 4 Section 2 Least-Squares Regression.
Regression Regression relationship = trend + scatter
Linear Trend Lines = b 0 + b 1 X t Where is the dependent variable being forecasted X t is the independent variable being used to explain Y. In Linear.
STATISTICS 12.0 Correlation and Linear Regression “Correlation and Linear Regression -”Causal Forecasting Method.
Section 9.2 Linear Regression. Section 9.2 Objectives Find the equation of a regression line Predict y-values using a regression equation.
CHAPTER 5 Regression BPS - 5TH ED.CHAPTER 5 1. PREDICTION VIA REGRESSION LINE NUMBER OF NEW BIRDS AND PERCENT RETURNING BPS - 5TH ED.CHAPTER 5 2.
Revision: Pivot Table 1. Histogram 2. Trends 3. Linear 4. Exponential
Chapter 6 (cont.) Difference Estimation. Recall the Regression Estimation Procedure 2.
7-3 Line of Best Fit Objectives
9.2 Linear Regression Key Concepts: –Residuals –Least Squares Criterion –Regression Line –Using a Regression Equation to Make Predictions.
Linear Regression 1 Section 9.2. Section 9.2 Objectives 2 Find the equation of a regression line Predict y-values using a regression equation.
Basic Statistics Linear Regression. X Y Simple Linear Regression.
1 Data Analysis Linear Regression Data Analysis Linear Regression Ernesto A. Diaz Department of Mathematics Redwood High School.
Section 1-3: Graphing Data
STATISTICS 12.0 Correlation and Linear Regression “Correlation and Linear Regression -”Causal Forecasting Method.
Regression Analysis Deterministic model No chance of an error in calculating y for a given x Probabilistic model chance of an error First order linear.
Correlation and Regression Chapter 9. § 9.2 Linear Regression.
Regression For the purposes of this class: –Does Y depend on X? –Does a change in X cause a change in Y? –Can Y be predicted from X? Y= mX + b Predicted.
Chapter 14 Introduction to Regression Analysis. Objectives Regression Analysis Uses of Regression Analysis Method of Least Squares Difference between.
Correlation and Regression Ch 4. Why Regression and Correlation We need to be able to analyze the relationship between two variables (up to now we have.
PreCalculus 1-7 Linear Models. Our goal is to create a scatter plot to look for a mathematical correlation to this data.
1 Objective Given two linearly correlated variables (x and y), find the linear function (equation) that best describes the trend. Section 10.3 Regression.
The simple linear regression model and parameter estimation
Department of Mathematics
Regression and Correlation of Data Summary
Linear Regression Special Topics.
Practice. Practice Practice Practice Practice r = X = 20 X2 = 120 Y = 19 Y2 = 123 XY = 72 N = 4 (4) 72.
Regression and Correlation
Kin 304 Regression Linear Regression Least Sum of Squares
BPK 304W Regression Linear Regression Least Sum of Squares
Finding the Best Fit Line
Least-Squares Regression
Correlation and Regression
Multivariate Analysis Regression
Algebra Review The equation of a straight line y = mx + b
Lesson 2.2 Linear Regression.
Presentation transcript:

Regression Regression: Mathematical method for determining the best equation that reproduces a data set Linear Regression: Regression method applied with a linear model (straight line) Uses Prediction of new X,Y values Understanding data behavior Verification of hypotheses/physical laws

Regression The Linear Model Y = mX + b Y = Dependent variable X = Independent variable m = slope = DY/DX b = y-intercept (point where line crosses y-axis at x=0) X1=1, Y1=2.4 X2=20, Y2=10 DX DY

Regression Fitting the data: finding the equation for the straight line that does the best job of reproducing the data.

Regression Residual: Difference between measured and calculated Y-values

Regression Analysis Use the least square method to “best fit” a straight line through the data points. A straight line is described by its slope and “y”-intercept in a x-y plot. Need to determine the numerical values of the slope and the “y”-intercept from the data. This is equivalent to adding a trendline to your scatter plot in EXCEL.

Regression Analysis The least square method consists of defining a difference, called the residual, between the regression line and a data point along a measured “x” value. Then add up the squared residuals for all data points. Adjusting the slope and the “y”-intercept of the regression line so that the sum of squared residuals, called regression error, has the smallest value.

Regression Analysis The covariance appears in the calculation of the correlation coefficient between the measurements of two variables. Let us denote the two variables as “x” and “y”. Their measurements are the “x” data set and the “y” data set.

Regression Analysis The slope of the regression line is given by the ratio of the covariance between the “x” and “y” data sets and of the variance of the “x” data set. You then use the equation of the line to determine the y-intercept. You MUST use the mean of x and the mean of y for this equation since your data points are likely not on the regression line.

Regression Analysis Once we determined the slope and the “y” intercept of the regression line, we have a mathematical relation that ties the “x” variable to the “y” variable. We can use this relation to predict values of “y” given a “x” value that are not on the data sets.

Regression Analysis Interpolation – the process by which we use the regression line to predict a value of the “y” variable for a value of the “x” variable that is not one of the data points but is within the range of the data set. The “x” and “y” points will lie on the regression line.

Regression Analysis Extrapolation – the process by which we use the regression line to predict a value of the “y” variable for a value of the “x” variable that is outside of the range of the data set. The “x” and “y” points also lie on the regression line but outside of the range of the data set.

Tricks of the Trade A curve can be partitioned into sections and “best” fitted a different curve in each section. Use scaling as a mean to increase the accuracy of the “fitted” curve.

Multivariate Analysis Regression Prediction: Once the best fit line has been determined, the equation can be used to predict new values of Y for any given X and vice versa. (Interpolation/Extrapolation) y = 772.03x + 10810 If a states % of the population with a college degree is 20%, then they can expect an average income level of y = 772.03(20) + 10810 = $26,250 If a states average income level is $30,000, then what % of its population has a college degree? x = (30,000 – 10810)/772.03 = 24.9%

Multivariate Analysis Excel Functions and Tools SLOPE() - Returns the slope when passed X, Y data.. INTERCEPT() - Returns the intercept when passed X, Y data.. LINEST() - Returns the slope and intercepts when passed X, Y data.. TREND() - Returns predicted values in a linear trend when passed X, Y data.. Trendline (from the Chart menu) Returns the trendline, equation, and correlation coefficient for a set of X,Y data.