Correlation and Regression Basics

Slides:



Advertisements
Similar presentations
Lesson 10: Linear Regression and Correlation
Advertisements

Forecasting Using the Simple Linear Regression Model and Correlation
13- 1 Chapter Thirteen McGraw-Hill/Irwin © 2005 The McGraw-Hill Companies, Inc., All Rights Reserved.
Correlation and Linear Regression.
Probabilistic & Statistical Techniques Eng. Tamer Eshtawi First Semester Eng. Tamer Eshtawi First Semester
Correlation and Regression Analysis
Linear Regression and Correlation
© 2000 Prentice-Hall, Inc. Chap Forecasting Using the Simple Linear Regression Model and Correlation.
Correlation and Regression Analysis
Review for Final Exam Some important themes from Chapters 9-11 Final exam covers these chapters, but implicitly tests the entire course, because we use.
Regression and Correlation
Correlation & Regression
Correlation and Regression
Linear Regression.
Relationship of two variables
Correlation and Linear Regression
Anthony Greene1 Correlation The Association Between Variables.
Chapter 6 & 7 Linear Regression & Correlation
AP Statistics Section 15 A. The Regression Model When a scatterplot shows a linear relationship between a quantitative explanatory variable x and a quantitative.
OPIM 303-Lecture #8 Jose M. Cruz Assistant Professor.
Lecture 6 Correlation and Regression STAT 3120 Statistical Methods I.
Correlation and Linear Regression INCM Correlation  Correlation coefficients assess strength of linear relationship between two quantitative variables.
Statistical Methods Statistical Methods Descriptive Inferential
Basic Concepts of Correlation. Definition A correlation exists between two variables when the values of one are somehow associated with the values of.
10B11PD311 Economics REGRESSION ANALYSIS. 10B11PD311 Economics Regression Techniques and Demand Estimation Some important questions before a firm are.
CHAPTER 5 Regression BPS - 5TH ED.CHAPTER 5 1. PREDICTION VIA REGRESSION LINE NUMBER OF NEW BIRDS AND PERCENT RETURNING BPS - 5TH ED.CHAPTER 5 2.
Chapter 4 Summary Scatter diagrams of data pairs (x, y) are useful in helping us determine visually if there is any relation between x and y values and,
Scatter Diagrams scatter plot scatter diagram A scatter plot is a graph that may be used to represent the relationship between two variables. Also referred.
CHAPTER 5 CORRELATION & LINEAR REGRESSION. GOAL : Understand and interpret the terms dependent variable and independent variable. Draw a scatter diagram.
GOAL: I CAN USE TECHNOLOGY TO COMPUTE AND INTERPRET THE CORRELATION COEFFICIENT OF A LINEAR FIT. (S-ID.8) Data Analysis Correlation Coefficient.
Copyright © 2015 McGraw-Hill Education. All rights reserved. No reproduction or distribution without the prior written consent of McGraw-Hill Education.
Chapter 13 Linear Regression and Correlation. Our Objectives  Draw a scatter diagram.  Understand and interpret the terms dependent and independent.
Correlation and Linear Regression
Linear Regression Essentials Line Basics y = mx + b vs. Definitions
Statistics 200 Lecture #6 Thursday, September 8, 2016
Regression Analysis AGEC 784.
Elementary Statistics
Correlation & Regression
EXPLORATORY DATA ANALYSIS and DESCRIPTIVE STATISTICS
STAT 4030 – Programming in R STATISTICS MODULE: Basic Data Analysis
REGRESSION (R2).
Data Analysis Module: Correlation and Regression
Correlation and Simple Linear Regression
A little VOCAB.
Correlation and Regression
STAT 4030 – Programming in R STATISTICS MODULE: Multiple Regression
Chapter 4 Correlation.
Regression Analysis.
Correlation and Regression Basics
STAT 4030 – Programming in R STATISTICS MODULE: Confidence Intervals
Understanding Research Results: Description and Correlation
HMI 7530– Programming in R STATISTICS MODULE: Multiple Regression
2. Find the equation of line of regression
Bivariate Testing (Chi Square)
HMI 7530– Programming in R STATISTICS MODULE: Confidence Intervals
HMI 7530– Programming in R STATISTICS MODULE: Basic Data Analysis
Prepared by Lee Revere and John Large
Bivariate Testing (Chi Square)
Lecture Notes The Relation between Two Variables Q Q
The Weather Turbulence
Introduction to Probability and Statistics Thirteenth Edition
STEM Fair Graphs.
Correlation and Regression
Section 1.4 Curve Fitting with Linear Models
Simple Linear Regression
Product moment correlation
Regression and Categorical Predictors
Warsaw Summer School 2017, OSU Study Abroad Program
Chapter Thirteen McGraw-Hill/Irwin
Nazmus Saquib, PhD Head of Research Sulaiman AlRajhi Colleges
Presentation transcript:

Correlation and Regression Basics HMI 7530– Programming in R STATISTICS MODULE: Correlation and Regression Basics Jennifer Lewis Priestley, Ph.D. Kennesaw State University 1

STATISTICS MODULE Basic Descriptive Statistics and Confidence Intervals Basic Visualizations Histograms Pie Charts Bar Charts Scatterplots Ttests One Sample Paired Independent Two Sample Proportion Testing ANOVA Chi Square and Odds Regression Basics 2 2 2

STATISTICS MODULE: Correlation Dependent Variable Independent (predictor) Variable Statistical Test Comments Quantitative Categorical T-TEST (one, two or paired sample) Determines if categorical variable (factor) affects dependent variable; typically used for experimental or planned change studies Chi-Square Tests if variables are statistically independent (i.e. are they related or not?) Correlation/ Regression Analysis Tests if two or more quantitative variables are related (binary is a special case). 3

STATISTICS MODULE: Correlation Correlation coefficients assess strength of linear relationship between two quantitative variables. The correlation measure ranges from -1 to +1. A negative correlation means that X and Y are inversely related. A positive correlation means that X and Y are directly related. Zero correlation means that X and Y are not linearly related. A correlation of +1 indicates X and Y are directly related and that all the points fall on the same straight line. A correlation of -1 indicates X and Y are inversely related and that all the points fall on the same straight line Plot Scatter Diagram of Each Predictor variable and Dependent Variable Look of Departures from Linearity Look for extreme data points (Outliers) Examine Partial Correlation Can’t determine causality, but isolate confounding variables 4

STATISTICS MODULE: Correlation For those who are interested: 5

STATISTICS MODULE: Correlation Age (yr) Prices Advertised ($) 1 12995 10950 2 10495 3 10995 4 6995 7990 5 8700 6 5990 4995 9 3200 2250 3995 11 2900 2995 13 1750 Consider the Used Car Data – Are these two variables related? The first step is to plot the data in a scatterplot. We can assess the Pearson Correlation Coefficient (r). 6

STATISTICS MODULE: Correlation The correlation coefficient is -0.945. Is this logical? Why or Why not? The line drawn through this scatterplot is called the “best fit” line – because it is the linear function that minimizes the distances between the output of the linear function and the observed points. 7

STATISTICS MODULE: Linear Regression From the previous slide, the “regression line” has been imposed onto the relationship between Price and Age of car. The equation of this line takes the general form of y=mx+b, where: Y is the dependent variable (Price) M is the slope of the line X is the independent variable (Age) B is the Y-intercept. When we discussion regression models, we transform this equation to be: Y = bo + b1x1 + …bnxn Where bo is the y-intercept and b1 is the slope of the line. The “slope” is also the effect of a one unit change of x on y. 8

STATISTICS MODULE: Linear Regression mod1 <- lm(Price~Year, data = cars) Results: 9

STATISTICS MODULE: Linear Regression From the previous slide, the model equation is presented in the form of the equation of a line: y=-924x + 12320. From this, we would say: For every 1 year of a car’s age, there is a $924 decrease in the price of the car. Every car “starts” at $12,320. If a car is 2 years old, the expected price is $10,472. That R2 value of .8937 is interpreted as “89.37% of the change in price of the cars can be explained by this linear model, where age is the only predictor”. 10