Part II Exploring Relationships Between Variables.

Slides:



Advertisements
Similar presentations
AP Statistics Section 3.2 C Coefficient of Determination
Advertisements

Linear Regression.  The following is a scatterplot of total fat versus protein for 30 items on the Burger King menu:  The model won’t be perfect, regardless.
 Objective: To look for relationships between two quantitative variables.
Correlation Data collected from students in Statistics classes included their heights (in inches) and weights (in pounds): Here we see a positive association.
Chapter 8 Linear regression
Chapter 8 Linear regression
Linear Regression Copyright © 2010, 2007, 2004 Pearson Education, Inc.
Copyright © 2010 Pearson Education, Inc. Chapter 8 Linear Regression.
Residuals Revisited.   The linear model we are using assumes that the relationship between the two variables is a perfect straight line.  The residuals.
Chapter 8 Linear Regression.
Warm up Use calculator to find r,, a, b. Chapter 8 LSRL-Least Squares Regression Line.
Copyright © 2009 Pearson Education, Inc. Chapter 8 Linear Regression.
Chapter 8 Linear Regression © 2010 Pearson Education 1.
Statistics Residuals and Regression. Warm-up In northern cities roads are salted to keep ice from freezing on the roadways between 0 and -9.5 ° C. Suppose.
CHAPTER 8: LINEAR REGRESSION
Copyright © 2010, 2007, 2004 Pearson Education, Inc. Chapter 8 Linear Regression.
Chapter 7 Linear Regression.
Scatter Diagrams and Linear Correlation
Chapter 8: Linear Regression
Correlation & Regression
Linear Regression.
Introduction to Linear Regression and Correlation Analysis
Relationship of two variables
Scatterplots, Association, and Correlation Copyright © 2010, 2007, 2004 Pearson Education, Inc.
Correlation with a Non - Linear Emphasis Day 2.  Correlation measures the strength of the linear association between 2 quantitative variables.  Before.
Copyright © 2010, 2007, 2004 Pearson Education, Inc. Chapter 7 Scatterplots, Association, and Correlation.
Scatterplots, Association,
Scatterplots, Associations, and Correlation
Notes Bivariate Data Chapters Bivariate Data Explores relationships between two quantitative variables.
1 Chapter 10 Correlation and Regression 10.2 Correlation 10.3 Regression.
Slide 7-1 Copyright © 2004 Pearson Education, Inc.
1 Chapter 7 Scatterplots, Association, and Correlation.
Section 2.2 Correlation A numerical measure to supplement the graph. Will give us an indication of “how closely” the data points fit a particular line.
Chapter 10 Correlation and Regression
Notes Bivariate Data Chapters Bivariate Data Explores relationships between two quantitative variables.
Copyright © 2008 Pearson Education, Inc. Publishing as Pearson Addison-Wesley Chapter 8 Linear Regression.
Objective: Understanding and using linear regression Answer the following questions: (c) If one house is larger in size than another, do you think it affects.
Relationships If we are doing a study which involves more than one variable, how can we tell if there is a relationship between two (or more) of the.
Copyright © 2010, 2007, 2004 Pearson Education, Inc. Chapter 8 Linear Regression.
Chapter 8 Linear Regression *The Linear Model *Residuals *Best Fit Line *Correlation and the Line *Predicated Values *Regression.
Chapter 8 Linear Regression. Slide 8- 2 Fat Versus Protein: An Example The following is a scatterplot of total fat versus protein for 30 items on the.
Copyright © 2007 Pearson Education, Inc. Publishing as Pearson Addison-Wesley Chapter 8 Linear Regression (3)
CHAPTER 5 Regression BPS - 5TH ED.CHAPTER 5 1. PREDICTION VIA REGRESSION LINE NUMBER OF NEW BIRDS AND PERCENT RETURNING BPS - 5TH ED.CHAPTER 5 2.
 Find the Least Squares Regression Line and interpret its slope, y-intercept, and the coefficients of correlation and determination  Justify the regression.
1 Association  Variables –Response – an outcome variable whose values exhibit variability. –Explanatory – a variable that we use to try to explain the.
Copyright © 2010 Pearson Education, Inc. Chapter 7 Scatterplots, Association, and Correlation.
Chapter 8 Linear Regression. Objectives & Learning Goals Understand Linear Regression (linear modeling): Create and interpret a linear regression model.
CHAPTER 8 Linear Regression. Residuals Slide  The model won’t be perfect, regardless of the line we draw.  Some points will be above the line.
Copyright © 2007 Pearson Education, Inc. Publishing as Pearson Addison-Wesley Slide 8- 1.
Chapter 7 Scatterplots, Association, and Correlation.
Chapter 8 Linear Regression. Fat Versus Protein: An Example 30 items on the Burger King menu:
Linear Regression Chapter 8. Fat Versus Protein: An Example The following is a scatterplot of total fat versus protein for 30 items on the Burger King.
Module 11 Scatterplots, Association, and Correlation.
Copyright © 2010 Pearson Education, Inc. Chapter 8 Linear Regression.
Statistics 7 Scatterplots, Association, and Correlation.
1-1 Copyright © 2015, 2010, 2007 Pearson Education, Inc. Chapter 7, Slide 1 Chapter 7 Linear Regression.
Statistics 8 Linear Regression. Fat Versus Protein: An Example The following is a scatterplot of total fat versus protein for 30 items on the Burger King.
Training Activity 4 (part 2)
Scatterplots, Association, and Correlation
Chapter 7: Scatterplots, Association, and Correlation
Chapter 8 Linear Regression.
Chapter 7 Linear Regression.
Chapter 8 Part 2 Linear Regression
Unit 4 Vocabulary.
M248: Analyzing data Block D UNIT D2 Regression.
Correlation and Regression
Scatterplots Scatterplots may be the most common and most effective display for data. In a scatterplot, you can see patterns, trends, relationships, and.
Algebra Review The equation of a straight line y = mx + b
Correlation r Explained
Honors Statistics Review Chapters 7 & 8
Presentation transcript:

Part II Exploring Relationships Between Variables

Chapter 7 Scatterplots, Association, and Correlation Scatterplot Example : Timeplot Easiest way to observe the relationship between two quantitative variables.

Looking at Scatterplots Direction Positive Direction Negative Direction Amount of Scatter

Scatterplot Details Cartesian Plane Roles for Variables x axis : Explanatory (Independent variable) y axis :Response (Dependent variable) o. (x,y) Ordered Pair x y

Correlation Positive Association How Strong (0 to 1)

Standardizing x and y Z x Z y >0

Correlation Coefficient Measures the strength of the linear association between two quantitative variables

Correlation Conditions Quantitative Variables Straight enough condition No Outliers

Correlation Properties Sign : Direction of association Always between -1 and 1 r = 1 maximum linear Association r = 0 No linear association Correlation treats x and y symmetrically Correlation has no units Correlation is not affected by changes in center or scale. Correlation measures the strength of the linear association between two variables Variables can have a strong association but still have small correlation Correlation is sensitive to outliers

What Can Go Wrong? Check the conditions Don’t confuse Correlation with Causation Lurking Variables A hidden variable that stands behind a relationship and determines it by simultaneously affecting both variables.

Exercises Page

Chapter 8 Linear Regression The Linear Model is just an equation of a straight line through the data

Equation of the Line y = mx + b Line will pass through Standardizing y = mx Line will pass through (0,0) x y b

Predicted Values Each predicted “y” tends to be closer to its mean (in standard deviations) than its corresponding “x” was. This property of the linear model is called regression to the mean and the line is called the regression line. If you are 1SD taller then the predicted value for your weight will be (r)*1SD heavier

Back to the original units Slope Predicted Value y-intercept

Residuals Difference between the observed value and its associated predicted value Residual = Data – Model Residuals help us to see whether the model makes sense

Burger King Menu Total Fat (g) vs. Protein (g) Example Page 194 r = 0.83 Mean x = 17.2g s x = 14g Mean y = 23.5g s y =16.4g How much fat for an item of 31g of protein?

Regression Formula (Prediction) Slope? y-intercept? Prediction? Fat?

Correlation and residuals If r = 1.0 all the residuals would be zero and have no variation If r = 0 the model would simply predict the mean of y. The residuals from that prediction would just be the fat observed values minus their mean. Same variation as the data

The Squared Correlation r^2 gives the fraction of the data’s variance accounted for by the model and 1-r^2 is the fraction of the original variance left in the residuals. R^2=0 None of the variance in the data is in the model; all of it is still in the residuals. How big should it be?

Least Squares Regression Line Minimizes the sum of the squared residuals. Assumptions and Conditions Linearity Straight enough condition Plot Residuals Regression Output on the computer Page 214