Notes Bivariate Data Chapters 7 - 9. Bivariate Data Explores relationships between two quantitative variables.

Slides:



Advertisements
Similar presentations
Chapter 3 Examining Relationships Lindsey Van Cleave AP Statistics September 24, 2006.
Advertisements

AP Statistics Section 3.2 B Residuals
Residuals.
Chapter 6: Exploring Data: Relationships Lesson Plan
Scatter Diagrams and Linear Correlation
AP Statistics Chapters 3 & 4 Measuring Relationships Between 2 Variables.
Chapter 2: Looking at Data - Relationships /true-fact-the-lack-of-pirates-is-causing-global-warming/
Looking at Data-Relationships 2.1 –Scatter plots.
CHAPTER 3 Describing Relationships
C HAPTER 2 S CATTER PLOTS, C ORRELATION, L INEAR R EGRESSION, I NFERENCES FOR R EGRESSION By: Tasha Carr, Lyndsay Gentile, Darya Rosikhina, Stacey Zarko.
Haroon Alam, Mitchell Sanders, Chuck McAllister- Ashley, and Arjun Patel.
Regression, Residuals, and Coefficient of Determination Section 3.2.
Chapter 5 Regression. Chapter 51 u Objective: To quantify the linear relationship between an explanatory variable (x) and response variable (y). u We.
Descriptive Methods in Regression and Correlation
Relationship of two variables
Correlation with a Non - Linear Emphasis Day 2.  Correlation measures the strength of the linear association between 2 quantitative variables.  Before.
2.4: Cautions about Regression and Correlation. Cautions: Regression & Correlation Correlation measures only linear association. Extrapolation often produces.
Chapter 6: Exploring Data: Relationships Chi-Kwong Li Displaying Relationships: Scatterplots Regression Lines Correlation Least-Squares Regression Interpreting.
1 Chapter 3: Examining Relationships 3.1Scatterplots 3.2Correlation 3.3Least-Squares Regression.
Chapter 6: Exploring Data: Relationships Lesson Plan Displaying Relationships: Scatterplots Making Predictions: Regression Line Correlation Least-Squares.
Chapter 6: Exploring Data: Relationships Lesson Plan Displaying Relationships: Scatterplots Making Predictions: Regression Lines Correlation Least-Squares.
Residuals Target Goal: I can construct and interpret residual plots to assess if a linear model is appropriate. 3.2c Hw: pg 192: 48, 50, 54, 56, 58 -
Ch 3 – Examining Relationships YMS – 3.1
Chapter 3 concepts/objectives Define and describe density curves Measure position using percentiles Measure position using z-scores Describe Normal distributions.
AP STATISTICS LESSON 3 – 3 LEAST – SQUARES REGRESSION.
AP Statistics Chapter 8 & 9 Day 3
Chapter 3 Section 3.1 Examining Relationships. Continue to ask the preliminary questions familiar from Chapter 1 and 2 What individuals do the data describe?
Lesson Scatterplots and Correlation. Knowledge Objectives Explain the difference between an explanatory variable and a response variable Explain.
Notes Bivariate Data Chapters Bivariate Data Explores relationships between two quantitative variables.
Verbal SAT vs Math SAT V: mean=596.3 st.dev=99.5 M: mean=612.2 st.dev=96.1 r = Write the equation of the LSRL Interpret the slope of this line Interpret.
Relationships If we are doing a study which involves more than one variable, how can we tell if there is a relationship between two (or more) of the.
The Practice of Statistics, 5th Edition Starnes, Tabor, Yates, Moore Bedford Freeman Worth Publishers CHAPTER 3 Describing Relationships 3.2 Least-Squares.
Examining Bivariate Data Unit 3 – Statistics. Some Vocabulary Response aka Dependent Variable –Measures an outcome of a study Explanatory aka Independent.
CHAPTER 5 Regression BPS - 5TH ED.CHAPTER 5 1. PREDICTION VIA REGRESSION LINE NUMBER OF NEW BIRDS AND PERCENT RETURNING BPS - 5TH ED.CHAPTER 5 2.
Chapter 5 Regression. u Objective: To quantify the linear relationship between an explanatory variable (x) and response variable (y). u We can then predict.
Chapter 4 - Scatterplots and Correlation Dealing with several variables within a group vs. the same variable for different groups. Response Variable:
Chapter 3-Examining Relationships Scatterplots and Correlation Least-squares Regression.
Chapter 4 Scatterplots and Correlation. Chapter outline Explanatory and response variables Displaying relationships: Scatterplots Interpreting scatterplots.
Chapter 2 Examining Relationships.  Response variable measures outcome of a study (dependent variable)  Explanatory variable explains or influences.
Business Statistics for Managerial Decision Making
Residuals Recall that the vertical distances from the points to the least-squares regression line are as small as possible.  Because those vertical distances.
^ y = a + bx Stats Chapter 5 - Least Squares Regression
LEAST-SQUARES REGRESSION 3.2 Least Squares Regression Line and Residuals.
CHAPTER 3 Describing Relationships
Notes Chapter 7 Bivariate Data. Relationships between two (or more) variables. The response variable measures an outcome of a study. The explanatory variable.
Simple Linear Regression The Coefficients of Correlation and Determination Two Quantitative Variables x variable – independent variable or explanatory.
Response Variable: measures the outcome of a study (aka Dependent Variable) Explanatory Variable: helps explain or influences the change in the response.
Lecture 4 Chapter 3. Bivariate Associations. Objectives (PSLS Chapter 3) Relationships: Scatterplots and correlation  Bivariate data  Scatterplots (2.
Chapter 3: Describing Relationships
CHAPTER 5: Regression ESSENTIAL STATISTICS Second Edition David S. Moore, William I. Notz, and Michael A. Fligner Lecture Presentation.
Part II Exploring Relationships Between Variables.
Describing Bivariate Relationships. Bivariate Relationships When exploring/describing a bivariate (x,y) relationship: Determine the Explanatory and Response.
1. Analyzing patterns in scatterplots 2. Correlation and linearity 3. Least-squares regression line 4. Residual plots, outliers, and influential points.
CHAPTER 3 Describing Relationships
SCATTERPLOTS, ASSOCIATION AND RELATIONSHIPS
Chapter 6: Exploring Data: Relationships Lesson Plan
Cautions about Correlation and Regression
Chapter 6: Exploring Data: Relationships Lesson Plan
Chapter 7 Part 1 Scatterplots, Association, and Correlation
AP Stats: 3.3 Least-Squares Regression Line
^ y = a + bx Stats Chapter 5 - Least Squares Regression
Unit 4 Vocabulary.
Least Squares Regression
Review of Chapter 3 Examining Relationships
Least-Squares Regression
CHAPTER 3 Describing Relationships
Chapter 3 Vocabulary Linear Regression.
Chapters Important Concepts and Terms
Honors Statistics Review Chapters 7 & 8
Review of Chapter 3 Examining Relationships
Presentation transcript:

Notes Bivariate Data Chapters 7 - 9

Bivariate Data Explores relationships between two quantitative variables.

The explanatory variable attempts to explain the observed outcomes. (In algebra this is your independent variable – “x”)

The response variable measures an outcome of a study. (In algebra this is your dependent variable – “y”)

○ When we gather data, we usually have in mind which variables are which. ○ Beware! – this explanatory/response relationship suggests a cause and effect relationship that may not exist in all data sets. Use common sense!!

○ A Lurking Variable is a variable that has an important effect on the relationship among the variables in a study but is not included among the variables being studied. ○ Lurking variables can suggest a relationship when there isn’t one or can hide a relationship that exists.

Displaying the Variables ○ We always graph our data right? ○ You use a scatterplot to graph the relationship between 2 quantitative variables. Each point represents an individual.

○ Remember that not all bivariate relationships are linear!!! We will talk about non- linear in the next unit.

Interpret a Scatterplot ○ Here is what we look for: ○ 1) direction (positive, negative) D ○ 2) form (linear, or not linear) S ○ 3) strength (correlation, r) S ○ 4) deviations from the pattern (outliers) U SUDS!!

Remember on outlier is an individual observation that falls outside the overall pattern of the graph. ○ There is no outlier test for bivariate data. It’s a judgment call

○ Categorical variables can be added to scatterplots by changing the symbols in the plot. (See P. 199 for examples) ○ Visual inspection is often not a good judge of how strong a linear relationship is. Changing the plotting scales or the amount of white space around a cloud of points can be deceptive. So….

A measure for strength... ○

Facts about Correlation: ○ 1) positive r – positive association (positive slope) negative r – negative association (negative slope) ○ 2) r must fall between –1 and 1 inclusive. ○ 3) r values close to –1 or 1 indicate that the points lie close to a straight line. ○ 4) r values close to 0 indicate a weak linear relationship. ○ 5) r values of –1 or 1 indicate a perfect linear relationship. ○ 6) correlation only measures the strength in linear relationships (not curves). ○ 7) correlation can be strongly affected by extreme values (outliers).

Least-Squares Regression Line ○ The least-squares regression line (LSRL) is a mathematical model for the data. ○ This line is also known as the line of best fit or the regression line.

Formal definition… ○ The least-squares regression line of y on x is the line that makes the sum of the squares of the vertical distances of the data points from the line as small as possible.

The form… ○

Some new formulas… ○

Why do we do regression? ○ The purpose of regression is to determine a model that we can use for making predictions.

Communication is always the goal!!! ○ When we write the equation for a LSRL we do not use x & y, we use the variable names themselves… ○ For example: ○ Predicted score = (hours studied)

Another measure of strength… ○ The coefficient of determination, r 2, is the fraction of the variation in the value of y that is explained by the linear model. ○ When we explain r 2 then we say… ___% of the variability in ___(y) can be explained by this linear model.

Deviations for single points ○ A residual is the vertical difference between an actual point and the LSRL at one specific value of x. That is, Residual = observed y – predicted y or Residual = y – ○ The mean of the residuals is always zero.

A new plot… ○ A residual plot plots the residuals on the vertical axis against the explanatory variables on the horizontal axis. ○ Such a plot magnifies the residuals and makes patterns easier to see.

Why do I need a residual plot? ○ Remember that all data is not linear in shape!!! The residual plot clearly shows if linear is appropriate. ○ A residual plot show good linear fit when the points are randomly scattered about y = 0 with no obvious patterns.

To create a residual plot on the calculator: ○ 1)You must have done a linear regression with the data you wish to use. ○ 2) From the Stat-Plot, Plot # menu choose scatterplot and leave the x list with the x values. ○ 3) Change the y-list to “RESID” chosen from the list menu. ○ 4) Zoom – 9

○ In scatterplots we can have points that are outliers or influential points or both. ○ An observation can be an outlier in the x direction, the y direction, or in both directions. ○ An observation is influential if removing it or adding it) would markedly change the position of the regression line.

○ Extrapolation is the use of a regression model for prediction outside the domain of values of the explanatory variable x. ○ Such predictions cannot be trusted.

Association vs. Causation ○ A strong association between two variables is NOT enough to draw conclusions about cause & effect.

Association vs Causation ○ Strong association between two variables x and y can reflect: ○ A) Causation – Change in x causes change in y ○ B) Common response – Both x and y are Responding to some other unobserved factor ○ C) Confounding – the effect on y of the explanatory variable x is hopelessly mixed up with the effects on y of other variables.

Association vs Causation ○ Cause and Effect can only be determined from a well designed experiment.

○ Data with no apparent linear relationship can also be examined in two ways to see if a relationship still exists: ○ 1) Check to see if breaking the data down into subsets or groups makes a difference. ○ 2) If the data is curved in some way and not linear, a relationship still exists. We will explore that in the next chapter.