Bivariate Data Pick up a formula sheet, Notes for Bivariate Data – Day 1, and a calculator.

Slides:



Advertisements
Similar presentations
Scatterplots and Correlation
Advertisements

3.1b Correlation Target Goal: I can determine the strength of a distribution using the correlation. D2 h.w: p 160 – 14 – 18, 21, 26.
Linear Regression (C7-9 BVD). * Explanatory variable goes on x-axis * Response variable goes on y-axis * Don’t forget labels and scale * Statplot 1 st.
Chapter 3 Bivariate Data
Warm up Use calculator to find r,, a, b. Chapter 8 LSRL-Least Squares Regression Line.
Scatter Diagrams and Linear Correlation
AP Statistics Chapters 3 & 4 Measuring Relationships Between 2 Variables.
What would you look like if your name was Tai Shan? Parents names: Mother: Mei Xiang Father: Tian Tian.
Chapter 4 Describing the Relation Between Two Variables
2.2 Correlation Correlation measures the direction and strength of the linear relationship between two quantitative variables.
Regression Chapter 10 Understandable Statistics Ninth Edition By Brase and Brase Prepared by Yixun Shi Bloomsburg University of Pennsylvania.
FACTOR THE FOLLOWING: Opener. 2-5 Scatter Plots and Lines of Regression 1. Bivariate Data – data with two variables 2. Scatter Plot – graph of bivariate.
Haroon Alam, Mitchell Sanders, Chuck McAllister- Ashley, and Arjun Patel.
C HAPTER 3: E XAMINING R ELATIONSHIPS. S ECTION 3.3: L EAST -S QUARES R EGRESSION Correlation measures the strength and direction of the linear relationship.
Linear Regression Analysis
Describing Relationships: Scatter Plots and Correlation ● The world is an indivisible whole (butterfly effect and chaos theory; quantum entanglement, etc.)
Descriptive Methods in Regression and Correlation
Correlation with a Non - Linear Emphasis Day 2.  Correlation measures the strength of the linear association between 2 quantitative variables.  Before.
ASSOCIATION: CONTINGENCY, CORRELATION, AND REGRESSION Chapter 3.
Biostatistics Unit 9 – Regression and Correlation.
Researchers, such as anthropologists, are often interested in how two measurements are related. The statistical study of the relationship between variables.
Chapter 3: Examining relationships between Data
Describing Relationships: Scatterplots and Correlation.
Lesson Least-Squares Regression. Knowledge Objectives Explain what is meant by a regression line. Explain what is meant by extrapolation. Explain.
Notes Bivariate Data Chapters Bivariate Data Explores relationships between two quantitative variables.
GENERATING Scatterplots and Correlations USING Ti84.
1.6 Linear Regression & the Correlation Coefficient.
Lesson Scatterplots and Correlation. Knowledge Objectives Explain the difference between an explanatory variable and a response variable Explain.
Notes Bivariate Data Chapters Bivariate Data Explores relationships between two quantitative variables.
3.2 Least Squares Regression Line. Regression Line Describes how a response variable changes as an explanatory variable changes Formula sheet: Calculator.
Relationships If we are doing a study which involves more than one variable, how can we tell if there is a relationship between two (or more) of the.
Examining Bivariate Data Unit 3 – Statistics. Some Vocabulary Response aka Dependent Variable –Measures an outcome of a study Explanatory aka Independent.
CHAPTER 5 Regression BPS - 5TH ED.CHAPTER 5 1. PREDICTION VIA REGRESSION LINE NUMBER OF NEW BIRDS AND PERCENT RETURNING BPS - 5TH ED.CHAPTER 5 2.
Chapter 4 Summary Scatter diagrams of data pairs (x, y) are useful in helping us determine visually if there is any relation between x and y values and,
Correlation The apparent relation between two variables.
 Find the Least Squares Regression Line and interpret its slope, y-intercept, and the coefficients of correlation and determination  Justify the regression.
Scatter Plots, Correlation and Linear Regression.
S CATTERPLOTS Correlation, Least-Squares Regression, Residuals Get That Program ANSCOMBE CRICKETS GESSEL.
Chapter 3-Examining Relationships Scatterplots and Correlation Least-squares Regression.
Residuals and Residual Plots Section Starter A study showed that the correlation between GPA and hours of study per week was r =.6 –Which.
2.5 Using Linear Models P Scatter Plot: graph that relates 2 sets of data by plotting the ordered pairs. Correlation: strength of the relationship.
AP Statistics HW: p. 165 #42, 44, 45 Obj: to understand the meaning of r 2 and to use residual plots Do Now: On your calculator select: 2 ND ; 0; DIAGNOSTIC.
Copyright © 2013, 2009, and 2007, Pearson Education, Inc. Chapter 3 Association: Contingency, Correlation, and Regression Section 3.3 Predicting the Outcome.
Section Copyright © 2014, 2012, 2010 Pearson Education, Inc. Chapter 10 Correlation and Regression 10-2 Correlation 10-3 Regression.
Statistics: Analyzing 2 Quantitative Variables MIDDLE SCHOOL LEVEL  Session #2  Presented by: Dr. Del Ferster.
Ch 14 – Inference for Regression YMS Inference about the Model.
Scatterplots and Linear Regressions Unit 8. Warm – up!! As you walk in, please pick up your calculator and begin working on your warm – up! 1. Look at.
Simple Linear Regression The Coefficients of Correlation and Determination Two Quantitative Variables x variable – independent variable or explanatory.
Exploring Relationships Between Variables. The explanatory variable attempts to “explain” the response variable. You would use the explanatory variable.
Response Variable: measures the outcome of a study (aka Dependent Variable) Explanatory Variable: helps explain or influences the change in the response.
Correlation  We can often see the strength of the relationship between two quantitative variables in a scatterplot, but be careful. The two figures here.
Chapters 8 Linear Regression. Correlation and Regression Correlation = linear relationship between two variables. Summarize relationship with line. Called.
Unit 3 – Association: Contingency, Correlation, and Regression Lesson 3-3 Linear Regression, Residuals, and Variation.
Wednesday: Need a graphing calculator today. Need a graphing calculator today.
Correlation & Linear Regression Using a TI-Nspire.
Two-Variable Data Analysis
1. Analyzing patterns in scatterplots 2. Correlation and linearity 3. Least-squares regression line 4. Residual plots, outliers, and influential points.
Scatterplots Association and Correlation
Sections Review.
Objectives Fit scatter plot data using linear models with and without technology. Use linear models to make predictions.
LSRL Least Squares Regression Line
Scatterplots A way of displaying numeric data
Suppose the maximum number of hours of study among students in your sample is 6. If you used the equation to predict the test score of a student who studied.
^ y = a + bx Stats Chapter 5 - Least Squares Regression
Do Now Create a scatterplot following these directions
Least Squares Regression
Do Now Describe the following relationships. Do Now Describe the following relationships.
Objectives Vocabulary
Chapter 3 Vocabulary Linear Regression.
Exploring Relationships Between Variables
Presentation transcript:

Bivariate Data Pick up a formula sheet, Notes for Bivariate Data – Day 1, and a calculator.

Archaeopteryx is an extinct beast having feathers like a bird but teeth and a long bony tail like a reptile. Only six fossil specimens are known. Because these specimens differ greatly in size, some scientists think they are different species rather than individuals from the same species. If the specimens belong to the same species and differ in size because some are younger than others, there should be a positive linear relationship between the bones from all individuals. An outlier from this relationship would suggest a different species. Here are data on the lengths in centimeters of the femur (a leg bone) and the humerus (a bone in the upper arm) for the five specimens that preserve both bones. femur humerus Load data into list 1 and list 2 and make a scatterplot.

This is not enough. What do we need?

femur length in cm humerus length in cm A “cheater” way to put scale on a scatterplot is to trace two points and label each axis with those two values.

femur length in cm humerus length in cm explanatory variable? response variable? femur length in cm humerus length in cm But does it really matter here?No. But often it does.

Find the correlation coefficient and explain what it means.

correlation coefficient Did you get it?

If you did not get the correlation coefficient, you must turn your diagnostics on. Push 2 nd then 0. Scroll down to diagnostics on. Push “enter” twice and little calculator guy will say “done”.

Find the correlation coefficient and explain what it means. correlation coefficient

r =.994 The correlation coefficient is ALWAYS between -1 and 1. It does not change when the units or scale is transformed. Let’s check out the formula sheet.

r =.994 What does it mean? The correlation coefficient describes the strength of the linear relationship. The closer it is to 1 or -1 the more the points line up. These points line up pretty well with a negative slope. The correlation coefficient would be close to -1. graph on the bottom of your notes

r =.994 What does it mean? The correlation coefficient describes the strength of the linear relationship. The closer it is to 1 or -1 the more the points line up. These points line up pretty well with a positive slope. The correlation coefficient would be close to 0.8 or 0.9.

r =.994 What does it mean? The correlation coefficient describes the strength of the linear relationship. The closer it is to 1 or -1 the more the points line up. These points don’t line up at all. The correlation coefficient would be nearly 0.

r =.994 What does it mean? The correlation coefficient describes the strength of the linear relationship. The closer it is to 1 or -1 the more the points line up. These points line up sort of well with a negative slope. The correlation coefficient might be – 0.6 or – 0.7.

r =.994 What does it mean? The correlation coefficient describes the strength of the linear relationship. The closer it is to 1 or -1 the more the points line up. These points don’t line up at all. The correlation coefficient would be fairly close to 0.

r =.994 Here’s what you write: This suggests a strong, positive, linear relationship between femur length and humerus length.

So what’s the rest of this stuff?

slope y-intercept coefficient of determination equation: ŷ = 1.197x – This is hugely important! It means the predicted y.

equation: ŷ = 1.197x – where x = femur length and y = humerus length slope = ; For every 1 cm increase in femur length, the humerus length increases by cm, on average. y-intercept ; When the femur length is 0 cm, the humerus length is about cm. Of course, this is ridiculous and is an example of extrapolation.

Residuals Since our line misses many of the points, a residual is a measure of the “miss.” residual = y – ŷ (actual – predicted) a residual is the vertical distance from the point to the line

What is the residual for the point (56, 63)? residual = y – ŷ ŷ = 1.197x – ŷ = 1.197(56) – = residual = y – ŷ = 63 – = Find the residual for the point (74, 84)

A residual plot is a graph of all the residuals. To get resid, push 2 nd stat resid This only works if the little guy knows the equation of the line.

femur length in cm residuals This is a horrible residual plot. We’d like the points to be equally scattered above and below the line. Residual Plot

That’s it for dinosaurs today.

Limitations of Correlation and Regression Correlation is linear only. One influential point or incorrectly entered data point can greatly change the regression line. Correlation is not robust (resistant to outliers). Correlations based on averages are usually too high when applied to individuals. Extrapolation can yield silly results. Predictions for y should be made using the range of values in the data. Correlation does not imply a cause-and-effect relationship.

Examples of a perfect linear fit A job pays $10 per hour. The relationship between hours worked and pay. hours worked pay

Examples of a perfect linear fit The association between hours worked and time spent pursuing hedonistic pleasures. hours worked hedonistic pleasure time Here we could switch x and y. Sometimes we are simply curious about an association.

Time for more data?