UNIT 2 BIVARIATE DATA. BIVARIATE DATA – THIS TOPIC INVOLVES…. y-axis DEPENDENT VARIABLE x-axis INDEPENDENT VARIABLE.

Slides:



Advertisements
Similar presentations
4.7 The coefficient of determination r2
Advertisements

Lesson 10: Linear Regression and Correlation
Eight backpackers were asked their age (in years) and the number of days they backpacked on their last backpacking trip. Is there a linear relationship.
EOC Review Line of Best Fit
Objectives Fit scatter plot data using linear models with and without technology. Use linear models to make predictions.
2-7 Curve Fitting with Linear Models Warm Up Lesson Presentation
Warm up Use calculator to find r,, a, b. Chapter 8 LSRL-Least Squares Regression Line.
Chapter 8 Linear Regression © 2010 Pearson Education 1.
LSRL Least Squares Regression Line
Examining Relationship of Variables  Response (dependent) variable - measures the outcome of a study.  Explanatory (Independent) variable - explains.
Correlation A correlation exists between two variables when one of them is related to the other in some way. A scatterplot is a graph in which the paired.
RESEARCH STATISTICS Jobayer Hossain Larry Holmes, Jr November 6, 2008 Examining Relationship of Variables.
Business Statistics - QBM117 Least squares regression.
Correlation & Regression Math 137 Fresno State Burger.
McGraw-Hill/Irwin Copyright © 2010 by The McGraw-Hill Companies, Inc. All rights reserved. Chapter 13 Linear Regression and Correlation.
Lecture 3: Bivariate Data & Linear Regression 1.Introduction 2.Bivariate Data 3.Linear Analysis of Data a)Freehand Linear Fit b)Least Squares Fit c)Interpolation/Extrapolation.
Correlation Scatter Plots Correlation Coefficients Significance Test.
Correlation and regression 1: Correlation Coefficient
VCE Further Maths Least Square Regression using the calculator.
Chapter 14 – Correlation and Simple Regression Math 22 Introductory Statistics.
Researchers, such as anthropologists, are often interested in how two measurements are related. The statistical study of the relationship between variables.
New Seats – Block 1. New Seats – Block 2 Warm-up with Scatterplot Notes 1) 2) 3) 4) 5)
Statistical Analysis Topic – Math skills requirements.
2-7 Curve Fitting with Linear Models LESSON PLAN Warm Up (Slide #2)
Sec 1.5 Scatter Plots and Least Squares Lines Come in & plot your height (x-axis) and shoe size (y-axis) on the graph. Add your coordinate point to the.
Variation and Prediction Intervals
Regression Regression relationship = trend + scatter
2-7 Curve Fitting with Linear Models Warm Up Lesson Presentation
Correlation The apparent relation between two variables.
Chapter 9: Correlation and Regression Analysis. Correlation Correlation is a numerical way to measure the strength and direction of a linear association.
Scatter Plots, Correlation and Linear Regression.
We would expect the ENTER score to depend on the average number of hours of study per week. So we take the average hours of study as the independent.
1.5 Scatter Plots and Least-Squares Lines Objectives : Create a scatter plot and draw an informal inference about any correlation between the inference.
Linear Regression Day 1 – (pg )
UNIT 2 BIVARIATE DATA. BIVARIATE DATA – THIS TOPIC INVOLVES…. y-axis DEPENDENT VARIABLE x-axis INDEPENDENT VARIABLE.
Linear Regression and Correlation Chapter GOALS 1. Understand and interpret the terms dependent and independent variable. 2. Calculate and interpret.
Least Squares Regression Lines Text: Chapter 3.3 Unit 4: Notes page 58.
Wednesday Today you need: Whiteboard, Marker, Eraser Calculator 1 page handout.
Section 1.6 Fitting Linear Functions to Data. Consider the set of points {(3,1), (4,3), (6,6), (8,12)} Plot these points on a graph –This is called a.
Unit 4 Lesson 3 (5.3) Summarizing Bivariate Data 5.3: LSRL.
ContentDetail  Two variable statistics involves discovering if two variables are related or linked to each other in some way. e.g. - Does IQ determine.
Chapter 7 Linear Regression. Bivariate data x – variable: is the independent or explanatory variable y- variable: is the dependent or response variable.
Material Taken From: Mathematics for the international student Mathematical Studies SL Mal Coad, Glen Whiffen, John Owen, Robert Haese, Sandra Haese and.
Chapter 5 Lesson 5.2 Summarizing Bivariate Data 5.2: LSRL.
©The McGraw-Hill Companies, Inc. 2008McGraw-Hill/Irwin Linear Regression and Correlation Chapter 13.
4.5 – Analyzing Lines of Best Fit Today’s learning goal is that students will be able to Use residuals to determine how well lines of fit model data. Distinguish.
Chapter 4. Correlation and Regression Correlation is a technique that measures the strength of the relationship between two continuous variables. For.
Wednesday: Need a graphing calculator today. Need a graphing calculator today.
Fitting Lines to Data Points: Modeling Linear Functions Chapter 2 Lesson 2.
LINEAR GRAPHS AND FUNCTIONS UNIT ONE GENERAL MATHS.
Correlation & Linear Regression Using a TI-Nspire.
Two-Variable Data Analysis
CHAPTER 3 Describing Relationships
Lesson 4.5 Topic/ Objective: To use residuals to determine how well lines of fit model data. To use linear regression to find lines of best fit. To distinguish.
Unit 4 LSRL.
LSRL.
Least Squares Regression Line.
Objectives Fit scatter plot data using linear models with and without technology. Use linear models to make predictions.
Linear graphs and models
Chapter 5 LSRL.
Chapter 4 Correlation.
Chapter 3.2 LSRL.
1) A residual: a) is the amount of variation explained by the LSRL of y on x b) is how much an observed y-value differs from a predicted y-value c) predicts.
Least Squares Regression Line LSRL Chapter 7-continued
4.5 Analyzing Lines of Fit.
Chapter 5 LSRL.
Chapter 5 LSRL.
Chapter 5 LSRL.
Objectives Vocabulary
LEARNING GOALS FOR LESSON 2.7
Presentation transcript:

UNIT 2 BIVARIATE DATA

BIVARIATE DATA – THIS TOPIC INVOLVES…. y-axis DEPENDENT VARIABLE x-axis INDEPENDENT VARIABLE

INDEPENDENT VS DEPENDENT VARIABLES The value of the DEPENDENT variable depends on the INDEPENDENT variable. eg. Identify the INDEPENDENT and DEPENDENT variable in each case: a. The time spent filling up a swimming pool with water compared to the size of the pool. b. The hours per week spent doing laundry compared with the number of children living in the house. c. The last time you went to the hairdresser and the length of your hair. d. A child’s height compared to their age.

INDEPENDENT VS DEPENDENT VARIABLES If you need more help with this, view the following video tutorial dependent-variables/ dependent-variables/

SCATTERPLOTS If you need more help with this, view the following video tutorial

SCATTERPLOTS If you need more help with this, view the following video tutorial

SCATTERPLOTS Study (hrs) Result (%)

Study Time(hrs) Result (%)

The following scatterplot is produced. Comparing this scatterplot to the graphs on our worksheet given in last lesson, we can say that there is a Strong Positive relationship between the number of hours studying and the score achieved on the test. Study Time(hrs) Result (%) Study Time Result

NOW TRY WORKBOOK SECTION ONE

PEARSON’S PRODUCT CORRELATION COEFFICIENT What is it? What is it? o Correlation between sets of data is a measure of how well they are related. The most common measure of correlation in stats is the Pearson Correlation. o It shows the linear relationship between two sets of data. o The correlation coefficient can be represented as r o Calculated using the equation: o Luckily, our calculator can solve this for us!

PEARSON’S PRODUCT CORRELATION COEFFICIENT What does it look like? What values can we expect? What does it look like? What values can we expect? o The value will be between -1 and 1. o 1 tells us there is a perfect positive relationship o -1 tells us there is a perfect negative relationship o Relationships for r values in between are given in the following table

PEARSON’S PRODUCT CORRELATION COEFFICIENT If you need more help with this, view the following video tutorial coefficient/ coefficient/

PEARSON’S PRODUCT CORRELATION COEFFICIENT Finding the Correlation Coefficient ‘r’ using the calculator Finding the Correlation Coefficient ‘r’ using the calculator eg. Consider the problem given earlier – Results on a Maths test were compared to the time spent studying the day before the test. The following data was obtained from a sample of 10 students. This data gave the scatterplot: Study (hrs) Result (%) Study Time Result Using this data, we can use the calculator to find r. First, using the Statistics function on your calculator, place the data into List 1 and List 2 Using what we know about ‘r’, view the scatterplot and predict – what approximate value will r have?

PEARSON’S PRODUCT CORRELATION COEFFICIENT Finding the Correlation Coefficient ‘r’ using the calculator Finding the Correlation Coefficient ‘r’ using the calculator Study (hrs) Result (%) Xlist: list with Independent Variables Ylist: list with Dependent Variables

PEARSON’S PRODUCT CORRELATION COEFFICIENT Finding the Correlation Coefficient ‘r’ using the calculator Finding the Correlation Coefficient ‘r’ using the calculator Study (hrs) Result (%) Click OK. The following information is revealed. Reading r off this gives the Correlation Coefficient r = This confirms that there is a strong positive relationship between the amount of study done and the score achieved on the maths test.

THE COEFFICIENT OF DETERMINATION

NOW TRY WORKBOOK SECTION TWO

THE COEFFICIENT OF DETERMINATION o Is a measure that allows us to determine how certain one can be in making predictions just by looking at a certain model/graph. o The coefficient of determination, r 2, tells us how much the variation in the dependent variable can be explained by the variation in the independent variable (note: not whether one causes the other). o We find r 2 simply by squaring our r value, ALSO our calculator also generates this value for us o Because r values have to be between -1 and +1, these r² values always fall between 0 and 1. o When answering questions about the Coefficient of Determination, the way we word our response is important – Variations to the dependent variable are not caused by variations of the independent variable, rather we can say, The Variations of the Dependent Variable can be explained by variations of the independent variable.

THE COEFFICIENT OF DETERMINATION If you need more help with this, view the following video tutorial

THE COEFFICIENT OF DETERMINATION Click STATISTICS. Insert Data into lists Choose CALC, Linear Reg Xlist: Choose list which has the INDEPENDENT variable Ylist: Choose list which has the DEPENDENT variable

THE COEFFICIENT OF DETERMINATION Study (hrs) Result (%) Again, looking at our prior example: Results on a Maths test were compared to the time spent studying the day before the test. This data was obtained from a sample of 10 students.

THE COEFFICIENT OF DETERMINATION Exercise (hrs) Heart Rate Lets try another example: The resting heart rate of 10 individuals were compared with the number of hours per week they spent exercising, to see if one influences the other. Choose CALC, Linear Reg Click STATISTICS. Insert Data into lists

THE COEFFICIENT OF DETERMINATION Exercise (hrs) Heart Rate Lets try another example: The resting heart rate of 10 individuals were compared with the number of hours per week they spent exercising, to see if one influences the other.

NOW TRY WORKBOOK SECTION THREE

LEAST SQUARES REGRESSION LINE o The least squares regression line is simply a line of best fit of your scatter-plotted data. o Our calculator can easily be used to plot this line and give us the equation that represents this line. What does it look like?? y=

LEAST SQUARES REGRESSION LINE Exercise (hrs) Heart Rate Lets find the linear regression line for our earlier example: The resting heart rate of 10 individuals were compared with the number of hours per week they spent exercising. Choose CALC, Linear Reg Click STATISTICS. Insert Data into lists

LEAST SQUARES REGRESSION LINE Exercise (hrs) Heart Rate

NOW TRY WORKBOOK SECTION FOUR

LEAST SQUARES REGRESSION EQUATIONS { WITHOUT THE LISTS OF DATA } There may be times when you are given some information about a set of bivariate data, but you have no access to the original table of data. You may be asked to use this information to find r, r 2, or the equation of the least squares regression line. Where a and b can be found using:

LEAST SQUARES REGRESSION EQUATIONS { WITHOUT THE LISTS OF DATA }

eg2. The following values were found when comparing the width of strawberries compared to the hours of daylight the plant was exposed to: Mean width of strawberries = 25mm Mean hours of daylight = 8 hours Standard Deviation of width of strawberries = 5mm Standard Deviation of hours of daylight = 3 hours Pearson’s Correlation Coefficient = 0.92 Find the equation of the least squares (linear) regression line (follow the given steps….) 1) Decide which is the IV (x-data) and which is the DV (y-data) 2) Define your variables based on which is your IV and DV 3) Sub values into your “b” equation to find “b” 4) Sub values into your “a” equation to find “a” 5) Sub your “a” and “b” values into the “y” equation to give your answer. LEAST SQUARES REGRESSION EQUATIONS { WITHOUT THE LISTS OF DATA }

USING THE CALCULATOR TO FIND VALUES {IF WE HAVE DATA} Enter your data into lists Choose calc – Two-Variable Xlist : Independent Data Ylist : Dependent Data

NOW TRY WORKBOOK SECTION FIVE

MAKING PREDICTIONS {INTERPOLATION & EXTRAPOLATION}

More on Interpolation vs Extrapolation…. Interpolation – Is an estimation of a value within the given data set. Extrapolation – Is an estimation of a value which is outside of the given set of data. Predicting a value which is an Interpolation is much more reliable than predicting an Extrapolation. Example 2: The height of plants were measured and compared to amount of water they received weekly. A) Is predicting the height of a plant which receives 600ml of water reliable? 600ml – This value is within our data set, so is an INTERPOLATION – predictions generally reliable. B) Is predicting the height of a plant which receives 2000ml of water reliable? 2000ml – This value is outside of our data set, so is an EXTRAPOLATION – predictions not as reliable. Water (ml) Height (cm)

MAKING PREDICTIONS {INTERPOLATION & EXTRAPOLATION} Example 3: A study of the weekly supermarket shopping cost of various household income groups produced the following results shown in the table below. a)Determine the least squares regression line b)Predict the weekly spend for an income of $720 c)Predict the weekly income if the supermarket spend is $146 d)Predict the weekly spend for an income of $1500 e)How reliable are your predictions in b and d? Weekly Income ($) Weekly Spend ($) b – Interpolation (lies within our given data set) RELIABLE d – Extrapolation (lies outside of the data set) NOT OVERLY RELIABLE, INDICATION ONLY

NOW TRY WORKBOOK SECTION SIX