1 Further Maths Chapter 4 Displaying and describing relationships between two variables.

Slides:



Advertisements
Similar presentations
SCATTERPLOT AND PERSONS PRODUCT- MOMENT CORRELATION COEFFICIENT. EXERCISE 2E AND 2F.
Advertisements

Describing Quantitative Variables
Chapter 3 Examining Relationships
Chapter 4 The Relation between Two Variables
IB Math Studies – Topic 6 Statistics.
CHAPTER 4: Scatterplots and Correlation. Chapter 4 Concepts 2  Explanatory and Response Variables  Displaying Relationships: Scatterplots  Interpreting.
CHAPTER 4: Scatterplots and Correlation
+ Scatterplots and Correlation Displaying Relationships: ScatterplotsThe most useful graph for displaying the relationship between two quantitative variables.
Chapter 41 Describing Relationships: Scatterplots and Correlation.
Chapter 2: Looking at Data - Relationships /true-fact-the-lack-of-pirates-is-causing-global-warming/
CORRELATION COEFFICIENTS What Does a Correlation Coefficient Indicate? What is a Scatterplot? Correlation Coefficients What Could a Low r mean? What is.
Describing Relationships: Scatterplots and Correlation
Correlation and Regression Analysis
Week 12 Chapter 13 – Association between variables measured at the ordinal level & Chapter 14: Association Between Variables Measured at the Interval-Ratio.
LIS 570 Summarising and presenting data - Univariate analysis continued Bivariate analysis.
How to Analyze Data? Aravinda Guntupalli. SPSS windows process Data window Variable view window Output window Chart editor window.
Correlation and regression 1: Correlation Coefficient
Bivariate Relationships Analyzing two variables at a time, usually the Independent & Dependent Variables Like one variable at a time, this can be done.
Association between 2 variables We've described the distribution of 1 variable in Chapter 1 - but what if 2 variables are measured on the same individual?
Relationships Scatterplots and correlation BPS chapter 4 © 2006 W.H. Freeman and Company.
Chapter 14 – Correlation and Simple Regression Math 22 Introductory Statistics.
Bivariate Data When two variables are measured on a single experimental unit, the resulting data are called bivariate data. You can describe each variable.
Notes Bivariate Data Chapters Bivariate Data Explores relationships between two quantitative variables.
Statistics in Applied Science and Technology Chapter 13, Correlation and Regression Part I, Correlation (Measure of Association)
1 Examining Relationships in Data William P. Wattles, Ph.D. Francis Marion University.
VCE Further Maths Chapter Two-Bivariate Data \\Servernas\Year 12\Staff Year 12\LI Further Maths.
Notes Bivariate Data Chapters Bivariate Data Explores relationships between two quantitative variables.
The Correlational Research Strategy
Scatterplots are used to investigate and describe the relationship between two numerical variables When constructing a scatterplot it is conventional to.
DESCRIPTIVE STATISTICS © LOUIS COHEN, LAWRENCE MANION & KEITH MORRISON.
CHAPTER 4: Scatterplots and Correlation ESSENTIAL STATISTICS Second Edition David S. Moore, William I. Notz, and Michael A. Fligner Lecture Presentation.
1.1 example these are prices for Internet service packages find the mean, median and mode determine what type of data this is create a suitable frequency.
© Copyright McGraw-Hill Correlation and Regression CHAPTER 10.
Relationships If we are doing a study which involves more than one variable, how can we tell if there is a relationship between two (or more) of the.
Chapter 7 Scatterplots, Association, and Correlation.
Aim: How do we analyze data with a two-way table?
Association between 2 variables We've described the distribution of 1 variable - but what if 2 variables are measured on the same individual? Examples?
Chapter 4 - Scatterplots and Correlation Dealing with several variables within a group vs. the same variable for different groups. Response Variable:
4.2 Correlation The Correlation Coefficient r Properties of r 1.
1 Association  Variables –Response – an outcome variable whose values exhibit variability. –Explanatory – a variable that we use to try to explain the.
We would expect the ENTER score to depend on the average number of hours of study per week. So we take the average hours of study as the independent.
Chapter 4 Scatterplots and Correlation. Chapter outline Explanatory and response variables Displaying relationships: Scatterplots Interpreting scatterplots.
Chapter 12: Correlation and Linear Regression 1.
Copyright © 2010 Pearson Education, Inc. Chapter 7 Scatterplots, Association, and Correlation.
The Correlational Research Strategy Chapter 12. Correlational Research The goal of correlational research is to describe the relationship between variables.
Chapter 16: Correlation. So far… We’ve focused on hypothesis testing Is the relationship we observe between x and y in our sample true generally (i.e.
APPLIED DATA ANALYSIS IN CRIMINAL JUSTICE CJ 525 MONMOUTH UNIVERSITY Juan P. Rodriguez.
What Do You See?. A scatterplot is a graphic tool used to display the relationship between two quantitative variables. How to Read a Scatterplot A scatterplot.
Chapter 7 Calculation of Pearson Coefficient of Correlation, r and testing its significance.
Notes Chapter 7 Bivariate Data. Relationships between two (or more) variables. The response variable measures an outcome of a study. The explanatory variable.
1 MVS 250: V. Katch S TATISTICS Chapter 5 Correlation/Regression.
Lecture 4 Chapter 3. Bivariate Associations. Objectives (PSLS Chapter 3) Relationships: Scatterplots and correlation  Bivariate data  Scatterplots (2.
Lecture 7: Bivariate Statistics. 2 Properties of Standard Deviation Variance is just the square of the S.D. If a constant is added to all scores, it has.
Slide Slide 1 Chapter 10 Correlation and Regression 10-1 Overview 10-2 Correlation 10-3 Regression 10-4 Variation and Prediction Intervals 10-5 Multiple.
Statistics 7 Scatterplots, Association, and Correlation.
Scatterplots & Correlations Chapter 4. What we are going to cover Explanatory (Independent) and Response (Dependent) variables Displaying relationships.
Two-Variable Data Analysis
Chapter 2 Bivariate Data Scatterplots.   A scatterplot, which gives a visual display of the relationship between two variables.   In analysing the.
Unit 1 Review. 1.1: representing data Types of data: 1. Quantitative – can be represented by a number Discrete Data Data where a fraction/decimal is not.
Chapter 12: Correlation and Linear Regression 1.
Calculating the correlation coefficient
CHAPTER 3 Describing Relationships
Suppose the maximum number of hours of study among students in your sample is 6. If you used the equation to predict the test score of a student who studied.
Summarising and presenting data - Bivariate analysis
Chapter 3: Describing Relationships
Chapter 3: Describing Relationships
Summarizing Bivariate Data
Association between 2 variables
Warsaw Summer School 2017, OSU Study Abroad Program
Association between 2 variables
Presentation transcript:

1 Further Maths Chapter 4 Displaying and describing relationships between two variables

2 Bivariate Data Often we are interested to see if a relationship exists between two variables, i.e. – Hours studied and study score, – Gender and resting pulse rate – Hair colour and Favourite Sport

3 Dependent and Independent Variable The first thing that needs to be considered is if there is some sort of dependency relationship between the two i.e. which of the two variables is likely to depend on the other? Does a person’s study score depend on number of hours studied or does the number of hours a person studies depend on their study score? Complete the other examples

4 Dependent and Independent Variable In the first case, the study score is the dependent variable, hours studied is the independent variable. In the second case, the resting pulse rate is the dependent variable, gender is the independent variable In the third case you would not expect there to be any relationship

5 Dependent and Independent Variable The variable that can be controlled is referred to as the independent variable. The variable that is measured in response is referred to as the dependent variable.

6 3 possible cases There are three possible situations that may occur when considering two variables at the one time Both variables are categorical One variable is categorical and the other is numerical Both variables are numerical

7 Categorical and Categorical One hundred people were randomly selected and surveyed as to whether they were in favour of lowering the speed limit in suburban streets to 40 km/h.

8 Categorical and Categorical When the data was tabulated it was found that from the males interviewed, 25 were in favour and the rest against. Of the females, 20 were in favour and the rest against.

9 Categorical and Categorical The group consisted of 65 males and 35 females. Each person voted for or against the proposal. The independent variable is The dependent variable is

10 Categorical and Categorical A two way frequency table is an appropriate way to display this data The independent variable should be put in the columns. The dependent variable should be put in the rows.

11 Categorical and Categorical Gender MaleFemale OpiOpi In favour niOnniOn Not in favour Total

12 Categorical and Categorical Gender MaleFemale OpiOpi In favour 2520 nionnion Not in favour 4015 Total6535

13 Categorical and Categorical Does this table appear to indicate that men are more in favour of lowering the speed limit than women? Discuss

14 Categorical and Categorical Two way frequency table (appropriately percentaged) Since the independent variable is in the columns, we need to calculate column percentages.

15 Categorical and Categorical Gender MaleFemale OpiOpi In favour 25 / 65 * 100 = 38.5% 20 / 35 * 100 = 57.1% niOnniOn Not in favour 40 / 65 * 100 = 61.5% 15 / 35 * 100 = 42.9% Total100%

16 Categorical and Categorical This can then be displayed using a percentaged segmented barchart.

17

18 Categorical and Categorical Report 57.1% of females are in favour of lowering the speed limit compared to 38.5 % of men. Women are clearly more in favour of lowering the speed limit.

19 Exercises Exercise 4A Pages 88 – 89 all Exercise 4B Page 91 all

20 Numerical and Categorical An investigation was carried out to see if there was a relationship between gender and resting pulse rate. Data was collected of the resting pulse rates of 23 boys and 23 girls. Dependent variable is Independent variable is Numerical variable is Categorical variable is Pulse rate Gender Pulse rate Gender

21 Numerical and Categorical Males Females

22 Back to Back Stem and Leaf Plot A back to back stem and leaf plot can be used to display the relationship between a numerical variable and a two valued categorical variable. malesfemales

23 Parallel Boxplots Parallel box plots can be used to display the relationship between a numerical variable and a two or more level categorical variable. Calculator Display Exercise 4C Page 93 Questions 1-3

24 Numerical and Numerical The following data was collected from 10 students Average hours of study for Further Mathematics per week Study Score

25 Numerical and Numerical The independent variable is The dependent variable is The appropriate way to display this data is by use of a scatterplot. The independent variable should always be found on the horizontal axis The dependent variable should always be found on the vertical axis.

26

27 Calculator Calculator Display Exercise 4D Page 96 all

28 Interpreting a Scatterplot When describing a scatterplot the following four features should be discussed Direction Form Strength Outliers

29 Direction The direction of a scatterplot can either be positive or negative

30 Form The form of a scatterplot can either be linear or non-linear

31 Strength The strength of a scatterplot can be either perfect, strong, moderate or weak.

32 Strength

33 Outliers Outliers are any points that are separated from the general body of points

34 Discuss

35 Discuss

36 Discuss

37 Correlation coefficient (r) r is a measure of the strength of the linear relationship between two numerical variables. The value of r is between –1 (perfect negative linear relationship) and 1 (perfect positive linear relationship). It is not appropriate to calculate a correlation coefficient if there are outliers in the data.

38 Estimating r Board Work Exercise 4E Page all

39 Calculating Pearson’s r r =  ( x – x ) ( y – y ) ( n –1 ) s x s y The key assumptions when using Pearson’s r is that the data is linear and that there are no outliers. Calculator demonstration Exercise 4F Pages Question 1,3

40 Coefficient of Determination (r 2 ) The coefficient of determination is the square of Pearson’s correlation coefficient. It is used to explain the degree to which one variable can be predicted from another variable

41 Coefficient of determination The coefficient of determination gives the percentage variation (r 2 * 100) in the dependent variable that is explained by the variation in the independent variable.

42 Example Average hours of study for Further Mathematics per week Study Score

43 Example Calculate r and interpret Calculate r 2 Interpret r 2

44 Example of the variation in the study score can be explained by the variation in the number of hours studied per week. The other is due to other factors.

45 Warning Pearsons r can be positive or negative depending on the direction of the scatterplot. The coefficient of determination will always be positive and is normally expressed as a percentage.

46 Warning If r 2 is equal to 0.36, then what will r be?

47 Exercise Complete Exercise 4G Pages all

48 Correlation and Causation For a number of rural towns the number of nightclubs was recorded, as were the number of churches. The following data resulted Graph this relationship as a scatterplot Number of churches Number of nightclubs

49 Correlation and Causation

50 Correlation and Causation Interpret this graph. As the number of churches increases, so does the number of nightclubs. Therefore an increase in the number of churches will lead to or cause an increase in the number of nightclubs. As people become more religious they will tend to visit nightclubs more. Nightclubs will be full of super religious people.

51 Correlation and Causation An increase in one variable will not always cause an increase in the other. In this situation there is a third variable that is hidden I.e. Population Therefore we never use the word cause when describing a relationship. We say As the number of churches increases, the number of nightclubs tends to increase. Exercise 4H Page 108 – 109 all

52 Chapter 4 Exercise 4I Page 109