Lecture 29 Dr. MUMTAZ AHMED MTH 161: Introduction To Statistics.

Slides:



Advertisements
Similar presentations
MTH 161: Introduction To Statistics
Advertisements

Correlation Chapter 9.
Chapter 15 (Ch. 13 in 2nd Can.) Association Between Variables Measured at the Interval-Ratio Level: Bivariate Correlation and Regression.
CORRELATON & REGRESSION
Describing the Relation Between Two Variables
Correlation and Simple Regression Introduction to Business Statistics, 5e Kvanli/Guynes/Pavur (c)2000 South-Western College Publishing.
Chapter 10 Relationships between variables
Chapter 13 Introduction to Linear Regression and Correlation Analysis
Fall 2006 – Fundamentals of Business Statistics 1 Chapter 13 Introduction to Linear Regression and Correlation Analysis.
Linear Regression and Correlation Analysis
Chapter 13 Introduction to Linear Regression and Correlation Analysis
Chapter 5 Continuous Random Variables and Probability Distributions
SIMPLE LINEAR REGRESSION
Describing Relationships: Scatterplots and Correlation
BCOR 1020 Business Statistics Lecture 24 – April 17, 2008.
Chapter 14 Introduction to Linear Regression and Correlation Analysis
Business Statistics - QBM117 Statistical inference for regression.
Correlation and Regression 1. Bivariate data When measurements on two characteristics are to be studied simultaneously because of their interdependence,
Scatter Diagrams and Correlation
Lecture 28 Dr. MUMTAZ AHMED MTH 161: Introduction To Statistics.
Lecture 3-2 Summarizing Relationships among variables ©
Linear Regression.
SIMPLE LINEAR REGRESSION
Correlation Scatter Plots Correlation Coefficients Significance Test.
Correlation and regression 1: Correlation Coefficient
Covariance and correlation
Sections 9-1 and 9-2 Overview Correlation. PAIRED DATA Is there a relationship? If so, what is the equation? Use that equation for prediction. In this.
Is there a relationship between the lengths of body parts ?
1 Chapter 9. Section 9-1 and 9-2. Triola, Elementary Statistics, Eighth Edition. Copyright Addison Wesley Longman M ARIO F. T RIOLA E IGHTH E DITION.
Chapter 13 Statistics © 2008 Pearson Addison-Wesley. All rights reserved.
Regression and Correlation. Bivariate Analysis Can we say if there is a relationship between the number of hours spent in Facebook and the number of friends.
Lecture 3 A Brief Review of Some Important Statistical Concepts.
Sullivan – Fundamentals of Statistics – 2 nd Edition – Chapter 4 Section 1 – Slide 1 of 30 Chapter 4 Section 1 Scatter Diagrams and Correlation.
1 Examining Relationships in Data William P. Wattles, Ph.D. Francis Marion University.
Correlation Association between 2 variables 1 2 Suppose we wished to graph the relationship between foot length Height
C.2000 Del Siegle for Created by Del Siegle For EPSY 5601 You will need to repeatedly click your mouse or space bar to progress through the information.
Introduction to Linear Regression
Chap 12-1 A Course In Business Statistics, 4th © 2006 Prentice-Hall, Inc. A Course In Business Statistics 4 th Edition Chapter 12 Introduction to Linear.
Association between 2 variables
4.1 Scatter Diagrams and Correlation. 2 Variables ● In many studies, we measure more than one variable for each individual ● Some examples are  Rainfall.
1 G Lect 8b G Lecture 8b Correlation: quantifying linear association between random variables Example: Okazaki’s inferences from a survey.
Basic Statistics Correlation Var Relationships Associations.
Applied Quantitative Analysis and Practices LECTURE#11 By Dr. Osman Sadiq Paracha.
MARKETING RESEARCH CHAPTER 18 :Correlation and Regression.
Relationships If we are doing a study which involves more than one variable, how can we tell if there is a relationship between two (or more) of the.
Describing Relationships Using Correlations. 2 More Statistical Notation Correlational analysis requires scores from two variables. X stands for the scores.
Scatter Diagrams and Correlation Variables ● In many studies, we measure more than one variable for each individual ● Some examples are  Rainfall.
CORRELATION. Correlation key concepts: Types of correlation Methods of studying correlation a) Scatter diagram b) Karl pearson’s coefficient of correlation.
Chapter 10 Correlation and Regression Lecture 1 Sections: 10.1 – 10.2.
Scatter Diagrams scatter plot scatter diagram A scatter plot is a graph that may be used to represent the relationship between two variables. Also referred.
Scatter Diagram of Bivariate Measurement Data. Bivariate Measurement Data Example of Bivariate Measurement:
Chapter 4 Scatterplots and Correlation. Chapter outline Explanatory and response variables Displaying relationships: Scatterplots Interpreting scatterplots.
1 Data Analysis Linear Regression Data Analysis Linear Regression Ernesto A. Diaz Department of Mathematics Redwood High School.
Section Copyright © 2014, 2012, 2010 Pearson Education, Inc. Chapter 10 Correlation and Regression 10-2 Correlation 10-3 Regression.
Linear Correlation (12.5) In the regression analysis that we have considered so far, we assume that x is a controlled independent variable and Y is an.
LINEAR EQUATIONS & THEIR GRAPHS CHAPTER 6. INTRODUCTION We will explore in more detail rates of change and look at how the slope of a line relates to.
CORRELATION ANALYSIS.
Lecture 10 Introduction to Linear Regression and Correlation Analysis.
Correlation Assumptions: You can plot a scatter graph You know what positive, negative and no correlation look like on a scatter graph.
Dr Hidayathulla Shaikh Correlation and Regression.
Correlation and regression by M.Shayan Asad
CORRELATION. Correlation  If two variables vary in such a way that movement in one is accompanied by the movement in other, the variables are said to.
Linear Regression and Correlation Analysis
Chapter 5 STATISTICS (PART 4).
Elementary Statistics
Ch11 Curve Fitting II.
Correlation and Regression Lecture 1 Sections: 10.1 – 10.2
Warsaw Summer School 2017, OSU Study Abroad Program
CORRELATION & REGRESSION compiled by Dr Kunal Pathak
Presentation transcript:

Lecture 29 Dr. MUMTAZ AHMED MTH 161: Introduction To Statistics

Review of Previous Lecture In last lecture we discussed: Joint Distributions Moment Generating Functions Covariance Related Examples 2

Objectives of Current Lecture In the current lecture: Covariance: Some important Results Describing Bivariate Data Scatter Plot Concept of Correlation Properties of Correlation Related examples and Excel Demo 3

Covariance 4

Covariance NOTE 2: If X and Y are INDEPENDENT, then E(XY)=E(X) E(Y) Hence Cov(X,Y)=0 NOTE 3: Converse of above results DOESN’T Hold, i.e. if Cov(X,Y)=0 then it doesn’t mean X and Y are independent. e.g. Let X be Normal r.v with mean zero and Y=X 2 then obviously X and Y are NOT independent. Now Cov(X,Y)=Cov( X, X 2 )=E(X 3 )-E(X 2 )E(X) =E(X 3 )-E(X 2 )*(0)[since E(X)=0] =E(X 3 ) =0 [Since Normal is symmetric] Hence, Zero Covariance doesn’t imply Independence. 5

Covariance Do Excel Demo 6

Describing Bivariate Data Sometimes, our interest lies in finding the “relationship”, or “association”, between two variables. This can be done by the following methods: Scatter Plot Correlation Regression Analysis 7

Scatter Plot A first step in finding whether or not a relationship between two variables exists, is to plot each pair of independent-dependent observations {(Xi, Yi)}, i=1,2,..,n as a point on a graph paper. Such a diagram is called a Scatter Diagram or Scatter Plot. Usually, independent variable is taken along X-axis and dependent variable is taken along Y-axis. 8

Suppose we wished to graph the relationship between foot length Height Foot Length and height In order to create the graph, which is called a scatterplot or scattergram, we need the foot length and height for each of our subjects. of 20 subjects.

1. Find 12 inches on the x-axis. 2. Find 70 inches on the y-axis. 3. Locate the intersection of 12 and Place a dot at the intersection of 12 and 70. Height Foot Length Assume our first subject had a 12 inch foot and was 70 inches tall.

5. Find 8 inches on the x-axis. 6. Find 62 inches on the y-axis. 7. Locate the intersection of 8 and Place a dot at the intersection of 8 and Continue to plot points for each pair of scores. Assume that our second subject had an 8 inch foot and was 62 inches tall.

Notice how the scores cluster to form a pattern. The more closely they cluster to a line that is drawn through them, the stronger the linear relationship between the two variables is (in this case foot length and height).

Notice how the scores cluster to form a pattern. The more closely they cluster to a line that is drawn through them, the stronger the linear relationship between the two variables is (in this case foot length and height).

If the points on the scatterplot have an upward movement from left to right, we say the relationship between the variables is positive.

If the points on the scatterplot have an upward movement from left to right, we say the relationship between the variables is positive.

If the points on the scatterplot have a downward movement from left to right, we say the relationship between the variables is negative. If the points on the scatterplot have an upward movement from left to right, we say the relationship between the variables is positive.

A positive relationship means that high scores on one variable are associated with high scores on the other variable are associated with low scores on the other variable. It also indicates that low scores on one variable

A negative relationship means that high scores on one variable are associated with low scores on the other variable. are associated with high scores on the other variable. It also indicates that low scores on one variable

Scatter Plot of No relationship 19

Correlation Correlation measures the direction and strength of the linear relationship between two random variables. In other words, two variables are said to be correlated if they tend to vary in some direction simultaneously. If both variables tend to increase (or decrease) together, the correlation is said to be direct or positive. E.g. The length of an iron bar will increase as the temperature increases. If one variable tends to increase as the other variable decreases, the correlation is said to be inverse or negative. E.g. If time spent on watching TV increases, then Grades of students decrease. If a variable neither increases nor decreases in response to an increase or decrease in other variable then the correlation is said to be Zero. E.g. The correlation between the shoe price and time spent on exercise is zero. 20

Correlation Notations: For population data, it is denoted by the Greek letter (ρ) For sample data it is denoted by the roman letter r or r xy. Range: Correlation always lies between -1 and 1 inclusive. -1 means perfect negative linear association 0 means No linear association +1 means perfect positive linear association 21

Correlation Note: In correlation analysis, both the variables are random and hence treated symmetrically, i.e. there is NO distinction between dependent and independent variables. In regression analysis (to be discussed in forthcoming lectures), we are interested in determining the dependence of one variable (that is random) upon the other variable that is non-random or fixed and in addition, we are interested in predicting the average value of the dependent variable by using the known values of other variable (called independent variable). 22

Correlation There is no assumption of causality The fact that correlation exists between two variables does not imply any Cause and Effect relationship but it describes only the linear association. Correlation is a necessary, but not a sufficient condition for determining causality. 23

Correlation Example: Two unrelated variables such as ‘sale of bananas’ and ‘the death rate from cancer’ in a city, may produce a high positive correlation which may be due to a third unknown variable (called confounding variable, namely, the city population). The larger the city, the more consumption of bananas and the higher will be the death rate from cancer. Clearly, this is a false of merely incidental correlation which is the result of a third variable, the city size. Such a false correlation between two unconnected variables is called Spurious or non-sense correlation. Therefore one should be very careful in interpreting the correlation coefficient as a measure of relationship or interdependence between two variables. 24

Correlation: Computation 25

Correlation: Computation Computationally easier version is: OR Note: r is a pure number and hence is unit less. 26

Correlation: Computation Example: Consider a hypothetical data on two variables X and Y. Calculate product moment coefficient of correlation between X and Y. 27 XY

Correlation: Computation Solution: 28 XY(X-Xbar)(X-Xbar) 2 (Y-Ybar)(Y-Ybar) 2 (X-Xbar)* (Y-Ybar) Total=

Correlation: Computation Solution: 29 XY(X-Xbar)(X-Xbar) 2 (Y-Ybar)(Y-Ybar) 2 (X-Xbar)* (Y-Ybar) Total=

Correlation: Computation Alternative Method: 30 XY Total=1525

Correlation: Computation Alternative Method: replacing values and simplifying, we get, r= XYX2X2 Y2Y2 XY Total=

Properties Correlation only measures the strength of a linear relationship. There are other kinds of relationships besides linear. Correlation is symmetrical with respect to the variables X and Y, i.e. r xy =r yx Correlation coefficient ranges from -1 to +1. Correlation is not affected by change of origin and scale. i.e. correlation does not change if the you multiply, divide, add, or subtract a value to/from all the x-values or y-values. Assumes a linear association between two variables. 32

Review Let’s review the main concepts: Covariance: Some important Results Describing Bivariate Data Scatter Plot Concept of Correlation Properties of Correlation Related examples and Excel Demo 33

Next Lecture In next lecture, we will study: Common misconceptions about correlation Related Examples 34