Lecture 6 Correlation Microarray example Defining correlation : Procedure of computing correlation (1)standardize x, (2)standardize y, (3)average product.

Slides:



Advertisements
Similar presentations
STATISTICS INTERVAL ESTIMATION Professor Ke-Sheng Cheng Department of Bioenvironmental Systems Engineering National Taiwan University.
Advertisements

Correlation Minium, Clarke & Coladarci, Chapter 7.
Biostatistics Unit 5 Samples Needs to be completed. 12/24/13.
Chapter 4 Systems of Linear Equations; Matrices
Chapter 6 Trade-Off Between Risk & Return
Risk and Return Learning Module.
EC220 - Introduction to econometrics (review chapter)
Chapter 25 Risk Assessment. Introduction Risk assessment is the evaluation of distributions of outcomes, with a focus on the worse that might happen.
Exponents and Radicals
Stat 13 Final Review A. Probability tables to use. B. variance algebra, correlation, covariance, regression C. Probability and Conditional probability.
Chapter 5 The Mathematics of Diversification
Regression and correlation methods
The Simple Regression Model
Correlation and regression Dr. Ghada Abo-Zaid
Lectures prepared by: Elchanan Mossel Yelena Shvets Introduction to probability Stat 134 FAll 2005 Berkeley Follows Jim Pitman’s book: Probability Sections.
Education 793 Class Notes Joint Distributions and Correlation 1 October 2003.
Lecture Presentation Software to accompany Investment Analysis and Portfolio Management Seventh Edition by Frank K. Reilly & Keith C. Brown Chapter.
Basic Statistical Concepts Psych 231: Research Methods in Psychology.
Basic Statistical Concepts
Statistics Psych 231: Research Methods in Psychology.
SOME STATISTICAL CONCEPTS Chapter 3 Distributions of Data Probability Distribution –Expected Rate of Return –Variance of Returns –Standard Deviation –Covariance.
Basic Statistical Concepts Part II Psych 231: Research Methods in Psychology.
Lecture 17: Correlations – Describing Relationships Between Two Variables 2011, 11, 22.
Lecture 4 Rescaling, Sum and difference of random variables: simple algebra for mean and standard deviation (X+Y) 2 =X 2 + Y XY E (X+Y) 2 = EX 2.
Correlation 10/30. Relationships Between Continuous Variables Some studies measure multiple variables – Any paired-sample experiment – Training & testing.
Regression Analysis British Biometrician Sir Francis Galton was the one who used the term Regression in the later part of 19 century.
2.1a Polynomial Functions Linear Functions Linear Correlation/Modeling Lots of new/old info. to start our next chapter!!!
MAT 254 – Probability and Statistics Sections 1,2 & Spring.
Measuring Returns Converting Dollar Returns to Percentage Returns
© 2012 Cengage Learning. All Rights Reserved. May not scanned, copied or duplicated, or posted to a publicly accessible website, in whole or in part. Chapter.
1 MBF 2263 Portfolio Management & Security Analysis Lecture 2 Risk and Return.
Multiplying and Dividing Real Numbers Objective: To multiply and divide real numbers.
Portfolio Management-Learning Objective
Lecture 15: Expectation for Multivariate Distributions Probability Theory and Applications Fall 2008 Those who ignore Statistics are condemned to reinvent.
Calculating Expected Return
Lecture Presentation Software to accompany Investment Analysis and Portfolio Management Seventh Edition by Frank K. Reilly & Keith C. Brown Chapter 7.
Investment Analysis and Portfolio management Lecture: 24 Course Code: MBF702.
Some Background Assumptions Markowitz Portfolio Theory
Investment Analysis and Portfolio Management Chapter 7.
 Lecture #9.  The course assumes little prior applied knowledge in the area of finance.  References  Kristina (2010) ‘Investment Analysis and Portfolio.
The Correlation Coefficient. Social Security Numbers.
Review of Probability Concepts ECON 4550 Econometrics Memorial University of Newfoundland Adapted from Vera Tabakova’s notes SECOND.
Chapter 08 Risk and Rate of Return
Basic Statistics Correlation Var Relationships Associations.
1 Risk Learning Module. 2 Measures of Risk Risk reflects the chance that the actual return on an investment may be different than the expected return.
PROBABILITY CONCEPTS Key concepts are described Probability rules are introduced Expected values, standard deviation, covariance and correlation for individual.
Investment Analysis and Portfolio Management First Canadian Edition By Reilly, Brown, Hedges, Chang 6.
Mean and Standard Deviation of Discrete Random Variables.
CHAPTER SEVEN Risk, Return, and Portfolio Theory J.D. Han.
6.3 THE CENTRAL LIMIT THEOREM. DISTRIBUTION OF SAMPLE MEANS  A sampling distribution of sample means is a distribution using the means computed from.
Understanding Your Data Set Statistics are used to describe data sets Gives us a metric in place of a graph What are some types of statistics used to describe.
Statistics & Econometrics Statistics & Econometrics Statistics & Econometrics Statistics & Econometrics Statistics & Econometrics Statistics & Econometrics.
Correlation. Correlation is a measure of the strength of the relation between two or more variables. Any correlation coefficient has two parts – Valence:
Pearson Correlation Coefficient 77B Recommender Systems.
Chapter 5 Joint Probability Distributions The adventure continues as we consider two or more random variables all at the same time. Chapter 5B Discrete.
Economics 173 Business Statistics Lecture 10 Fall, 2001 Professor J. Petry
Lecture 3 1 Recap Random variables Continuous random variable Sample space has infinitely many elements The density function f(x) is a continuous function.
Lecture 29 Dr. MUMTAZ AHMED MTH 161: Introduction To Statistics.
Understanding Your Data Set Statistics are used to describe data sets Gives us a metric in place of a graph What are some types of statistics used to describe.
1 MVS 250: V. Katch S TATISTICS Chapter 5 Correlation/Regression.
LESSON 5 - STATISTICS & RESEARCH STATISTICS – USE OF MATH TO ORGANIZE, SUMMARIZE, AND INTERPRET DATA.
Types of risk Market risk
Portfolio Risk Lecture 14.
PRODUCT MOMENTS OF BIVARIATE RANDOM VARIABLES
5AC007 Business Finance Lecture Week 9
Types of risk Market risk
Chapter 3: Getting the Hang of Statistics
Chapter 3 Statistical Concepts.
Chapter 3: Getting the Hang of Statistics
Financial Econometrics Fin. 505
Presentation transcript:

Lecture 6 Correlation Microarray example Defining correlation : Procedure of computing correlation (1)standardize x, (2)standardize y, (3)average product of standardized x and standardized y properties.: between -1 and 1 Three special cases : perfect positive relationship= 1, perfect negative relationship= -1 and no correlation =0 Back to the stock example. Stock example: stock prices are likely to be correlated. Need a measure of strength of correlation.

Definition of correlation coefficient If X and Y both have mean 0 and variance 1, then correlation coefficient  = E (XY) Correlation remains the same under any scale changes For the general case, standardize each variable first. Cov (X, Y) = E (X-mean of X)(Y-mean of Y) If you forgot to divide by SD, then you obtained a quantity called Covariance, which is still useful (see next page) Without subtracting the mean, you got E(XY), a garbage ! A remedy : cov(X,Y) = E(XY) - E(X) E(Y)

Correlation coefficient= cov(X,Y)/SD(X)SD(Y), where cov(X,Y)= E [(X-mean) (Y- mean)] Use the independence example (from lecture 4) to construct positive correlation by cutting of the points on the edge Do a step by step calculation of corr. Coeff. Do a plotting showing 4 quadrants by drawing vertical and horizontal lines passing the means.

(+, +) X, Y both higher than mean (-, -) X,Y both lower than mean (-, +) X lower than mean Y higher than mean (+, -) X higher than mean, Y lower than mean Product=positive Product=positive Product=negative Product=negative

Conceptual : Step by step for Corr Coeff. Stdzd = standardized (remove mean, divided by SD) xystdzd xstdzd yproduct 24-5/SD(X)- 1.5/SD(Y ) 7.5/SD(X)SD(Y) 43-3/SD(X) -2.5/SD(Y)7.5/SD(X)SD(Y) 66-1/SD(X) 0.5/SD(Y)-0.5/SD(X)SD(Y) 851 /SD(X) -0.5/SD(Y)-0.5/SD(X)SD(Y) 1083/SD(X) 2.5/SD(Y)7.5/SD(X)SD(Y) 1275/SD(X) 1.5/SD(Y)7.5/SD(X)SD(Y) E X=7E Y=5.5SD( X) = sqrt(35/3)=3.4 X-EX Y-EY Consistency : if use n-1 in doing SD, then use n-1 for averaging product Use population version, so divided by n SD(Y)= 1.7 Corr =(29/6)/3.4 times 1.7=29/35=0.828

Practice: Step by step for Covariance,variance, and correlation coefficients. E X=7E Y=5.5 SD( X) =3.4 sqrt(35/3)=3.4 Consistency : if use n-1 in doing SD, then use n-1 for averaging product Use population version, so divided by n SD(Y)=1.7Corr=0.828 =cov/sd(x)sd(y) xyX-EXY-EYprod uct (X-EX) 2 (Y-EY) Cov =29/6

Positive correlations Corr = 0.9 Corr =.8 Corr =.5 On line illustration with Xlispstat, using (bi-normal r n)

Algebra for Variance, covariance Var(X+Y)= Var X + Var Y + 2 cov (X,Y) Var(X) = Cov (X, X) Var (X+a)= Var (X) Cov (X+a, Y+ b)= Cov(X,Y) Cov (aX, bY)=ab Cov(X,Y) Var(aX) =a 2 Var (X) Cov( X+Y, Z)= cov(X,Z) + cov (Y,Z) Cov (X+Y, V+W)= cov(X,V) + cov (X, W) + cov (Y, W) + cov(Y,W) TRICK : pretend all means are zero; (X+Y)(V+W)=XV+XW+YW+YW

Stock prices are correlated Effect on variance of option 1 and option Recall the problem

Example Stock A and Stock B Current price : both the same, $10 per share Predicted performance a week later: similar Both following a normal distribution with Mean $10.0 and SD $1.0 You have twenty dollars to invest Option 1 : buy 2 shares of A portfolio mean=?, SD=? Option 2 : buy one share of A and one share of B Which one is better? Why? Assume that there is a correlation of.8 between the prices of stock A and stock B a week later

Better? In what sense? What is the probability that portfolio value will be higher than 22 ? What is the probability that portfolio value will be lower than 18? What is the probability that portfolio value will be between18 and 22? ( How about if correlation equals 1 ?)

For option 2, the key is to find variance Let X be the future price of stock A Let Y be the future price of stock B Let T = X + Y portfolio value E T = E X + E Y (same as done before) Var T = Var X + Var Y + 2 cov (X, Y) Cov (X, Y) = correlation times SD(X) SD(Y) =.8 times 1 times 1 = 0.8 Var X = (SD (X) ) 2 =1 2 =1; Var Y = 1 Var T = times.8 = 3.6 (compared to 1+1=2 when assuming independence) SD (T) = squared root of 3.6=1.9 is still less than SD for option 1

Index Index is usually constructed as a weighted average of several variables Stock index Course grade =.2 midterm+.45 Final +.15 HW +.2 LAB Find SD of course grade Independence; dependence