Multivariate Data and Matrix Algebra Review BMTRY 726 Spring 2012.

Slides:



Advertisements
Similar presentations
Applied Informatics Štefan BEREŽNÝ
Advertisements

Lecture 3: A brief background to multivariate statistics
Matrices A matrix is a rectangular array of quantities (numbers, expressions or function), arranged in m rows and n columns x 3y.
3_3 An Useful Overview of Matrix Algebra
Symmetric Matrices and Quadratic Forms
Ch 7.3: Systems of Linear Equations, Linear Independence, Eigenvalues
The Terms that You Have to Know! Basis, Linear independent, Orthogonal Column space, Row space, Rank Linear combination Linear transformation Inner product.
Chapter 2 Matrices Definition of a matrix.
Ch 7.2: Review of Matrices For theoretical and computation reasons, we review results of matrix theory in this section and the next. A matrix A is an m.
1 Neural Nets Applications Vectors and Matrices. 2/27 Outline 1. Definition of Vectors 2. Operations on Vectors 3. Linear Dependence of Vectors 4. Definition.
Boot Camp in Linear Algebra Joel Barajas Karla L Caballero University of California Silicon Valley Center October 8th, 2008.
資訊科學數學11 : Linear Equation and Matrices
The Multivariate Normal Distribution, Part 2 BMTRY 726 1/14/2014.
Linear regression models in matrix terms. The regression function in matrix terms.
INDR 262 INTRODUCTION TO OPTIMIZATION METHODS LINEAR ALGEBRA INDR 262 Metin Türkay 1.
Matrices CS485/685 Computer Vision Dr. George Bebis.
Modern Navigation Thomas Herring
Matrix Approach to Simple Linear Regression KNNL – Chapter 5.
Arithmetic Operations on Matrices. 1. Definition of Matrix 2. Column, Row and Square Matrix 3. Addition and Subtraction of Matrices 4. Multiplying Row.
Stats & Linear Models.
5.1 Orthogonality.
1 Statistical Analysis Professor Lynne Stokes Department of Statistical Science Lecture 5QF Introduction to Vector and Matrix Operations Needed for the.
Today Wrap up of probability Vectors, Matrices. Calculus
Linear Algebra Review By Tim K. Marks UCSD Borrows heavily from: Jana Kosecka Virginia de Sa (UCSD) Cogsci 108F Linear.
A vector can be interpreted as a file of data A matrix is a collection of vectors and can be interpreted as a data base The red matrix contain three column.
: Appendix A: Mathematical Foundations 1 Montri Karnjanadecha ac.th/~montri Principles of.
Boyce/DiPrima 9th ed, Ch 7.3: Systems of Linear Equations, Linear Independence, Eigenvalues Elementary Differential Equations and Boundary Value Problems,
Compiled By Raj G. Tiwari
Linear Algebra Review 1 CS479/679 Pattern Recognition Dr. George Bebis.
1 February 24 Matrices 3.2 Matrices; Row reduction Standard form of a set of linear equations: Chapter 3 Linear Algebra Matrix of coefficients: Augmented.
8.1 Vector spaces A set of vector is said to form a linear vector space V Chapter 8 Matrices and vector spaces.
Some matrix stuff.
Digital Image Processing, 3rd ed. © 1992–2008 R. C. Gonzalez & R. E. Woods Gonzalez & Woods Matrices and Vectors Objective.
Statistics and Linear Algebra (the real thing). Vector A vector is a rectangular arrangement of number in several rows and one column. A vector is denoted.
Matrices. Definitions  A matrix is an m x n array of scalars, arranged conceptually as m rows and n columns.  m is referred to as the row dimension.
Linear algebra: matrix Eigen-value Problems
Matrix Algebra and Regression a matrix is a rectangular array of elements m=#rows, n=#columns  m x n a single value is called a ‘scalar’ a single row.
Multivariate Statistics Matrix Algebra I W. M. van der Veld University of Amsterdam.
Matrix Differential Calculus By Dr. Md. Nurul Haque Mollah, Professor, Dept. of Statistics, University of Rajshahi, Bangladesh Dr. M. N. H. MOLLAH.
Linear algebra: matrix Eigen-value Problems Eng. Hassan S. Migdadi Part 1.
A Review of Some Fundamental Mathematical and Statistical Concepts UnB Mestrado em Ciências Contábeis Prof. Otávio Medeiros, MSc, PhD.
Introduction to Matrices and Matrix Approach to Simple Linear Regression.
1 Matrix Algebra and Random Vectors Shyh-Kang Jeng Department of Electrical Engineering/ Graduate Institute of Communication/ Graduate Institute of Networking.
Meeting 18 Matrix Operations. Matrix If A is an m x n matrix - that is, a matrix with m rows and n columns – then the scalar entry in the i th row and.
Special Topic: Matrix Algebra and the ANOVA Matrix properties Types of matrices Matrix operations Matrix algebra in Excel Regression using matrices ANOVA.
Chapter 2 … part1 Matrices Linear Algebra S 1. Ch2_2 2.1 Addition, Scalar Multiplication, and Multiplication of Matrices Definition A matrix is a rectangular.
Review of Matrix Operations Vector: a sequence of elements (the order is important) e.g., x = (2, 1) denotes a vector length = sqrt(2*2+1*1) orientation.
Stats & Summary. The Woodbury Theorem where the inverses.
Matrices and Matrix Operations. Matrices An m×n matrix A is a rectangular array of mn real numbers arranged in m horizontal rows and n vertical columns.
Feature Extraction 主講人:虞台文. Content Principal Component Analysis (PCA) PCA Calculation — for Fewer-Sample Case Factor Analysis Fisher’s Linear Discriminant.
Instructor: Mircea Nicolescu Lecture 8 CS 485 / 685 Computer Vision.
STROUD Worked examples and exercises are in the text Programme 5: Matrices MATRICES PROGRAMME 5.
Unsupervised Learning II Feature Extraction
Boot Camp in Linear Algebra TIM 209 Prof. Ram Akella.
Linear Algebra Engineering Mathematics-I. Linear Systems in Two Unknowns Engineering Mathematics-I.
1 Objective To provide background material in support of topics in Digital Image Processing that are based on matrices and/or vectors. Review Matrices.
Matrix Algebra Definitions Operations Matrix algebra is a means of making calculations upon arrays of numbers (or data). Most data sets are matrix-type.
Lecture XXVI.  The material for this lecture is found in James R. Schott Matrix Analysis for Statistics (New York: John Wiley & Sons, Inc. 1997).  A.
Introduction to Vectors and Matrices
Review of Linear Algebra
CS479/679 Pattern Recognition Dr. George Bebis
Review of Matrix Operations
Matrices and vector spaces
Matrices and Vectors Review Objective
Multivariate Data and Matrix Algebra Review
CS485/685 Computer Vision Dr. George Bebis
Matrix Algebra and Random Vectors
Objective To provide background material in support of topics in Digital Image Processing that are based on matrices and/or vectors.
Introduction to Vectors and Matrices
Subject :- Applied Mathematics
Presentation transcript:

Multivariate Data and Matrix Algebra Review BMTRY 726 Spring 2012

What is ‘Multivariate’ Data? Data in which each sampling unit contributes to more than one outcome. For example…. Sampling Unit Cancer patientsSerum concentrations on a panel of protein markers are collected in chemotherapy patients Smoking cessation participants Collect background information and smoking behavior at multiple visits Post-operative patient outcome Multiple measures of how a patient is doing post- operatively: patient self-reported pain, opioid consumption, ICU/Hospital length of stay DiabeticsEach subject assigned to different glucose control option (medication, diet, diet and medication). Fasting blood glucose is monitored at 0, 3, 6, 9, 12, and 15 months.

Goals of Multivariate Analysis Data reduction and structural simplification – Say we collect 16 biological markers to examine patient response to chemotherapy. – Ideally we might like to summarize patient response as some simple combination of the markers. – How can variation in p=16 markers be summarized?

Goals of Multivariate Analysis Sorting and grouping data – Participants are enrolled in a smoking cessation program for several years – Information about the background of each subject and smoking behavior at multiple visits – Some patients quit while others do not – Can we use the background and smoking behavior information to classify those that quit and those that do not in order to screen future participants?

Goals of Multivariate Analysis Investigating dependence among variables – Subjects take a standardized test with different categories of questions Sentence completion Number sequences Orientation of patterns Arithmetic (etc.) – Can correlation among scores be attributed to variation in one or more unobserved factors? Intelligence Mathematical ability

Goals of Multivariate Analysis Prediction based on relationship between variables – We conduct a microarray experiment to compare tumor and healthy tissue – We want to develop a reliable classification tool based on the gene expression information from our experiment

Goals of Multivariate Analysis Hypothesis testing – Participants in a diabetes study are placed into one of three treatment groups – Fasting blood glucose is evaluated at 0, 3, 6, 9, 12, and 15 months – We want to test the hypothesis that treatment groups are different.

Multivariate Data Properties What property/ies of multivariate data make commonly used statistical approached inappropriate?

Notation & Data Organization Consider an example where we have 15 tumor markers collected on 30 tissue samples The 15 markers are variables and our samples represent the subjects in the data. These data can most easily be expressed as an 15 by 30 array

Notation & Data Organization More generally, let j = 1,2,…,p represent a set of variables collected in a study And let i = 1,2,…,n represent the samples

Random Vectors Each experimental unit has multiple outcome measures thus we can arrange the i th subject’s j = 1,2,…, p outcomes as a vector. is a random variable as are it’s individual elements p denotes the number of outcomes for subject i i = 1,2,…,n is the number subjects

Descriptive Statistics We can calculate familiar descriptive statistics for this array – Mean – Variance – Covariance (Correlation)

Arranged as Arrays Means Covariance

Distance Many multivariate statistics are based on the idea of distance For example, if we are comparing two groups we might look at the difference in their means Euclidean distance

Distance But why is Euclidean distance inappropriate in statistics? This leads us to the idea of statistical distance Consider a case where we have two measures

Distance Consider a case where we have two measures

Distance Consider a case where we have two measures

Distance Our expression of statistical distance can be generalized to p variables to any fixed set of points

Basic Matrix Operations Can I add A 2x3 and B 3x3 ? What is the product of matrix A and scalar c ? When can I multiply the two matrices A and B ?

Matrix Transposes The transpose of an n x m matrix A, denoted as A ’, is an m x n matrix whose ij th element is the ji th element of A Properties of a transpose:

Types of Matrices Square matrix: Idempotent: Symmetric: A square matrix is diagonal :

More Definitions An n x n matrix A is nonsingular if there exists an matrix B n x n such that B is the multiplicative inverse of A and can be written as A square matrix with no multiplicative inverse is said to be…. We can calculate the inverse of a matrix assuming one exists but it is tedious (let the computer do it).

Matrix Determinant The determinant of a square matrix A is a scalar given by What is the determinant of

Matrix Determinant The determinant of a square matrix A is a scalar given by What is the determinant of

Matrix Determinant What about the determinant of the 3x3 matrix?

Matrix Determinant Using this result what is the determinant of

Orthogonal an Orthonormal vectors A collection of m-dimensional vectors, x 1, x 2, …, x p are orthogonal if… The collection of vectors is said to be orthonormal if what 2 conditions are met?

Linear Dependence The p of m-dimensional vectors,, are linearly dependent if there is a set of constants, c 1,c 2, …,c p not all zero for which

Linear Dependence The p of m-dimensional vectors,, are linearly dependent if there is a set of constants, c 1,c 2, …,c p not all zero for which Conversely, if no such set of non-zero constants exists, the vectors are linearly independent.

Rank of a Matrix Row rank is the number of rows Column rank is the number of cols Find the column rank of How are row and column rank related? What does rank tell us about linear dependence of the vectors that make up the matrix?

Orthogonal Matrices A square matrix A n x n is said to be orthogonal if its columns form an orthonormal set. This can be easily be determined by showing that

Eigenvalues and Eigenvectors The eigenvalues of an A n x n matrix are the solutions to for a set of eigenvectors,. We typically normalize so that

Quadratic Forms Given a symmetric matrix A n x n and an n-dimensional vector x, we can write the quadratic form as. For example, find the quadratic form where

Trace Let A be an n x n matrix, the trace of A is given by Properties of the trace:

Positive Definite Matrices A symmetric matrix A is said to be positive definite if this implies

Positive Definite Matrices A real symmetric matrix is:

Back to Random Vectors Define Y as a random vector Then the population mean vector is:

Random Vectors Cont’d So Y i is a random variable whose mean and variance can be expressed by:

Covariance of Random Vectors We then define the covariance between the i th and j th trait in Y as Yielding the covariance matrix

Correlation Matrix of Y The correlation matrix for Y is

Properties of a Covariance Matrix is symmetric (i.e.  ij =  ji for all i,j ) is positive semi-definite for any vector of constants

Linear Combinations Consider linear combinations of the elements of Y If Y has mean  and covariance , then

Linear Combinations Cont’d If  is not positive definite then for at least one