Presentation on theme: "BY RAHUL A. PARSA DRAKE UNIVERSITY & STUART A. KLUGMAN SOCIETY OF ACTUARIES Copula Regression."— Presentation transcript:
BY RAHUL A. PARSA DRAKE UNIVERSITY & STUART A. KLUGMAN SOCIETY OF ACTUARIES Copula Regression
Outline of Talk OLS Regression Generalized Linear Models (GLM) Copula Regression Continuous case Discrete Case Examples
Notation Notation: Y – Dependent Variable Assumption Y is related to X’s in some functional form
OLS Regression Y is linearly related to X’s OLS Model
OLS Regression Estimated Model
OLS Multivariate Normal Distribution Assume Jointly follow a multivariate normal distribution Then the conditional distribution of Y | X follows normal distribution with mean and variance given by
OLS & MVN Y-hat = Estimated Conditional mean It is the MLE Estimated Conditional Variance is the error variance OLS and MLE result in same values Closed form solution exists
GLM Y belongs to an exponential family of distributions g is called the link function x's are not random Y|x belongs to the exponential family Conditional variance is no longer constant Parameters are estimated by MLE using numerical methods
GLM Generalization of GLM: Y can be any distribution (See Loss Models) Computing predicted values is difficult No convenient expression conditional variance
Copula Regression Y can have any distribution Each X i can have any distribution The joint distribution is described by a Copula Estimate Y by E(Y|X= x ) – conditional mean
Copula Ideal Copulas will have the following properties: ease of simulation closed form for conditional density different degrees of association available for different pairs of variables. Good Candidates are: Gaussian or MVN Copula t-Copula
MVN Copula CDF for MVN is Copula is Where G is the multivariate normal cdf with zero mean, unit variance, and correlation matrix R. Density of MVN Copula is Where v is a vector with ith element
Conditional Distribution in MVN Copula The conditional distribution of x n given x 1 ….x n-1 is Where
Copula Regression Continuous Case Parameters are estimated by MLE. If are continuous variables, then we use previous equation to find the conditional mean. one-dimensional numerical integration is needed to compute the mean.
Copula Regression Discrete Case When one of the covariates is discrete Problem: determining discrete probabilities from the Gaussian copula requires computing many multivariate normal distribution function values and thus computing the likelihood function is difficult Solution: Replace discrete distribution by a continuous distribution using a uniform kernel.
Copula Regression – Standard Errors How to compute standard errors of the estimates? As n -> ∞, MLE, converges to a normal distribution with mean and variance I( ) -1, where I( ) – Information Matrix.
How to compute Standard Errors Loss Models: “To obtain information matrix, it is necessary to take both derivatives and expected values, which is not always easy. A way to avoid this problem is to simply not take the expected value.” It is called “Observed Information.”
Examples All examples have three variables R Matrix : Error measured by Also compared to OLS 10.7 1 1
Example 1 Dependent – X3 - Gamma Though X2 is simulated from Pareto, parameter estimates do not converge, gamma model fit Error: VariablesX1-ParetoX2-ParetoX3-Gamma Parameters3, 1004, 3003, 100 MLE3.44, 161.111.04, 112.0033.77, 85.93 Copula59000.5 OLS637172.8
Ex 1 - Standard Errors Diagonal terms are standard deviations and off- diagonal terms are correlations
Example 1 - Cont Maximum likelihood Estimate of Correlation Matrix 10.7110.699 0.71110.713 0.6990.7131 R-hat =