Tip #1: Look at your data (1)

Slides:



Advertisements
Similar presentations
Copula Representation of Joint Risk Driver Distribution
Advertisements

Probabilistic Analysis of Hydrological Loads to Optimize the Design of Flood Control Systems B. Klein, M. Pahlow, Y. Hundecha, C. Gattke and A. Schumann.
3.3 Hypothesis Testing in Multiple Linear Regression
INTRODUCTION TO COPULAS
Inference for Linear Regression (C27 BVD). * If we believe two variables may have a linear relationship, we may find a linear regression line to model.
G Lecture 101 Examples of Binary Data Binary Data and Correlation Measurement Models and Binary Data Measurement Models and Ordinal Data Analyzing.
6-1 Introduction To Empirical Models 6-1 Introduction To Empirical Models.
Copula Functions and Bivariate Distributions: Applications to Political Interdependence Alejandro Quiroz Flores, Wilf Department of Politics, NYU Motivation.
Pair-copula constructions of multiple dependence Workshop on ''Copulae: Theory and Practice'' Weierstrass Institute for Applied Analysis and.
Master thesis presentation Joanna Gatz TU Delft 29 of July 2007 Properties and Applications of the T copula.
Models for construction of multivariate dependence Workshop on Copulae and Multivariate Probability distributions in Finance – Theory, Applications,
Simulating Exchangeable Multivariate Archimedean Copulas and its Applications Authors: Florence Wu Emiliano A. Valdez Michael Sherris.
Risk Aggregation: Copula Approach
Lecture 23: Tues., Dec. 2 Today: Thursday:
Chapter Eighteen MEASURES OF ASSOCIATION
Lecture 24: Thurs. Dec. 4 Extra sum of squares F-tests (10.3) R-squared statistic (10.4.1) Residual plots (11.2) Influential observations (11.3,
Statistical Methods in Computer Science Hypothesis Testing I: Treatment experiment designs Ido Dagan.
Archimedean Copulas Theodore Charitos MSc. Student CROSS.
Statistical Methods in Computer Science Hypothesis Testing I: Treatment experiment designs Ido Dagan.
Measures of Association Deepak Khazanchi Chapter 18.
Review for Exam 2 Some important themes from Chapters 6-9 Chap. 6. Significance Tests Chap. 7: Comparing Two Groups Chap. 8: Contingency Tables (Categorical.
Review for Final Exam Some important themes from Chapters 9-11 Final exam covers these chapters, but implicitly tests the entire course, because we use.
Copula functions Advanced Methods of Risk Management Umberto Cherubini.
P á l Rakonczai, L á szl ó Varga, Andr á s Zempl é ni Copula fitting to time-dependent data, with applications to wind speed maxima Eötvös Loránd University.
Dependence between mortality and morbidity: is underwriting scoring really different for Life and Health products? Andrey Kudryavtsev, St.Petersburg State.
Correlation and Regression
Correlations and Copulas 1. Measures of Dependence 2 The risk can be split into two parts: the individual risks and the dependence structure between them.
Risk Modeling of Multi-year, Multi-line Reinsurance Using Copulas
Copyright © Cengage Learning. All rights reserved. 13 Linear Correlation and Regression Analysis.
Chapter 5 Modeling & Analyzing Inputs
Enterprise Risk Management in Insurance Groups July 11, Enterprise Risk Management in Insurance Groups: Measuring Risk Concentration and Default.
B. John Manistre FSA, FCIA, MAAA Risk Dependency Research: A Progress Report Enterprise Risk Management Symposium Washington DC July 30, 2003.
Covariance and correlation
Correlation.
Ch 8 Estimating with Confidence. Today’s Objectives ✓ I can interpret a confidence level. ✓ I can interpret a confidence interval in context. ✓ I can.
Traffic Modeling.
A Beginner’s Guide to Bayesian Modelling Peter England, PhD EMB GIRO 2002.
1 Statistical Distribution Fitting Dr. Jason Merrick.
Random Regressors and Moment Based Estimation Prepared by Vera Tabakova, East Carolina University.
1 G Lect 8b G Lecture 8b Correlation: quantifying linear association between random variables Example: Okazaki’s inferences from a survey.
Basic Statistics Correlation Var Relationships Associations.
Day 3: Sampling Distributions. CCSS.Math.Content.HSS-IC.A.1 Understand statistics as a process for making inferences about population parameters based.
Estimating the Predictive Distribution for Loss Reserve Models Glenn Meyers Casualty Loss Reserve Seminar September 12, 2006.
Relationship between two variables Two quantitative variables: correlation and regression methods Two qualitative variables: contingency table methods.
SUPA Advanced Data Analysis Course, Jan 6th – 7th 2009 Advanced Data Analysis for the Physical Sciences Dr Martin Hendry Dept of Physics and Astronomy.
By: Amani Albraikan.  Pearson r  Spearman rho  Linearity  Range restrictions  Outliers  Beware of spurious correlations….take care in interpretation.
Lévy copulas: Basic ideas and a new estimation method J L van Velsen, EC Modelling, ABN Amro TopQuants, November 2013.
Probability and Measure September 2, Nonparametric Bayesian Fundamental Problem: Estimating Distribution from a collection of Data E. ( X a distribution-valued.
© Copyright McGraw-Hill Correlation and Regression CHAPTER 10.
Academic Research Academic Research Dr Kishor Bhanushali M
Using Copulas.
1 Reserving Ranges and Acceptable Deviations CANE Fall 2005 Meeting Kevin Weathers FCAS, MAAA The Hartford This document is designed for discussion purposes.
Review of Probability. Important Topics 1 Random Variables and Probability Distributions 2 Expected Values, Mean, and Variance 3 Two Random Variables.
Section Copyright © 2014, 2012, 2010 Pearson Education, Inc. Chapter 10 Correlation and Regression 10-2 Correlation 10-3 Regression.
Copyright © 2010 Pearson Education, Inc Chapter Seventeen Correlation and Regression.
Portfolio wide Catastrophe Modelling Practical Issues.
Pair-copula constructions of multiple dependence Vine Copula Workshop Delft Institute of Applied Mathematics, 19. November, 2007 Kjersti Aas.
Latvijas Aktuāru Asociācija AN INTRODUCTION TO COPULAS Gaida Pettere Professor, Dr. Math. Riga Technical University, Chairmen of Latvian Actuarial Association.
MEGN 537 – Probabilistic Biomechanics Ch.5 – Determining Distributions and Parameters from Observed Data Anthony J Petrella, PhD.
1 Lecture Plan Modelling Profit Distribution from Wind Production (Excel Case: Danish Wind Production and Spot Prices) Reasons for copula.
1 Ka-fu Wong University of Hong Kong A Brief Review of Probability, Statistics, and Regression for Forecasting.
McGraw-Hill/Irwin © 2003 The McGraw-Hill Companies, Inc.,All Rights Reserved. Part Four ANALYSIS AND PRESENTATION OF DATA.
Inference about the slope parameter and correlation
MATH-138 Elementary Statistics
Chapter 10 CORRELATION.
Elementary Statistics
Correlation and Regression
Non – Parametric Test Dr. Anshul Singh Thapa.
Ch11 Curve Fitting II.
Presentation transcript:

Fitting copulas: Some tips Andreas Tsanakas, Cass Business School Staple Inn, 16/11/06

Tip #1: Look at your data (1) Usually there are some data, even if they are not enough to run a formal Maximum Likelihood Estimation process Plot what you have and look at some heuristics They may help you decide with model choice and sensible parameter ranges Example Plot ranks Visually test for tail-dependence Visually test for skewness

Tip #1: Look at your data (2) Sample ranks are a data set on which a copula may be fitted So work with those There are dependence measures that relate only to these ranks Spearman’s rank correlation Kendall tau Blomqvist beta The estimates of these tend to be more stable than of the usual correlation Can use directly to parameterise some models Kendall’s tau works well with elliptical (Gaussian, t) and Archimedean (Gumbel, Clayton) copulas

Rank plots Dependence patterns do not get distorted by marginals ‘Real world’ ‘Copula world’

Example copulas: Gaussian (ρ=0.5) Has no tail dependence

Example copulas: t Does have tail dependence, same in both tails

Example copulas: Asymmetric t Has tail dependence and skewness

Tail correlations A local dependence metric that is sensitive to asymptotic tail dependence (based on Schmidt and Schmid, 2006) Left tail p=0 Right tail p=1 Compare 3 copulas - Gaussian - t - Asymmetric t Plot using 20,000 Samples Very unstable!

Skew-rank correlations Try to identify skewness from a small sample (of 20) Generalisation of rank correlation with a 3rd moment adjustment The sensitivity to the 3rd joint moments of the sample ranks increases with coefficient “a”. a=0 gives the usual rank correlation.

Quadrant correlations Can write (rank) correlation as sum of correlations in each quadrant (Smith, 2002) Gaussian copula

Quadrant correlations Can write (rank) correlation as sum of correlations in each quadrant (Smith, 2002) Gaussian copula

Quadrant correlations Can write (rank) correlation as sum of correlations in each quadrant (Smith, 2002) t copula

Quadrant correlations Can write (rank) correlation as sum of correlations in each quadrant (Smith, 2002) Asymmetric t copula

Quadrant correlations Plot the % breakdown of aggregate rank correlation to quadrants

Tip #2: Use judgement expertly There are 3 ways of using expert judgement in model calibration Method 1 Ask underwriters and other experts for sources of correlation Make up some numbers Method 2 Ask questions that experts can answer meaningfully Identify drivers of dependence Associate those with your copula models Method 3 Adopt a full formal Bayesian framework Not quite there yet!

Tip #3: Understand your models Copula models can often be expressed via factors (drivers) So not as ad-hoc as you may think Helps you chose an appropriate structure Gaussian copula: n (or less!) additive factors determine correlation structure t-copula: same as Gaussian with one additional multiplicative factor driving tail dependence, even for otherwise uncorrelated risks Archimedean copulas (e.g. Gumbel, Clayton): there is one factor, conditional upon which all risks are independent p-factor Archimedean copulas: as above, but can use more than one factor

Tip #4: Keep it simple Modern DFA software offers you a wealth possibilities But you may be tempted to overparameterise your model A small number of parameters (drivers) will give you a better chance of making meaningful (interpretable) choices You may even be able to do a bit of estimation

Tip #5: Use other models for reference Suppose you have a peril model which you believe in But it doesn’t cover all your risks Maybe the dependence structure that it implies between classes can be used as a proxy for risks that aren’t covered? Fit a copula to the peril model simulations Loads of (pseudo-)data! Alternatively suppose you have some particular pairs of risks for which you have more data This may help you get a feeling for sensible parameter ranges

Tip #6: Work backwards See what diversification credits different copula choices imply Do they make sense? The argument is of course circular, but it may help involve people in the thinking

Tip #7: Take care of your data You may not have enough data today If you collect and maintain your data you’ll have more tomorrow

Tip #8: Adopt a positive attitude It is no good complaining about fitting a copula ‘being impossible’ It is difficult but not impossible in principle Insisting on the difficulty does not make the problem of dependencies go away There are some things you can do Though they still may not work Shouldn’t you be used to this?

Literature I used for this talk On most things: McNeil, Frey and Embrechts (2005), Quantitative Risk Management, Princeton University Press. On quadrant correlations and normal mixtures Smith (2002), ‘Dependent Tails’, 2002 GIRO Convention, http://www.actuaries.org.uk/Display_Page.cgi?url=/giro2002/index.xml A technical paper from which I pinched an idea or two: Schmidt and Schmid (2006) ‘Nonparametric Inference on Multivariate Versions of Blomqvist's Beta and Related Measures of Tail Dependence’, http://www.uni-koeln.de/wiso-fak/wisostatsem/autoren/schmidt/publication.html

Sample literature on fitting copulas (yes it exists) Genest and Rivest (1993), ‘Statistical inference procedures for bivariate Archimedean copulas,’ J. Amer. Statist. Assoc. 88, 1034-1043. Joe (1997), Multivariate Models and Dependence Concepts, Chapman & Hall. McNeil, Frey and Embrechts (2005), Quantitative Risk Management, Princeton University Press. Denuit, Purcaru and Van Bellegem (2006), 'Bivariate archimedean copula modelling for censored data in nonlife insurance'. Journal of Actuarial Practice 13, 5-32. Chen, Fan and Tsyrennikov (2006), ‘Efficient estimation of semiparametric multivariate copula models. J. Am. Stat. Assoc. 101, 1228-1240.

Appendix - dependence measures Consider risks X and Y, with cdfs F and G. Let U=F(X), V=G(Y) Let X’=X-E[X], Y’=Y-E[Y], U’=U-E[U], V’=V-E[V] Assume sample of size n x={x1,…,xn } is the sample from random variable X etc u={u1,…,un } are the normalised sample ranks of X, i.e. numbers 1/n, 2/n,…,1, but ordered in the same way as the elements of x. Same for y, v.

Appendix - dependence measures Pearson correlation coefficient Spearman correlation coefficient Spearman correlation for the Gaussian copula where r is the Pearson correlation of the underlying normal distribution

Appendix - dependence measures Kendall correlation coefficient (population version) where is an independent copy of (X,Y) Kendall correlation coefficient (sample version) Kendall correlation coefficient elliptical copulas (incl. Gaussian, t) where r is the Pearson correlation of the underlying elliptical distribution

Appendix - dependence measures Kendall correlation coefficient of the Pareto (flipped Clayton) copula Copula function: Kendall’s τ: Kendall correlation coefficient of the Gumbel copula

Appendix - dependence measures “Tail-Blomqvist” correlation coefficient (Schmidt and Schmid, 2006) where 0<p<1 and C is the copula of (X, Y) For we have asymptotic upper tail-dependence For we have asymptotic lower tail-dependence For sample version use the empirical copula:

Appendix - dependence measures “Skew-rank correlation” Quadrant rank correlation (Smith, 2002) etc, where I{A} is the indicator function of set A