An Application of Bayesian Analysis in Forecasting Insurance Loss Payments Yanwei (Wayne) Zhang CAS annual meeting 2010 Washington DC Nov 9th, 2010.

Slides:

Advertisements

Similar presentations

Introduction to Monte Carlo Markov chain (MCMC) methods

Advertisements

Other MCMC features in MLwiN and the MLwiN->WinBUGS interface

Chapter 25 Risk Assessment. Introduction Risk assessment is the evaluation of distributions of outcomes, with a focus on the worse that might happen.

©Towers Perrin Emmanuel Bardis, FCAS, MAAA Cane Fall 2005 meeting Stochastic Reserving and Reserves Ranges Fall 2005 This document was designed for discussion.

Bayesian Estimation in MARK

2005 Hopkins Epi-Biostat Summer Institute1 Module 2: Bayesian Hierarchical Models Francesca Dominici Michael Griswold The Johns Hopkins University Bloomberg.

What role should probabilistic sensitivity analysis play in SMC decision making? Andrew Briggs, DPhil University of Oxford.

Bayesian statistics – MCMC techniques

Industrial Engineering College of Engineering Bayesian Kernel Methods for Binary Classification and Online Learning Problems Theodore Trafalis Workshop.

An Introduction to Stochastic Reserve Analysis Gerald Kirschner, FCAS, MAAA Deloitte Consulting Casualty Loss Reserve Seminar September 2004.

Results 2 (cont’d) c) Long term observational data on the duration of effective response Observational data on n=50 has EVSI = £867 d) Collect data on.

1 Doing Statistics for Business Doing Statistics for Business Data, Inference, and Decision Making Marilyn K. Pelosi Theresa M. Sandifer Chapter 7 Sampling.

Applied Bayesian Analysis for the Social Sciences Philip Pendergast Computing and Research Services Department of Sociology

SIMPLE LINEAR REGRESSION

1 1 Slide © 2003 South-Western/Thomson Learning™ Slides Prepared by JOHN S. LOUCKS St. Edward’s University.

1 BA 555 Practical Business Analysis Review of Statistics Confidence Interval Estimation Hypothesis Testing Linear Regression Analysis Introduction Case.

Lecture II-2: Probability Review

Monté Carlo Simulation MGS 3100 – Chapter 9. Simulation Defined A computer-based model used to run experiments on a real system.  Typically done on a.

Department of Geography, Florida State University

Lecture 11 Implementation Issues – Part 2. Monte Carlo Simulation An alternative approach to valuing embedded options is simulation Underlying model “simulates”

Statistics for Managers Using Microsoft Excel, 4e © 2004 Prentice-Hall, Inc. Chap 7-1 Chapter 7 Confidence Interval Estimation Statistics for Managers.

SIMPLE LINEAR REGRESSION

Inference for regression - Simple linear regression

Risk Modeling of Multi-year, Multi-line Reinsurance Using Copulas

Day 7 Model Evaluation. Elements of Model evaluation l Goodness of fit l Prediction Error l Bias l Outliers and patterns in residuals.

Overview G. Jogesh Babu. Probability theory Probability is all about flip of a coin Conditional probability & Bayes theorem (Bayesian analysis) Expectation,

Bayesian parameter estimation in cosmology with Population Monte Carlo By Darell Moodley (UKZN) Supervisor: Prof. K Moodley (UKZN) SKA Postgraduate conference,

Analysis and Visualization Approaches to Assess UDU Capability Presented at MBSW May 2015 Jeff Hofer, Adam Rauk 1.

Model Inference and Averaging

Introduction to MCMC and BUGS. Computational problems More parameters -> even more parameter combinations Exact computation and grid approximation become.

Bootstrapping Identify some of the forces behind the move to quantify reserve variability. Review current regulatory requirements regarding reserves and.

A Beginner’s Guide to Bayesian Modelling Peter England, PhD EMB GIRO 2002.

CHAPTER 14 MULTIPLE REGRESSION

Two Approaches to Calculating Correlated Reserve Indications Across Multiple Lines of Business Gerald Kirschner Classic Solutions Casualty Loss Reserve.

More on Stochastic Reserving in General Insurance GIRO Convention, Killarney, October 2004 Peter England and Richard Verrall.

Practical GLM Modeling of Deductibles

Chapter Outline 3.1THE PERVASIVENESS OF RISK Risks Faced by an Automobile Manufacturer Risks Faced by Students 3.2BASIC CONCEPTS FROM PROBABILITY AND STATISTICS.

The Examination of Residuals. Examination of Residuals The fitting of models to data is done using an iterative approach. The first step is to fit a simple.

 Copyright 2006 National Council on Compensation Insurance, Inc. All Rights Reserved. BAYESIAN ESTIMATION OF STATE SPACE RESERVING MODELS Casualty Loss.

Testing Models on Simulated Data Presented at the Casualty Loss Reserve Seminar September 19, 2008 Glenn Meyers, FCAS, PhD ISO Innovative Analytics.

Estimating the Predictive Distribution for Loss Reserve Models Glenn Meyers Casualty Loss Reserve Seminar September 12, 2006.

Stat 112: Notes 2 Today’s class: Section 3.3. –Full description of simple linear regression model. –Checking the assumptions of the simple linear regression.

Lecture 5 Model Evaluation. Elements of Model evaluation l Goodness of fit l Prediction Error l Bias l Outliers and patterns in residuals.

- 1 - Bayesian inference of binomial problem Estimating a probability from binomial data –Objective is to estimate unknown proportion (or probability of.

Reserve Variability – Session II: Who Is Doing What? Mark R. Shapland, FCAS, ASA, MAAA Casualty Actuarial Society Spring Meeting San Juan, Puerto Rico.

©2015 : OneBeacon Insurance Group LLC | 1 SUSAN WITCRAFT Building an Economic Capital Model

5-1 ANSYS, Inc. Proprietary © 2009 ANSYS, Inc. All rights reserved. May 28, 2009 Inventory # Chapter 5 Six Sigma.

Sampling distributions rule of thumb…. Some important points about sample distributions… If we obtain a sample that meets the rules of thumb, then…

On Predictive Modeling for Claim Severity Paper in Spring 2005 CAS Forum Glenn Meyers ISO Innovative Analytics Predictive Modeling Seminar September 19,

Stochastic Loss Reserving Using Bayesian MCMC Models Glenn Meyers Presented to Association des Actuaires I.A.R.D. November 6, 2015 Talk based on CAS Monograph.

The generalization of Bayes for continuous densities is that we have some density f(y|  ) where y and  are vectors of data and parameters with  being.

Bayesian Travel Time Reliability

Sampling and estimation Petter Mostad

Practical GLM Analysis of Homeowners David Cummings State Farm Insurance Companies.

 2006 National Council on Compensation Insurance, Inc. Slide 1 of 17 A Claim Counts Model for Discerning the Rate of Inflation from Raw Claims Data Spring.

6. Population Codes Presented by Rhee, Je-Keun © 2008, SNU Biointelligence Lab,

Stochastic Loss Reserving with the Collective Risk Model Glenn Meyers ISO Innovative Analytics Casualty Loss Reserving Seminar September 18, 2008.

Statistics Sampling Distributions and Point Estimation of Parameters Contents, figures, and exercises come from the textbook: Applied Statistics and Probability.

A Stochastic Framework for Incremental Average Reserve Models Presented by Roger M. Hayne, PhD., FCAS, MAAA Casualty Loss Reserve Seminar September.

Session C7: Dynamic Risk Modeling Loss Simulation Model Working Party Basic Model Underlying Prototype Presented by Robert A. Bear Consulting Actuary and.

Copyright © Cengage Learning. All rights reserved. 5 Joint Probability Distributions and Random Samples.

Chapter 2: Bayesian hierarchical models in geographical genetics Manda Sayler.

1 Getting started with WinBUGS Mei LU Graduate Research Assistant Dept. of Epidemiology, MD Anderson Cancer Center Some material was taken from James and.

Ch 1. Introduction Pattern Recognition and Machine Learning, C. M. Bishop, Updated by J.-H. Eom (2 nd round revision) Summarized by K.-I.

Hierarchical Models. Conceptual: What are we talking about? – What makes a statistical model hierarchical? – How does that fit into population analysis?

Dave Clark American Re-Insurance 2003 Casualty Loss Reserve Seminar

MCMC Stopping and Variance Estimation: Idea here is to first use multiple Chains from different initial conditions to determine a burn-in period so the.

Chapter Outline 3.1 THE PERVASIVENESS OF RISK

Predictive distributions

Parametric Methods Berlin Chen, 2005 References:

Presentation transcript:

An Application of Bayesian Analysis in Forecasting Insurance Loss Payments Yanwei (Wayne) Zhang CAS annual meeting 2010 Washington DC Nov 9th, 2010

2 Highlights: Bayesian methodology and credibility theory Case study in loss reserving Appendix: Bayesian analysis in Excel

3 Bayesian methodology and credibility theory Part I

4 Bayesian methodology Fundamental question is With the posterior distribution of the parameters, the distribution of any quantities of interest can be obtained The key is the Bayes’ rule: Given data and a specified model, what is the distribution of the parameters? Posterior distribution is proportional to data distribution * prior distribution

5 Application to the actuarial field Most of you are Bayesian! – Bornhuetter-Ferguson type reserving to regulate data or account for information not in data with prior knowledge of the average loss ratio – Credibility. Bühlmann and Gisler (2005) said “Credibility theory belongs mathematically to the area of Bayesian statistics [and it] is motivated by questions arising in insurance practice.” So, when you are talking about these, you are thinking in a Bayesian world But……Few of you are doing Bayesian analysis! Now, there is an opportunity to be a real Bayesian!

6 More on credibility Credibility theory refers to We should not retain the word just for the actuarial credibility formulas Recall that the actuarial credibility theory started asking the following question: “any procedure that uses information (‘borrows strength’) from samples from different, but related, populations.” –- Klugman (1987) Given a group of policyholders with some common risk factor and past claims experience, what is the Bayes’ premium to be charged for each policyholder? Bayes’ Premium Bayesian Analysis Credibility to borrow information Bülmann formulas …… Hierarchical model

7 Credibility example Credibility formulas are only linear approximations to overcome computational difficulties: – No closed form except for some simple models and distributions – Hard to estimate the population parameters Now, there is no reason to linearly approximate Bayesian methods as advances in statistical computation in the past several decades have enabled more complex and realistic models to be constructed and estimated Consider the following example in Workers’ Comp (see Scollnik 2001): Question: What’s the expected count for year 5, given the observed claim history? Let’s do the Bayesian hierarchical analysis and then compare it with other estimates Year Group 1Group 2Group 3 Payroll# ClaimsPayroll# ClaimsPayroll# Claims

8 Visualization of the hierarchies Intuitively, we assume claim count to be a Poisson distribution Credibility view assumes that each group has a different claim rate per exposure µ k, but each µ k arises from the same distribution, say We call µ 0 and ¾ hyper-parameters If µ 0 is estimated using all the data, so will each µ k. Thus, the estimation of one group will borrow information from other groups, and will be pooled toward the overall mean Assign non-informative (flat) priors so that ¾ and µ 0 are estimated from the data, e.g. Data Group

9 Results of different estimations

10 Visualization of posterior distribution

11 Case study in loss reserving Part II Paper: A Bayesian Nonlinear Model for Forecasting Insurance Loss Payments. Yanwei (Wayne) Zhang, CNA Insurance Company Vanja Dukic, Applied Math, University of Colorado-Boulder James Guszcza, Deloitte Consulting LLP Paper is available at

12 Importance of loss reserving

13 Challenges Challenges in current loss reserving practices: – Most stochastic models need to be supplemented by a tail factor, but the corresponding uncertainty is hard to be accounted for – Inference at an arbitrary point is hard to obtain, e.g., 3 months or 9 months – Too many parameters! Parsimony is the basic principle of statistics – Treat accident year, development lag, or both independently – Focus on one triangle, lack a method to blend industry data – Usually rely on post-model selection using judgment: Input of point estimate is almost meaningless, but large leverage Extra uncertainty is not accounted for

14 Benefits of the Bayesian hierarchical model to be built Allows input of external information and expert opinion Blending of information across accident years and across companies Extrapolates development beyond the range of observed data Estimates at any time point can be made Uncertainty of extrapolation is directly included Full distribution is available, not just standard error Prediction of a new accident year can be achieved Minimizes the risk of underestimating the uncertainty in traditional models Estimation of company-level and accident-year-level variations

15 Steps in the Bayesian analysis Steps in a Bayesian analysis – Setting up the probability model Specify the full distribution of data and the priors Prior distribution could be either informative or non-informative, but need to result in a proper joint density – Computation and inference Usually need to use sampling method to simulate values from the posterior distribution – Model checking Residual plot Out-of-Sample validation Sensitivity analysis of prior distribution

16 Visualization of data Workers’ Comp Schedule P data ( ) from 10 large companies Use only 9 years’ data, put the 10 th year as hold-out validation set

17 Visualization of data continued

18 Probability model We use the Log-Normal distribution to reflect the skewness and ensure that cumulative losses are positive We use a nonlinear mean structure with the log-logistic growth curve: t ! /(t ! + µ ! ) We use an auto-correlated process along the development for forecasting We build a multi-level structure to allow the expected ultimate loss ratios to vary by accident year and company: – In one company, loss ratios from different years follow the same distribution with a mean of company-level loss ratio – Different company-level average loss ratios follow the same distribution with a mean of the industry-level loss ratio Growth curve is assumed to be the same within one company, but vary across companies, arising from the same industry average growth curve Assign non-informative priors to complete model specification Expected cumulative loss = premium * expected loss ratio * expected emergence

19 Visualization of the model Data AY Company

20 Computation and model estimation Such a specification does not result in a closed-form posterior distribution Must resort to sampling method to simulate the distribution We use Markov Chain Monte Carlo [MCMC] algorithms – The software WinBUGS implements the MCMC method – Always need to check the convergence of the MCMC algorithm Trajectory plot Density plot Autocorrelation plot

21 Checking convergence of the Markov chains

22 Fitted curves for the first accident year

23 Joint distribution of growth parameters

24 Estimation of loss ratios Industry average loss ratio is [0.644, 0.748] Variations across company is about twice as large as those across accident years

25 Loss reserve estimation results Autocorrelation is about [0.445, 0.511] Industrial average emergence percentage at 108 months is about 93.5% Bayesian reserves projected to ultimate are greater than the GLM estimates projected to 108 months, by a factor about 1.4. Company Estimate at ultimateEstimate at the end of the 9th year Bayesian GLM-ODP ReservePred Err50% IntervalReservePred ErrReservePred Err (230.80,292.54) (159.37,188.60) (206.70,224.83) (77.17,87.14) (40.33,49.21) (45.48,52.41) (33.03,35.90) (21.62,24.32) (27.11,34.42) (18.94,20.80)

26 Residual Plot

27 Residual plot by company

28 Out-of-Sample test We use only 9 years of data to train the model, and validate on the 10 th year – Note that this is the cash flow of the coming calendar year Policies written in the past Policies to be written in the coming year (need an estimated premium) For 4 companies, we also have observed data for the bottom right part The coverage rates of the 50% and 95% intervals in the two validation sets are The model performs fairly well overall, but long-term prediction is a little under expectation 50% Interval95% Interval Set 157%95% Set 240%81%

29 Sensitivity analysis Change the prior distribution of the industry-level loss ratio to more realistic distributions 6 scenarios: Gamma distribution with mean 0.5, 0.7 and 0.9, variance 0.1 and 0.2, respectively

30 Discussion of the model The model used in this analysis provides solutions to many existing challenges The model can be further improved: – Inflation can be readily included with an appropriate model – Prior information can be incorporated on the accident-year or company level – Build in more hierarchies: states, lines of business, etc… – Include triangles that have more loss history to stabilize extrapolation For future research: – Which curve to choose? (model risk) – Correlation among lines? (Copula)

31 Summary Introduced Bayesian hierarchical model as a full probability model that allows pooling of information and inputs of expert opinion Illustrated application of the Bayesian model in insurance with a case study of forecasting loss payments in loss reserving using data from multiple companies The application of Bayesian model in insurance is intuitive and promising. I hope more people will start exploiting it and applying it to their work. You may download this presentation, the paper and code from my website: ; Or contact me at:

32 Reference Bülmann and Gisler (2005). A Course in Credibility Theory and its Applications. Clark D. R. (2003). LDF Curve-Fitting and Stochastic Reserving: A Maximum Likelihood Approach. Available at Guszcza J. (2008). Hierarchical Growth Curve Models for Loss Reserving. Available at Klugman S (1987). Credibility for Classification Ratemaking via The Hierarchical Normal Linear Model.Available at Scollnik D. P. M. (2001). Actuarial Modeling with MCMC and BUGS, North American Actuarial Journal 5(2): Zhang et al. (2010). A Bayesian Nonlinear Model for Forecasting Insurance Loss Payments. Available at

33 Questions?

34 WinBUGS in Excel Appendix

35 WinBUGS BUGS ( B ayesian inference U sing G ibbs S ampling ) was developed by the MRC Biostatistics Unit, and it has a number of versions. WinBUGS is one of them. We can work directly in WinBUGS, but better to submit batch run from other software – R: package R2WinBUGS – SAS: macro %WINBUGSIO – Excel: add-in BugsXLA R is most handy when working with WinBUGS, but we will focus on Excel here The excel add-in BugsXLA is developed by Phil Woodward, and provides a great user interface to work with WinBUGS It allows the specification of typical Bayesian hierarchical models, but enhancement is needed to fit more complicated and customized models I will illustrate this using the simple Workers’ Comp Frequency model

36 BugsXLA Download and install WinBUGS at Download and install the Excel add-in BugsXLA at Put the data into long format

37 BugsXLA Click the “Bayesian analysis” button Specify input data Specify categorical variables In the new window, move the variable “Group” to the “FACTORS” column Can specify the levels and the ordering with “Edit Factor Levels”

38 BugsXLA Specify Poisson Distribution for the response variable “Claims” Want to use identity link, but the only option is “log” But for this simple example, we can just re-parameterize the model Put “Payroll” as offset Put “Group” as random effect We are done specifying the model. Now, click “MCMC Options” to customize simulations

39 BugsXLA Burn-in: number of simulations to discard from the beginning Samples: number of samples to draw Thin: sample every k th simulations Chains: number of chains Import Stats: summary statistics for the parameters and simulations Import Sample: the simulated outcomes for each parameter

40 BugsXLA After clicking “OK” in the “Bayesian analysis” dialog, a “Prior Distribution” dialog pops up Change the distribution here so that the group effect is Normally distributed, with a large variance, say, the standard deviation is uniform on (0,100) Click “Run WinBUGS” Then,

41 BugsXLA Simulation results are imported – Estimation summary – Model checks – Simulated outcomes Calculate the mean for each group Plot the result