On Predictive Modeling for Claim Severity Paper in Spring 2005 CAS Forum Glenn Meyers ISO Innovative Analytics Predictive Modeling Seminar September 19,

On Predictive Modeling for Claim Severity Paper in Spring 2005 CAS Forum Glenn Meyers ISO Innovative Analytics Predictive Modeling Seminar September 19, 2005

Problems with Experience Rating for Excess of Loss Reinsurance Use submission claim severity data –Relevant, but –Not credible –Not developed Use industry distributions –Credible, but –Not relevant (???)

General Problems with Fitting Claim Severity Distributions Parameter uncertainty –Fitted parameters of chosen model are estimates subject to sampling error. Model uncertainty –We might choose the wrong model. There is no particular reason that the models we choose are appropriate. Loss development –Complete claim settlement data is not always available.

Outline of Talk Quantifying Parameter Uncertainty –Likelihood ratio test Incorporating Model Uncertainty –Use Bayesian estimation with likelihood functions –Uncertainty in excess layer loss estimates Bayesian estimation with prior models based on data reported to a statistical agent –Reflect insurer heterogeneity –Develops losses

The Likelihood Ratio Test

An Example – The Pareto Distribution Simulate random sample of size 1000  = 2.000,  = 10,000

Hypothesis Testing Example Significance level = 5%  2 critical value = 5.991 H 0 : (  ) = (10000, 2) H 1 : (  ) ≠ (10000, 2) lnLR = 2(-10034.660 + 10035.623) =1.207 Accept H 0

Hypothesis Testing Example Significance level = 5%  2 critical value = 5.991 H 0 : (  ) = (10000, 1.7) H 1 : (  ) ≠ (10000, 1.7) lnLR = 2(-10034.660 + 10045.975) =22.631 Reject H 0

Confidence Region X% confidence region corresponds to the 1-X% level hypothesis test. The set of all parameters (  ) that fail to reject corresponding H 0. For the 95% confidence region: –(10000, 2.0) is in. –(10000, 1.7) out.

Confidence Region Outer Ring 95%, Inner Ring 50%

Grouped Data Data grouped into four intervals –562 under 5000 –181 between 5000 and 10000 –134 between 10000 and 20000 –123 over 20000 Same data as before, only less information is given.

Confidence Region for Grouped Data Outer Ring 95%, Inner Ring 50%

Confidence Region for Ungrouped Data Outer Ring 95%, Inner Ring 50%

Estimation with Model Uncertainty COTOR Challenge – November 2004 COTOR published 250 claims –Distributional form not revealed to participants Participants were challenged to estimate the cost of a $5M x $5M layer. Estimate confidence interval for pure premium

You want to fit a distribution to 250 Claims Knee jerk first reaction, plot a histogram.

This will not do! Take logs And fit some standard distributions.

Still looks skewed. Take double logs. And fit some standard distributions.

Still looks skewed. Take triple logs. Still some skewness. Lognormal and gamma fits look somewhat better.

Candidate #1 Quadruple lognormal

Candidate #2 Triple loggamma

Candidate #3 Triple lognormal

All three cdf’s are within confidence interval for the quadruple lognormal.

Elements of Solution Three candidate models –Quadruple lognormal –Triple loggamma –Triple lognormal Parameter uncertainty within each model Construct a series of models consisting of –One of the three models. –Parameters within a broad confidence interval for each model. –7803 possible models

Steps in Solution Calculate likelihood (given the data) for each model. Use Bayes’ Theorem to calculate posterior probability for each model –Each model has equal prior probability.

Steps in Solution Calculate layer pure premium for 5 x 5 layer for each model. Expected pure premium is the posterior probability weighted average of the model layer pure premiums. Second moment of pure premium is the posterior probability weighted average of the model layer pure premiums squared.

CDF of Layer Pure Premium Probability that layer pure premium ≤ x equals Sum of posterior probabilities for which the model layer pure premium is ≤ x

Numerical Results

Histogram of Predictive Pure Premium

Example with Insurance Data Continue with Bayesian Estimation Liability insurance claim severity data Prior distributions derived from models based on individual insurer data Prior models reflect the maturity of claim data used in the estimation

Initial Insurer Models Selected 20 insurers –Claim count in the thousands Fit mixed exponential distribution to the data of each insurer Initial fits had volatile tails Truncation issues –Do small claims predict likelihood of large claims?

Initial Insurer Models

Low Truncation Point

High Truncation Point

Selections Made Truncation point = $100,000 Family of cdf’s that has “correct” behavior –Admittedly the definition of “correct” is debatable, but –The choices are transparent!

Selected Insurer Models

Each model consists of 1.The claim severity distribution for all claims settled within 1 year 2.The claim severity distribution for all claims settled within 2 years 3.The claim severity distribution for all claims settled within 3 years 4.The ultimate claim severity distribution for all claims 5.The ultimate limited average severity curve

Three Sample Insurers Small, Medium and Large Each has three years of data Calculate likelihood functions –Most recent year with #1 on prior slide –2 nd most recent year with #2 on prior slide –3 rd most recent year with #3 on prior slide Use Bayes theorem to calculate posterior probability of each model

Formulas for Posterior Probabilities Model (m) Cell Probabilities Likelihood (m) Using Bayes’ Theorem Number of claims

Results Taken from paper.

Formulas for Ultimate Layer Pure Premium Use #5 on model (3 rd previous) slide to calculate ultimate layer pure premium

Results All insurers were simulated from same population. Posterior standard deviation decreases with insurer size.

Possible Extensions Obtain model for individual insurers Obtain data for insurer of interest Calculate likelihood, Pr{data|model}, for each insurer’s model. Use Bayes’ Theorem to calculate posterior probability of each model Calculate the statistic of choice using models and posterior probabilities –e.g. Loss reserves

On Predictive Modeling for Claim Severity Paper in Spring 2005 CAS Forum Glenn Meyers ISO Innovative Analytics Predictive Modeling Seminar September 19,

Similar presentations

Presentation on theme: "On Predictive Modeling for Claim Severity Paper in Spring 2005 CAS Forum Glenn Meyers ISO Innovative Analytics Predictive Modeling Seminar September 19,"— Presentation transcript:

Similar presentations

About project

Feedback

Log in

Auth with social network:

On Predictive Modeling for Claim Severity Paper in Spring 2005 CAS Forum Glenn Meyers ISO Innovative Analytics Predictive Modeling Seminar September 19,

Similar presentations

Presentation on theme: "On Predictive Modeling for Claim Severity Paper in Spring 2005 CAS Forum Glenn Meyers ISO Innovative Analytics Predictive Modeling Seminar September 19,"— Presentation transcript:

Similar presentations

About project

Feedback