A Beginner’s Guide to Bayesian Modelling Peter England, PhD EMB GIRO 2002.

Slides:



Advertisements
Similar presentations
Bayes rule, priors and maximum a posteriori
Advertisements

Introduction to Monte Carlo Markov chain (MCMC) methods
Other MCMC features in MLwiN and the MLwiN->WinBUGS interface
MCMC estimation in MlwiN
Bayesian inference of normal distribution
©Towers Perrin Emmanuel Bardis, FCAS, MAAA Cane Fall 2005 meeting Stochastic Reserving and Reserves Ranges Fall 2005 This document was designed for discussion.
Week 11 Review: Statistical Model A statistical model for some data is a set of distributions, one of which corresponds to the true unknown distribution.
ECE 8443 – Pattern Recognition LECTURE 05: MAXIMUM LIKELIHOOD ESTIMATION Objectives: Discrete Features Maximum Likelihood Resources: D.H.S: Chapter 3 (Part.
Biointelligence Laboratory, Seoul National University
Bayesian Statistics Simon French
Bayesian inference “Very much lies in the posterior distribution” Bayesian definition of sufficiency: A statistic T (x 1, …, x n ) is sufficient for 
Bayesian Estimation in MARK
1 Parametric Sensitivity Analysis For Cancer Survival Models Using Large- Sample Normal Approximations To The Bayesian Posterior Distribution Gordon B.
 Stochastic Reserving in General Insurance Peter England, PhD EMB Younger Members’ Convention 03 December 2002.
Bayesian inference Gil McVean, Department of Statistics Monday 17 th November 2008.
An Introduction to Stochastic Reserve Analysis Gerald Kirschner, FCAS, MAAA Deloitte Consulting Casualty Loss Reserve Seminar September 2004.
Bayesian estimation Bayes’s theorem: prior, likelihood, posterior
Basics of Statistical Estimation. Learning Probabilities: Classical Approach Simplest case: Flipping a thumbtack tails heads True probability  is unknown.
Results 2 (cont’d) c) Long term observational data on the duration of effective response Observational data on n=50 has EVSI = £867 d) Collect data on.
Bayesian Analysis for Extreme Events Pao-Shin Chu and Xin Zhao Department of Meteorology School of Ocean & Earth Science & Technology University of Hawaii-
Robin McDougall, Ed Waller and Scott Nokleby Faculties of Engineering & Applied Science and Energy Systems & Nuclear Science 1.
Jeff Howbert Introduction to Machine Learning Winter Classification Bayesian Classifiers.
1 Bayesian methods for parameter estimation and data assimilation with crop models Part 2: Likelihood function and prior distribution David Makowski and.
ECE 8443 – Pattern Recognition LECTURE 06: MAXIMUM LIKELIHOOD AND BAYESIAN ESTIMATION Objectives: Bias in ML Estimates Bayesian Estimation Example Resources:
Applications of Bayesian sensitivity and uncertainty analysis to the statistical analysis of computer simulators for carbon dynamics Marc Kennedy Clive.
Statistical Decision Theory
Model Inference and Averaging
Bootstrapping Identify some of the forces behind the move to quantify reserve variability. Review current regulatory requirements regarding reserves and.
More on Stochastic Reserving in General Insurance GIRO Convention, Killarney, October 2004 Peter England and Richard Verrall.
Bayesian Extension to the Language Model for Ad Hoc Information Retrieval Hugo Zaragoza, Djoerd Hiemstra, Michael Tipping Presented by Chen Yi-Ting.
Bayesian Analysis and Applications of A Cure Rate Model.
 Copyright 2006 National Council on Compensation Insurance, Inc. All Rights Reserved. BAYESIAN ESTIMATION OF STATE SPACE RESERVING MODELS Casualty Loss.
Casualty Loss Reserve Seminar September 9-11, 2007 San Diego, CA  Copyright 2007 National Council on Compensation Insurance, Inc. All Rights Reserved.
Testing Models on Simulated Data Presented at the Casualty Loss Reserve Seminar September 19, 2008 Glenn Meyers, FCAS, PhD ISO Innovative Analytics.
Estimating the Predictive Distribution for Loss Reserve Models Glenn Meyers Casualty Loss Reserve Seminar September 12, 2006.
Fast Simulators for Assessment and Propagation of Model Uncertainty* Jim Berger, M.J. Bayarri, German Molina June 20, 2001 SAMO 2001, Madrid *Project of.
An Application of Bayesian Analysis in Forecasting Insurance Loss Payments Yanwei (Wayne) Zhang CAS annual meeting 2010 Washington DC Nov 9th, 2010.
The Triangle of Statistical Inference: Likelihoood Data Scientific Model Probability Model Inference.
Reserve Variability – Session II: Who Is Doing What? Mark R. Shapland, FCAS, ASA, MAAA Casualty Actuarial Society Spring Meeting San Juan, Puerto Rico.
Statistical Decision Theory Bayes’ theorem: For discrete events For probability density functions.
On Predictive Modeling for Claim Severity Paper in Spring 2005 CAS Forum Glenn Meyers ISO Innovative Analytics Predictive Modeling Seminar September 19,
Bayesian Approach For Clinical Trials Mark Chang, Ph.D. Executive Director Biostatistics and Data management AMAG Pharmaceuticals Inc.
Bayesian Prior and Posterior Study Guide for ES205 Yu-Chi Ho Jonathan T. Lee Nov. 24, 2000.
The generalization of Bayes for continuous densities is that we have some density f(y|  ) where y and  are vectors of data and parameters with  being.
 2006 National Council on Compensation Insurance, Inc. Slide 1 of 17 A Claim Counts Model for Discerning the Rate of Inflation from Raw Claims Data Spring.
Stochastic Loss Reserving with the Collective Risk Model Glenn Meyers ISO Innovative Analytics Casualty Loss Reserving Seminar September 18, 2008.
1 Chapter 8: Model Inference and Averaging Presented by Hui Fang.
C-2: Loss Simulation C-2: Loss Simulation. Statistical Analysis in Risk Management – Two main approaches: – Maximum probable loss (or MPY) if $5 million.
Introduction to Sampling Methods Qi Zhao Oct.27,2004.
A Stochastic Framework for Incremental Average Reserve Models Presented by Roger M. Hayne, PhD., FCAS, MAAA Casualty Loss Reserve Seminar September.
Gibbs Sampling and Hidden Markov Models in the Event Detection Problem By Marc Sobel.
G. Cowan Lectures on Statistical Data Analysis Lecture 10 page 1 Statistical Data Analysis: Lecture 10 1Probability, Bayes’ theorem 2Random variables and.
Computacion Inteligente Least-Square Methods for System Identification.
Outline Historical note about Bayes’ rule Bayesian updating for probability density functions –Salary offer estimate Coin trials example Reading material:
Ch 1. Introduction Pattern Recognition and Machine Learning, C. M. Bishop, Updated by J.-H. Eom (2 nd round revision) Summarized by K.-I.
A Study on Speaker Adaptation of Continuous Density HMM Parameters By Chin-Hui Lee, Chih-Heng Lin, and Biing-Hwang Juang Presented by: 陳亮宇 1990 ICASSP/IEEE.
Hierarchical Models. Conceptual: What are we talking about? – What makes a statistical model hierarchical? – How does that fit into population analysis?
Prediction and Missing Data. Summarising Distributions ● Models are often large and complex ● Often only interested in some parameters – e.g. not so interested.
Bayesian Inference: Multiple Parameters
Stochastic Reserving in General Insurance Peter England, PhD EMB
Modeling and Simulation CS 313
Multiple Imputation using SOLAS for Missing Data Analysis
Introducing Bayesian Approaches to Twin Data Analysis
Ch3: Model Building through Regression
María Dolores Martínez-Miranda Jens Perch Nielsen Richard Verrall
Course on Bayesian Methods in Environmental Valuation
Predictive distributions
Bayesian Inference, Basics
LECTURE 07: BAYESIAN ESTIMATION
Classical regression review
Presentation transcript:

A Beginner’s Guide to Bayesian Modelling Peter England, PhD EMB GIRO 2002

Outline An easy one parameter problem A harder one parameter problem Problems with multiple parameters Modelling in WinBUGS Stochastic Claims Reserving Parameter uncertainty in DFA

Bayesian Modelling: General Strategy Specify distribution for the data Specify prior distributions for the parameters Write down the joint distribution Collect terms in the parameters of interest Recognise the (conditional) posterior distribution? –Yes: Estimate the parameters, or sample directly –No: Sample using an appropriate scheme Forecasting: Recognise the predictive distribution? –Yes: Estimate the parameters –No: Simulate an observation from the data distribution, conditional on the simulated parameters

A One Parameter Problem Data Sample [3,8,5,9,5,8,4,8,7,3] Distributed as a Poisson random variable? Use a Gamma prior for the mean of the Poisson Predicting a new observation? Negative Binomial predictive distribution

Poisson Example 1 – Estimation

Poisson Example 1 – Prediction

One Parameter Problem: Simple Case We can recognise the posterior distribution of the parameter We can recognise the predictive distribution No simulation required (We can use simulation if we want to)

Variability of a forecast Includes estimation variance and process variance Analytic solution: estimate the two components Bayesian solution: simulate the parameters, then simulate the forecast conditional on the parameters

Main Features of Bayesian Analysis Focus is on distributions (of parameters or forecasts), not just point estimates The mode of posterior or predictive distributions is analogous to “maximum likelihood” in classical statistics

One Parameter Problem: Harder Case Use a log link between the mean and the parameter, that is: Use a normal distribution for the prior What is the posterior distribution? How do we simulate from it?

Poisson Example 2 – Estimation

Poisson Example 2 Step 1: Use adaptive rejection sampling (ARS) from log density to sample the parameter Step 2: For prediction, sample from a Poisson distribution with mean, with theta simulated at step 1

A Multi-Parameter Problem From Scollnik (NAAJ, 2001) 3 Group workers compensation policies Exposure measured using payroll as a proxy Number of claims available for each of last 4 years Problem is to describe claim frequencies in the forecast year

Scollnik Example 1

Scollnik Example 1 Posterior Distributions

Scollnik Example 1 Use Gibbs Sampling –Iterate through each parameter in turn –Sample from the conditional posterior distribution, treating the other parameters as fixed Sampling is easy for Use ARS for

WinBUGS WinBUGS is an expert system for Bayesian analysis You specify –The distribution of the data –The prior distributions of the parameters WinBUGS works out the conditional posterior distributions WinBUGS decides how to sample the parameters WinBUGS uses Gibbs sampling for multiple parameter problems

Stochastic Claims Reserving Changes the focus from a “best estimate” of reserves to a predictive distribution of outstanding liabilities Most stochastic methods to date have only considered 2 nd moment properties (variance) in addition to a “best estimate” Bayesian methods can be used to investigate a full predictive distribution, and incorporate judgement (through the choice of priors). For more information, see England and Verrall (BAJ, 2002)

The Bornhuetter-Ferguson Method Useful when the data are unstable First get an initial estimate of ultimate Estimate chain-ladder development factors Apply these to the initial estimate of ultimate to get an estimate of outstanding claims

Conceptual Framework

Estimates of outstanding claims To estimate ultimate claims using the chain ladder technique, you would multiply the latest cumulative claims in each row by f, a product of development factors. Hence, an estimate of what the latest cumulative claims should be is obtained by dividing the estimate of ultimate by f. Subtracting this from the estimate of ultimate gives an estimate of outstanding claims:

The Bornhuetter-Ferguson Method Let the initial estimate of ultimate claims for accident year i be The estimate of outstanding claims for accident year i is

Comparison with Chain-ladder replaces the latest cumulative claims for accident year i, to which the usual chain-ladder parameters are applied to obtain the estimate of outstanding claims. For the chain-ladder technique, the estimate of outstanding claims is

Multiplicative Model for Chain-Ladder

BF as a Bayesian Model Put a prior distribution on the row parameters. The Bornhuetter-Ferguson method assumes there is prior knowledge about these parameters, and therefore uses a Bayesian approach. The prior information could be summarised as the following prior distributions for the row parameters:

BF as a Bayesian Model Using a perfect prior (very small variance) gives results analogous to the BF method Using a vague prior (very large variance) gives results analogous to the standard chain ladder model In a Bayesian context, uncertainty associated with a BF prior can be incorporated

Parameter Uncertainty in DFA Often, in DFA, forecasts are obtained using simulation, assuming the underlying parameters are fixed (for example, a standard application of Wilkie’s model) Including parameter uncertainty may not be straightforward in the absence of a Bayesian framework, which includes it naturally Ignoring parameter uncertainty will underestimate the true uncertainty!

Summary Bayesian modelling using simulation methods can be used to fit complex models Focus is on distributions of parameters or forecasts Mode is analogous to “maximum likelihood” It is a natural way to include parameter uncertainty when forecasting (e.g. in DFA)

References Scollnik, DPM (2001) Actuarial Modeling with MCMC and BUGS, North American Actuarial Journal, 5 (2), pages England, PD and Verrall, RJ (2002) Stochastic Claims Reserving in General Insurance, British Actuarial Journal Volume 8 Part II (to appear). Spiegelhalter, DJ, Thomas, A and Best, NG (1999), WinBUGS Version 1.2 User Manual, MRC Biostatistics Unit.