Dependence between mortality and morbidity: is underwriting scoring really different for Life and Health products? Andrey Kudryavtsev, St.Petersburg State.

Slides:



Advertisements
Similar presentations
Copula Representation of Joint Risk Driver Distribution
Advertisements

Assumptions underlying regression analysis
Structural Equation Modeling
1 Health Warning! All may not be what it seems! These examples demonstrate both the importance of graphing data before analysing it and the effect of outliers.
Inference for Regression
Confidence Intervals This chapter presents the beginning of inferential statistics. We introduce methods for estimating values of these important population.
EPIDEMIOLOGY AND BIOSTATISTICS DEPT Esimating Population Value with Hypothesis Testing.
Cal State Northridge  320 Ainsworth Sampling Distributions and Hypothesis Testing.
The Simple Regression Model
European University at St. Petersburg THE METHOD OF QUANTILE REGRESSION, A NEW APPROACH TO ACTUARIAL MATHEMATICS Authors: Ruslan Abduramanov Andrey Kudryavtsev.
Lecture 16 – Thurs, Oct. 30 Inference for Regression (Sections ): –Hypothesis Tests and Confidence Intervals for Intercept and Slope –Confidence.
Inference.ppt - © Aki Taanila1 Sampling Probability sample Non probability sample Statistical inference Sampling error.
T-Tests Lecture: Nov. 6, 2002.
Lecture Slides Elementary Statistics Twelfth Edition
7-2 Estimating a Population Proportion
Copyright © 2010, 2007, 2004 Pearson Education, Inc. Lecture Slides Elementary Statistics Eleventh Edition and the Triola Statistics Series by.
Data Analysis Statistics. Inferential statistics.
1 1 Slide © 2003 South-Western/Thomson Learning™ Slides Prepared by JOHN S. LOUCKS St. Edward’s University.
Today Concepts underlying inferential statistics
1 BA 555 Practical Business Analysis Review of Statistics Confidence Interval Estimation Hypothesis Testing Linear Regression Analysis Introduction Case.
Formalizing the Concepts: Simple Random Sampling.
Chapter 9 Hypothesis Testing II. Chapter Outline  Introduction  Hypothesis Testing with Sample Means (Large Samples)  Hypothesis Testing with Sample.
STA Lecture 201 STA 291 Lecture 20 Exam II Today 5-7pm Memorial Hall (Same place as exam I) Makeup Exam 7:15pm – 9:15pm Location CB 234 Bring a calculator,
Review of normal distribution. Exercise Solution.
Experimental Statistics - week 2
Ch 10 Comparing Two Proportions Target Goal: I can determine the significance of a two sample proportion. 10.1b h.w: pg 623: 15, 17, 21, 23.
Estimation and Confidence Intervals
ESTIMATING with confidence. Confidence INterval A confidence interval gives an estimated range of values which is likely to include an unknown population.
F OUNDATIONS OF S TATISTICAL I NFERENCE. D EFINITIONS Statistical inference is the process of reaching conclusions about characteristics of an entire.
© 2002 Thomson / South-Western Slide 8-1 Chapter 8 Estimation with Single Samples.
Section Copyright © 2014, 2012, 2010 Pearson Education, Inc. Lecture Slides Elementary Statistics Twelfth Edition and the Triola Statistics Series.
1 Chapter 6. Section 6-1 and 6-2. Triola, Elementary Statistics, Eighth Edition. Copyright Addison Wesley Longman M ARIO F. T RIOLA E IGHTH E DITION.
Population All members of a set which have a given characteristic. Population Data Data associated with a certain population. Population Parameter A measure.
Estimates and Sample Sizes Lecture – 7.4
Abcd 2001 Healthcare Conference Learning from Experience October 2001 © Peter Turvey.
And, now take you into a WORLD of……………...
Copyright © Cengage Learning. All rights reserved. 10 Inferences Involving Two Populations.
Non-life insurance mathematics Nils F. Haavardsson, University of Oslo and DNB Skadeforsikring.
Copyright © 2010, 2007, 2004 Pearson Education, Inc. Chapter 7 Estimates and Sample Sizes 7-1 Review and Preview 7-2 Estimating a Population Proportion.
Estimating a Population Proportion
1 Estimation From Sample Data Chapter 08. Chapter 8 - Learning Objectives Explain the difference between a point and an interval estimate. Construct and.
Conditions for Constructing a Confidence Interval
Maximum Likelihood Estimation Methods of Economic Investigation Lecture 17.
Confidence Intervals: The Basics BPS chapter 14 © 2006 W.H. Freeman and Company.
1 Chapter 6 Estimates and Sample Sizes 6-1 Estimating a Population Mean: Large Samples / σ Known 6-2 Estimating a Population Mean: Small Samples / σ Unknown.
C M Clarke-Hill1 Analysing Quantitative Data Forming the Hypothesis Inferential Methods - an overview Research Methods.
Sections 7-1 and 7-2 Review and Preview and Estimating a Population Proportion.
Copyright © 2010, 2007, 2004 Pearson Education, Inc. All Rights Reserved. Section 7-1 Review and Preview.
1 Inferences About The Pearson Correlation Coefficient.
+ “Statisticians use a confidence interval to describe the amount of uncertainty associated with a sample estimate of a population parameter.”confidence.
Copyright © 2011 by The McGraw-Hill Companies, Inc. All rights reserved. McGraw-Hill/Irwin Chapter 7 Sampling and Sampling Distributions.
+ DO NOW. + Chapter 8 Estimating with Confidence 8.1Confidence Intervals: The Basics 8.2Estimating a Population Proportion 8.3Estimating a Population.
1 Chapter 6. Section 6-1 and 6-2. Triola, Elementary Statistics, Eighth Edition. Copyright Addison Wesley Longman M ARIO F. T RIOLA E IGHTH E DITION.
Estimating a Population Mean:  Known
Statistics for Decision Making Basic Inference QM Fall 2003 Instructor: John Seydel, Ph.D.
Chapter 10 The t Test for Two Independent Samples
1 Copyright © 2010, 2007, 2004 Pearson Education, Inc. All Rights Reserved. Example: In a recent poll, 70% of 1501 randomly selected adults said they believed.
Math 4030 Final Exam Review. Probability (Continuous) Definition of pdf (axioms, finding k) Cdf and probability (integration) Mean and variance (short-cut.
Chapters 6 & 7 Overview Created by Erin Hodgess, Houston, Texas.
POLS 7000X STATISTICS IN POLITICAL SCIENCE CLASS 5 BROOKLYN COLLEGE-CUNY SHANG E. HA Leon-Guerrero and Frankfort-Nachmias, Essentials of Statistics for.
Lecture PowerPoint Slides Basic Practice of Statistics 7 th Edition.
 Assumptions are an essential part of statistics and the process of building and testing models.  There are many different assumptions across the range.
Learning Objectives After this section, you should be able to: The Practice of Statistics, 5 th Edition1 DESCRIBE the shape, center, and spread of the.
1 Copyright © 2010, 2007, 2004 Pearson Education, Inc. All Rights Reserved. Example: In a recent poll, 70% of 1501 randomly selected adults said they believed.
+ Chapter 8: Estimating with Confidence Section 8.2 Estimating a Population Proportion.
Statistical principles: the normal distribution and methods of testing Or, “Explaining the arrangement of things”
CHAPTER 6: SAMPLING, SAMPLING DISTRIBUTIONS, AND ESTIMATION Leon-Guerrero and Frankfort-Nachmias, Essentials of Statistics for a Diverse Society.
Inference about the slope parameter and correlation
Inference: Conclusion with Confidence
Inferences and Conclusions from Data
Presentation transcript:

Dependence between mortality and morbidity: is underwriting scoring really different for Life and Health products? Andrey Kudryavtsev, St.Petersburg State University, Russia 1.A. Stochastic Dependence 9.A. Various Topics

Aim to show that underwriting scores are quite close to each other for different kinds of insurance products, say for life and health insurance If so, there are problems in portfolio construction because of –risks may be more dependent, –possible higher degree of risk accumulation

Idea to compare underwriting scores for life and health risks of a sample population Results –help to understand question how to use and interpret the underwriting scores –do NOT help to solve any questions of statistical estimation

Methodology The sample population used is investigated from medical point of view The medical records and reviews were used to produce the averaging underwriting scores for life and health risks The scores are comparing to estimate the existence and degree of correlations The idea of modelling with copula is analysed

The investigation paper is based on the special study with data collection for real group of people The number of people studied was 769 The study took place in 2000 The basic aim of the study was mostly medical It included two parts: –deep medical investigation –survey about people’s preferences in healthcare

The place of investigation Lyssye Gory – a small town in Central Russia in Saratov Region (downstream river Volga, south-east from Moscow) WHY: –typical agricultural province in Russia with some industrial development –an appropriate professional mix of population

The target group people living in one medical district additional restrictions: –age interval chosen (from 20 to 49 including the latter age) –full set of the covariates (risk factors) investigated

Reasons for age restrictions Young people (younger than 20 year old) are presumably completely healthy: probably no extra life and health risks Old people (50+) are probably quite ill: the dependence observed between life and health risks is basically explained with poor health Only chosen age range (20 to 49) demonstrates balanced mixture of risk sub-groups

The basic risk factor chosen job/profession (with additional information about working conditions) height/weight index existing conditions (current diseases) addictions (tobacco smoking and alcohol drinking) heredity factors (indirectly estimated)

The Underwriting Manuals used Insurers: –Skandia International Insurance Corporation –Munich Re –Cologne Re There are some differences in those company- specific scoring procedures Resulting score was equal to arithmetic average between company-specific scores (all three manuals for life score and Skandia and Cologne Re manuals for health score)

Underwriting scoring Risks estimated –Life (extra mortality score under whole life insurance contract ) –Health (permanent health (income protection) insurance with 4 weeks of waiting periods) The choice of health scoring –it shows quite serious problem with health –too serious (very long) diseases are rare

Rounding the individual scores Score intervalFinal score up to from 101 to from 136 to from 176 to from 226 to from 276 to more then 326>300

The distribution of people investigated Life score Health scoreTotal > >300> Total

The distribution of people investigated there is some form of dependence the coefficient of correlation is 0,6312 quite large – the actual t-test value is 24,6 that is much higher than the critical value nevertheless, it is far from comonotonic (one- to-one functional) dependence the dependence could not be explained only with mortality risks in permanent health (income protection) products as it is too high

Standard/sub-standard proportions Life risksHealth risksTotal standardsub- standard standard sub- standard Total

Standard/sub-standard dependence: conclusions there is large enough dependence between life and health scores even for age intervals where it is not highly expected from the point of view of health dynamics with age actuaries and underwriters should be more careful with assumptions about the existence of independence between different Life and Health products in context of ALM and similar concepts

Standard/sub-standard dependence: analysis The important result is that the proportion of standard risks is 27,5 per cent for life score and 22,69 per cent for health score It is too small The odd of standard and sub-standard risks (1:3) is different from usual odd for life insurance portfolios (9:1)

Standard/sub-standard dependence: explanations The differencies could be explained with a)more conservative estimation under the investigation than one in insurance practice b)self-selection of potential clients with poor health c)full informational support in the investigation vs. informational deficit in practice of insurance The latter explanation is important for insurance practice

Dependence among sub-standard risks Correlation coefficient is 0,84 It is even more than for all risks The idea is to develop more formal model than simple statistical coefficient, say, copulas It helps to understand the character of dependence in more details

Marginal distributions They are conditional as the risks analysed are sub-standard The last two “boxes” (300 and ‘>300’) for health risk scores should be combined Both distributions were fitted using Maximum Likelihood method In both cases, the best goodness-of-fit (measured with χ 2 -test) was achieved on Log-Normal distribution

Marginal distributions Values for life riskshealth risks Distribution parameter μ3,7614,545 Distribution parameter σ1,0432,088 Degrees of freedom43 χ 2 -test8,811,39 p-value0,0660,709

Copula As a first choice, the normal copula could be used where is the bivariate Normal distribution function with zero vector of expected values and covariation matrix

Copula: conclusions As marginal distributions in our case are Log- Normal, the copula simply gives the bivariate Log-Normal distribution Unfortunately, the model is not well calibrated Other copulas tend to bring much more complex formulas Such models may be quite simple tools for portfolio modelling in the context of ALM or similar concepts

Thank You!