Text Mining: Approaches and Applications Claim Severity Case Study 2011 SOA Health Meeting Session 61 Jonathan Polon FSA www.claimanalytics.com.

Slides:



Advertisements
Similar presentations
Simplifications of Context-Free Grammars
Advertisements

1
© 2008 Pearson Addison Wesley. All rights reserved Chapter Seven Costs.
Copyright © 2003 Pearson Education, Inc. Slide 1 Computer Systems Organization & Architecture Chapters 8-12 John D. Carpinelli.
Chapter 1 The Study of Body Function Image PowerPoint
A Transition Matrix Representation of the Algorithmic Statistical Process Control Procedure with Bounded Adjustments and Monitoring Changsoon Park Department.
Copyright © 2011, Elsevier Inc. All rights reserved. Chapter 6 Author: Julia Richards and R. Scott Hawley.
Author: Julia Richards and R. Scott Hawley
1 Copyright © 2013 Elsevier Inc. All rights reserved. Appendix 01.
Properties Use, share, or modify this drill on mathematic properties. There is too much material for a single class, so you’ll have to select for your.
1 Balloting/Handling Negative Votes September 11, 2006 ASTM Training Session Bob Morgan Brynn Iwanowski.
UNITED NATIONS Shipment Details Report – January 2006.
David Burdett May 11, 2004 Package Binding for WS CDL.
1 RA I Sub-Regional Training Seminar on CLIMAT&CLIMAT TEMP Reporting Casablanca, Morocco, 20 – 22 December 2005 Status of observing programmes in RA I.
State of New Jersey Department of Health and Senior Services Patient Safety Reporting System Module 2 – New Event Entry.
Jeopardy Q 1 Q 6 Q 11 Q 16 Q 21 Q 2 Q 7 Q 12 Q 17 Q 22 Q 3 Q 8 Q 13
Jeopardy Q 1 Q 6 Q 11 Q 16 Q 21 Q 2 Q 7 Q 12 Q 17 Q 22 Q 3 Q 8 Q 13
Exit a Customer Chapter 8. Exit a Customer 8-2 Objectives Perform exit summary process consisting of the following steps: Review service records Close.
Local Customization Chapter 2. Local Customization 2-2 Objectives Customization Considerations Types of Data Elements Location for Locally Defined Data.
Custom Statutory Programs Chapter 3. Customary Statutory Programs and Titles 3-2 Objectives Add Local Statutory Programs Create Customer Application For.
Custom Services and Training Provider Details Chapter 4.
CALENDAR.
FACTORING ax2 + bx + c Think “unfoil” Work down, Show all steps.
Year 6 mental test 10 second questions
1 Discreteness and the Welfare Cost of Labour Supply Tax Distortions Keshab Bhattarai University of Hull and John Whalley Universities of Warwick and Western.
REVIEW: Arthropod ID. 1. Name the subphylum. 2. Name the subphylum. 3. Name the order.
Table 12.1: Cash Flows to a Cash and Carry Trading Strategy.
PP Test Review Sections 6-1 to 6-6
EU market situation for eggs and poultry Management Committee 20 October 2011.
Regression with Panel Data
2 |SharePoint Saturday New York City
IP Multicast Information management 2 Groep T Leuven – Information department 2/14 Agenda •Why IP Multicast ? •Multicast fundamentals •Intradomain.
VOORBLAD.
Sample Service Screenshots Enterprise Cloud Service 11.3.
Copyright © 2012, Elsevier Inc. All rights Reserved. 1 Chapter 7 Modeling Structure with Blocks.
 Copyright I/O International, 2013 Visit us at: A Feature Within from Item Class User Friendly Maintenance  Copyright.
1 RA III - Regional Training Seminar on CLIMAT&CLIMAT TEMP Reporting Buenos Aires, Argentina, 25 – 27 October 2006 Status of observing programmes in RA.
Factor P 16 8(8-5ab) 4(d² + 4) 3rs(2r – s) 15cd(1 + 2cd) 8(4a² + 3b²)
Basel-ICU-Journal Challenge18/20/ Basel-ICU-Journal Challenge8/20/2014.
1..
CONTROL VISION Set-up. Step 1 Step 2 Step 3 Step 5 Step 4.
© 2012 National Heart Foundation of Australia. Slide 2.
Adding Up In Chunks.
Statistical Analysis SC504/HS927 Spring Term 2008
Understanding Generalist Practice, 5e, Kirst-Ashman/Hull
Before Between After.
Model and Relationships 6 M 1 M M M M M M M M M M M M M M M M
25 seconds left…...
Subtraction: Adding UP
What is the value of this coin? How much change is here? 6.
Januar MDMDFSSMDMDFSSS
Analyzing Genes and Genomes
©Brooks/Cole, 2001 Chapter 12 Derived Types-- Enumerated, Structure and Union.
Essential Cell Biology
Numerical Analysis 1 EE, NCKU Tien-Hao Chang (Darby Chang)
Exponents and Radicals
Intracellular Compartments and Transport
PSSA Preparation.
Copyright © 2013 Pearson Education, Inc. All rights reserved Chapter 11 Simple Linear Regression.
Essential Cell Biology
Immunobiology: The Immune System in Health & Disease Sixth Edition
Multiple Regression and Model Building
Educator Evaluation: A Protocol for Developing S.M.A.R.T. Goal Statements.
1 Dr. Scott Schaefer Least Squares Curves, Rational Representations, Splines and Continuity.
1 Percentage: A commonly used relative quantity..
Presentation transcript:

Text Mining: Approaches and Applications Claim Severity Case Study 2011 SOA Health Meeting Session 61 Jonathan Polon FSA

2 Agenda Text Mining for Health Insurers Case Study Overview Text Mining Results Questions

Text Mining For Health Insurers

4 Text Mining for Health Insurance Risk Measurement Underwriting Pricing Claims Management Fraud detection Claim approval Case management 4

5 Sources of Text Application Process Application for insurance Attending physician statements Call center logs Post Claim Claim application Attending physician statements Adjuster notes Call center logs Other correspondence 5

6 Why Use Text Mining? May contain information not available in structured data fields May contain subjective data (eg, expert opinions) May be an early indicator of severity –Lags in receiving treatment –Lags in receiving and processing bills 6

7 Case Study Overview

8 Project Overview Workers compensation business Medical only claims 15 days from First Notice on Loss (FNOL) For each claim predict likelihood that Total Claim Cost will exceed a specified threshold 8

9 Data Sources 9 Claimant: AgeGender Zip CodeSIC Code Marital statusTenure Injury: Loss dateReport date Type of injBody part LocationWitnesses Medical Bills: AmbulanceInpatient TestsProcedures PrescriptionsProviders Text: Claim activityEquipment Inj descriptionOccupation Claim adjuster notes

10 Case Study Text Mining

11 Modeling Approach 1.Exploratory stage: a.Train models without any text mining b.Train models exclusively with text mining 2.Intermediate stage: a.Apply text mining to predict residuals of non-text model 3.Final model: a.Combine text and non-text predictors using the findings from Steps 1a and 2a 11

12 Text Mining Considerations 1.Word frequencies 2.Stemming 3.Exclusion list 4.Phrases 5.Synonyms 6.Negatives 7.Singular value decomposition 12

13 1.Word Frequencies Text mining for predictive modeling: –Identify words or phrases that occur frequently within the text –Test to see if any of these words or phrases are predictive of the event being modeled –Typically limit analysis to words whose frequency in the text exceeds a minimum amount (eg, is contained in at least 3% of all records) 13

14 Word Frequency Example Word% of Records Employee62.3% Doctor47.8% Back23.0% Hand17.2% Contact14.1% Pay11.8% Lift8.7% Pain7.6% Strain5.5% Visit4.2% Clinic3.4% 14

15 2.Stemming Reduce words to their roots so that related words are treated as the same For example: –Investigate, investigated, investigation, investigator –Can all be stemmed to investigat and treated as the same word 15

16 3.Exclusion List Common words that carry little meaning can be defined and excluded from the text mining analysis For example: the, of, and are unlikely to provide predictive value 16

17 4.Phrases Common phrases may be pre-specified by the user to consider as one string –Eg, lower back, lost time N-grams: count frequency of every combination of N consecutive words –May be more effective to identify groups of words that appear together frequently even if not consecutively 17

18 5.Synonyms Words with the same meaning can be considered as the same –Eg, doctor, dr, physician, gp –Eg, acetaminophen, Tylenol, APAP –Eg, return to work, rtw 18

19 6.Negatives Should negatives be isolated? –Eg, no pain vs pain Negatives may be difficult to identify: –MRI not required, no MRI required, does not need an MRI, no need for an MRI The mention of a negative may imply concern In this case study, negatives provided small amount of lift but not isolated for final model due to practical considerations 19

20 7.Singular Value Decomposition Similar to Principal Components Analysis Convert a vector of word counts into lower dimension while maximizing retention of info In essence, a numeric summary of the observed word frequencies for a record Drawback is lack of interpretability of results –End users may wish to understand which word is driving the risk assessment 20

21 Word Frequencies by Record RecordWord 1 Word 2 Word 50 Word 10 0 Word 20 0 Word k

22 Singular Value Decomposition RecordVal 1 Val SVD compresses k-dimensions (one per each word) to lower dimensionality (eg, 1, 2 or 3) The compression algorithm maximizes the information retained Each new dimension is a linear combination of the original k- dimensions

23 Predicting Outcomes with Text Predictor variables are the word frequencies –Or binary variables indicating presence of word May be several hundreds or thousands of these Select a subset to include in final model –Univariate analysis –CART –Stepwise regression 23

24 Stepwise Regression Backward stepwise regression: –Build regression model with all variables –Remove the one var that results in least loss of fit –Continue until marginal decrease in fit > threshold Forward stepwise regression: –Build regression model with one var with best fit –Add the one variable that results in most lift –Continue until marginal increase in lift < threshold 24

25 Case Study Results

26 Text Mining: Phrases Selected 26 CombinedText Only # Total Phrases915 # Phrases: Claims Mgmt Action56 # Phrases: Medical Procedures22 # Phrases: Injury Type14 # Phrases: Type of Medical Provider 12 # Phrases: Time reference01

27 Text Mining: Phrases Selected 27 CombinedText Only # Total Phrases915 # Phrases: Claims Mgmt Action56 # Phrases: Medical Procedures22 # Phrases: Injury Type14 # Phrases: Type of Medical Provider 12 # Phrases: Time reference01

28 Model Evaluation Measuring goodness of fit should be performed on out-of-sample data –Protects against overfit and ensures model is robust –For this project, 10% of data was held back Measures for comparing goodness of fit include: –Gains or lift charts –Squared error 28

29 Cumulative Gains Chart - Baseline 29 % Severe Claims % All Claims Baseline Perfect Area between the two curves is the models lift

30 Cumulative Gains Chart – No Text 30 % Severe Claims % All Claims Baseline No Text Area between the two curves is the models lift

31 Cumulative Gains Chart – Text Only 31 % Severe Claims % All Claims Baseline No Text Text Only Text-only performs slightly better than no text

32 Cumulative Gains Chart – Combined 32 % Severe Claims % All Claims Baseline No Text Text Only Combined Combined text and non-text model performs best

33 Case Study Findings Text-only model slightly better than model without text Combined (text and non-text) model performs best Analyzing text can be simpler than summarizing medical bill transaction data Text mining is easy to interpret: certain words or phrases are correlated with higher or lower risk Text mining may provide extra lift for less experienced modelers –Adding additional strong predictors may compensate for other modeling deficiencies 33

34 Questions 34