Machine Learning Interpretability

Slides:

Advertisements

Similar presentations

Multi-Layer Perceptron (MLP)

Advertisements

1 Data Mining: and Knowledge Acquizition — Chapter 5 — BIS /2014 Summer.

Ch. Eick: More on Machine Learning & Neural Networks Different Forms of Learning: –Learning agent receives feedback with respect to its actions (e.g. using.

CPSC 502, Lecture 15Slide 1 Introduction to Artificial Intelligence (AI) Computer Science cpsc502, Lecture 15 Nov, 1, 2011 Slide credit: C. Conati, S.

For Wednesday Read chapter 19, sections 1-3 No homework.

Evaluating Inforce Blocks Of Disability Business With Predictive Modeling SOA Spring Health Meeting May 28, 2008 Jonathan Polon FSA

Application of Stacked Generalization to a Protein Localization Prediction Task Melissa K. Carroll, M.S. and Sung-Hyuk Cha, Ph.D. Pace University, School.

Artificial Neural Networks

Classification Neural Networks 1

Report on Intrusion Detection and Data Fusion By Ganesh Godavari.

Lecture 5 (Classification with Decision Trees)

Neural Networks. Background - Neural Networks can be : Biological - Biological models Artificial - Artificial models - Desire to produce artificial systems.

Chapter 5 Data mining : A Closer Look.

Ensemble Learning (2), Tree and Forest

Decision Tree Models in Data Mining

Machine Learning Usman Roshan Dept. of Computer Science NJIT.

Basic Data Mining Techniques

Midterm Review Rao Vemuri 16 Oct Posing a Machine Learning Problem Experience Table – Each row is an instance – Each column is an attribute/feature.

Using Neural Networks in Database Mining Tino Jimenez CS157B MW 9-10:15 February 19, 2009.

Introduction to machine learning and data mining 1 iCSC2014, Juan López González, University of Oviedo Introduction to machine learning Juan López González.

Report on Intrusion Detection and Data Fusion By Ganesh Godavari.

Some working definitions…. ‘Data Mining’ and ‘Knowledge Discovery in Databases’ (KDD) are used interchangeably Data mining = –the discovery of interesting,

An informal description of artificial neural networks John MacCormick.

CS 782 – Machine Learning Lecture 4 Linear Models for Classification  Probabilistic generative models  Probabilistic discriminative models.

Lecture Notes for Chapter 4 Introduction to Data Mining

Data Mining and Decision Support

Combining multiple learners Usman Roshan. Decision tree From Alpaydin, 2010.

Artificial Intelligence Methods Neural Networks Lecture 3 Rakesh K. Bissoondeeal Rakesh K. Bissoondeeal.

Mortgages. A mortgage is a loan that is secured by property. Mortgages are large loans, and the money is generally borrowed over a large amount of time.

Tree and Forest Classification and Regression Tree Bagging of trees Boosting trees Random Forest.

Chapter 11 – Neural Nets © Galit Shmueli and Peter Bruce 2010 Data Mining for Business Intelligence Shmueli, Patel & Bruce.

Machine Learning Usman Roshan Dept. of Computer Science NJIT.

 October 20, 2011 Objective: Students will identify the types of credit available to consumers and the sources of credit.

Prepared by Fayes Salma.  Introduction: Financial Tasks  Data Mining process  Methods in Financial Data mining o Neural Network o Decision Tree  Trading.

California Real Estate Principles, 10.1 Edition

Usman Roshan Dept. of Computer Science NJIT

Machine Learning Supervised Learning Classification and Regression

Big data classification using neural network

Data Mining, Machine Learning, Data Analysis, etc. scikit-learn

PFIN 7 Using Consumer Loans 5 BILLINGSLEY/ GITMAN/ JOEHNK/

11,947 Loans Loan Sources are PA Treasury, NYSERDA, Oregon and GCEA

Computer Science and Engineering, Seoul National University

Trees, bagging, boosting, and stacking

Classification with Perceptrons Reading:

Basic machine learning background with Python scikit-learn

Introduction to Data Mining and Classification

Advanced Analytics Using Enterprise Miner

NBA Draft Prediction BIT 5534 May 2nd 2018

Reducing Loan Risk Using Data Analytics

Classification and Prediction

Machine Learning & Data Science

Lending in a Financial Reform World

Chapter 18 – The Mortgage Market

Exam #3 Review Zuyin (Alvin) Zheng.

Classification Neural Networks 1

iSRD Spam Review Detection with Imbalanced Data Distributions

Ensemble learning.

Data Mining, Machine Learning, Data Analysis, etc. scikit-learn

Data Mining, Machine Learning, Data Analysis, etc. scikit-learn

Neural Networks II Chen Gao Virginia Tech ECE-5424G / CS-5824

Overview of deep learning

MIS2502: Data Analytics Classification Using Decision Trees

Neural Networks II Chen Gao Virginia Tech ECE-5424G / CS-5824

Predicting Loan Defaults

Usman Roshan Dept. of Computer Science NJIT

Bug Localization with Combination of Deep Learning and Information Retrieval A. N. Lam et al. International Conference on Program Comprehension 2017.

Future Borrowers: Challenges and Opportunities

RMBS Rating Methodology

Patterson: Chap 1 A Review of Machine Learning

Presentation transcript:

Machine Learning Interpretability Thuy Nguyen Mihir Jain Edward Adcock Toby Alfred-Jones © COPYRIGHT | Delta Capita | CONFIDENTIAL

Contents Project objective Use cases Data overview Neural Network model description Machine Learning Interpretability Results Visualization Next steps 01 02 03 04 05 06 07 08

01. Project objective

Project objective Delta Capita has developed a Neural Network model to determine the likelihood of mortgage default given a set of information that represents an individual loan The aim of the project is to develop knowledge extraction techniques which interpret the mortgage defaulting model

02. Use cases

Use cases The ability to accurately determine loan default is especially useful in two domains: Mortgage-backed security investing Risk Management

03. Data overview Overview of data 8 Data features 9 Data features (provided) 10 Data features (created) 11 Data features (added) 12 Model Preparation 13

Overview of data Main Dataset Freddy Mac Single-Family Loans Time frame: 1999 – 2016 Size: 15.3m unique loans with 326m performance updates (monthly) Types of Loans: Default : 85k Fully Paid : 15.2m Ratio of Default vs Fully Paid loans: 0.6 : 99.4 Additional Dataset Average National Mortgage Interest Rate Monthly national interest rate for standard mortgages from January 1999 to July 2016 Housing Price Index Per State Monthly House Price Index in each U.S. state from January 1999 to July 2016 Unemployment Rate Per State Seasonally adjusted unemployment rate by each U.S. state from January 1999 to July 2016

Data features We split the following section into 3 parts: Data Features provided in the main dataset Data Features created from the main dataset Data Features added from external sources We will later evaluate the models performance on data from Feature Sets 1, 2 & 3 (combined)

Data features (provided) Monthly Performance Update Features Evaluation of on-going loans on a monthly basis Origination Features Assessment at the time of the loan application Origination Features Credit Score Original Unpaid Principal Balance First Payment Date Loan-To-Value Ratio First Time Home Buyer Flag Interest Rate Maturity Date Channel of origination of a loan Metropolitan Statistical Area Prepayment Penalty Mortgage Flag Mortgage Insurance Percentage Product Type Number of Units in a Property Property Type Occupancy Status Property State Combined Loan-To-Value Ratio Loan Purpose Debt-To-Income (DTI) Ratio Original Loan Term Number of Borrowers Performance Features Monthly Reporting Period Current Actual Unpaid Principal Balance Loan Age Remaining Months to Legal Maturity Current Interest Rate

Data features (created) Based on history of current loan: Occurrence of: Loan Status (30-dd, 60-dd, 90-dd, foreclosed, etc ...) Occurrences of Loan Status in the last 12 months Percentage change between Last Balance and Current Balance Based on history of all loans: Number of Loans (active) per State/Zip-code Number of Loans (taken out) per State/Zip-code Number of Loans (taken out) per State/Zip-code in the last 12 months Default Rate per State/Zip-code Default Rate per State/Zip-code in the last 12 months Occurrences of ‘Paid Off’ & ‘Default’ per State/Zip-code Occurrences of ‘Paid Off’ & ‘Default’ per State/Zip-code in the last 12 months

Data features (added) Economic Features: Monthly Unemployment Rate per State Monthly Housing Price Index Per State Monthly National Interest Rate Extra features created from added datasets: Difference between Current Interest Rate and National Interest Rate Number of Months that Mortgage Interest Rate is less than National Interest Rate

Model preparation Class imbalance Categorical data: One hot encoding Using under-sampling technique on the training set New ratio of Default vs Fully Paid loans: 15 : 85 Categorical data: One hot encoding For example, if the property is in New York, the value is 1, otherwise 0 Randomly shuffle data

04. Neural Network Model Description Model Architecture 15 Performance Evaluation Metric 17 Model performance 19

Model Architecture We use Neural Network to create the Mortgage Classification model Model classes: Default or Fully Paid Model output: Any value between 0 and 1, which represents the probability of Default Threshold value of 0.5 (Value above 0.5 predicts a Default Loan) Model architecture: Layers Number of layers Number of Neurons Input layer (Number of loan features) 1 133* Hidden layer 2 100 : 100 Output layer (Number of classes) * means out of 133 input features, there are only 64 unique loan features

Performance Evaluation Metrics We use 4 performance metrics: Accuracy - Overall classification accuracy True Positive Rate - Classification accuracy of ‘Default’ loans True Negative Rate - Classification accuracy of ‘Fully Paid’ loans AUC - ( True Positive Rate + True Negative Rate ) / 2

Model performance Using the data from Feature Sets 1, 2 & 3 (combined): Performance Metric % Accuracy 98.3 % Correct Default (True Positive Rate) 98.5 % Correct ‘Fully Paid’ (True Negative Rate) 98.2 AUC 98.4

05. Machine Learning Interpretability Knowledge Extraction 22 TREPAN 23 Distilling Soft Decision Tree 24 LIME 25

Knowledge extraction Problems: Methods: Neural Networks: high performance, but black-box Decision Tree: high representation, but low performance Combine Neural Networks & Decision Tree to create rules that are human-comprehensible Methods: Global: TREPAN Distilling Soft Decision Tree Local: LIME

TREPAN Key features: Neural Networks serve as an oracle that returns class labels Construct models of the underlying distribution of data Tree expansion: best-first expansion to increase fidelity Splitting tests: m-of-n Stopping criteria: Global criteria: size of the tree, highest fidelity tree Local criterion: stopping the tree Key metrics: Accuracy Fidelity Comprehensibility

Distilling soft decision tree Key features: Mimic the input– output function from the Neural Networks Soft targets: true label, predictions of Neural Networks Trained with mini-batch-gradient descent Uses learned filters to make hierarchical decisions Selects a particular static probability distribution over classes as output Key metrics: Accuracy Comprehensibility: complexity of the tree

LIME Key features: Create a local linear model around the prediction Assign weights to different features in the dataset Compute the class probability Predict the class having the highest probability Key metrics: Accuracy

06. Results TREPAN 27 Distilling Soft Decision Tree 28 LIME 29

TREPAN Use 400 data points Conditions: Model performance: Maximum of nodes: 10 Minimum sample: 100 Model performance: Accuracy: 80% Fidelity: 88%

Distilling Soft Decision Tree Use 400 data points Condition: Maximum of tree depth: 10 Accuracy: 95%

LIME Use 400 data points Example: Prediction of a loan for the 5th customer

07. Visualisation

Visualization - Dashboard

Visualization - Dashboard

08. Next Steps

Next steps Interpretability Model Use the entire dataset to validate all interpretability models Try to interpret different Machine Learning models such as Random Forest, SVM Commercial products Develop a front-end app which is easier for people with no data science background to use Provide the tool to work irrespective of dataset or Python libraries Suggest recommendation from results of interpretability models