Chapter 1: Introduction to Predictive Modeling 1.1 Applications 1.2 Generalization 1.3 JMP Predictive Modeling Platforms.

Slides:



Advertisements
Similar presentations
Chapter 1 Introduction 1. The Evolution of Data Analysis To Support Business Intelligence 2 Evolutionary Step Business Question Enabling Technologies.
Advertisements

Polynomial Curve Fitting BITS C464/BITS F464 Navneet Goyal Department of Computer Science, BITS-Pilani, Pilani Campus, India.
CPSC 502, Lecture 15Slide 1 Introduction to Artificial Intelligence (AI) Computer Science cpsc502, Lecture 15 Nov, 1, 2011 Slide credit: C. Conati, S.
QMM 384 – Data Mining Data Mining: Introduction Introduction to Predictive Analytics.
Week 3. Logistic Regression Overview and applications Additional issues Select Inputs Optimize complexity Transforming Inputs.
Decision Making: An Introduction 1. 2 Decision Making Decision Making is a process of choosing among two or more alternative courses of action for the.
DATA MINING CS157A Swathi Rangan. A Brief History of Data Mining The term “Data Mining” was only introduced in the 1990s. Data Mining roots are traced.
Data Mining – Intro.
Part I: Classification and Bayesian Learning
Operations Research I Lecture 1-3 Chapter 1
Data Mining: A Closer Look
Chapter 5 Data mining : A Closer Look.
Microsoft Enterprise Consortium Data Mining Concepts Introduction to Directed Data Mining: Decision Trees Prepared by David Douglas, University of ArkansasHosted.
Beyond Opportunity; Enterprise Miner Ronalda Koster, Data Analyst.
Introduction to Directed Data Mining: Decision Trees
Analyzing Data: Bivariate Relationships Chapter 7.
Dr. Awad Khalil Computer Science Department AUC
Overview DM for Business Intelligence.
Data Mining. 2 Models Created by Data Mining Linear Equations Rules Clusters Graphs Tree Structures Recurrent Patterns.
Data Mining Dr. Chang Liu. What is Data Mining Data mining has been known by many different terms Data mining has been known by many different terms Knowledge.
Data Mining Joyeeta Dutta-Moscato July 10, Wherever we have large amounts of data, we have the need for building systems capable of learning information.
INTRODUCTION TO MACHINE LEARNING. $1,000,000 Machine Learning  Learn models from data  Three main types of learning :  Supervised learning  Unsupervised.
Overview of Data Mining Methods Data mining techniques What techniques do, examples, advantages & disadvantages.
Knowledge Discovery and Data Mining Evgueni Smirnov.
Chapter 9 – Classification and Regression Trees
Chapter 7 Neural Networks in Data Mining Automatic Model Building (Machine Learning) Artificial Intelligence.
Zhangxi Lin ISQS Texas Tech University Note: Most slides are from Decision Tree Modeling by SAS Lecture Notes 5 Auxiliary Uses of Trees.
Knowledge Discovery and Data Mining Evgueni Smirnov.
AI Week 14 Machine Learning: Introduction to Data Mining Lee McCluskey, room 3/10
Jeff Howbert Introduction to Machine Learning Winter Regression Linear Regression.
Applying Neural Networks Michael J. Watts
Logistic Regression Demo: dmdata2 and dmdata3 Bankloan Assignment: subscribe_training and subscribe_validate.
Business Intelligence and Decision Modeling Week 9 Customer Profiling Decision Trees (Part 2) CHAID CRT.
Data Mining – Intro. Course Overview Spatial Databases Temporal and Spatio-Temporal Databases Multimedia Databases Data Mining.
Consul- ting Services Outsour- cing Services Techno- logy Services Local Profes- sional Services Competence Centers Business Intelligence WebTech SAP.
Chapter 4: Introduction to Predictive Modeling: Regressions
MKT 700 Business Intelligence and Decision Models Algorithms and Customer Profiling (1)
Copyright © 2010 SAS Institute Inc. All rights reserved. Decision Trees Using SAS Sylvain Tremblay SAS Canada – Education SAS Halifax Regional User Group.
Overview of Methods Data mining techniques What techniques do, examples, advantages & disadvantages.
Customer Relationship Management (CRM) Chapter 4 Customer Portfolio Analysis Learning Objectives Why customer portfolio analysis is necessary for CRM implementation.
Copyright © 2011 Pearson Education, Inc. Publishing as Prentice Hall 5-1 Data Mining Methods: Classification Most frequently used DM method Employ supervised.
Data Mining and Decision Support
Overview of the Data Mining Process
WHAT IS DATA MINING?  The process of automatically extracting useful information from large amounts of data.  Uses traditional data analysis techniques.
Monday, February 22,  The term analytics is often used interchangeably with:  Data science  Data mining  Knowledge discovery  Extracting useful.
WHAT IS DATA MINING?  The process of automatically extracting useful information from large amounts of data.  Uses traditional data analysis techniques.
Eco 6380 Predictive Analytics For Economists Spring 2016 Professor Tom Fomby Department of Economics SMU.
Neural networks (2) Reminder Avoiding overfitting Deep neural network Brief summary of supervised learning methods.
DATA MINING TECHNIQUES (DECISION TREES ) Presented by: Shweta Ghate MIT College OF Engineering.
Chapter 11 – Neural Nets © Galit Shmueli and Peter Bruce 2010 Data Mining for Business Intelligence Shmueli, Patel & Bruce.
Oracle Advanced Analytics
What Is Cluster Analysis?
Chapter 1 Introduction.
Using JMP for the Case Competition
Introduction Machine Learning 14/02/2017.
Introduction to Data Science Lecture 7 Machine Learning Overview
Introduction To Statistics
Stock Market Prediction
Chapter 1 Introduction.
Vincent Granville, Ph.D. Co-Founder, DSC
Azure Machine Learning 101
Machine Learning & Data Science
Classification of Variables
Overview of Machine Learning
Introduction To Statistics
CSCI N317 Computation for Scientific Applications Unit Weka
Model generalization Brief summary of methods
Using JMP for the Case Competition
Machine Learning – a Probabilistic Perspective
Machine Learning in Business John C. Hull
Presentation transcript:

Chapter 1: Introduction to Predictive Modeling 1.1 Applications 1.2 Generalization 1.3 JMP Predictive Modeling Platforms

Chapter 1: Introduction to Predictive Modeling 1.1 Applications 1.2 Generalization 1.3 JMP Predictive Modeling Platforms

Objectives Describe common applications of predictive modeling in business, science, and engineering. Describe typical data that is available for predictive modeling. Define commonly used terms used in predictive modeling. 3

Predictive Modeling Applications 4 Database marketing Financial risk management Fraud detection Process monitoring Pattern detection Healthcare Informatics

The Data 5 ExperimentalOpportunistic Purpose Research Operational Value Scientific Commercial Generation Actively controlled Passively observed Size Small Massive Hygiene Clean Dirty State Static Dynamic

inputstarget Predictive Modeling Data 6 Training Data Training data case: categorical or numeric input and target measurements

Types of Targets Supervised Classification –Event/no event (binary target) –Class label (multiclass problem) Regression –Continuous outcome 7

Continuous Targets Healthcare Outcomes –Target = hospital length of stay, hospital cost Liquidity Management –Target = amount of money at an ATM machine or in a branch vault Process Volatility –Target = moving range of yields Sales –Target = dollar value of sales 8

Measurement Levels Three types in JMP Continuous Ordinal Nominal JMP automatically performs specific types of analyses based on the measurement level of the target. For example, linear regression versus logistic regression. In some platforms, ordinal and nominal variables inputs are handled differently. 9

Chapter 1: Introduction to Predictive Modeling 1.1 Applications 1.2 Generalization 1.3 JMP Predictive Modeling Platforms

Objectives Define generalization. Define honest assessment. Describe how honest assessment can be done in JMP. 11

The Scope of Generalization Model Selection and Comparison –Which model gives the best prediction? Decision/Allocation Rule –What actions should be taken on new cases? Deployment –How can the predictions be applied to new cases? 12

Model Complexity 13...

Model Complexity 14 Not complex enough...

Model Complexity 15 Too complex Not complex enough

Honest Assessment: Data Splitting 16

Data Partitioning 17 Training Data inputstarget...

Data Partitioning 18 Training DataValidation Data inputstargetinputstarget...

Data Partitioning 19 Training DataValidation Data Partition available data into training and validation sets. inputstargetinputstarget

Predictive Model Sequence 20 Create a sequence of models with increasing complexity. Model Complexity Training DataValidation Data inputstargetinputstarget

Model Performance Assessment 21 Validation Assessment Rate model performance using validation data. Training DataValidation Data inputstargetinputstarget Model Complexity

Model Selection 22 Model Complexity Validation Assessment Select the simplest model with the highest validation assessment. Training DataValidation Data inputstargetinputstarget

Chapter 1: Introduction to Predictive Modeling 1.1 Applications 1.2 Generalization 1.3 JMP Predictive Modeling Platforms

Objectives Show the platforms that will used in the class. 24

Accessing the Neural or Partition Platforms 25

Partition Platform Dialog 26

Neural Platform Dialog 27