Trees Lives Temp>30° Lives Dies Temp<=30° Temp<0° Temp>=0° Nodes Terminal or Leaf Nodes.

Slides:



Advertisements
Similar presentations
The Software Infrastructure for Electronic Commerce Databases and Data Mining Lecture 4: An Introduction To Data Mining (II) Johannes Gehrke
Advertisements

Decision Trees Decision tree representation ID3 learning algorithm
CPSC 502, Lecture 15Slide 1 Introduction to Artificial Intelligence (AI) Computer Science cpsc502, Lecture 15 Nov, 1, 2011 Slide credit: C. Conati, S.
CART: Classification and Regression Trees Chris Franck LISA Short Course March 26, 2013.
6-1 Introduction To Empirical Models 6-1 Introduction To Empirical Models.
 Coefficient of Determination Section 4.3 Alan Craig
A Quick Overview By Munir Winkel. What do you know about: 1) decision trees 2) random forests? How could they be used?
Robert Plant != Richard Plant. Sample Data Response, covariates Predictors Remotely sensed Build Model Uncertainty Maps Covariates Direct or Remotely.
Chapter 7 – Classification and Regression Trees
Chapter 7 – Classification and Regression Trees
Krishna Thakur Hu Suk Lee Outline  Introduction  GIS questions?  Objectives  Materials and Methods  Results  Discussion  Conclusions.
Modeling Digital Remote Sensing Presented by Rob Snyder.
Sparse vs. Ensemble Approaches to Supervised Learning
Statistics 350 Lecture 16. Today Last Day: Introduction to Multiple Linear Regression Model Today: More Chapter 6.
Class 3: Thursday, Sept. 16 Reliability and Validity of Measurements Introduction to Regression Analysis Simple Linear Regression (2.3)
Sparse vs. Ensemble Approaches to Supervised Learning
Ensemble Learning (2), Tree and Forest
Statistics for Business and Economics 8 th Edition Chapter 11 Simple Regression Copyright © 2013 Pearson Education, Inc. Publishing as Prentice Hall Ch.
Chapter 6 & 7 Linear Regression & Correlation
Model Building III – Remedial Measures KNNL – Chapter 11.
Chapter 9 – Classification and Regression Trees
Introduction to Linear Regression
1 COMP3503 Inductive Decision Trees with Daniel L. Silver Daniel L. Silver.
Why Model? Make predictions or forecasts where we don’t have data.
Business Intelligence and Decision Modeling Week 9 Customer Profiling Decision Trees (Part 2) CHAID CRT.
Regression Lesson 11. The General Linear Model n Relationship b/n predictor & outcome variables form straight line l Correlation, regression, t-tests,
Spatial Interpolation Chapter 13. Introduction Land surface in Chapter 13 Land surface in Chapter 13 Also a non-existing surface, but visualized as a.
Business Statistics: A Decision-Making Approach, 6e © 2005 Prentice-Hall, Inc. Chap 13-1 Introduction to Regression Analysis Regression analysis is used.
Jeff Howbert Introduction to Machine Learning Winter Regression Linear Regression Regression Trees.
MACHINE LEARNING 10 Decision Trees. Motivation  Parametric Estimation  Assume model for class probability or regression  Estimate parameters from all.
Chapter 9: Correlation and Regression Analysis. Correlation Correlation is a numerical way to measure the strength and direction of a linear association.
CHAPTER 5 CORRELATION & LINEAR REGRESSION. GOAL : Understand and interpret the terms dependent variable and independent variable. Draw a scatter diagram.
Chapter1: Introduction Chapter2: Overview of Supervised Learning
1 Global Mapping, Approaches, Issues, and Accuracies Russ Congalton & Kamini Yadav University of New Hampshire January 16, 2014 Menlo Park, CA.
DECISION TREE Ge Song. Introduction ■ Decision Tree: is a supervised learning algorithm used for classification or regression. ■ Decision Tree Graph:
Konstantina Christakopoulou Liang Zeng Group G21
How Good is a Model? How much information does AIC give us? –Model 1: 3124 –Model 2: 2932 –Model 3: 2968 –Model 4: 3204 –Model 5: 5436.
Simple Linear Regression The Coefficients of Correlation and Determination Two Quantitative Variables x variable – independent variable or explanatory.
RECITATION 4 MAY 23 DPMM Splines with multiple predictors Classification and regression trees.
Environmental Modeling Weighting GIS Layers Weighting GIS Layers.
5 a day – Water in California. 1.What part of the state gets the most precipitation? 2.How do we move water around the state? 3.Name at least 3 ways we.
Intelligent Database Systems Lab 國立雲林科技大學 National Yunlin University of Science and Technology A support system for predicting eBay end prices Presenter.
Tree and Forest Classification and Regression Tree Bagging of trees Boosting trees Random Forest.
Statistics 350 Lecture 2. Today Last Day: Section Today: Section 1.6 Homework #1: Chapter 1 Problems (page 33-38): 2, 5, 6, 7, 22, 26, 33, 34,
Random Forests Feb., 2016 Roger Bohn Big Data Analytics 1.
Introduction Many problems in Engineering, Management, Health Sciences and other Sciences involve exploring the relationships between two or more variables.
Multiple Regression.
Why Model? Make predictions or forecasts where we don’t have data.
Robert Plant != Richard Plant
Introduction to Machine Learning and Tree Based Methods
More General Need different response curves for each predictor
How Good is a Model? How much information does AIC give us?
Trees Nodes Is Temp>30? False True Temp<=30° Temp>30°
The Least-Squares Regression Line
ECE 471/571 – Lecture 12 Decision Tree.
Direct or Remotely sensed
Multiple Regression.
MIS2502: Data Analytics Classification using Decision Trees
Ninja Trader: Introduction to data mining in financial applications
The Science of Predicting Outcome
Introduction to Predictive Modeling
Multivariable Logistic Regression Split Cohort into Development &
Jensen, et. al Winter distribution of blue crab Callinectes sapidus in Chesapeake Bay: application and cross-validation of a two-stage generalized.
Statistical Learning Dong Liu Dept. EEIS, USTC.
CART on TOC CART for TOC R 2 = 0.83
Regression and Categorical Predictors
The BRT was made with over 5,000 trees!
MIS2502: Data Analytics Classification Using Decision Trees
Chapter 12 Inference on the Least-squares Regression Line; ANOVA
STT : Intro. to Statistical Learning
Presentation transcript:

Trees Lives Temp>30° Lives Dies Temp<=30° Temp<0° Temp>=0° Nodes Terminal or Leaf Nodes

Trees Classification Trees –Predicted outcome is a class (cover type) Regression Trees –Predicted outcome is a value (percent) Boosted Trees –Combines classification and regression trees Random Forests –Combines many trees to improve fit

Classification Trees Reflectance < 0.1 Water Yes Reflectance > 0.9 Snow or Cloud Yes Ground No

Classification Tree Snow or Ice Ground Water Reflectance

Regression Trees Precipitation < 0.5 Suitability=0.0 Yes Precipitation > 0.9 Suitability=0.5 Yes Suitability=0.0 No Precipitation < 0.1 Suitability=0.3 YesNo

Regression Trees Suitability

Trees Classification and Regression Trees Predictors can be continuous or categorical Easy to interpret and understand Robust Easy to validate Statistical methods well understood Can still make really complex trees that over fit the data!

Regression Trees in GIS Geospatial and regression tree analysis to map groundwater depth for manual well drilling suitability in the Zinder region of Niger

CA Housing Prices

Building Trees Goals: –Find the tree with the least number of “nodes” (branches) that best represents the phenomenon Approach: –Minimize the “deviance” that the samples have from the model

R squared

Regression Trees in GIS Geospatial and regression tree analysis to map groundwater depth for manual well drilling suitability in the Zinder region of Niger Length of branch indicates amount of deviance explained

Regression Trees Analysis of Object Oriented Software, Science Direct

Additional Resources An Introduction to Categorical Data Analysis –By ALAN AGRESTI –Page 85 R Documentation: – project.org/web/packages/tree/tree.pdf