More General Need different response curves for each predictor

Slides:



Advertisements
Similar presentations
What Could We Do better? Alternative Statistical Methods Jim Crooks and Xingye Qiao.
Advertisements

SPM – introduction & orientation introduction to the SPM software and resources introduction to the SPM software and resources.
Overview of SPM p <0.05 Statistical parametric map (SPM)
Chapter 4: Basic Estimation Techniques
A Tale of Two GAMs Generalized additive models as a tool for data exploration Mariah Silkey, Actelion Pharmacueticals Ltd. 1.
Additive Models, Trees, etc. Based in part on Chapter 9 of Hastie, Tibshirani, and Friedman David Madigan.
Regression “A new perspective on freedom” TexPoint fonts used in EMF. Read the TexPoint manual before you delete this box.: AAA A A A A AAA A A.
Lecture Notes for E Alpaydın 2010 Introduction to Machine Learning 2e © The MIT Press (V1.0) ETHEM ALPAYDIN © The MIT Press, 2010
Generalized Additive Models Keith D. Holler September 19, 2005 Keith D. Holler September 19, 2005.
Logistic Regression Example: Horseshoe Crab Data
More General Need different response curves for each predictor Need more complex responses.
Best Model Dylan Loudon. Linear Regression Results Erin Alvey.
Robert Plant != Richard Plant. Sample Data Response, covariates Predictors Remotely sensed Build Model Uncertainty Maps Covariates Direct or Remotely.
Model Assessment, Selection and Averaging
Basis Expansion and Regularization Presenter: Hongliang Fei Brian Quanz Brian Quanz Date: July 03, 2008.
Vector Generalized Additive Models and applications to extreme value analysis Olivier Mestre (1,2) (1) Météo-France, Ecole Nationale de la Météorologie,
Kernel methods - overview
Copyright © 2008 by the McGraw-Hill Companies, Inc. All rights reserved. McGraw-Hill/Irwin Managerial Economics, 9e Managerial Economics Thomas Maurice.
Basis Expansions and Regularization Based on Chapter 5 of Hastie, Tibshirani and Friedman.
1 An Introduction to Nonparametric Regression Ning Li March 15 th, 2004 Biostatistics 277.
Prelude of Machine Learning 202 Statistical Data Analysis in the Computer Age (1991) Bradely Efron and Robert Tibshirani.
© 2002 Prentice-Hall, Inc.Chap 14-1 Introduction to Multiple Regression Model.
Jensen, et. al Winter distribution of blue crab Callinectes sapidus in Chesapeake Bay: application and cross- validation of a two- stage generalized.
Data Mining Volinsky - Columbia University 1 Chapter 4.2 Regression Topics Credits Hastie, Tibshirani, Friedman Chapter 3 Padhraic Smyth Lecture.
Regression. Population Covariance and Correlation.
Trees Lives Temp>30° Lives Dies Temp
STANDARDIZATION OF CPUE FROM ALEUTIAN ISLANDS GOLDEN KING CRAB FISHERY OBSERVER DATA M.S.M. Siddeek 1, J. Zheng 1, Doug Pengilly 2, and Gretchen Bishop.
Machine Learning with Discriminative Methods Lecture 00 – Introduction CS Spring 2015 Alex Berg.
Discrepancy between Data and Fit. Introduction What is Deviance? Deviance for Binary Responses and Proportions Deviance as measure of the goodness of.
How Good is a Model? How much information does AIC give us? –Model 1: 3124 –Model 2: 2932 –Model 3: 2968 –Model 4: 3204 –Model 5: 5436.
Gaussian Process and Prediction. (C) 2001 SNU CSE Artificial Intelligence Lab (SCAI)2 Outline Gaussian Process and Bayesian Regression  Bayesian regression.
Linear regression models. Purposes: To describe the linear relationship between two continuous variables, the response variable (y- axis) and a single.
ESTIMATION OF THE MEAN. 2 INTRO :: ESTIMATION Definition The assignment of plausible value(s) to a population parameter based on a value of a sample statistic.
CHAPTER 8: Nonparametric Methods Alpaydin transparencies significantly modified, extended and changed by Ch. Eick Last updated: March 4, 2011.
Chapter 9 Day 2. Warm-up  If students picked numbers completely at random from the numbers 1 to 20, the proportion of times that the number 7 would be.
1 C.A.L. Bailer-Jones. Machine Learning. Model selection and combination Machine learning, pattern recognition and statistical data modelling Lecture 10.
1 C.A.L. Bailer-Jones. Machine Learning. Data exploration and dimensionality reduction Machine learning, pattern recognition and statistical data modelling.
Who will you trust? Field technicians? Software programmers?
Summary of t-Test for Testing a Single Population Mean (m)
Chapter 4: Basic Estimation Techniques
PREDICT 422: Practical Machine Learning
Confidence Intervals and Sample Size
Chapter 4 Basic Estimation Techniques
Chapter 4.2 Regression Topics
Deep Feedforward Networks
Chapter 12 Simple Linear Regression and Correlation
More General Need different response curves for each predictor
Basic Estimation Techniques
How Good is a Model? How much information does AIC give us?
Project 4: Facial Image Analysis with Support Vector Machines
Statistics in SPSS Lecture 7
Machine learning, pattern recognition and statistical data modelling
Machine learning, pattern recognition and statistical data modelling
Basic Estimation Techniques
The normal distribution
Chapter 12 Simple Linear Regression and Correlation
Regression Model Building
Introduction to Predictive Modeling
Explained and unexplained variance
Topic 1: Statistical Analysis
Jensen, et. al Winter distribution of blue crab Callinectes sapidus in Chesapeake Bay: application and cross-validation of a two-stage generalized.
Bias-variance Trade-off
ESTIMATION OF THE MEAN AND PROPORTION
Model generalization Brief summary of methods
Parametric Methods Berlin Chen, 2005 References:
Linear Regression and Correlation
The BRT was made with over 5,000 trees!
Generalized Linear Models
Generalized Additive Model
Presentation transcript:

More General Need different response curves for each predictor Need more complex responses

Generalized Additive Models 𝑔 𝑓 𝑥 𝑖 = 𝛽 0 +𝑓 1 𝑥 1𝑖 + 𝑓 2𝑖 𝑥 2𝑖 +… Adds functions to linearize each predictor variable 𝐸 𝑌 𝑖 = 𝑔 −1 ( 𝑓 1 𝑥 1𝑖 + 𝑓 2𝑖 𝑥 2𝑖 +…) Functions can be parametric or non-parametric: Including splines Makes GAMS: Very general Prone to over-fitting

Spline Curves 𝑓 𝑥 = 1 4 (𝑥+2) 3 −2≤𝑥≤−1 1 4 3 𝑥 3 −6 𝑥 2 +4 −1≤𝑥≤1 1 4 2−𝑥 3 1≤𝑥≤2 Knots Bell-shaped Irwin-Hall spline

Spline Curves in R Wrap predictors in a spline function: s(predictor) Use “gamma” parameter to set the number of knots Controls over-fitting 1.4 is recommended In R: TheModel=gam(Height~s(AnnualPrecip), data=TheData,gamma=1.4)

Reading When you have time: For our next meeting (on web site): “The Elements of Statistical Learning” by Friedman Generalized Additive Models by Hastie and Tibshirani For our next meeting (on web site): Read Martinez-Rincon (wahoo) Jensen (crabs)

Which Approach? GAM Kernel Smoother Age Income Age Income Z-axis shows the proportion of families with a telephone at home Hastie and Tibshirani 1986, Generalized Additive Models

GAM Plots in R “Partial” = 1 Covariate Modeled Response Curve 95% CI Sample point “Grass” FIA Doug-Fir height data vs. BioClim Annual Precipitation

Brown Shrimp in GOM Data from SeaMap and NOAA SeaMap Data, brown shrimp prefer muddy bottoms. Also, they spawn in shallow waters and then migrate to deeper water as they mature. The reason the density goes down as the depth goes to 0 is that the size of the net allows the smaller shrimp to escape. Data from SeaMap and NOAA

Gamma=1.4 Explained Deviance: 59%, AIC=57807 Data from FIA and BioClim Models for Doug-Fir in California from FIA data Explained Deviance: 59%, AIC=57807 Data from FIA and BioClim

Gamma=10 Explained Deviance: 59%, AIC=57961 Data from FIA and BioClim

Gamma=20 Explained Deviance: 57%, AIC=58081 Data from FIA and BioClim

Gamma=20 Explained Deviance: 51%, AIC=58796 Data from FIA and BioClim

Gamma=0.1 Explained Deviance: 59%, AIC=57811 Data from FIA and BioClim

GAM Model Runs Layers Gamma Explained Deviance AIC All 6 1.4 59 57807 10 58 57961 20 57 58081 Best 3 51 58796 0.1 57811

Best Model? Best 3 predictors, gamma=20 Data from FIA and BioClim

Gamma in GAMs 𝑛 = number of training points 𝑥 = degrees of freedom 𝑛 – number of estimated parameters gam() chooses smoothing parameters to minimize: Note: The reason the effect of gamma reverses itself at large values is that 𝑔𝑎𝑚𝑎 ∗𝑥 becomes larger than 𝑛 ( 𝑦 − 𝑦 𝑖 ) 2 (𝑛−𝑔𝑎𝑚𝑎 ∗𝑥) 2

Additional Resources Generalized Additive Models: an introduction with R Copyrighted book Includes: Linear models GLMs GAMs Examples in R Some matrix algebra

Additional Resources Geospatial Analysis with GAMs: http://www.casact.org/education/annual/2011/handouts/C3-Guszcza.pdf Disease mapping using GAMs (workshop): http://www.cireeh.org/pmwiki.php/Main/Gam-mapWorkshop Mapping population based studies: http://www.ij-healthgeographics.com/content/5/1/26