Maxent interface.

Slides:



Advertisements
Similar presentations
Introduction to GRCP Boualem RABTA Center for World Food Studies (SOW-VU) Vrije Universiteit - Amsterdam.
Advertisements

Random Forest Predrag Radenković 3237/10
Linear Regression.
Logistic Regression Psy 524 Ainsworth.
Modeling species distribution using species-environment relationships Istituto di Ecologia Applicata Via L.Spallanzani, Rome ITALY
ECE 8443 – Pattern Recognition LECTURE 05: MAXIMUM LIKELIHOOD ESTIMATION Objectives: Discrete Features Maximum Likelihood Resources: D.H.S: Chapter 3 (Part.
Maximum Entropy spatial modeling with imperfect data.
Environmental GIS Nicholas A. Procopio, Ph.D, GISP Some slides from Lyna Wiggins (Rutgers University)
Climate Predictability Tool (CPT)
Robert Plant != Richard Plant. Sample Data Response, covariates Predictors Remotely sensed Build Model Uncertainty Maps Covariates Direct or Remotely.
Latent Growth Curve Modeling In Mplus:
Statistics for the Social Sciences Psychology 340 Spring 2005 Prediction cont.
Today Linear Regression Logistic Regression Bayesians v. Frequentists
Neural Networks. R & G Chapter Feed-Forward Neural Networks otherwise known as The Multi-layer Perceptron or The Back-Propagation Neural Network.
Ensemble Learning: An Introduction
Project  Now it is time to think about the project  It is a team work Each team will consist of 2 people  It is better to consider a project of your.
Modeling Species Distribution with MaxEnt
Maxent Implements “Maximum Entropy” modeling –Entropy = randomness –Maximizes randomness by removing patterns –The pattern is the response Website with.
Rebecca Boger Earth and Environmental Sciences Brooklyn College.
Introduction to ArcGIS for Environmental Scientists Module 2 – GIS Fundamentals Lecture 5 – Coordinate Systems and Map Projections.
Radial Basis Function Networks
MODELS OF QUALITATIVE CHOICE by Bambang Juanda.  Models in which the dependent variable involves two ore more qualitative choices.  Valuable for the.
ESRM 250 & CFR 520: Introduction to GIS © Phil Hurvitz, KEEP THIS TEXT BOX this slide includes some ESRI fonts. when you save this presentation,
FW364 Ecological Problem Solving Lab 4: Blue Whale Population Variation [Ramas Lab]
Climate Predictability Tool (CPT) Ousmane Ndiaye and Simon J. Mason International Research Institute for Climate and Society The Earth.
Using ESRI ArcGIS 9.3 Spatial Analyst
StAR web server tutorial for ROC Analysis. ROC Analysis ROC Analysis: This module allows the user to input data for several classifiers to be tested.
MARE 250 Dr. Jason Turner Hypothesis Testing III.
Designing Interface Components. Components Navigation components - the user uses these components to give instructions. Input – Components that are used.
Ryan DiGaudio Modified from Catherine Jarnevich, Sunil Kumar, Paul Evangelista, Jeff Morisette, Tom Stohlgren Maxent Overview.
Intro to Raster GIS GTECH361 Lecture 11. CELL ROW COLUMN.
Using Climatic data in Diva GIS Franck Theeten, Royal Museum for central Africa Cabin training 2013.
Museum and Institute of Zoology PAS Warsaw Magdalena Żytomska Berlin, 6th September 2007.
 Muhamad Jantan & T. Ramayah School of Management, Universiti Sains Malaysia Data Analysis Using SPSS.
Uncertainty How “certain” of the data are we? How much “error” does it contain? How well does the model match reality? Goal: –Understand and document uncertainties.
Design and Implementation of a Dynamic Data MLP to Predict Motion Picture Revenue David A. Gerasimow.
Machine Learning CUNY Graduate Center Lecture 4: Logistic Regression.
Aquarius Level-3 Binning and Mapping Fred Patt. Definitions Projection - any process which transforms a spatially organized data set from one coordinate.
Climate Predictability Tool (CPT) Ousmane Ndiaye and Simon J. Mason International Research Institute for Climate and Society The Earth.
Maximum Entropy Models and Feature Engineering CSCI-GA.2590 – Lecture 6B Ralph Grishman NYU.
ArcGIS: ArcCatalog. Agenda The ArcCatalog Interface General ArcCatalog ▫Navigating directories ▫Preview data ▫View metadata Advanced ArcCatalog ▫Modifying.
1 Overview Importing data from generic raster files Creating surfaces from point samples Mapping contours Calculating summary attributes for polygon features.
Radiometric Normalization Spring 2009 Ben-Gurion University of the Negev.
Mr. Magdi Morsi Statistician Department of Research and Studies, MOH
Ryan DiGaudio Modified from Catherine Jarnevich, Sunil Kumar, Paul Evangelista, Jeff Morisette, Tom Stohlgren Maxent Overview.
How Good is a Model? How much information does AIC give us? –Model 1: 3124 –Model 2: 2932 –Model 3: 2968 –Model 4: 3204 –Model 5: 5436.
About GGum Zhao Shouying. General on IRT Item response theory models have become increasingly popular measurement tools in the past thirty-five years.
Postgraduate Computing Lectures PAW 1 PAW: Physicist Analysis Workstation What is PAW? –A tool to display and manipulate data. Learning PAW –See ref. in.
Logistic Regression Saed Sayad 1www.ismartsoft.com.
Giansalvo EXIN Cirrincione unit #4 Single-layer networks They directly compute linear discriminant functions using the TS without need of determining.
Nonparametric Statistics
Downloading the MAXENT Software
Use of Maxent for predictive habitat mapping of CWC in the Bari canyon Bargain Annaëlle Foglini Federica, Bonaldo Davide, Pairaud Ivane & Fabri Marie-Claire.
Chapter 11 – Neural Nets © Galit Shmueli and Peter Bruce 2010 Data Mining for Business Intelligence Shmueli, Patel & Bruce.
GML in CDI and CSR ISO using Ends&Bends
BINARY LOGISTIC REGRESSION
HGEN Thanks to Fruhling Rijsdijk
Maximum Entropy Models and Feature Engineering CSCI-GA.2591
Probability Theory and Parameter Estimation I
An Interactive Tutorial for SPSS 10.0 for Windows©
Just the basics: Learning about the essential steps to do some simple things in SPSS Larkin Lamarche.
How Good is a Model? How much information does AIC give us?
Lab1 Instruction Georeferencing a raster
Modeling Species Distribution with MaxEnt Bryce Maxell, Acting Director, Montana Natural Heritage Program & Scott Story, Nongame Data Manager, Montana.
Random walk initialization for training very deep feedforward networks
Multivariate Methods Berlin Chen
Multivariate Methods Berlin Chen, 2005 References:
ESRM 250/CFR 520 Autumn 2009 Phil Hurvitz
More on Maxent Env. Variable importance:
Presentation transcript:

Maxent interface

Maximum Entropy (Maxent) Deterministic Precise mathematical definition Continuous and categorical environmental data Continuous output

Maxent can be downloaded at: http://www. cs. princeton For this slide ask them to start Maxent to follow the presentation more easily Note: when downloading Maxent, make sure that maxent.jar is saved as is, and not as a .zip file

Input data: Samples: a .csv file with 3 fields (species label, longitude, latitude) and a header as first line. Can have multiple species in a single file Environmental layers*: ASCII files (ESRI or DIVA-GIS formats) grouped in a folder. No mask file is needed * also possible to use SWD format (sample-with-data): a .csv file containing the environmental variables’ values for each occurrence point

2.Quadratic → square of variable 3.Product → product of two variables Classes of features: 1.Linear* → variable itself 2.Quadratic → square of variable 3.Product → product of two variables 4.Threshold → binary transformation (0, 1) of a continuous variable using a threshold 5.Hinge → like a linear feature, but constant below a threshold * Categorical data: Binary feature → variable itself

What are the features used for? to constrain the probability distribution of maximum entropy (most spread out) which determines a species probability distribution (output prediction) Constraints: Linear* → mean Quadratic → variance Product → covariance Threshold → fit an arbitrary response Hinge → like linear (but constant below a threshold) * Categorical data: Binary feature → proportion

Auto features setting optimizes the use of a set of features based on the number of presence records for the species Linear features if <10 presence points available Linear + quadratic if 10-14 presence points available Linear + quadratic + hinge if 15-79 presence points available All features if >80 presence points available In order to override this default setting, it is necessary to use the command line flags described in help menu. However, the beta regularization value has to be adjusted too.

Outputs: SpeciesName.html contains response curves, pictures of predictions and jackknife to measure variable importance if chosen prediction can be saved as cumulative, logistic, or raw Output file types available: ASCII grid and DIVA-GIS grid (.mxe is not a grid output) model can be projected on different climatic datasets (different geographic region or different period of time

Examples of response curves (how each environmental variable affects Maxent model) Picture of Maxent prediction

Used to be the default type New default type Raw values (very small) Cumulative output Logistic output Raw output Used to be the default type New default type Raw values (very small) Each value is the sum of Non-linear scale up Sum over all cells used probabilities of cells < the of raw values for training is 1 cell grid, times 100 General notes: Thresholding (binning) can change the look of the map significantly Care in interpreting the thresholds (e.g. a cumulative value of 80% doesn’t mean that the probability of a species’ occurrence is 80%) Grids have floating points values, thus they should be imported as floating point grids this in GIS software in order to preserve the fine details in classifying cells as suitable

Lastly... (more) Settings button: opens a new window with more settings Random test/train partition of occurrence data for each run; same for background data % of occurrence data randomly set aside as test points (default is 0) modifies the regularization value (higher value gives a more spread out distribution); works only if auto features option is off. Sampling is assumed to be biased according to sampling distribution Occurrence data from a file (rather than a random sample of training data) is used to test AUC, omission, etc

To summarize in a few words... To run Maxent: Occurrence data in a .csv file Training environmental dataset (no mask needed) containing ASCII grids Optional: environmental dataset for projecting models Maxent outputs (predictions): ASCII grids, floating point (not integers) Can be raw, logistic, or cumulative predictions Additional files, including an .html summary file