Predicting Locations Using Map Similarity(PLUMS): A Framework for Spatial Data Mining Sanjay Chawla(Vignette Corporation) Shashi Shekhar, Weili Wu(CS,

Slides:



Advertisements
Similar presentations
Brief introduction on Logistic Regression
Advertisements

AP Statistics.  Least Squares regression is a way of finding a line that summarizes the relationship between two variables.
Spatial Dependency Modeling Using Spatial Auto-Regression Mete Celik 1,3, Baris M. Kazar 4, Shashi Shekhar 1,3, Daniel Boley 1, David J. Lilja 1,2 1 CSE.
Pattern Recognition and Machine Learning: Kernel Methods.
TNO orbit computation: analysing the observed population Jenni Virtanen Observatory, University of Helsinki Workshop on Transneptunian objects - Dynamical.
SPATIAL DATA ANALYSIS Tony E. Smith University of Pennsylvania Point Pattern Analysis Spatial Regression Analysis Continuous Pattern Analysis.
The General Linear Model. The Simple Linear Model Linear Regression.
Bayesian statistics – MCMC techniques
BAYESIAN INFERENCE Sampling techniques
A PARALLEL FORMULATION OF THE SPATIAL AUTO-REGRESSION MODEL FOR MINING LARGE GEO-SPATIAL DATASETS HPDM 2004 Workshop at SIAM Data Mining Conference Barış.
C.T. LuSpatial Data Mining1 Spatial Data Mining: Three Case Studies Presented by: Chang-Tien Lu Spatial Database Lab Department of Computer Science University.
Spatial Data Mining: Teleconnections Shashi Shekhar Mcknight Distinguished University Professor, U of Minnesota only in old plan Only in new plan In both.
Introduction to Spatial Data Mining
Linear Models Tony Dodd January 2007An Overview of State-of-the-Art Data Modelling Overview Linear models. Parameter estimation. Linear in the.
Basics of Statistical Estimation. Learning Probabilities: Classical Approach Simplest case: Flipping a thumbtack tails heads True probability  is unknown.
Introduction to Spatial Data Mining 강홍구 데이타베이스연구실 CHAPTER 7.
1 Hybrid Agent-Based Modeling: Architectures,Analyses and Applications (Stage One) Li, Hailin.
Spatial Data Mining: Three Case Studies For additional details Shashi Shekhar, University of Minnesota Presented.
Panelist: Shashi Shekhar McKnight Distinguished Uninversity Professor University of Minnesota Cyber-Infrastructure (CI) Panel,
1 Ensembles of Nearest Neighbor Forecasts Dragomir Yankov, Eamonn Keogh Dept. of Computer Science & Eng. University of California Riverside Dennis DeCoste.
Amos Storkey, School of Informatics. Density Traversal Clustering and Generative Kernels a generative framework for spectral clustering Amos Storkey, Tom.
Prof. Ram and I By Shashi Shekhar (not Sushi Shekhar!) May 6 th, 2006.
Bayesian Learning Rong Jin.
7-1 Introduction The field of statistical inference consists of those methods used to make decisions or to draw conclusions about a population. These.
Lecture 3 Properties of Summary Statistics: Sampling Distribution.
Robin McDougall, Ed Waller and Scott Nokleby Faculties of Engineering & Applied Science and Energy Systems & Nuclear Science 1.
The Paradigm of Econometrics Based on Greene’s Note 1.
Part 1: Introduction 1-1/22 Econometrics I Professor William Greene Stern School of Business Department of Economics.
Overview G. Jogesh Babu. Probability theory Probability is all about flip of a coin Conditional probability & Bayes theorem (Bayesian analysis) Expectation,
WSEAS AIKED, Cambridge, Feature Importance in Bayesian Assessment of Newborn Brain Maturity from EEG Livia Jakaite, Vitaly Schetinin and Carsten.
Applications of Bayesian sensitivity and uncertainty analysis to the statistical analysis of computer simulators for carbon dynamics Marc Kennedy Clive.
Stochastic Algorithms Some of the fastest known algorithms for certain tasks rely on chance Stochastic/Randomized Algorithms Two common variations – Monte.
Prof. Dr. S. K. Bhattacharjee Department of Statistics University of Rajshahi.
Particle Filters for Shape Correspondence Presenter: Jingting Zeng.
7-1 Introduction The field of statistical inference consists of those methods used to make decisions or to draw conclusions about a population. These.
Statistical Sampling-Based Parametric Analysis of Power Grids Dr. Peng Li Presented by Xueqian Zhao EE5970 Seminar.
CSC321: 2011 Introduction to Neural Networks and Machine Learning Lecture 11: Bayesian learning continued Geoffrey Hinton.
CS Statistical Machine learning Lecture 18 Yuan (Alan) Qi Purdue CS Oct
Markov Random Fields Probabilistic Models for Images
Loan Default Model Saed Sayad 1www.ismartsoft.com.
Spatial Data Mining Satoru Hozumi CS 157B. Learning Objectives Understand the concept of Spatial Data Mining Understand the concept of Spatial Data Mining.
1  The Problem: Consider a two class task with ω 1, ω 2   LINEAR CLASSIFIERS.
Accurate Robot Positioning using Corrective Learning Ram Subramanian ECE 539 Course Project Fall 2003.
Point Estimation of Parameters and Sampling Distributions Outlines:  Sampling Distributions and the central limit theorem  Point estimation  Methods.
EED 401: ECONOMETRICS COURSE OUTLINE
1 Chapter 8: Model Inference and Averaging Presented by Hui Fang.
Machine Learning in CSC 196K
Bayesian Modelling Harry R. Erwin, PhD School of Computing and Technology University of Sunderland.
Introduction to emulators Tony O’Hagan University of Sheffield.
Proposed Courses. Important Notes State-of-the-art challenges in TV Broadcasting o New technologies in TV o Multi-view broadcasting o HDR imaging.
Ch 1. Introduction Pattern Recognition and Machine Learning, C. M. Bishop, Updated by J.-H. Eom (2 nd round revision) Summarized by K.-I.
Overview G. Jogesh Babu. R Programming environment Introduction to R programming language R is an integrated suite of software facilities for data manipulation,
Biointelligence Laboratory, Seoul National University
Machine Learning and Data Mining Clustering
Stat 223 Introduction to the Theory of Statistics
Limited Dependent Variables
LECTURE 09: BAYESIAN ESTIMATION (Cont.)
7-1 Introduction The field of statistical inference consists of those methods used to make decisions or to draw conclusions about a population. These.
Ch3: Model Building through Regression
Accurate Robot Positioning using Corrective Learning
The Least-Squares Regression Line
Location Prediction and Spatial Data Mining (S. Shekhar)
POINT ESTIMATOR OF PARAMETERS
Shashi Shekhar Weili Wu Sanjay Chawla Ranga Raju Vatsavai
Chengyaun yin School of Mathematics SHUFE
Stat 223 Introduction to the Theory of Statistics
Outline Texture modeling - continued Julesz ensemble.
Machine Learning and Data Mining Clustering
Spatial Data Mining: Three Case Studies
Outline Texture modeling - continued Markov Random Field models
Presentation transcript:

Predicting Locations Using Map Similarity(PLUMS): A Framework for Spatial Data Mining Sanjay Chawla(Vignette Corporation) Shashi Shekhar, Weili Wu(CS, Univ. of Minnesota) Uygar Ozesmi(Ericyes University, Turkey)

Outline Motivation Application Domain Distinguishing characteristics of spatial data mining Problem Definition Spatial Statistics Approach Our approach: PLUMS Experiments, Results, Conclusion and Future Work

Motivation Historical Examples of Spatial Data Exploration –Asiatic Cholera, 1855 –Theory of Gondwanaland –Effect of fluoride on Dental Hygiene A potential application in news –Tracking the West Nile Virus

Application Domain Wetland Management: Predicting locations of bird(red-winged blackbird) nests in wetlands Why we choose this application ? –Strong spatial component –Domain Expertise –Classical Data Mining techniques(logistic regression, neural nets) had already been applied

Application Domain: Continued.. Nest Locations Distance to open water Vegetation DurabilityWater Depth

Unique characteristics of spatial data mining Spatial Autocorrelation Property

Unique characteristics…cont Average Distance to Nearest Prediction(ADNP):

Location Prediction:Problem Formulation Given: A spatial framework S. – Explanatory functions, – Dependent function F –A family F of learning model function mappings Find an element Objective: maximize (map_similarity = classification_accuracy + spatial accuracy) Constraints: spatial autocorrelation exists

Spatial Statistics Approach ”Logistic Regression:

Spatial Stat: Solution Techniques Least Square Estimation: Biased and Inconsistent Maximum Likelihood: Involve computation of large determinant(from W) Bayesian: Monte Carlo Markov Chain(e.g. Gibbs Sampling)

Our Approach

Experiment Setup

Result(1)

Result(2)

Conclusion and Future work PLUMS >> Classical Data Mining techniques PLUMS State-of-the-art Spatial Statistics approaches Better performance(two orders of magnitude) Try other configurations of the PLUMS framework and formalize!