Shashi ShekharMining For Spatial Patterns1 Mining for Spatial Patterns Shashi Shekhar Department of Computer Science University of Minnesota

Slides:



Advertisements
Similar presentations
Random Forest Predrag Radenković 3237/10
Advertisements

Spatial Dependency Modeling Using Spatial Auto-Regression Mete Celik 1,3, Baris M. Kazar 4, Shashi Shekhar 1,3, Daniel Boley 1, David J. Lilja 1,2 1 CSE.
© Tan,Steinbach, Kumar Introduction to Data Mining 8/05/ Data Mining: Exploring Data Lecture Notes for Chapter 3 Introduction to Data Mining by Tan,
Spatio – Temporal Outlier Detection in Environmental Data
Civil and Environmental Engineering Carnegie Mellon University Sensors & Knowledge Discovery (a.k.a. Data Mining) H. Scott Matthews April 14, 2003.
© University of Minnesota Data Mining for the Discovery of Ocean Climate Indices 1 CSci 8980: Data Mining (Fall 2002) Vipin Kumar Army High Performance.
A PARALLEL FORMULATION OF THE SPATIAL AUTO-REGRESSION MODEL FOR MINING LARGE GEO-SPATIAL DATASETS HPDM 2004 Workshop at SIAM Data Mining Conference Barış.
Data Mining Cluster Analysis: Advanced Concepts and Algorithms Lecture Notes for Chapter 9 Introduction to Data Mining by Tan, Steinbach, Kumar © Tan,Steinbach,
C.T. LuSpatial Data Mining1 Spatial Data Mining: Three Case Studies Presented by: Chang-Tien Lu Spatial Database Lab Department of Computer Science University.
Introduction to Spatial Data Mining
Spatial Outlier Detection and implementation in Weka Implemented by: Shan Huang Jisu Oh CSCI8715 Class Project, April Presented by Jisu.
SSCP: Mining Statistically Significant Co-location Patterns Sajib Barua and Jörg Sander Dept. of Computing Science University of Alberta, Canada.
Extraction of high-level features from scientific data sets Eui-Hong (Sam) Han Department of Computer Science and Engineering University of Minnesota Research.
Spatial Data Mining: Three Case Studies For additional details Shashi Shekhar, University of Minnesota Presented.
Spatial Data Mining: Spatial outlier detection Spatial outlier A data point that is extreme relative to it neighbors Given A spatial graph G={V,E} A neighbor.
A Unified Approach to Spatial Outliers Detection Chang-Tien Lu Spatial Database Lab Department of Computer Science University of Minnesota
Panelist: Shashi Shekhar McKnight Distinguished Uninversity Professor University of Minnesota Cyber-Infrastructure (CI) Panel,
Shashi ShekharMining For Spatial Patterns1 Mining for Spatial Patterns Shashi Shekhar Department of Computer Science University of Minnesota
Why Geography is important.
Co-location pattern mining (for CSCI 5715) Charandeep Parisineti, Bhavtosh Rath Chapter 7: Spatial Data Mining [1]Yan Huang, Shashi Shekhar, Hui Xiong.
University of Minnesota Department of Computer Science and Engineering Directed by Professor Shashi Shekhar Department of Computer Science and Engineering.
Data Mining – Intro.
WPI Center for Research in Exploratory Data and Information Analysis From Data to Knowledge: Exploring Industrial, Scientific, and Commercial Databases.
Data Mining Techniques
Kansas State University Department of Computing and Information Sciences CIS 830: Advanced Topics in Artificial Intelligence From Data Mining To Knowledge.
Overview G. Jogesh Babu. Probability theory Probability is all about flip of a coin Conditional probability & Bayes theorem (Bayesian analysis) Expectation,
Chapter 7: Spatial Data Mining 7.1 Pattern Discovery 7.2 Motivation 7.3 Classification Techniques 7.4 Association Rule Discovery Techniques 7.5 Clustering.
COMP3503 Intro to Inductive Modeling
Downscaling in time. Aim is to make a probabilistic description of weather for next season –How often is it likely to rain, when is the rainy season likely.
School of Information Technologies The University of Sydney Australia Spatio-Temporal Analysis of the relationship between South American Precipitation.
1 A Bayesian Method for Guessing the Extreme Values in a Data Set Mingxi Wu, Chris Jermaine University of Florida September 2007.
11 Spatial Data Mining CS 697 Assignment 1 February 16, 2010 Pradnya Khutafale, Peter Lucas, and Chris Maio Advisor: Dr. Wei Ding Computer Science Department.
Mapping and analysis for public safety: An Overview.
NGDM 2009 panel on Climate Change Mining Climate and Ecosystem Data : Challenges and Opportunities Vipin Kumar University of Minnesota.
Treatment Learning: Implementation and Application Ying Hu Electrical & Computer Engineering University of British Columbia.
Discovery of Patterns in the Global Climate System using Data Mining Vipin Kumar Army High Performance Computing Research Center Department of Computer.
Various topics Petter Mostad Overview Epidemiology Study types / data types Econometrics Time series data More about sampling –Estimation.
Markov Random Fields Probabilistic Models for Images
Data Mining – Intro. Course Overview Spatial Databases Temporal and Spatio-Temporal Databases Multimedia Databases Data Mining.
Advanced Database Course (ESED5204) Eng. Hanan Alyazji University of Palestine Software Engineering Department.
Spatial Data Mining Satoru Hozumi CS 157B. Learning Objectives Understand the concept of Spatial Data Mining Understand the concept of Spatial Data Mining.
Objective Data  The outlined square marks the area of the study arranged in most cases in a coarse 24X24 grid.  Data from the NASA Langley Research Center.
1 Data Mining: Concepts and Techniques (3 rd ed.) — Chapter 12 — Jiawei Han, Micheline Kamber, and Jian Pei University of Illinois at Urbana-Champaign.
Chapter 11 Statistical Techniques. Data Warehouse and Data Mining Chapter 11 2 Chapter Objectives  Understand when linear regression is an appropriate.
Page  1 LAND COVER GEOSTATISTICAL CLASSIFICATION FOR REMOTE SENSING  Kęstutis Dučinskas, Lijana Stabingiene and Giedrius Stabingis  Department of Statistics,
Mining Weather Data for Decision Support Roy George Army High Performance Computing Research Center Clark Atlanta University Atlanta, GA
DISCOVERING SPATIAL CO- LOCATION PATTERNS PRESENTED BY: REYHANEH JEDDI & SHICHAO YU (GROUP 21) CSCI 5707, PRINCIPLES OF DATABASE SYSTEMS, FALL 2013 CSCI.
Spatial Data Mining. Outline 1.Motivation, Spatial Pattern Families 2.Limitations of Traditional Statistics 3.Colocations and Co-occurrences 4.Spatial.
© Tan,Steinbach, Kumar Introduction to Data Mining 8/05/ Data Mining: Exploring Data Lecture Notes for Chapter 3 Introduction to Data Mining by Tan,
Unsupervised Mining of Statistical Temporal Structures in Video Liu ze yuan May 15,2011.
Spatial Congeries Pattern Mining Presented by: Iris Zhang Supervisor: Dr. David Cheung 24 October 2003.
Discovery of Climate Indices using Clustering Michael Steinbach Steven Klooster Christopher Potter Rohit Bhingare, School of Informatics University of.
Data Mining and Decision Support
Discriminative Training and Machine Learning Approaches Machine Learning Lab, Dept. of CSIE, NCKU Chih-Pin Liao.
© Vipin Kumar IIT Mumbai Case Study 2: Dipoles Teleconnections are recurring long distance patterns of climate anomalies. Typically, teleconnections.
Diagnostic verification and extremes: 1 st Breakout Discussed the need for toolkit to build beyond current capabilities (e.g., NCEP) Identified (and began.
Introduction to Spatial Data Mining 7.1 Pattern Discovery 7.2 Motivation 7.3 Classification Techniques 7.4 Association Rule Discovery Techniques 7.5 Clustering.
3/13/2016 Data Mining 1 Lecture 2-1 Data Exploration: Understanding Data Phayung Meesad, Ph.D. King Mongkut’s University of Technology North Bangkok (KMUTNB)
Non-parametric Methods for Clustering Continuous and Categorical Data Steven X. Wang Dept. of Math. and Stat. York University May 13, 2010.
NOAA Northeast Regional Climate Center Dr. Lee Tryhorn NOAA Climate Literacy Workshop April 2010 NOAA Northeast Regional Climate.
Why Is It There? Chapter 6. Review: Dueker’s (1979) Definition “a geographic information system is a special case of information systems where the database.
Mining Statistically Significant Co-location and Segregation Patterns.
Spatial Data Mining.
Data Mining – Intro.
G10 Anuj Karpatne Vijay Borra
Location Prediction and Spatial Data Mining (S. Shekhar)
Data Mining: Exploring Data
Shashi Shekhar Weili Wu Sanjay Chawla Ranga Raju Vatsavai
Spatial Data Mining: Three Case Studies
Data exploration and visualization
Presentation transcript:

Shashi ShekharMining For Spatial Patterns1 Mining for Spatial Patterns Shashi Shekhar Department of Computer Science University of Minnesota Collaborators: U. of Minnesota: V. Kumar, G. Karypis, C.T. Lu, W. Wu, Y. Huang, V. Raju, P. Zhang, P. Tan, M. Steinbach NASA Ames Research Center: C. Potter California State University, Monterey Bay: S. Klooster This work was partially funded by NASA and Army High Performance Computing Center

Shashi ShekharMining For Spatial Patterns2 Background NSF workshop on GIS and DM (3/99) Spatial data - traffic, bird habitats, global climate, logistics,... For spatial patterns - outliers, location prediction, associations, sequential associations, clustering, trends, …

Shashi ShekharMining For Spatial Patterns3 Framework Problem statement: capture special needs Data exploration: maps, new methods Try reusing classical methods from data mining, spatial statistics If reuse is not possible, invent new methods Validation, Performance tuning

Shashi ShekharMining For Spatial Patterns4 Research Goals Research Goals: modeling of ecological data  event modeling  zone modeling. finding spatio-temporal patterns  associations  predictive models. A key interest is finding connections between the ocean and the land.

Shashi ShekharMining For Spatial Patterns5 Sources of Earth Science Data Before 1950, very sparse, unreliable data. Since 1950, reliable global data. Ocean temperature and pressure are based on data from ships. Most land data, (solar, precipitation, temperature and pressure) comes from weather stations. Since 1981, data has been available from Earth orbiting satellites. FPAR, a measure related to plant Since 1999 TERRA, the flagship of the NASA Earth Observing System, is providing much more detailed data.

Shashi ShekharMining For Spatial Patterns6 Example Pattern: Teleconnections Teleconnections are the simultaneous variation in climate and related processes over widely separated points on the Earth. For example, El Nino is the anomalous warming of the eastern tropical region of the Pacific, and has been linked to various climate phenomena. Droughts in Australia and Southern Africa Heavy rainfall along the western coast of South America Milder winters in the Midwest

Shashi ShekharMining For Spatial Patterns7 Net Primary Production (NPP) Net Primary Production (NPP) is the net assimilation of atmospheric carbon dioxide (CO 2 ) into organic matter by plants. NPP is driven by solar radiation and can be constrained by precipitation and temperature. NPP is a key variable for understanding the global carbon cycle and ecological dynamics of the Earth. Keeping track of NPP is important because it includes the food source of humans and all other organisms. Sudden changes in the NPP of a region can have a direct impact on the regional ecology. An ecosystem model for predicting NPP, CASA (the Carnegie Ames Stanford Approach) provides a detailed view of terrestrial productivity.

Shashi ShekharMining For Spatial Patterns8 Benefits of Data Mining Data mining provides earth scientist with tools that allow them to spend more time choosing and exploring interesting families of hypotheses. However, statistics is needed to provide methods for determining the “statistical” significance of results. By applying the proposed data mining techniques, some of the steps of hypothesis generation and evaluation will be automated, facilitated and improved. Association rules provide a “new” framework for detecting relationships between events.

Shashi ShekharMining For Spatial Patterns9 Approaches

Shashi ShekharMining For Spatial Patterns10 Clustering Interested in relationships between regions, not “points.” For land, clustering based on NPP or other variables, e.g., precipitation, temperature. For ocean, clustering based on SST (Sea Surface Temperature). When “raw” NPP and SST are used, clustering can find seasonal patterns. Anomalous regions have plant growth patterns which reversed from those typically observed in the hemisphere in which they reside, and are easy to spot.

Shashi ShekharMining For Spatial Patterns11 Clustering SNN clusters of SST that are highly correlated with El Nino indices. El Nino Regions

Shashi ShekharMining For Spatial Patterns12 Spatial Association Rule Citation: Symp. On Spatial Databases 2001 Problem: Given a set of boolean spatial features find subsets of co-located features, e.g. (fire, drought, vegetation) Data - continuous space, partition not natural, no reference feature Classical data mining approach: association rules But, Look Ma! No Transactions!!! No support measure! Approach: Work with continuous data without transactionizing it! confidence = Pr.[fire at s | drought in N(s) and vegetation in N(s)] support: cardinality of spatial join of instances of fire, drought, dry veg. participation: min. fraction of instances of a features in join result new algorithm using spatial joins and apriori_gen filters

Shashi ShekharMining For Spatial Patterns13 Event Definition Convert the time series into sequence of events at each spatial location.

Shashi ShekharMining For Spatial Patterns14 Interesting Association Patterns Use domain knowledge to eliminate uninteresting patterns. A pattern is less interesting if it occurs at random locations. Approach: Partition the land area into distinct groups (e.g., based on land- cover type). For each pattern, find the regions for which the pattern can be applied. If the pattern occurs mostly in a certain group of land areas, then it is potentially interesting. If the pattern occurs frequently in all groups of land areas, then it is less interesting.

Shashi ShekharMining For Spatial Patterns15 Association Rules Intra-zone non-sequential Patterns Shrubland regions FPAR-Hi  NPP-Hi (support  10) Region corresponds to semi-arid grasslands, a type of vegetation, which is able to quickly take advantage of high precipitation than forests. Hypothesis: FPAR-Hi events could be related to unusual precipitation conditions.

Shashi ShekharMining For Spatial Patterns16 Answers: and Can you find co-location patterns from the following sample dataset? Co-location

Shashi ShekharMining For Spatial Patterns17 Co-location Can you find co-location patterns from the following sample dataset?

Shashi ShekharMining For Spatial Patterns18 Spatial Co-location A set of features frequently co-located Given A set T of K boolean spatial feature types T={f 1,f 2, …, f k } A set P of N locations P={p 1, …, p N } in a spatial frame work S, p i  P is of some spatial feature in T A neighbor relation R over locations in S Find T c =  subsets of T frequently co-located Objective Correctness Completeness Efficiency Constraints R is symmetric and reflexive Monotonic prevalence measure Reference Feature Centric Window CentricEvent Centric Co-location

Shashi ShekharMining For Spatial Patterns19 Participation index Participation ratio pr(f i, c) of feature f i in co-location c = {f 1, f 2, …, f k }: fraction of instances of f i with feature {f 1, …, f i-1, f i+1, …, f k } nearby 2.Participation index = min{pr(f i, c)} Algorithm Hybrid Co-location Miner Association rulesCo-location rules underlying spacediscrete setscontinuous space item-types events /Boolean spatial features collectionstransactionsneighborhoods prevalence measuresupportparticipation index conditional probability measure Pr.[ A in T | B in T ]Pr.[ A in N(L) | B at L ] Comparison with association rules Co-location

Shashi ShekharMining For Spatial Patterns20 Spatial Co-location Patterns Spatial feature A,B,C and their instances Possible associations are (A, B), (B, C), etc. Neighbor relationship includes following pairs: A1, B1 A2, B1 A2, B2 B1, C1 B2, C2 Dataset

Shashi ShekharMining For Spatial Patterns21 Spatial Co-location Patterns Spatial feature A,B, C, and their instances Support A,B =2 B,C=2 Support A,B=1 B,C=2 Partition approach [Yasuhiko, KDD 2001] Support not well defined,i.e. not independent of execution trace Has a fast heuristic which is hard to analyze for correctness/completeness Dataset

Shashi ShekharMining For Spatial Patterns22 Spatial Co-location Patterns Spatial feature A,B, C, and their instances Dataset Reference feature approach [Han SSD 95] C as reference feature to get transactions Transactions: (B1) (B2) Support (A,B) = Ǿ from Apriori algorithm Note: Neighbor relationship includes following pairs: A1, B1 A2, B1 A2, B2 B1, C1 B2, C2

Shashi ShekharMining For Spatial Patterns23 Spatial Co-location Patterns Spatial feature A,B, C, and their instances Our approach (Event Centric) Neighborhood instead of transactions Spatial join on neighbor relationship Support  Prevalence Participation index = min. p_ratio P_ratio(A, (A,B)) = fraction of instance of A participating in join(A,B, neighbor) Examples Support(A,B)=min(2/2,3/3)=1 Support(B,C)=min(2/2,2/2)=1 Dataset

Shashi ShekharMining For Spatial Patterns24 Spatial Co-location Patterns Spatial feature A,B, C, and their instances Support A,B =2 B,C=2 Support A,B=1 B,C=2 Support(A,B)=min(2/2,3/3)=1 Support(B,C)=min(2/2,2/2)=1 Partition approachOur approach Dataset Reference feature approach C as reference feature Transactions: (B1) (B2) Support (A,B) = Ǿ

Shashi ShekharMining For Spatial Patterns25 Spatial Outliers Spatial Outlier: A data point that is extreme relative to it neighbors Case Study: traffic stations different from neighbors [SIGKDD 2001] Data - space-time plot, distr. Of f(x), S(x) Distribution of base attribute: spatially smooth frequency distribution over value domain: normal Classical test - Pr.[item in population] is low Q? distribution of diff.[f(x), neighborhood agg{f(x)}] Insight: this statistic is distributed normally! Test: (z-score on the statistics) > 2 Performance - spatial join, clustering methods

Shashi ShekharMining For Spatial Patterns26 Spatial Outlier Detection Given A spatial graph G={V,E} A neighbor relationship (K neighbors) An attribute function : V -> R An aggregation function : :R k -> R A comparison function Confidence level threshold  Statistic test function ST: R ->{T, F} Find O = {v i | v i  V, v i is a spatial outlier} Objective Correctness: The attribute values of v i is extreme, compared with its neighbors Computational efficiency Constraints and ST are algebraic aggregate functions of and Computation cost dominated by I/O op.

Shashi ShekharMining For Spatial Patterns27 Spatial Outlier Detection Test 1. Choice of Spatial Statistic S(x) = [f(x)–E y  N(x) (f(y))] Theorem: S(x) is normally distributed if f(x) is normally distributed 2. Test for Outlier Detection | (S(x) -  s ) /  s | >  Hypothesis I/O cost determined by clustering efficiency f(x)S(x) Spatial Outlier Detection

Shashi ShekharMining For Spatial Patterns28 Results 1. CCAM achieves higher clustering efficiency (CE) 2. CCAM has lower I/O cost 3. High CE => low I/O cost 4. Big Page => high CE Z-order CCAM I/O costCE value Cell-Tree Spatial Outlier Detection

Shashi ShekharMining For Spatial Patterns29 A Unified Approach Spatial Outliers Original Data Our Approach Scatter Plot Tests : quantitative, graphical Results: Computation = spatial self-join Tests: algebraic functions of join Join predicate: neighbor relations I/O-cost: f(clustering efficiency) Our algorithm is I/O-efficient for Algebric tests

Shashi ShekharMining For Spatial Patterns30 Original Data Variogram Cloud Moran Scatter Plot Graphical Spatial Tests

Shashi ShekharMining For Spatial Patterns31 Location Prediction Citations: IEEE Tran. on Multimedia 2002, SIAM DM Conf. 2001, SIGKDD DMKD 2000 Problem: predict nesting site in marshes given vegetation, water depth, distance to edge, etc. Data - maps of nests and attributes spatially clustered nests, spatially smooth attributes Classical method: logistic regression, decision trees, bayesian classifier but, independence assumption is violated ! Misses auto- correlation ! Spatial auto-regression (SAR), Markov random field bayesian classifier Open issues: spatial accuracy vs. classification accurary Open issue: performance - SAR learning is slow!

Shashi ShekharMining For Spatial Patterns32 Given: 1. Spatial Framework 2. Explanatory functions: 3. A dependent class: 4. A family of function mappings: Find: Classification model: Objective:maximize classification_accuracy Constraints: Spatial Autocorrelation exists Nest locations Distance to open water Vegetation durability Water depth Location Prediction

Shashi ShekharMining For Spatial Patterns33 Motivation and Framework

Shashi ShekharMining For Spatial Patterns34 Spatial Autoregression Model (SAR) y =  Wy + X  +  W models neighborhood relationships  models strength of spatial dependencies  error vector Solutions  and  - can be estimated using ML or Bayesian stat. e.g., spatial econometrics package uses Bayesian approach using sampling-based Markov Chain Monte Carlo (MCMC) method. Likelihood-based estimation requires O(n 3 ) ops. Other alternatives – divide and conquer, sparse matrix, LU decomposition, etc. Solution Procedures

Shashi ShekharMining For Spatial Patterns35 Evaluation Linear Regression Spatial Regression Spatial model is better

Shashi ShekharMining For Spatial Patterns36 Markov Random Field based Bayesian Classifiers Pr(l i | X, L i ) = Pr(X|l i, L i ) Pr(l i | L i ) / Pr (X) Pr(l i | L i ) can be estimated from training data L i denotes set of labels in the neighborhood of si excluding labels at si Pr(X|l i, L i ) can be estimated using kernel functions Solutions stochastic relaxation [Geman] Iterated conditional modes [Besag] Graph cut [Boykov] Solution Procedures

Shashi ShekharMining For Spatial Patterns37 SAR can be rewritten as y = (QX)  + Q  where Q = (I-  W) -1 which can be viewed as a spatial smoothing operation. This transformation shows that SAR is similar to linear logistic model, and thus suffers with same limitations – i.e., SAR model assumes linear separability of classes in transformed feature space SAR model also make more restrictive assumptions about the distribution of features and class shapes than MRF The relationship between SAR and MRF are analogous to the relationship between logistic regression and Bayesian classifiers. Our experimental results shows that MRF model yields better spatial and classification accuracies than SAR predictions. Comparison

Shashi ShekharMining For Spatial Patterns38 Confusion Matrix: Spatial Confusion Matrix: MRF vs. SAR

Shashi ShekharMining For Spatial Patterns39 Experiment Design

Shashi ShekharMining For Spatial Patterns40 Conclusion and Future Directions Spatial domains may not satisfy assumptions of classical methods data: auto-correlation, continuous geographic space patterns: global vs. local, e.g. spatial outliers vs. outliers data exploration: maps and albums Open Issues patterns: hot-spots, blobology (shape), spatial trends, … metrics: spatial accuracy(predicted locations), spatial contiguity(clusters) spatio-temporal dataset scale and resolutions sentivity of patterns geo-statistical confidence measure for mined patterns

Shashi ShekharMining For Spatial Patterns41 Reference 1.S. Shekhar, S. Chawla, S. Ravada, A. Fetterer, X. Liu and C.T. Liu, “Spatial Databases: Accomplishments and Research Needs”, IEEE Transactions on Knowledge and Data Engineering, Jan.-Feb S. Shekhar and Y. Huang, “Discovering Spatial Co-location Patterns: a Summary of Results”, In Proc. of 7th International Symposium on Spatial and Temporal Databases (SSTD01), July S. Shekhar, C.T. Lu, P. Zhang, "Detecting Graph-based Spatial Outliers: Algorithms and Applications“, the Seventh ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, S. Shekhar, C.T. Lu, P. Zhang, “Detecting Graph-based Saptial Outlier”, Intelligent Data Analysis, To appear in Vol. 6(3), S. Shekhar, S. Chawla, the book “Spatial Database: Concepts, Implementation and Trends”, Prentice Hall, S. Chawla, S. Shekhar, W. Wu and U. Ozesmi, “Extending Data Mining for Spatial Applications: A Case Study in Predicting Nest Locations”, Proc. Int. Confi. on 2000 ACM SIGMOD Workshop on Research Issues in Data Mining and Knowledge Discovery (DMKD 2000), Dallas, TX, May 14, S. Chawla, S. Shekhar, W. Wu and U. Ozesmi, “Modeling Spatial Dependencies for Mining Geospatial Data”, First SIAM International Conference on Data Mining, S. Shekhar, P.R. Schrater, R. R. Vatsavai, W. Wu, and S. Chawla, “Spatial Contextual Classification and Prediction Models for Mining Geospatial Data”,To Appear in IEEE Transactions on Multimedia, S. Shekhar, V. Kumar, P. Tan. M. Steinbach, Y. Huang, P. Zhang, C. Potter, S. Klooster, “Mining Patterns in Earth Science Data”, IEEE Computing in Science and Engineering (Submitted)

Shashi ShekharMining For Spatial Patterns42 Reference 10.S. Shekhar, C.T. Lu, P. Zhang, “A Unified Approach to Spatial Outliers Detection”, IEEE Transactions on Knowledge and Data Engineering (Submitted) 11.S. Shekhar, C.T. Lu, X. Tan, S. Chawla, Map Cube: A Visualization Tool for Spatial Data Warehouses, as Chapter of Geographic Data Mining and Knowledge Discovery. Harvey J. Miller and Jiawei Han (eds.), Taylor and Francis, 2001, ISBN S. Shekhar, Y. Huang, W. Wu, C.T. Lu, What's Spatial about Spatial Data Mining: Three Case Studies, as Chapter of Book: Data Mining for Scientific and Engineering Applications. V. Kumar, R. Grossman, C. Kamath, R. Namburu (eds.), Kluwer Academic Pub., 2001, ISBN Shashi Shekhar and Yan Huang, Multi-resolution Co-location Miner: a New Algorithm to Find Co-location Patterns in Spatial Datasets, Fifth Workshop on Mining Scientific Datasets (SIAM 2nd Data Mining Conference), April 2002