Spatial Database & Spatial Data Mining Shashi Shekhar Dept. of Computer Sc. and Eng. University of Minnesota

Slides:



Advertisements
Similar presentations
Spatial Dependency Modeling Using Spatial Auto-Regression Mete Celik 1,3, Baris M. Kazar 4, Shashi Shekhar 1,3, Daniel Boley 1, David J. Lilja 1,2 1 CSE.
Advertisements

Knowledge Graph: Connecting Big Data Semantics
Constructing Popular Routes from Uncertain Trajectories Ling-Yin Wei 1, Yu Zheng 2, Wen-Chih Peng 1 1 National Chiao Tung University, Taiwan 2 Microsoft.
Fire Sync Data Analysis Christel’s Baby Steps to Temporal and Spatial Analyses.
Spatial Mining.
C++ Programming: Program Design Including Data Structures, Third Edition Chapter 21: Graphs.
Capacity Constrained Routing Algorithms
OLD Organization 6.1 Example Network Databases 6.2 Conceptual, Logical, and Physical Data Models 6.3 Query Language for Graphs 6.4 Graph Algorithms 6.5.
T-Drive : Driving Directions Based on Taxi Trajectories Microsoft Research Asia University of North Texas Jing Yuan, Yu Zheng, Chengyang Zhang, Xing Xie,
Using Structure Indices for Efficient Approximation of Network Properties Matthew J. Rattigan, Marc Maier, and David Jensen University of Massachusetts.
Xyleme A Dynamic Warehouse for XML Data of the Web.
Spatio-Temporal Databases
Cascading Spatio-Temporal Pattern Discovery P. Mohan, S.Shekhar, J. Shine, J. Rogers CSci 8715 Presented by: Atanu Roy Akash Agrawal.
1 Evacuation Route Planning: A Scientific Approach Shashi Shekhar McKnight Distinguished University Professor, University of Minnesota Project Details.
Spatio-Temporal Databases. Outline Spatial Databases Temporal Databases Spatio-temporal Databases Multimedia Databases …..
Spatial Data Mining: Teleconnections Shashi Shekhar Mcknight Distinguished University Professor, U of Minnesota only in old plan Only in new plan In both.
Group Members Faculty : Professor Shashi Shekhar Professor Mohamed Mokbel Students : Mete Celik Betsy George James Kang Sangho Kim Xiaojia Li Qingsong.
Dieter Pfoser, LBS Workshop1 Issues in the Management of Moving Point Objects Dieter Pfoser Nykredit Center for Database Research Aalborg University, Denmark.
1 ISI’02 Multidimensional Databases Challenge: representation for efficient storage, indexing & querying Examples (time-series, images) New multidimensional.
Evacuation Route Planning: Scalable Approaches
(Geo) Informatics across Disciplines! Why Geo-Spatial Computing? Societal: Google Earth, Google Maps, Navigation, location-based service Global Challenges.
Spatial Data Mining: Three Case Studies For additional details Shashi Shekhar, University of Minnesota Presented.
1 Evacuation Planning Algorithms Qingsong Lu Advisor: Prof. Shashi Shekhar August 9, 2002.
A Unified Approach to Spatial Outliers Detection Chang-Tien Lu Spatial Database Lab Department of Computer Science University of Minnesota
Panelist: Shashi Shekhar McKnight Distinguished Uninversity Professor University of Minnesota Cyber-Infrastructure (CI) Panel,
Comparing path-based and vertically-partitioned RDF databases Preetha Lakshmi & Chris Mueller 12/10/2007 CSCI 8715 Shashi Shekhar.
1 Evacuation Planning Algorithms Professor Shashi Shekhar Dept. of Computer Science, University of Minnesota Participants: Q. Lu, S. Kim February 2004.
Prof. Ram and I By Shashi Shekhar (not Sushi Shekhar!) May 6 th, 2006.
University of Minnesota Department of Computer Science and Engineering Directed by Professor Shashi Shekhar Department of Computer Science and Engineering.
Shashi Shekhar Professor, Computer Science Department Teaching: Csci 8705: Topics in Scientific Databases Csci.
Data Mining – Intro.
Source:
MAP MANIA Student Study Guide. Map Mania  ___________ is the study of the earth and its features.  ___________ is a person who studies the earth. Geography.
Chapter 1: Introduction to Spatial Databases 1.1 Overview 1.2 Application domains 1.3 Compare a SDBMS with a GIS 1.4 Categories of Users 1.5 An example.
IST 210 Introduction to Spatial Databases. IST 210 Evolution of acronym “GIS” Fig 1.1 Geographic Information Systems (1980s) Geographic Information Science.
Introduction In ArcGIS, there are three basic data structures:
Mapping and analysis for public safety: An Overview.
Representing and Using Graphs
Location Based Services: A Gentle Introduction Presented to: Computer Science 1001 By Shashi Shekhar Computer Sc. Faculty University of Minnesota May 2.
Spatial Data Mining Ashkan Zarnani Sadra Abedinzadeh Farzad Peyravi.
Tables tables are rows (across) and columns (down) common format in spreadsheets multiple tables linked together create a relational database entity equals.
Data Mining – Intro. Course Overview Spatial Databases Temporal and Spatio-Temporal Databases Multimedia Databases Data Mining.
1 Spatial Data Models and Structure. 2 Part 1: Basic Geographic Concepts Real world -> Digital Environment –GIS data represent a simplified view of physical.
Spatial Data Mining Satoru Hozumi CS 157B. Learning Objectives Understand the concept of Spatial Data Mining Understand the concept of Spatial Data Mining.
Group 8: Denial Hess, Yun Zhang Project presentation.
Mining Weather Data for Decision Support Roy George Army High Performance Computing Research Center Clark Atlanta University Atlanta, GA
Mapcube to Understand Traffic Patterns Shashi Shekhar Computer Science Department University of Minnesota (612)
Spatial Data Mining. Outline 1.Motivation, Spatial Pattern Families 2.Limitations of Traditional Statistics 3.Colocations and Co-occurrences 4.Spatial.
Ephemeral Network Broker to Facilitate Future Mobility Business Models/Transactions A collaboration between Ford University Research Program and University.
Spatial Computing Shashi Shekhar McKnight Distinguished University Professor Dept. of Computer Sc. and Eng. University of Minnesota
Urban Traffic Simulated From A Dual Perspective Hu Mao-Bin University of Science and Technology of China Hefei, P.R. China
© Vipin Kumar IIT Mumbai Case Study 2: Dipoles Teleconnections are recurring long distance patterns of climate anomalies. Typically, teleconnections.
1 Travel Times from Mobile Sensors Ram Rajagopal, Raffi Sevlian and Pravin Varaiya University of California, Berkeley Singapore Road Traffic Control TexPoint.
Chapter 20: Graphs. Objectives In this chapter, you will: – Learn about graphs – Become familiar with the basic terminology of graph theory – Discover.
Spatial Networks Introduction to Spatial Computing CSE 5ISC Some slides adapted from Shashi Shekhar, University of Minnesota.
Database Laboratory TaeHoon Kim. /18 Work Progress.
Presented by: Siddhant Kulkarni Spring Authors: Publication:  ICDE 2015 Type:  Research Paper 2.
Overview Issues in Mobile Databases – Data management – Transaction management Mobile Databases and Information Retrieval.
Introduction to Spatial Computing CSE 5ISC
T-Share: A Large-Scale Dynamic Taxi Ridesharing Service
Associative Query Answering via Query Feature Similarity
Interdisciplinary Research:Two War Stories
(Geo) Informatics across Disciplines!
Finding Fastest Paths on A Road Network with Speed Patterns
CS & CS Capstone Project & Software Development Project
Database Systems Instructor Name: Lecture-3.
CSE572, CBS572: Data Mining by H. Liu
Topological Signatures For Fast Mobility Analysis
Spatial Data Mining: Three Case Studies
CSE572: Data Mining by H. Liu
Presentation transcript:

Spatial Database & Spatial Data Mining Shashi Shekhar Dept. of Computer Sc. and Eng. University of Minnesota

Spatial Data Location-based Services –E.g.: MapPoint, MapQuest, Yahoo/Google Maps, … Courtesy: Microsoft Live Search (

Spatial Data In-car Navigation Device Emerson In-Car Navigation System (Courtesy: Amazon.com)

Book

Outline Spatial Databases –Conceptual Modeling Pictograms enhanced Entity Relationship Model –Logical Data Model Direction predicates and queries –Physical Data Model Query Processing – Shortest Paths, Evacuation Routes, –Correlated time-series Storage – Connectivity Clustered Access Method Spatial Data Mining –Location Prediction – fast algorithms –Co-location patterns – definition, algorithms –Spatial outliers – algorithms –Hot-spots – new work on “mean streets”

Geo-Spatial Databases: Management and Mining Nest locationsDistance to open water Vegetation durability Water depth 1. Recent book from our group!3. Shortest Path Queries4. Storing roadmaps in disk blocks 2. Parallelize Range Queries 6. Spatial outlier detect bad sensor (#9) on Highway I Location prediction to characterize nesting grounds.

Spatial Data Mining (SDM) The process of discovering –interesting, useful, non-trivial patterns patterns: non-specialist exception to patterns: specialist –from large spatial datasets Spatial pattern families –Spatial outlier, discontinuities –Location prediction models –Spatial clusters –Co-location patterns –…

Spatial Data Mining - Example Nest locations Distance to open water Vegetation durabilityWater depth

Spatial Autocorrelation (SA) First Law of Geography –“All things are related, but nearby things are more related than distant things. [Tobler, 1970]” Spatial autocorrelation –Nearby things are more similar than distant things –Traditional i.i.d. assumption is not valid –Measures: K-function, Moran’s I, Variogram, … Pixel property with independent identical distribution Vegetation Durability with SA

Implication of Auto-correlation Computational Challenge: Computing determinant of a very large matrix in the Maximum Likelihood Function:

Outline Spatial Databases –Conceptual Modeling Pictograms enhanced Entity Relationship Model –Logical Data Model Direction predicates and queries –Physical Data Model Query Processing – Shortest Paths, Evacuation Routes, –Correlated time-series Storage – Connectivity Clustered Access Method Spatial Data Mining –Location Prediction – fast algorithms –Co-location patterns – definition, algorithms –Spatial outliers – algorithms –Hot-spots – new work on “mean streets”

Spatio-temporal Query Processing Teleconnection –Find (land location, ocean location) pairs with correlated climate changes Ex. El Nino affects climate at many land locations Global Influence of El Nino during the Northern Hemisphere Winter (D: Dry, W: Warm, R: Rainfall) Average Monthly Temperature (Courtsey: NASA, Prof. V. Kumar)

Auto-correlation saves computation cost Challenge –high dimensional (e.g., 600) feature space –67k land locations and 100k ocean locations (degree by degree grid) –50-year monthly data Computational Efficiency –Spatial autocorrelation Reduce Computational Complexity –Spatial indexing to organize locations Top-down tree traversal is a strong filter Spatial join query: filter-and-refine –save 40% to 98% computational cost at θ = 0.3 to 0.9

Evacuation Route Planning - Motivation No coordination among local plans means Traffic congestions on all highways e.g. 60 mile congestion in Texas (2005) Great confusions and chaos "We packed up Morgan City residents to evacuate in the a.m. on the day that Andrew hit coastal Louisiana, but in early afternoon the majority came back home. The traffic was so bad that they couldn't get through Lafayette." Mayor Tim Mott, Morgan City, Louisiana ( ) Florida, Lousiana (Andrew, 1992) ( ( National Weather Services) ( FEMA.gov) I-45 out of Houston Houston (Rita, 2005)

A Real Scenario Nuclear Power Plants in Minnesota Twin Cities

Monticello Emergency Planning Zone Monticello EPZ Subarea Population 2 4,675 5N 3,994 5E 9,645 5S 6,749 5W 2,236 10N E 1,785 10SE 1,390 10S 4,616 10SW 3,408 10W 2,354 10NW 707 Total41,950 Estimate EPZ evacuation time: Summer/Winter (good weather): 3 hours, 30 minutes Winter (adverse weather): 5 hours, 40 minutes Emergency Planning Zone (EPZ) is a 10-mile radius around the plant divided into sub areas. Data source: Minnesota DPS & DHS Web site:

A Real World Testcase Source cities Destination Monticello Power Plant Routes used only by old plan Routes used only by result plan of capacity constrained routing Routes used by both plans Congestion is likely in old plan near evacuation destination due to capacity constraints. Our plan has richer routes near destination to reduce congestion and total evacuation time. Twin Cities Experiment Result Total evacuation time: - Existing Plan: 268 min. - New Plan: 162 min.

Outline Spatial Databases –Conceptual Modeling Pictograms enhanced Entity Relationship Model –Logical Data Model Direction predicates and queries –Physical Data Model Query Processing – Shortest Paths, Evacuation Routes, –Correlated time-series Storage – Connectivity Clustered Access Method Spatial Data Mining –Location Prediction – fast algorithms –Co-location patterns – definition, algorithms –Spatial outliers – algorithms –Hot-spots – new work on “mean streets”

Resource Description Framework (RDF) Physical model  Representation  Directed Acyclic Graph, TAGs  Storage method  Connectivity-Clustered Access Method (CCAM)  Frequent Operations  Breadth First Search  Path Computation

Semantics in Databases Ontology - Shared Conceptualization of knowledge in a specific domain. Resource Description Framework (RDF) - Representation of resource information in World Wide Web. Patterns

Ontology based Semantic Computing  Example Query SELECT * FROM travelmode WHERE ONT_RELATED (transport, ‘IS_A’, ‘ Road’, ‘Transport_Ontology’, 123) = 1; Result: All walk and drive modes. … Drive Walk Transport Road Commuter Rail Bus  Applications Homeland Security, Life Sciences, Web Services

Resource Description Framework (RDF) Multimodal Transportation System Commonwealth Ave. and Subway (Green Line), Boston [source: Subway Stations Road Intersections Transition Edge N1 N2N3N4N5 R1R2 R3 Graph Representation (between BU Central and Blandford St)

Resource Description Framework (RDF) : Street : TrafficLight : RailRoute : bus :busTerminals : busStops crosscuts used_by parallel has Start/end halts Light Rail System : Rail_line : Streets start/end has serves crosscuts parallel : Terminals used_by Road System : TrafficLight : Stations : Trains Transit Edges(*) Multimodal Transportation System : Streets SELECT S.street, S.busStop, R.Stations, R.RailRoute,R.Terminal FROM TABLE(SDO_RDF_MATCH( ‘(?x : halts ?b) SDO_RDF_Models(‘rail_line R’,’street S’)), ‘(?rr :serves ? z), WHERE S.b hasTransitTo R.z and S.Street = ‘Commonwealth’ ‘(?rr :start/end ?tr), Find all routes from the Commonwealth Avenue to the Logan Airport using bus and subway systems. *Note: A subset of possible transition edges is shown. and R.terminal = ‘Logan airport’;

Geo-Spatial Databases: Management and Mining Nest locationsDistance to open water Vegetation durability Water depth 1. Recent book from our group!3. Shortest Path Queries4. Storing roadmaps in disk blocks 2. Parallelize Range Queries 6. Spatial outlier detect bad sensor (#9) on Highway I Location prediction to characterize nesting grounds.