Knowledge Discovery from Mobile Phone Communication Activity Data Streams Fergal Walsh Data Stream Research presented in this poster was funded by a Strategic.

Slides:



Advertisements
Similar presentations
Data Mining and the Web Susan Dumais Microsoft Research KDD97 Panel - Aug 17, 1997.
Advertisements

UNIVERSITY COLLEGE DUBLIN DUBLIN CITY UNIVERSITY This material is based upon work supported by Science Foundation Ireland under Grant No. 03/IN3/1361 TEMPORAL.
Management, Population and Marketing of institutional repositories / open access journals Iryna Kuchma, eIFL Open Access program manager, eIFL.net Presented.
Reported by Sujing Wang UH-DMML Group Meeting Nov. 22, 2010.
Spatiotemporal Pattern Mining For Travel Behavior Prediction UIC IGERT Seminar 02/14/2007 Chad Williams.
Overview of Data Mining & The Knowledge Discovery Process Bamshad Mobasher DePaul University Bamshad Mobasher DePaul University.
Why Road Geometry? Mobile Mapping Technology  The concept of active contours or snakes was first introduced by (Kass et al., 1988) and since then, it.
Group 3 Akash Agrawal and Atanu Roy 1 Raster Database.
--Presented By Sudheer Chelluboina. Professor: Dr.Maggie Dunham.
Using Structure Indices for Efficient Approximation of Network Properties Matthew J. Rattigan, Marc Maier, and David Jensen University of Massachusetts.
Semantic Web and Web Mining: Networking with Industry and Academia İsmail Hakkı Toroslu IST EVENT 2006.
Spatio-Temporal Databases. Outline Spatial Databases Temporal Databases Spatio-temporal Databases Multimedia Databases …..
Chad A. Williams † Peter C. Nelson Abolfazl (Kouros) Mohammadian University of Illinois at Chicago Department of Computer Science Colloquium July 16th,
Advanced Topics COMP163: Database Management Systems University of the Pacific December 9, 2008.
Giga-Mining Corinna Cortes and Daryl Pregibon AT&T Labs-Research Presented by: Kevin R. Gee 28 October 1999.
Dieter Pfoser, LBS Workshop1 Issues in the Management of Moving Point Objects Dieter Pfoser Nykredit Center for Database Research Aalborg University, Denmark.
STARTING EXPLORING MOBILE PHONE DATA IN THE SANDBOX Pilar Rey del Castillo.
Haptic: Image: Audio: Text: Landmark: YesNo YesNo YesNo YesNo YesNo Haptic technology, or haptics, is a tactile feedback technology that takes advantage.
Data Mining – Intro.
Advanced Database Applications Database Indexing and Data Mining CS591-G1 -- Fall 2001 George Kollios Boston University.
DASHBOARDS Dashboard provides the managers with exactly the information they need in the correct format at the correct time. BI systems are the foundation.
Business Intelligence
Light Detection and Ranging (LiDAR) LiDAR is increasingly regarded as the de facto data source for the generation of Digital Elevation Models (DEMs) in.
Multiresolution Semantic Visualization of Network Traffic Alefiya Hussain, Arun Viswanathan USC/Information Sciences Institute Discover PatternsCreate.
OLAM and Data Mining: Concepts and Techniques. Introduction Data explosion problem: –Automated data collection tools and mature database technology lead.
GeoPKDD Geographic Privacy-aware Knowledge Discovery and Delivery Kick-off meeting Pisa, March 14, 2005.
Mobile Mapping Systems (MMS) for infrastructural monitoring and mapping are becoming more prevalent as the availability and affordability of solutions.
Data Mining Techniques
Web Usage Mining Sara Vahid. Agenda Introduction Web Usage Mining Procedure Preprocessing Stage Pattern Discovery Stage Data Mining Approaches Sample.
Data Mining. 2 Models Created by Data Mining Linear Equations Rules Clusters Graphs Tree Structures Recurrent Patterns.
6 am 11 am 5 pm Fig. 5: Population density estimates using the aggregated Markov chains. Colour scale represents people per km. Population Activity Estimation.
Kansas State University Department of Computing and Information Sciences CIS 830: Advanced Topics in Artificial Intelligence From Data Mining To Knowledge.
A Java-based tool for determining if spatial objects (polygons) require simplification before delivery to a mobile device using a Location-based Service.
1. Wheeler, R.E. Notes on View Camera Geometry. 2003, Wolf, P.R. and DeWitt, B.A. Elements of Photogrammetry(with.
Multimedia Information Retrieval and Multimedia Data Mining Chengcui Zhang Assistant Professor Dept. of Computer and Information Science University of.
Exploring Metropolitan Dynamics with an Agent- Based Model Calibrated using Social Network Data Nick Malleson & Mark Birkin School of Geography, University.
Chapter 1 Introduction to Data Mining
Thomson Scientific October 2006 ISI Web of Knowledge Autumn updates.
The OpenStreetMap project The OpenStreetMap Project is one of the best known examples of Volunteered Geographic Information on the Internet today. The.
Robust GW summary statistics & robust GW regression are used to investigate spatial variation and relationships in a freshwater acidification critical.
Spatial Data Analysis Yaji Sripada. Dept. of Computing Science, University of Aberdeen2 In this lecture you learn What is spatial data and their special.
Vvisual comparision of data Measuring the quality of Volunteered Geographic Information (VGI) datasets such as OpenStreetMap is often attempted without.
Wireless Sensor Networks In-Network Relational Databases Jocelyn Botello.
Wei Feng , Jiawei Han, Jianyong Wang , Charu Aggarwal , Jianbin Huang
Page 1 Alliver™ Page 2 Scenario Users Contents Properties Contexts Tags Users Context Listener Set of contents Service Reasoner GPS Navigator.
Integrating GVis, GIS and KDD for Exploring Spatio-Temporal Data Integrating GVis, GIS and KDD for Exploring Spatio-Temporal Data Monica Wachowicz Wageningen.
Data Mining – Intro. Course Overview Spatial Databases Temporal and Spatio-Temporal Databases Multimedia Databases Data Mining.
Chapter 5: Business Intelligence: Data Warehousing, Data Acquisition, Data Mining, Business Analytics, and Visualization DECISION SUPPORT SYSTEMS AND BUSINESS.
Robust GW summary statistics & robust GW regression are used to investigate a freshwater acidification data set. Results show that data relationships can.
Research Technology Facilitator Program Researchers as centers of resource networks.
Internet of Things in Industries
Predicting the Location and Time of Mobile Phone Users by Using Sequential Pattern Mining Techniques Mert Özer, Ilkcan Keles, Ismail Hakki Toroslu, Pinar.
Efficient OLAP Operations in Spatial Data Warehouses Dimitris Papadias, Panos Kalnis, Jun Zhang and Yufei Tao Department of Computer Science Hong Kong.
Augmenting (personal) IR Readings Review Evaluation Papers returned & discussed Papers and Projects checkin time.
DataJewel 1 : Tightly Integrating Visualization with Temporal Data Mining Mihael Ankerst, David H. Jones, Anne Kao, Changzhou Wang 1 US patent pending.
Data Mining Concepts and Techniques Course Presentation by Ali A. Ali Department of Information Technology Institute of Graduate Studies and Research Alexandria.
Crime Forecasting Using Data Mining Techniques: Chung-Hsien Yu, Max W. Ward, Melissa Morabito, and Wei Ding Crime Forecasting Using Data Mining Techniques.
Data Mining in Germany IIM Conference, Oct. 24, 2012 Gottfried Schwarz, DLR > Lecture > Author Document > Datewww.DLR.de Chart 1.
Visual Information Retrieval
Eric Shook Department of Geography Kent State University
Eick: Introduction Machine Learning
Datamining : Refers to extracting or mining knowledge from large amounts of data Applications : Market Analysis Fraud Detection Customer Retention Production.
Pervasive Data Access (PDA) Research Group
Jiawei Han Department of Computer Science
Weichuan Dong Qingsong Liu Zhengyong Ren Huanyang Zhao
Data Mining: Concepts and Techniques Course Outline
כריית מידע -- מבוא ד"ר אבי רוזנפלד.
Research Areas Christoph F. Eick
Data Warehousing and Data Mining
Course Summary ChengXiang “Cheng” Zhai Department of Computer Science
Presentation transcript:

Knowledge Discovery from Mobile Phone Communication Activity Data Streams Fergal Walsh Data Stream Research presented in this poster was funded by a Strategic Research Cluster Grant (07/SRC/I1168) by Science Foundation Ireland under the National Development Plan. The authors gratefully acknowledge this support. Data Exploration Stream Processor Raw CDR Data Indexed Database Indexed Database Exploratory Query Tool Data stream processor for pre-processing each record and computing aggregates Spatial, temporal and user indices for efficient querying 1 week of data (> 200 million records) 1 week of data (> 200 million records) Web based tool for ad hoc spatio-temporal queries Web based tool for ad hoc spatio-temporal queries Communication event counts per cell per hour (weekday average) 00:00 08:00 12:00 18:00 Trajectories of 2 sample users Location of caller and callee for 2 sample users Anonymised Customer Data Records (CDR) from Meteor, Ireland’s 3 rd largest mobile phone network More than 1 million customers One record per call/sms sent received About 40 million records per day Information retrieval using stream data mining and machine learning techniques Find users similar to some example users (classification using Support Vector Machines): Users who travel from Maynooth to Dublin daily Users who travel to Dublin from rural areas daily (using semantics of spatial areas) Groups of users who are planning a meet-up (using communication motifs) Find areas with similar phone usage activity profiles (clustering) Nightlife, business, residential, rural Find clusters of users with similar activity profiles (clustering) Development of (ncg.nuim.ie/i2maps/) Future Work Learn activity chains (probabilistic models) of each users communication and movement events. These will use semantic labels rather than raw spatial locations. Predict movement and communication events from learned models. Current Work About 7000 cells (spatial areas) Cell areas range from <1km 2 to ~50km 2 Publications Pozdnoukhov A., Walsh F., Exploratory Novelty Identification in Human Activity Data Streams, ACM SIGSPATIAL International Workshop on GeoStreaming at 18th ACM SIGSPATIAL GIS, Pozdnoukhov A., Walsh F., Kaiser F., Statistical Machine Learning from VGI, Position paper at Role of Volunteered Geographic Information in Advancing Science Workshop at GIScience'10, Kaiser C., Walsh F., Farmer C. and Pozdnoukhov A., User-centric time-distance representation of road networks. In Springer LNCS proc. of the GIScience'10 (full paper) Records are ordered by time and independent of each other, making this data ideally suited to stream processing The authors gratefully acknowledge the support of Meteor for providing the data used in this poster, in particular Mr. John Bathe and Mr. Adrian Whitwham. Thanks to Ronan Farrell (IMWS) for obtaining the data from Meteor for StratAG Thanks to John Doyle for providing the cell tessellation used in the examples above. Acknowledgements