Using mobile network big data for land use classification Kaushalya Madhawa, Sriganesh Lokanathan, Danaja Maldeniya, Rohan Samarajiva CPRsouth 2015 Taipei.

Slides:



Advertisements
Similar presentations
David J. Sailor1 and Hongli Fan2 1. Portland State University
Advertisements

The methodology and applications of Agricultural Landscape monitoring in Estonia Kalev Sepp, Institute of Environmental Protection Estonian Agricultural.
Literature Review Kathryn Westerman Oliver Smith Enrique Hernandez Megan Fowler.
Enhancing Policy Decision Making with Large-Scale Digital Traces Vanessa Frias-Martinez University of Maryland NFAIS, February 2014.
Baseline knowledge about ICT in Nepal Rohan Samarajiva Nagarkot, March 2015 This work was carried out with the aid of a grant from the International.
Representativity of the Iowa Environmental Mesonet Daryl Herzmann and Jeff Wolt, Department of Agronomy, Iowa State University The Iowa Environmental Mesonet.
THE AUSTRALIAN NATIONAL UNIVERSITY Infrasound Technology Workshop, November 2007, Tokyo, Japan OPTIMUM ARRAY DESIGN FOR THE DETECTION OF DISTANT.
Unit Seven: Cities and Urban Land Use Advanced Placement Human Geography Session 1.
Improving intraurban land use characterization with nighttime imagery Sharolyn Anderson, Assistant Professor, University of Denver Paul C. Sutton, Associate.
“Real-time” Transient Detection Algorithms Dr. Kang Hyeun Ji, Thomas Herring MIT.
Avatar Path Clustering in Networked Virtual Environments Jehn-Ruey Jiang, Ching-Chuan Huang, and Chung-Hsien Tsai Adaptive Computing and Networking Lab.
Unit Seven: Cities and Urban Land Use Advanced Placement Human Geography Session 8.
SocalBSI 2008: Clustering Microarray Datasets Sagar Damle, Ph.D. Candidate, Caltech  Distance Metrics: Measuring similarity using the Euclidean and Correlation.
Alain Bertaud Urbanist Module 2: Spatial Analysis and Urban Land Planning The Spatial Structure of Cities: International Examples of the Interaction of.
Regional and urban Policy Measuring access to public transport in European cities Hugo Poelman REGIO-GIS DG Regional and Urban Policy.
Data Mining Techniques
Data Mining for Intrusion Detection: A Critical Review Klaus Julisch From: Applications of data Mining in Computer Security (Eds. D. Barabara and S. Jajodia)
Urban land-use models provide valuable tools for studying the internal structure of cities, but their applicability to large cities of the world has been.
1 The Online Issue Tracker 7th PPD Global Workshop: Public-Private Dialogue for Sustainable Business. Frankfurt, Germany Véronique Salze-Lozac’h, Economic.
What is the significance of ICTs to legislators? Rohan Samarajiva Yangon, 26 July 2014 This work was carried out with the aid of a grant from the International.
Seasonal Decomposition of Cell Phone Activity Series and Urban Dynamics Blerim Cici, Minas Gjoka, Athina Markopoulou, Carter T. Butts 1.
Optimal use of new satellite resources. Research funded by NERC/CEH and JNCC. Rapid Land Cover Mapping.
Where did you come from? Where did you go? Robust policy relevant evidence from mobile network big data Danaja Maldeniya, Amal Kumarage, Sriganesh Lokanathan,
BUILDING EXTRACTION AND POPULATION MAPPING USING HIGH RESOLUTION IMAGES Serkan Ural, Ejaz Hussain, Jie Shan, Associate Professor Presented at the Indiana.
Visualizing the Uncertainty of Urban Ontology Terms
THE SOCIO-SPATIAL SYNERGY IN LAND GOVERNANCE: A CASE OF INFORMAL SETTLEMENTS IN BAHIR DAR, ETHIOPIA BERHANU KEFALE ALEMIE, ROHAN BENNETT, JAAP ZEVENBERGEN.
Using cluster analysis for Identifying outliers and possibilities offered when calculating Unit Value Indices OECD NOVEMBER 2011 Evangelos Pongas.
LV Network Templates for a Low Carbon Future Mark Dale & Gavin Shaddick 26 th October 2012.
Big Data for Development: New opportunities for emerging markets Presentation to Access to Information Unit, Bangladesh Prime Minister’s Office, Dhaka.
YOU ARE WHAT YOU EAT (AND DRINK): IDENTIFYING CULTURAL BOUNDARIES BY ANALYZING FOOD AND DRINK HABITS IN FOURSQUARE Presenter: LEUNG Pak Him.
NOVEMBER 19, 2015 STUDY SESSION R IDGEFIELD J UNCTION S UBAREA P LAN DRAFT NOVEMBER 2015.
Where is the Centre of Hua Hin’s CBD?
Blue Grass Energy Cooperative Corporation 2006 Load Forecast Prepared by: East Kentucky Power Cooperative, Inc. Forecasting and Market Analysis Department.
Big and Open Data: Challenges and Issues
Using geolocated Twitter traces to infer residence and mobility Nigel Swier, Bence Kormaniczky, and Ben Clapperton.
1 Work carried out by SCOT and KUL presented at VEGETATION 2000 Conference with the support of CNES and contribution of JRC and VTT.
Clustering Patrice Koehl Department of Biological Sciences National University of Singapore
Categories of urban Land Use
Predicting the Location and Time of Mobile Phone Users by Using Sequential Pattern Mining Techniques Mert Özer, Ilkcan Keles, Ismail Hakki Toroslu, Pinar.
Effects of Highway Capacity Increases on Urban Development Patterns Land Use and Growth Impacts from Highway Capacity IncreasesLand Use and Growth Impacts.
Urban Land Use. Residential – Includes all places where people live – Generally the largest land use in most cities often taking up to 40% or more of.
Demonstration How to create meaningful Maps - using graduated symbols - using graduated colours - using Classification methods Analyzing techniques.....
Unit #2 – Human Geography Population. Demographics statistics based on population related factors such as age, sex, education, etc. Birthrate number of.
Urban Land Use Chapter Major Land Uses 1. Residential (40%) 2. Transportation (33%) 3. Commercial (5%) 4. Industrial (6%) 5. Institutional and Public.
2013 IEEE 14th International Conference on Mobile Data Management Authors: 1. Jiansu Pu 2. Siyuan Liu 3. Ye Ding 4. Huamin Qu 5. Lionel Ni By: Farah Kamw.
CBD Characteristics You will need to be able to describe and where appropriate explain the main characteristics of the CBD. Where possible always try and.
Cluster Analysis What is Cluster Analysis? Types of Data in Cluster Analysis A Categorization of Major Clustering Methods Partitioning Methods.
Topic 4: Cluster Analysis Analysis of Customer Behavior and Service Modeling.
A Spectral Approach for Large Scale Data Traffic Load: Analysis and Application HONGZHI SHI, YONG LI (TSINGHUA UNIVERSITY) DI WU (HUNAN UNIVERSITY) YING.
Big data Analytics for Tourism Destination management
Data Partnerships: LIRNEasia’s experience in Sri Lanka
Data Mining, Machine Learning, Data Analysis, etc. scikit-learn
Handbook on Residential Property Price Indices
Clustering Patrice Koehl Department of Biological Sciences
Student handout.
Session 3: Innovative solutions
A Personal Tour of Machine Learning and Its Applications
IMAGE PROCESSING RECOGNITION AND CLASSIFICATION
Unit Seven: Cities and Urban Land Use Advanced Placement Human Geography Session 1.
Baselining PMU Data to Find Patterns and Anomalies
Unit Seven: Cities and Urban Land Use Advanced Placement Human Geography Session 8.
National inventories for Air Quality Management
Twitter as a novel source of mobility indicators
Network Science: A Short Introduction i3 Workshop
Special Topics in Geo-Business Data Analysis
University of Washington, Autumn 2018
Data Mining, Machine Learning, Data Analysis, etc. scikit-learn
Data Mining, Machine Learning, Data Analysis, etc. scikit-learn
Benjamin Scholl, Daniel E. Wilson, David Fitzpatrick  Neuron 
Presentation transcript:

Using mobile network big data for land use classification Kaushalya Madhawa, Sriganesh Lokanathan, Danaja Maldeniya, Rohan Samarajiva CPRsouth 2015 Taipei City, Taiwan 25 th August 2015 This work was carried out with the aid of a grant from the International Development Research Centre, Canada and the Department for International Development UK..

Implications of using mobile network big data for urban policy Almost real-time monitoring of urban land use Help align master plan to reality Complement infrequent and expensive surveys 2

The data: historical and anonymized Call Detail Records (CDRs) from Sri Lanka Call Detail Record (CDR): – Records of all calls made and received by a person created mainly for the purposes of billing – Similar records exist for all SMS-es sent and received as well as for all Internet sessions – The Cell ID in turn has a lat-long position associated with it CDR data for 1 month in 2013 – Covers under 10 million SIMs – Nearly 1.5 billion records of calls made and received 3

The hourly loading of base stations reveals distinct patterns We can use this insight to group base stations into different groups, using unsupervised machine learning techniques 4 Type Y: ? Type X: ?

Methodology The time series of users connected at a base station contains variations, that can be grouped by similar characteristics A month of data is collapsed into an indicative week (Sunday to Saturday), with the time series normalized by the z-score Principal Component Analysis(PCA) is used to identify the discriminant patterns from noisy time series data Each base station’s pattern is filtered into 15 principal components (covering 95% of the data for that base station) Using the 15 principal components, we cluster all the base stations into 3 clusters in an unsupervised manner using k-means algorithm 5

Three spatial clusters in Colombo District 6 Cluster-1 exhibits patterns consistent with commercial area Cluster-3 exhibits patterns consistent with residential area Cluster-2 exhibits patterns more consistent with mixed-use

Our results show Central Business District (CBD) in Colombo city has expanded 7

Small area in NE corner of Colombo District classified as belonging to Cluster 1? 8 Seethawaka Export Processing Zone Photo ©Senanayaka Bandara - PanoramioPanoramio

We use silhouette coefficients to understand the quality of the clustering Silhouette coefficient indicates quality of clustering a(i) - average distance of i with all other data within the same cluster b(i) - average distance of i with all other data within the neighboring cluster Based on the s-values, Cluster 3 is the least coherent amongst the three 9 ClusterAvg. Silhouette Coefficient 1 – Commercial – Residential – Mixed-use0.22

Internal variations in mixed use regions: More commercial or more residential? 10 Blue dots: more residential than commercial Red dots: more commercial than residential To evaluate the relative closeness to the other two clusters, we define extent of commercialization as:

Plans & reality 1985 Plan 2013 reality UDA Plan

Policy implications and future work Almost real-time monitoring of urban land use – We are currently working on understanding finer temporal variations in zone characteristics (especially the mixed-use areas) Help align master plan to reality Can complement infrequent & expensive surveys LIRNEasia is working to unpack the identified categories further, e.g., – Entertainment zones that show evening activity 12