1 OLAP for heterogeneous socio-economic data – the challenge of integration, analysis and crime prevention: a Czech case study. Jiří HORÁK, Igor IVAN,

Slides:



Advertisements
Similar presentations
Joint meeting with National Statistical Offices and National Mapping Agencies of the Phare Candidate Countries Report on the use of geographical information.
Advertisements

Urban Statistics serving the evolving European Urban Agenda Presented by Jagdev Virdee Prepaired by Teodora Brandmuller, Eurostat unit E4 IAOS 2014, Da.
Counting the Dutch, The Future of the Virtual Census in the Netherlands Presentation at the seminar Counting the 7 Billion 24 February 2012 * Geert Bruinooge.
Application for presenting census results in the context of statistical data confidentiality in Poland Amelia Wardzińska-Sharif Central Statistical Office.
Hiroyuki KITADA, Yumi SEKINE National Statistics Center Japan.
Data Sources Data Warehouse Analysis Results Data visualisation Analytical tools OLAP Data Mining Overview of Business Intelligence Data visualisation.
Geographic Information Systems
Lab3 CPIT 440 Data Mining and Warehouse.
The Registry of Territorial Identification, Addresses and Real Estates Jaroslav Bačina.
OLAP OPERATIONS. OLAP ONLINE ANALYTICAL PROCESSING OLAP provides a user-friendly environment for Interactive data analysis. In the multidimensional model,
Producing migration data using household surveys Experience of the Republic of Moldova UNECE Work Session on Migration Statistics, Geneva, October.
Seminar on “Spatial statistics” Session 1: Use of statistical grids in official statistics Conference of European Statisticians, Paris, Fifty-eighth plenary.
DATA PROTECTION ISSUES COMBINING OF PERSONAL DATA STORED IN DIFFERENT INSTITUTIONS 9th Meeting of Central and Eastern European Commissioners June 4-6 th.
Enabling a national road and street database in population statistics Pasi Piela Q2014 Vienna Conference.
Working in The Czech Republic Citizens of EU/EEA countries do not need a work permit Registration at Labour Office – made by employer Residence permit.
Dutch Virtual Census Presentation at the International Seminar on Population and Housing Censuses; Beyond the 2010 Round November, 2012 Egon Gerards,
Nikola Šimandlová Alternativa 50+, o.p.s Age management: tools for breaking.
Igor Kuzma, Statistical Office of the Republic of Slovenia Tomaž Žagar, Geodetic Institute of Slovenia GIS Portal – dissemination of geostatistics
VESTA GIS WORKSHOP – Salzburg1 Experiences with vocational training on Geographical Information from VSB-TU Ostrava Jiri Horak, Bronislava Horakova,
Record matching for census purposes in the Netherlands Eric Schulte Nordholt Senior researcher and project leader of the Census Statistics Netherlands.
Health Datasets in Spatial Analyses: The General Overview Lukáš MAREK Department of Geoinformatics, Faculty.
CITY CENTER DELIMITATION Olomouc case study Jaroslav Burian First StatGIS conference.
More and better Improvement of official statistics through the Swedish Geodata Cooperation Jerker MOSTRÖM Senior Advisor, Regions and Environment Department,
DLI Workshop -- Mar Hosted by Dalhousie University March 2000 DLI Training Workshop.
Population census micro data for research: the case of Slovenia Danilo Dolenc Statistical Office of the Republic of Slovenia Ljubljana, First Regional.
CS 157B: Database Management Systems II March 20 Class Meeting Department of Computer Science San Jose State University Spring 2013 Instructor: Ron Mak.
Data Warehouse. Design DataWarehouse Key Design Considerations it is important to consider the intended purpose of the data warehouse or business intelligence.
USE OF GIS TECHNOLOGY IN STATISTICAL OFFICE OF ESTONIA NORDIC FORUM FOR GEO-STATISTICS , COPENHAGEN Inge Nael HEAD OF GIS SECTION MARKETING.
Geneva, 21 May 2012 Snezana Lakcevic Statistical Office of the Republic of Serbia Head of Population Census Division Workshop on Censuses Using Registers.
Data Warehousing.
The Dutch Virtual Census based on registers and already existing surveys Eric Schulte Nordholt Senior researcher and project leader of the Census Statistics.
New and easier ways of working with aggregate data and geographies from UK censuses Justin Hayes UK Data Service Census Support.
MODERN CENSUS in POLAND Janusz Dygaszewicz Central Statistical Office in Poland Group of Experts on Population and Housing Census Geneva, October.
Ahsan Abdullah 1 Data Warehousing Lecture-10 Online Analytical Processing (OLAP) Virtual University of Pakistan Ahsan Abdullah Assoc. Prof. & Head Center.
1 Topics about Data Warehouses What is a data warehouse? How does a data warehouse differ from a transaction processing database? What are the characteristics.
Decision Support and Date Warehouse Jingyi Lu. Outline Decision Support System OLAP vs. OLTP What is Date Warehouse? Dimensional Modeling Extract, Transform,
1 Categories of data Operational and very short-term decision making data Current, short-term decision making, related to financial transactions, detailed.
Copyright © 2007 Ramez Elmasri and Shamkant B. Navathe Slide
CZECH STATISTICAL OFFICE | Na padesátém 81, Prague 10 | czso.cz1/X Ing. Jaroslav Kraus, Ph.D. Mgr. Štěpán Moravec DISAGGREGATION METHODS FOR GEOREFERENCING.
The availability of Dutch census microdata Eric Schulte Nordholt Senior researcher and project leader of the Census Statistics Netherlands Division Social.
Fox MIS Spring 2011 Data Warehouse Week 8 Introduction of Data Warehouse Multidimensional Analysis: OLAP.
5 Marzo 2007 INNOVATIONS IN CENSUS MAPPING AND CENSUS DATA GEOCODING Fabio Crescenzi Istat, Central Directorate on General Censuses Joint UNECE/Eurostat.
Sharing experience Attribution (by) Licensees may copy, distribute, display and perform the work and make derivative works based on it only if they give.
Business Intelligence Transparencies 1. ©Pearson Education 2009 Objectives What business intelligence (BI) represents. The technologies associated with.
OLAP On Line Analytic Processing. OLTP On Line Transaction Processing –support for ‘real-time’ processing of orders, bookings, sales –typically access.
Enabling a national road and street database in population statistics: Commuting distances for all employed persons and other accessibility statistics.
EFGS 2015 European Forum for Geography and Statistics 2015 Conference
Pooja Sharma Shanti Ragathi Vaishnavi Kasala. BUSINESS BACKGROUND Lowe's started as a single hardware store in North Carolina in 1946 and since then has.
UN Workshop for South Asian Countries on Collection and Dissemination of Socio-economic Data from Population and Housing Censuses May 2012 Rudra.
Data Resource Management Agenda What types of data are stored by organizations? How are different types of data stored? What are the potential problems.
1 OVERVIEW OF THE WORLD PROGRAMME FOR THE CENSUS OF AGRICULTURE 2010 FAO Statistics Division November 2005.
REPUBLIC OF TURKEY TURKISH STATISTICAL INSTITUTE TurkStat Demography Statistics Department Population and Migration Statistics Group EXPERIENCES.
The Need for Data Analysis 2 Managers track daily transactions to evaluate how the business is performing Strategies should be developed to meet organizational.
Regional Policy Towards indicators of proximity to services in Europe's major cities Enhancing the analytical use of the GMES Urban Atlas in combination.
The Concepts of Business Intelligence Microsoft® Business Intelligence Solutions.
Statistics, Geodata and Our Way to Geoinformation Zdeňka Udržalová Head of Unit of Statistical Territorial Units Statistical Registers Department Czech.
A growing demand for small area statistics. How to make demand and supply meet? Asta Manninen, Pilar Martin-Guzmán and Derek Bond CESS Budapest, 20 – 21.
Chapter 13 Business Intelligence and Data Warehouses
Analysis of demographical and economical statistical data on the basis of a trans-national hierarchical grid Igor Kuzma Statistical Office of the Republic.
The regional and urban dimension of crime in the EU
Working Group - Geographic Information Systems for statistics
Country report - Finland
Census Planning and Management
Country report - Sweden
Urban Statistics on a national scale in the Netherlands
Plans for the 2021 Population and Housing Census
Local Administrative Units
Key Considerations for Planning and Management of Census Operations
Key Considerations for Planning and Management of Census Operations
Presentation transcript:

1 OLAP for heterogeneous socio-economic data – the challenge of integration, analysis and crime prevention: a Czech case study. Jiří HORÁK, Igor IVAN, Bronislava HORÁKOVÁ VSB-Technical University of Ostrava Intergraph CS Ltd. Czech Republic

2 European Forum for Geography and Statistics 2015 Conference Vienna, Austria, 10 – 12 November 2015 Big Spatial Data Features: – Volume beyond the limit of usual geo-processing, – Velocity higher than available by usual processes, – Variety, combining more diverse geodata sources than usual. traditional methods of geodata collection, storing, processing, controlling, analysing, modelling, validating and visualizing fail to provide effective solutions how to exploit the big spatial data?

3 European Forum for Geography and Statistics 2015 Conference Vienna, Austria, 10 – 12 November 2015 part of Business intelligence On-line analytical processing - provide an effective and intuitive access to consolidated data (harmonized and aggregated) stored in multidimensional data structures. OLAP operations: – Drill-down (success in hierarchy down, towards more details), – Roll - Up (success in hierarchy up, obtaining more aggregated data) – Drill-Across (link several fact tables with the same granularity) – Slice-and-Dice (splitting data) – Pivot (exchange of dimension in designed view) multidimensional database as a Data Warehouse: subject-oriented, integrated, time-variant and non-volatile collection of data Multidimensional database and OLAP

4 European Forum for Geography and Statistics 2015 Conference Vienna, Austria, 10 – 12 November 2015 dimensional modelling elementary items in fact tables contain aggregated data (counts, sums etc.) organised according to dimensions (features) dimensions usually contain hierarchical structure Granularity – the level of detail for facts Additivity - possibility to summarize data according to dimensions Fact tables and dimensions

5 European Forum for Geography and Statistics 2015 Conference Vienna, Austria, 10 – 12 November 2015 Data sources: population data – grid 1km, 100 m Census 2011(CZSO), municipal IS reg. of land identif., addresses and properties - buildings (NMCA) central crime register (Police CZ) - events offence register (city police) – local, central is planned register of schools (Min. of Education, Youth and Sports) - contact register of health service providers (Min. of Health) – contact, beds register of unemployed (Labour office) register of gambling machines (Min. of Finance) register of companies (CZSO, or others) DWH & OLAP for social environment (crime, human factors)

6 European Forum for Geography and Statistics 2015 Conference Vienna, Austria, 10 – 12 November 2015 ETL processes: Data differs in quality, formats, accesses, legal and ethical aspects (license policy, sensitivity), and maintenance control procedures - integrity constrains, check validity of time range, geographical range, referential integrity etc. harmonisation – referential time of event from time interval, harmonisation of addresses, classification of facilities, buildings etc. Geocoding for missing or bad coordinates aggregation – according to multidimensional modelling data anonymization – filtering, scramble, rounding, projection ETL processes for DWH & OLAP for social environment

7 Fact tables: CRIME POPULATION UNEMPLOYED HEALTH BUILDING FACILITIES Dimensional tables: DATE SQUARE ADMIN_UNITS AGE SEX and more Structure

8 European Forum for Geography and Statistics 2015 Conference Vienna, Austria, 10 – 12 November 2015 Grid – 100 x 100 m (4 th level of the scale system for communes and urban districts, Bacler), 500 m, 1 km, 5 km Administrative units - part of municipality, municipality, MEA, LAU1, NUTS3 temporal dimension - one day unit, week, month, year day-cycle hours – hour unit, morning time, rush hours age - 5-years basic categories, 10-years, 20-years, “30 and more”. crime (& offences) - standard 3-level classification system facilities - purpose and the hierarchical structure Dimensions and hierarchy

9 European Forum for Geography and Statistics 2015 Conference Vienna, Austria, 10 – 12 November 2015 Pivoting Place of commitment X Resid. of offenders OLAP pivoting, selections, relationships Scatter plot, regres.a. Gambling machines X Population

10 European Forum for Geography and Statistics 2015 Conference Vienna, Austria, 10 – 12 November 2015 Data grid view

11 European Forum for Geography and Statistics 2015 Conference Vienna, Austria, 10 – 12 November 2015 Number of burglaries per 100 flats (2014)

12 European Forum for Geography and Statistics 2015 Conference Vienna, Austria, 10 – 12 November 2015 # burglaries to dwellings, # residential buildings (2014) 3 towns: CB Ceske Budejovice KO Kolin OV Ostrava Differences: density of buildings density of burglaries dependencies

13 European Forum for Geography and Statistics 2015 Conference Vienna, Austria, 10 – 12 November 2015 Number of gambling machines per 1km 2

14 European Forum for Geography and Statistics 2015 Conference Vienna, Austria, 10 – 12 November 2015 Number of gambling clubs per 100 inhabitants

15 European Forum for Geography and Statistics 2015 Conference Vienna, Austria, 10 – 12 November 2015 # sprayer crimes per 1 school (2014)

16 European Forum for Geography and Statistics 2015 Conference Vienna, Austria, 10 – 12 November 2015 Classification tree for sprayer crimes Dependency – second.schools + regions; no second.schools + gambling m. + districts No dependency – population, buildings, basic schools, property offences

Thank you for your attention! 17 Data is provided by the courtesy of the Czech Statistical Office, Police of the Czech Republic, Czech Office for Surveying, Mapping and Cadaster, Czech Ministry of Finance, Labour offices, Czech Ministry of Health and Municipal Police departments in Ostrava, Kolín and České Budějovice. The research is supported by the research of the Czech Ministry of Interior, project “Geoinformatics as a tool to support integrated activities of safety and emergency units”, No. MV /VZ-2012.