David Martin Department of Geography University of Southampton 2001 Census: the emergence of a new geographical framework.

Slides:



Advertisements
Similar presentations
Creating consistency in British Census Space a Nigel Walford and Kelly Hayles Centre for Earth and Environmental Science Research, School of Earth Sciences.
Advertisements

A2 Unit 4A Geography fieldwork investigation Candidates taking Unit 4A have, in section A, the opportunity to extend an area of the subject content into.
NSF DUE ; Module 4.3. NSF DUE ; GeoTEd Partners Module name and number.
1 Incorporating Statistical Process Control and Statistical Quality Control Techniques into a Quality Assurance Program Robyn Sirkis U.S. Census Bureau.
Data Mining Classification: Alternative Techniques
Zone design methods for epidemiological studies Samantha Cockings, David Martin Department of Geography University of Southampton, UK Thanks to: Arne Poulstrup,
The future of output geography - geography policy for NeSS Robert Heyward ONS.
Northern Ireland Neighbourhood Information Service - NINIS Fiona Johnston Neighbourhood Statistics NISRA.
TPL-aware displacement-driven detailed placement refinement with coloring constraints Tao Lin and Chris Chu Iowa State University 1.
EE 7730 Image Segmentation.
Using ESRI ArcGIS 9.3 Arc ToolBox 3 (Spatial Analyst)
Using the Maryland Biological Stream Survey Data to Test Spatial Statistical Models A Collaborative Approach to Analyzing Stream Network Data Andrew A.
Methods of Geographical Perturbation for Disclosure Control Division of Social Statistics And Department of Geography Caroline Young Supervised jointly.
GIS in Spatial Epidemiology: small area studies of exposure- outcome relationships Robert Haining Department of Geography University of Cambridge.
Access to UK Census Data for Spatial Analysis: Towards an Integrated Census Support Service John Stillwell 1, Justin Hayes 2, Rob Dymond-Green 2, James.
GIS 2, Final Project: Creating a Dasymetric Map for Two Counties in Minnesota By: Hamidreza Zoraghein Melissa Cushing Caitlin Lee Fall 2013.
Census.ac.uk Census Area Statistics and Casweb David Rawnsley Census Dissemination Unit (CDU) Mimas University of Manchester.
Gridded Population Workshop: New York: May 2000 High resolution and local scale: national population surface models from the UK Censuses David Martin Department.
Geographical Data Products Carol Blackwood UKBORDERS 3 rd July 2012.
Nigel James Bodleian Library The Census Accessing and mapping British Census Data.
GEOG3025 Census and administrative data sources 3: Integration and future development.
GEOG3025 Census and administrative data sources 2: Outputs and access.
Overview of the Database Development Process
Preparing Data for Analysis and Analyzing Spatial Data/ Geoprocessing Class 11 GISG 110.
Building Blocks The process of creating decennial census tabulation blocks. GeoElections User’s Conference October 6 th, 2011 Tampa, Florida.
Using Research to Inform Geographic Policy Best-fitting from Output Areas to Higher Geographies.
Learning Phonetic Similarity for Matching Named Entity Translation and Mining New Translations Wai Lam, Ruizhang Huang, Pik-Shan Cheung ACM SIGIR 2004.
A N A RCHITECTURE AND A LGORITHMS FOR M ULTI -R UN C LUSTERING Rachsuda Jiamthapthaksin, Christoph F. Eick and Vadeerat Rinsurongkawong Computer Science.
Using the ACS: Issues with studying small areas and change over time Presented to Association of Public Data Users January 20, 2011.
Using ArcView to Create a Transit Need Index John Babcock GRG394 Final Presentation.
Union and Intersection of Polygons Union = Intersection =
1 Assessing the Impact of SDC Methods on Census Frequency Tables Natalie Shlomo Southampton Statistical Sciences Research Institute University of Southampton.
Transportation leadership you can trust. TRB Planning Applications Conference May 18, 2009 Houston, TX A Recommended Approach to Delineating Traffic Analysis.
New and easier ways of working with aggregate data and geographies from UK censuses Justin Hayes UK Data Service Census Support.
Ⓒ AYLESBURY VALE DISTRICT COUNCIL Moving with OS MasterMap A proposed methodology for using MasterMap to manage changes Martyn Sutcliffe OS PAI/Change.
Evaluating Transportation Impacts of Forecast Demographic Scenarios Using Population Synthesis and Data Simulation Joshua Auld Kouros Mohammadian Taha.
GEOG3025 Administrative and statistical geographies.
Extent and Mask Extent of original data Extent of analysis area Mask – areas of interest Remember all rasters are rectangles.
Quality Assurance Programme of the Canadian Census of Population Expert Group Meeting on Population and Housing Censuses Geneva July 7-9, 2010.
Map overlays & Geoprocessing Learn about spatial analysis functions overlays, clipping & buffering Use overlays to analyze multiple spatial criteria Understand.
FORS 8450 Advanced Forest Planning Lecture 5 Relatively Straightforward Stochastic Approach.
GEOG3025 Geographical referencing and the modifiable areal unit problem.
1 Memory Management Chapter 7. 2 Memory Management Subdividing memory to accommodate multiple processes Memory needs to be allocated to ensure a reasonable.
Minimal Broker Overlay Design for Content-Based Publish/Subscribe Systems Naweed Tajuddin Balasubramaneyam Maniymaran Hans-Arno Jacobsen University of.
1 Overview Importing data from generic raster files Creating surfaces from point samples Mapping contours Calculating summary attributes for polygon features.
Category Independent Region Proposals Ian Endres and Derek Hoiem University of Illinois at Urbana-Champaign.
Exploiting census workplace data to build a daytime grid map of England and Wales. David Martin, Samantha Cockings, Alan Smith European Forum for Geostatistics,
Data Zones - Consultation Euan Smith Small Area Stats 5 th October 2010.
U.S. Census Data & TIGER/Line Files
Exploring Microsimulation Methodologies for the Estimation of Household Attributes Dimitris Ballas, Graham Clarke, and Ian Turton School of Geography University.
Definition of Spatial Analysis
Innovative and Unconventional Approach Toward Analytical Cadastre – based on Genetic Algorithms Anna Shnaidman Mapping and Geo-Information Engineering.
Introduction to Geographic Information Systems Fall 2013 (INF 385T-28620) Dr. David Arctur Research Fellow, Adjunct Faculty University of Texas at Austin.
Genetic algorithms: A Stochastic Approach for Improving the Current Cadastre Accuracies Anna Shnaidman Uri Shoshani Yerach Doytsher Mapping and Geo-Information.
Partially Overlapped Channels Not Considered Harmful Arunesh Mishra, Vivek Shrivastava, Suman Banerjee, William Arbaugh (ACM SIGMetrics 2006) Slides adapted.
Exploratory Spatial Optimization in Site Search: A Neighborhood Operator Approach Thomas J. Cova Department of Geography University of Utah and Richard.
Automating the administration boundary design process using Hierarchical Spatial Reasoning theory and GIS Francisco Escobar Serryn Eagleson Ian Williamson.
1 A Methodology for automatic retrieval of similarly shaped machinable components Mark Ascher - Dept of ECE.
GIS and the Built Environment: An Overview Phil Hurvitz UW-CAUP-Urban Form Lab GIS and the Geography of Obesity Workshop August 3, 2005.
Kevin A Henry, Ph.D New Jersey Cancer Registry Cancer Epidemiology Services Frank Boscoe, Ph.D New York State Cancer Registry Estimating the accuracy of.
The complexities of publishing gridded data for the UK European Forum for Geostatistics Krakow – October 2014 Ian Coady Geography Policy and Research Manager.
INTRODUCTION Despite recent advances in spatial analysis in transport, such as the accounting for spatial correlation in accident analysis, important research.
Applications of Tabu Search OPIM 950 Gary Chen 9/29/03.
Lecture 24: Uncertainty and Geovisualization
Cristian Ferent and Alex Doboli
Clustering (3) Center-based algorithms Fuzzy k-means
Marja Tammilehto-Luode, Statistics Finland
Lecture 6 Implementing Spatial Analysis
Zone design methods for epidemiological studies
Graphical solution A Graphical Solution Procedure (LPs with 2 decision variables can be solved/viewed this way.) 1. Plot each constraint as an equation.
Presentation transcript:

David Martin Department of Geography University of Southampton 2001 Census: the emergence of a new geographical framework

Overview Background issues Postcode building blocks Output areas by automated zone design –Zone design experiments –Illustrative results –Demonstrator project Application to SAM specification A new project… Conclusions

Background issues 1991: EDs designed for data collection, but used for both data collection and output 2001: separation of collection and output geographies - purpose-specific geographies New output areas built from synthetic unit postcode polygons Application of automated zone design (after Openshaw, 1977)

Postcode building blocks Approx 1.7m unit postcodes Aggregation of these small building blocks into output areas (OAs) ensures best census-postal geography match No pre-existing polygons, (exc. Scotland) NISRA to digitize, ONS to generate OS to create separate new product !!

Generation of postcode polygons (1) Thiessen polygons around individual ADDRESS- POINTS, clipped to statutory boundaries and topographic features

Generation of postcode polygons (2) Boundaries dissolved between adjacent address polygons with common postcode, to form postcode polygons

OA design methodology Automated zoning procedures derived from Openshaw (1977)… Variety of alternative approaches Computationally intensive, iterative search for ‘best’ solution to the zoning problem, given a set of constraints Not feasible in previous data and computing environments

Output areas by automated zone design Initial Random Aggregation of Building Blocks Initial Random Aggregation of Building Blocks Iterative Recombination Design Constraints (Contiguity, Thresholds, Shape, Size, Homogeneity) Design Constraints (Contiguity, Thresholds, Shape, Size, Homogeneity) 2001 Output Areas 2001 Output Areas

OA design (1) Initial random aggregation of postcodes into potential output areas

OA design (2) Choose one postcode at random as candidate for swapping into a different output area

OA design (3) Make the swap and evaluate the impact on the overall solution

OA design (4) If swap does not result in an improvement, go back to the previous configuration

OA design (5) Choose another postcode at random as candidate for swapping into another output area

OA design (6) If the swap results in an overall improvement, keep it as part of the solution and examine a new potential swap…

Constraints (1) Contiguity: output areas from adjacent postcodes (NB problem of stacks) Thresholds: output areas above population thresholds (NB problem of sub-threshold parishes) Shape: output areas should be as compact as possible minimize perimeter 2 /area

Constraints (2) Size: output areas should be as uniformly sized as possible - avoiding very large and very small populations  minimize  (OApop-target) 2 Homogeneity: output areas should be as socially uniform as possible  existing ONS tenure-based measure maximize intra-area correlations

Intra-area correlation Measures similarity of values within any area of interest (Holt et al., 1996; Tranmer and Steel, 1998) Higher correlation: greater homogeneity (theoretical maximum of 1.0) Can be computed for a single category (eg. ‘owner occupied’ or for multi-category variables Tenure and dwelling type tested in project

Zone design experiments ONS postcode polygons for test areas Populated with plausible synthetic populations by iterative sampling of SAR individuals (PCs structured by dwelling type) Test OAs constructed using alternative combinations of design constraints: (OApop only; OApop+shape; OApop+homog; OApop+shape+homog)

Illustrative results (urban/rural) EDOAOA(SH)PC n x(s)308166(44)162(54)21 d(ten) d(dwe) n x(s)427154(37)155(42)17 d(ten) d(dwe)

1991-style EDs EDOAOA(SH)PC n x(s)308166(44)162(54)21 d(ten) d(dwe) n x(s)427154(37)155(42)17 d(ten) d(dwe)

Unit postcodes EDOAOA(SH)PC n x(s)308166(44)162(54)21 d(ten) d(dwe) n x(s)427154(37)155(42)17 d(ten) d(dwe)

2001-style Output Areas EDOAOA(SH)PC n x(s)308166(44)162(54)21 d(ten) d(dwe) n x(s)427154(37)155(42)17 d(ten) d(dwe)

Project website

Demonstrator data…

Application to SAM specification Proposal for small area microdata (SAM) – more spatial, less attribute detail than SARs Use wards as building blocks, target SAM areas 7-10k population Same procedures as for postcode to OA Subsequent splitting of ‘superwards’

Hampshire wards n = 235 mean = 5872 min = 996 max = Portsmouth Basingstoke Southampton

Hampshire SAM 5k n = 176 mean = 8230 min = 5035 max = 15684

Hampshire SAM 15k n = 66 mean = min = max = 51368

A new project… Problem of matching two sets of areal units: –1991 ED data for 1981 EDs? –2001 OA data for 1991 EDs? Various approaches possible: –Individual-level data within Census Offices –Lookup table approximations –Areal interpolation (various) Which is best matching configuration?

A new project: automated zone matching More general computational problem: Given two boundary sets and some target zone characteristics, find the optimal match Can be conceptualized as a modified AZP process (iterative, computationally intensive, general purpose problem) Automatic tool when no lookup tables etc.

First boundary set Take a familiar area: Boundary set A eg EDs A1 A2 A3 A4

Secondary boundary set For the same area: Boundary set B eg OAs B1 B4 B2 B3 B5

Full intersection Intersect A and B Clean topology A1B1 A2B1 A2B2 A2B3 A1B4 A3B1 A3B4 A4B5

Set up automated zone matching Set up design criteria: equality of population size, area, density, etc. Adjust weight for ancillary variable Set one zone as source which must be maintained (eg. that for which data are available) Set up initial random aggregation incorporating true matches Over to (modified) AZP…

Alternative solutions… Solution 1: perfect match maintaining all zones complete eg. creation of census tracts O1 = A1+A2+A3 = B1+B2+B3+B4 O2 = A4 = B5

Alternative solutions… Solution 2: boundary set B unbroken, closest match to A eg. creation of lookup tables, local approximations O1 = B1  A1 O4 = A4 = B5 O3 = B4  A3 O2 = B2+B3  A2

Conclusions Major application of geographical technique developed 20+ years ago Multiple purpose-specific geographies – generated from existing spatial data Multiple applications of the same approach –Census output areas –SAM areas –Generic geography matching

Demonstrator RSS meeting: Nov 2000