Using Research to Inform Geographic Policy Best-fitting from Output Areas to Higher Geographies.

Slides:



Advertisements
Similar presentations
Multiple Indicator Cluster Surveys Survey Design Workshop
Advertisements

Will 2011 be the last Census of its kind in England and Wales? Roma Chappell, Programme Director Beyond 2011 Office for National Statistics, July 2011.
Healthy Lifestyles Synthetic Estimates Project Shaun Scholes, Kevin Pickering and Claire Deverill.
U.S. Census and American Community Survey Overview Open a web browser and go to:
Enhancing Data Quality of Distributive Trade Statistics Workshop for African countries on the Implementation of International Recommendations for Distributive.
1 CDBG Income Survey Requirements For Grant Administrators.
The future of output geography - geography policy for NeSS Robert Heyward ONS.
State of the Cities Database Philippa Robinson Neighbourhoods, Cities and Regions Communities and Local Government.
The micro-geography of UK demographic change Paul Norman School of Geography, University of Leeds Understanding Population Trends and Processes.
A Language Analysis of Atlantic County, NJ using ACS The following “language analysis” takes a spatial look at the diverse languages spoken by Atlantic.
Population Estimates 2012 Texas State Data Center Conference for Data Users May 22, 2012 Austin, TX.
Developing Geographical Information Systems In A Cohort Study Andy Boyd ALSPAC, Social Medicine University of Bristol.
March 2013 ESSnet DWH - Workshop IV DATA LINKING ASPECTS OF COMBINING DATA INCLUDING OPTIONS FOR VARIOUS HIERARCHIES (S-DWH CONTEXT)
Developments with ONS’ Small Area Population Estimates Project Andy Bates.
Review: What influences confidence intervals?
Correlation and Autocorrelation
Week 17GEOG2750 – Earth Observation and GIS of the Physical Environment1 Lecture 14 Interpolating environmental datasets Outline – creating surfaces from.
Methods of Geographical Perturbation for Disclosure Control Division of Social Statistics And Department of Geography Caroline Young Supervised jointly.
Lecture 4. Interpolating environmental datasets
David Martin Department of Geography University of Southampton 2001 Census: the emergence of a new geographical framework.
RGS-IBG Online CPD course in GIS Analysing Data using WebGIS: The Office of National Statistics Session 3.
The Spatial Scale of Residential Segregation in Northern Ireland, 1991–2001 Chris Lloyd 1, Ian Shuttleworth 1 and David Martin 2 1 School of Geography,
Geographical Data Products Carol Blackwood UKBORDERS 3 rd July 2012.
GEOG3025 Census and administrative data sources 2: Outputs and access.
Determining Sample Size
111 American Community Survey Fundamentals 2009 Population Association of America ACS Workshop April 29, 2009.
Copyright 2010, The World Bank Group. All Rights Reserved. Agricultural Census Sampling Frames and Sampling Section A 1.
Sampling. Concerns 1)Representativeness of the Sample: Does the sample accurately portray the population from which it is drawn 2)Time and Change: Was.
Adaptive Kernel Density in Demographic Analysis Richard Lycan Institute on Aging Portland State University.
1 How much tenure mix is there in England? How has this changed ? What are the implications for past and future policy? ENHR conference Toulouse,
Exploring Error and the American Community Survey.
Use of survey (LFS) to evaluate the quality of census final data Expert Group Meeting on Censuses Using Registers Geneva, May 2012 Jari Nieminen.
An Introduction to The Network for a Healthy California GIS Viewer Welcome to Webinar Anthony Barnes Bhavdeep Sachdev 9:00am to 10:30am.
Multiple Indicator Cluster Surveys Survey Design Workshop Sampling: Overview MICS Survey Design Workshop.
Fundamentals of Data Analysis Lecture 9 Management of data sets and improving the precision of measurement.
1 Statistical Disclosure Control for Communal Establishments in the UK 2011 Census Joe Frend Office for National Statistics.
Coverage assessment and adjustment methodology Owen Abbott Methodology Directorate, ONS.
2011 CENSUS Coverage Assessment – What’s new? OWEN ABBOTT.
Understanding Geographies Statistics for the Voluntary Sector Alison Peacock Mission Planning Officer Wednesday 8 July 2009.
Eurostat Overall design. Presented by Eva Elvers Statistics Sweden.
Providing Access to Census- based Interaction Data in the UK: That’s WICID! John Stillwell School of Geography, University of Leeds Leeds, LS2 9JT, United.
GEOG3025 Confidentiality and social implications.
Sustainable rural populations: the case of two National Park areas Alan Marshall Ludi Simpson Cathie Marsh Centre for Census and Survey Research.
ICP Workshop, Tunis Nov. 03 Overview of the Sample Framework.
New and easier ways of working with aggregate data and geographies from UK censuses Justin Hayes UK Data Service Census Support.
MGS3100_04.ppt/Sep 29, 2015/Page 1 Georgia State University - Confidential MGS 3100 Business Analysis Regression Sep 29 and 30, 2015.
Teaching Research Methods: Resources for HE Social Sciences Practitioners Workshop 2: Using Census 2011.
Demographic change at small area level Small area statistics to develop public policy Paul Norman School of Geography, University of Leeds ESRC RES
1 ACS Statistical Issues and Challenges: One-, Three-, and Five-year Period Estimates Alfredo Navarro U.S. Census Bureau Association of Professional Data.
Design of the 2011 Census Coverage Survey Owen Abbott (ONS) James Brown (Institute of Education)
Review I A student researcher obtains a random sample of UMD students and finds that 55% report using an illegally obtained stimulant to study in the past.
Characterizing Rural England using GIS Steve Cinderby, Meg Huby, Anne Owen.
Exploiting census workplace data to build a daytime grid map of England and Wales. David Martin, Samantha Cockings, Alan Smith European Forum for Geostatistics,
The micro-geography of UK demographic change Paul Norman School of Geography, University of Leeds understanding population trends and processes.
Census Office Fernando Casimiro Geneva, July 2010 Portugal – Census results tailored to user needs «
Census 2011 – A Question of Confidentiality Statistical Disclosure control for the 2011 Census Carole Abrahams ONS Methodology BSPS – York, September 2011.
Research Team Sport England 1 Taking forward your segmentation profile Other ways to geographically analyse the segmentation data in your area.
The complexities of publishing gridded data for the UK European Forum for Geostatistics Krakow – October 2014 Ian Coady Geography Policy and Research Manager.
South West Public Health Observatory Using GIS in Public Health Luke Hounsome South-West Public Health Observatory.
Business case development and benefits quantification Neil Townsend 25 November 2012.
Developments with ONS’s Small Area Population Estimates Project Andy Bates, Office for National Statistics.
Or boundary data available on Nottingham Insight
Travelling to School.
Geographies of Poverty:
Geographic Definitions and Linking
England and Wales Grid Map
Expert Expert Group Meeting on Statistical Methodology for Delineating Cities and Rural Areas Iven M. Sikanyiti 28th-30th January 2019 United Nations:
Rebasing and Linking of National Accounts
UK Labour Market Areas and Rural Urban Characteristics
MGS 3100 Business Analysis Regression Feb 18, 2016
Presentation transcript:

Using Research to Inform Geographic Policy Best-fitting from Output Areas to Higher Geographies

Introduction The Geography Policy for National Statistics sets out the methodology for calculating areal statistics – “best-fitting”. Methodology was adopted for 2011 Census, but for small areas, this approach is not always as accurate as previous estimates. The results of ONS Geography research on these cases will feed into a review of GSS geographic policy.

Context ONS has a legal duty under the Statistics and Registration Service Act (2007) not to disclose “information which relates to and identifies a particular person”. Output Areas were designed for the 2001 Census with this requirement in mind.

Census Output Areas (OAs) Designed to minimise within-area and maximise between-area socio- economic variations. Individual OAs, and aggregations of complete OAs, are non-disclosive. For 2011 Census, revision of OAs was kept to a minimum. Only 2.6% of 2001 OAs were changed. Over 181,000 cover England and Wales.

Census Output Areas (OAs) Designed to minimise within-area and maximise between-area socio- economic variations. Individual OAs, and aggregations of complete OAs, are non-disclosive. For 2011 Census, revision of OAs was kept to a minimum. Only 2.6% of 2001 OAs were changed. Over 181,000 cover England and Wales. South Brent, Devon (pop. 3,000)

Creation of Output Areas Formed from groups of Thiessen polygons generated from households and snapped to residential postcode unit centroids. Target of 125 households per OA.

Census Households and Unit Postcode Centroids

Household/ Unit Postcode Thiessen polygon building blocks

Census Output Areas from building blocks

Exact estimates Where statistics for an area are derived directly from the Census households located within them. Exact estimates were produced in 2001 for all geographies. Exact estimates can produce slivers when incompatible boundaries (here, parishes) are overlaid. Small cell adjustment was applied to the dissatisfaction of users OAs

Exact estimates Even before 2001 Census published, demand expressed for release on future boundaries. We needed a methodology that can accommodate this demand OAs

Statistical building blocks Statistical outputs should be constructed from the smallest geographical areas for which data are available. For Census univariate data, these building-blocks are OA; for Census multivariate data, LSOA. Allows consistent and comparable statistics, even if the geography changes.

Best-fitting to higher geographies For ‘higher’geographies, Census statistics are ‘best- fitted’ from the OA or LSOA pop-weighted centroids that a target area contains. For geographies made up of aggregations of complete OAs or LSOAs (e.g. MSOA / LAD) this equates to exact fit. The National Statistics Postcode Lookup (NSPL) provides a pre-built best-fitting tool.

Output Area PWCs from Census Households (median position)

OAs and OAPWCs

OA and higher geography e.g. LSOA

The Geography Policy for National Statistics allows for exceptions to be requested from the Statistical Policy Committee. The irregular boundaries and very small populations of National Parks makes them unsuitable for best-fitting as very few OAPWCs fall within them. Statistics for National Parks are therefore exact-fitted as an exception to the policy Exception - National Parks

Usually, whatever small populations located within them are associated with OAPWCs that fall outside the park.. Statistics for National Parks are therefore always under-counted if best fit. Exception - National Parks

Best-fitting to higher geographies Otherwise, best-fitting using the OAPWC contained within the target geography works very well. Statistics are non-disclosive, on a consistent, reliable and relatively simple basis. Research by Ralphs (2011) determined that best fitting is a dependable methodology and it was adopted as fundamental to the policy. Suitable also for non-nesting geographies. Usually…

OA, OAPWC and non-nesting geography

Non-nesting Geography unit without OAPWC

Statistics for areas without OAPWCs? Because we cannot release sub-OA statistics, no data may be published for these areas. But there is demand.

Statistics for areas without OAPWCs? One option: Best-fit, to the target area, the statistics for the OA pop-weighted centroid (OAPWC) which is nearest to the target area’s geometric centroid and which is within the same LAD.

One option: Best-fit, to the target area, the statistics for the OA pop-weighted centroid (OAPWC) which is nearest to the target area’s geometric centroid and which is within the same LAD. Statistics for areas without OAPWCs?

Best-fitting using adjacent OAPWC This does produce non-disclosive statistics for the target area. But: Statistics are indicative rather than precise; Statistics for the nearby OAPWC serve for two or more areas; Statistics cannot be aggregated up to compile statistics for higher areas.

The ‘areas with a small population’ problem There are a number of administrative and statistical geographies that can in principle be smaller than an Output Area Parishes; Wards; Built-up areas (BUA); The problem is also clustered around particular areas such as the Isles of Scilly and the City of London. Civil parishes are a particularly good geography for the research due to the large number that are smaller than an Output Area

There is a real demand for parish data There is a very real demand for parish data, demonstrated by the fact that they are one of the most popular geographies on the Neighbourhood Statistics site. Many parishes in England are very recent creations driven by the government’s localism agenda. For 2011 Census, some of the previous parish level outputs were removed creating a greater requirement for accuracy on the parish outputs that were produced. Yet we can not currently provide statistics for parishes that do not contain an OAPWC.

Parish area varies enormously Chester Castle (0.04 km 2) Stanhope, County Durham (255km 2 ) Over 6,000x larger. 28

Parish population varies enormously Some old parishes are completely uninhabited; more recent creations are home to up to 75,000 people. By contrast, OA population falls within strict thresholds (100 – 625).

Coverage The majority of the population of England live in “Unparished Areas”. All of Wales is covered by “communities”, the local equivalent to parishes. 30

Outline methodology Fictitious data as Census 2011 data proxy linked to postcodes as household proxy. One copy aggregated to OA pop-weighted centroids as “model” (best fit) data. A second copy aggregated to parishes as “true” (exact fit) data. Stats for “model” and “true” data compared.

The datapoint ratio and model power The “datapoint ratio” is the ratio of the number of postcodes in the OA that would be supplying “model” data to the number of postcodes in the parish that would be receiving those data).

The datapoint ratio and model power Where the “datapoint ratio” was close to 1:1, correlations between the “model” statistics for the external OAPWC and the equivalent “true” statistics for the receiving parish were strong. Elsewhere, correlations were generally weak and raw error and percentage errors were high. For parishes without an OAPWC, the datapoint ratio was always less than 1:1 and the mean datapoint ratio was 0.3 : 1.

Conclusions - parishes At least eleven possible types of spatial relationship between parishes and OAs. Most drive the datapoint ratio away from 1:1. No structural link between parishes and the output area hierarchy of geographies. Estimating statistics for a parish on the basis of adjacent OAPWCs unlikely to succeed. Statistics for the 1,142 parishes without OAPWCs are on average likely to be over-estimated.

Conclusions – more general Using an external OAPWC to estimate statistics for an area that does not contain an OAPWC is only likely to produce statistics close to the ‘truth’ where the following conditions apply: The target geography is similar in number of postcodes/ households to the supplying geography and The target geography is structurally similar to the OA/LSOA/MSOA geography. The pilot project was successful in classifying the types of geography and scenarios that make best-fitting via adjacent OAPWCs problematic. The procedure appears to be especially weak for count/sum variables.

What’s next? We are about to embark upon the project proper, with dedicated high-performance hardware and household-level Census data that will provide a finer degree of granularity and realism to the results. This will allow us to develop measures to mitigate the problems identified, and this will assist in our aim to publish robust and non-disclosive estimates for the small geographies that users are most at home with. The research will feed into a review of the Geography Policy for National Statistics planned for

Links: GEOGRAPHY POLICY FOR NATIONAL STATISTICS national-statistics.pdf Best-fit Policy: Coady: An overview of best-fitting: Building 2011 Census estimates from Output Areas: fitting.pdf fitting.pdf Ralphs: Exploring the performance of best fitting to provide ONS data for non standard geographical areas of-best-fitting-to-produce-ons-data-for-non--standard-geographical-areas.pdf Contact ONS Geography at: Access Open Geography products at: Access as Linked Data at: Follow ONS Geography at