How to Rezone. A method for redistributing census populations to create bespoke geographies

Similar documents
Module 8. Use of Double Geography datasets within SASPAC

Mapping Census data in QGIS

Module 3a. Selection of Areas - by Text Listing

Module 1. Selection of Variables

User Guidance Note GE-04

Converting census data into GIS data

1. Open the New American FactFinder using this link:

Appendix Interaction Data: Classroom Activities

How to calculate population and jobs within ½ mile radius of site

Module 3b. Selection of Areas - Using Mapping

Quick Guide to American FactFinder

Information for Bulk Users

Exercise 4.1. Create New Variables in a Shapefile. GIS Techniques for Monitoring and Evaluation of HIV/AIDS and Related Programs

Basic Tasks in ArcGIS 10.3.x

Census Small Area Population Statistics (SAPS) A Guide to using the SAPMAP web application

Quantitative Mapping Using Census Data

Analysing crime data in Maps for Office and ArcGIS Online

Downloading 2010 Census Data

Lesson 8 : How to Create a Distance from a Water Layer

GateKeeper Mapping Creating Zone Layers & Utilising the Grid Generator GateKeeper Version 3.5 February 2015

ClubSelect Accounts Receivable Meeting Member Needs with Special Services

Geographical Information Systems Institute. Center for Geographic Analysis, Harvard University

GEOG4017 Geographical Information Systems Lab 3 Data Collection and Integration (I)

ESRI stylesheet selects a subset of the entire body of the metadata and presents it as if it was in a tabbed dialog.

US Geo-Explorer User s Guide. Web:

1. Use the Add Data button to add each of the datasets you wish to convert to the map document.

User s Manual. Version

Using Spatial Data in a Desktop GIS; QGIS 2.8 Practical 2

GIS Basics for Urban Studies

Job Aid File Management

Using the SABINS Data Finder

Introduction to GIS & Mapping: ArcGIS Desktop

Printing Batch Unofficial Transcripts

Georeferencing and Digitizing

IPEDS Human Resources Survey

SilvAssist 3.5 Instruction Manual Instruction Manual for the SilvAssist Toolbar For ArcGIS. Version 3.5

v CMS-Wave Analysis SMS 12.2 Tutorial Prerequisites Requirements Time Objectives

Mapping 2001 Census Data Using ArcView 3.3

1.1 ing an Individual Contact or Selected Contacts (Non Synchronous)

How to Get Census Data from E-STAT

Creating Mailing Labels (2010)

Working with 2000 Census Data in ArcGIS: Using the US Census Bureau Web Site for SF1 and SF3 Data

User Manual.

Project Zip Code. Version CUNA s Powerful Grassroots Program. User Manual. Copyright 2018, All Rights Reserved

Working with International Census Data in GIS

Question: How do I move my mobile account from the Corporate to my Personal Account?

The Timetabler. Export V8.1 Export to VASS V E R S I O N 8.1

Data Visualisation with Google Fusion Tables

Help Guide Version 1.2.1

How to Use the Cancer-Rates.Info/NJ

Navigate to Cognos Cognos Analytics supports all browsers with the exception of Microsoft Edge.

Geocoding Reference USA data in ArcMap 9.3

Oracle Sourcing Support Helpdesk: Telephone: > Option

USING 2001 CENSUS DATA IN ArcMap 8.2

Simple sets of data can be expressed in a simple table, much like a

Select the Parks within Forest Acres

OneView. User s Guide

Getting Started with MapInfo Professional Hands On Session 1D

Downloading shapefiles and using essential ArcMap tools

v TUFLOW-2D Hydrodynamics SMS Tutorials Time minutes Prerequisites Overview Tutorial

Tutorial: Getting Census Data from Geolytics

Entering and Managing Data in EvaluationWeb for CDC Program Announcement PS

User Guide. Web Intelligence Rich Client. Business Objects 4.1

Broker ASSESS administrator s guide

CMBI. Australian Census Power Bi Sample. Business intelligence consultancy CMBI Business Intelligence Services. Power BI Samples.

HOWTO Make an Initial Connection to the Office Anywhere Desktop using a Windows 7 based PC v1

Chapter 7. Joining Maps to Other Datasets in QGIS

BIO8014 GIS & Remote Sensing Practical Series. Practical 1: Introduction to ArcGIS Desktop

From Non-Spatial Data to Spatial Data. Geocoding & Georeferencing in ArcGIS

Market Insight Wizards & Virtual Variables Training Manual v1.0

Massachusetts Institute of Technology Department of Urban Studies and Planning

MicroStrategy Analytics Desktop

Creating a Microdata Extract

LloydsLink Commercial Banking Online (CBO) Migration Tool User Guide

Microsoft Access Database How to Import/Link Data

Objectives Learn how to work with projections in GMS, and how to combine data from different coordinate systems into the same GMS project.

Building Self-Service BI Solutions with Power Query. Written By: Devin

ArcGIS Online (AGOL) Quick Start Guide Fall 2018

Joining data from an Excel spreadsheet

More Skills 11 Export Queries to Other File Formats

Visual Analytics User s Guide

Online Membership System Renewals Summary Guide

Manually Uploading Pupils to ParentPay

Session 3: Cartography in ArcGIS. Mapping population data

Highway Performance Monitoring System

GeoLytics. User Guide. Online Demographic Database

Entering and Managing Data in EvaluationWeb for CDC Program Announcement PS

ADD/EDIT A JOURNAL ENTRY

How to Import More Than 256 Columns of Data into a GIS

How to Export a Report in Cognos Analytics

Electronic Owner s Manual User Guide

Making Yield Contour Maps Using John Deere Data

Lab #7: Mapping US Census Data

Geographical Inequalities, Spatial Scale and Small Area Statistics for England and Wales

How to Mail Merge a file with Microsoft Word 2003

Basic facts about Dudley

ENERGY COMPANY OBLIGATION (ECO) ELIGIBILITY SYSTEM

Market Intelligence Portal. User Guidance

Tutorial 3 - Performing a Change-Point Analysis in Excel

Working with Census Data Excel 2013

Transcription:

How to Rezone A method for redistributing census populations to create bespoke geographies January 2014

Contents Contents... 2 Introduction... 3 Limitations of this method... 4 Extract the zip file... 6 Part 1: Select Postcodes... 7 Part 2: Create Gazetteer File... 9 Part 3: Rezone Census tables using Gazetteer File... 11 Notes... 12

Introduction Census data are published by the Office for National Statistics (ONS) in England & Wales and National Records for Scotland (NRS) in Scotland. These data are published at a number of geographies from the national level right down to output area. Output areas themselves are built from postcodes and typically contain around 125 households in England and Wales but are somewhat smaller in Scotland. Rezoning is necessary when data are required for non-standard geographies; areas for which ONS/NRS do not publish data. In some cases these areas can be created quite simply by aggregating existing geographies. In other cases the boundary of the area in question does not align with any existing geography. In these cases an output area, may sit partly within and partly without the new zone and decisions must be made of how best to distribute data to the new area. There are a number of ways to approach this task and this document outlines a method using postcode population estimates. Methodology Outline The first stage of the process is to identify those postcodes which fall within the new zone. This is achieved using GIS (Geographic Information System) and the result should be a list of postcodes in CSV format. The postcode CSV file is used in SASPAC along with a specially designed System File to produce a list of Output Areas and the proportion of the population of each that falls within the new zone (a Gazetteer file). The Gazetteer file is then used in SASPAC to rezone the census data so that statistics for the new zone are available. Process Select postcodes in new area Create a gazeteer file in SASPAC Rezone Census tables

Limitations of this method Inaccurate allocation of populations This method distributes all variables in the same way: An output area that which is found to lie half inside the new zone and half outside will have each variable halved. The issue here is that variables are likely to be distributed in a variety of ways within an output area and so a characteristic may be attributed to the new zone when no one with that characteristic resides within the boundary. An example of this is as follows. An output area contains 300 people. It is bisected by the boundary of a new zone so that half of the population is within the new zone and half is outside. Of the total population 200 are female and 100 are male. All of the men live in the part of the output area within the new zone while the female population is split so that 50 live within the new zone and 150 live outside. The results of the rezone are shown below alongside the actual population of the area. Actual population of Rezoned population area of area Males 100 50 Females 50 100 Because the rezone distributes half of the population of each variable the result is inaccurate when compared to the actual population of the area. Inconsistent Totals A second issue is inconsistencies in totals which arises from the rounding of data in SASPAC and so is not specific to this method of rezoning. Census data deals in whole integers, each number representing a member of the population or a household. This means that it is necessary to round, both up and down, when a rezoning task takes place. As each number is rezoned and rounded in isolation of all others this can mean that variables within a table no longer add up to the totals they should. An example is an output area that has 99 males and 201 females giving a total population of 300. Again, the output area is divided into two. The total population in the new zone is 150. The number of men calculated is 49.5 which is rounded to 50. The calculated number of women is 100.5, rounded to 101. As a result the calculation males plus females equals 151, not 150. This same process is shown in the table below. Total population Half of population Rezoned population (rounded) Males 99 49.5 50 Females 201 100.5 101 Total 300 150 150

Mitigation There are two factors which mitigate this. Firstly, inaccuracies are unlikely to be as significant as those shown in the above example, populations will tend to be distributed more evenly within an output area than is the case in this example. Secondly, the original base census data is not intended to be an accurate count of a population but rather an estimate of the characteristics. The underlying uncertainty in the estimates means that problems of distribution are less significant that they may appear. The only way to truly mitigate this issue would be to rezone each variable independently but this is not possible with the information available and would be massively time consuming to attempt if the data were available. Note on Detailed Characteristics This method uses Output Areas as its base unit. Detailed Characteristics data are not available at output area level and so rezoning using this method is not possible. It would therefore be necessary to use MSOA data rather than OA. The MSOA geography is relatively large (containing an average of 7,200 people) and so issues of accuracy and data distribution would be magnified. As a result the rezoning of Detailed Characteristics using this method is not recommended.

Download Rezone.zip If you have not already downloaded the Rezone.zip file click here: http://data.london.gov.uk/datastorefiles/datafiles/demographics/2011-census/rezone.zip The rezone.zip file (60Mb) file contains the files and folders that you will need to undertake the rezoning task. If clicking the link fails to download the zip file copy the link into your web browser directly. Extract the zip file Next, extract the contents of Rezone.zip. 1. Right click on the file Rezone.zip 2. Click Extract All 3. Extract the file to C:\SASPAC 4. Click Next Note: This is necessary to ensure that the file paths referred to in these instructions are correct. 5. A folder named Rezone has been created in your SASPAC folder. 6. The following folders are now in the Rezone folder: Shape files Tab files Scaling Factors

Part 1: Select Postcodes The first stage of the process is to identify the postcodes that sit within the new zone using GIS. This process requires the new zone to have already been mapped as a boundary file in your GIS software. Using ESRI Arc GIS 1. Open Arc GIS and a blank map 2. Add the boundary of the new zone to the table of contents 3. Add the postcode point data for your region. (C:\SASPAC\Rezone\Shape Files\[region name].shp) NOTE: It is necessary to use the Shape files provided as the zone identifiers in these shape files are consistent with those in the SASPAC system file that will be used in stage 2 of this process. For further information see end note on postcodes. i 4. Select those postcodes that lie within the new zone boundary a. Go to Selection b. Select by location c. Selection method: select features from d. Target layer: [region name] i.e. postcode data e. Source layer: [name of your shape file] f. Spatial selection method: Target layer(s) features are within the Source layer feature g. OK 5. Export the selected postcodes to a CSV file a. Right click on the postcode layer in the Table of Contents b. Open Attribute Table c. From the dropdown list in the top left corner choose Export d. For the Export dropdown list choose: Selected records e. Click on the browse button f. Change the file type to Text file g. Navigate to C:\SASPAC\Rezone\ h. Name the file Postcodes for Rezone.CSV i. Click Save j. Do you want to add the new table to the current map?: Click No 6. Edit the CSV file a. Open the CSV file in Excel b. Delete columns A C and D c. Delete the first row d. Save the file and exit Excel.

Using MapInfo 1. Open MapInfo and open the.tab file for the new zone in a map window using File Open... 2. Go to File Open and open the postcode point data for your region in the Current Mapper window. (C:\SASPAC\Rezone\Tab Files\[region name].tab) NOTE: It is necessary to use the Tab files provided as the zone identifiers in these files are consistent with those in the SASPAC system file that will be used in stage 2 of this process. For further information see end note on postcodes. ii 3. Select those postcodes that lie within the new zone boundary a. Go to Query SQL Select b. In the from tables window add the two tables currently open from the Tables drop-down box (making sure to select the region table first and the new zone second) c. This should automatically create the statement [Region name].obj Within [New zone].obj d. In the Select Columns window add the [Region].Postcode column from the Columns dropdown box. e. Now select OK (with the Browse Results tick box checked) 4. Export the selected postcodes to a CSV file a. A new query window will appear and present a list of all the unit postcodes contained within the new zone b. Select Table Export from the main menu bar c. Select the name of the query ( Query[X] ) just undertaken from the list and Export d. Browse to C:\SASPAC\Rezone\ e. Name the file Postcodes for Rezone and save in a.csv format f. Click Save g. Leave the File Character Set to Windows US & W.Europe ( ANSI ) and do not check the Use First Line for Column Titles h. Select OK

Part 2: Create Gazetteer File The second stage of the process is to use the list of postcodes that has just been output from GIS to create a Gazetteer File. This is done in SASPAC. 1. Open a new Export CSV task in SASPAC 2. Input the scaling factors System File: C:\SASPAC\Rezone\scaling factors\scaling FACTORS.sys 3. Click Select Areas and choose the Use Mapshore tab a. Click Open Zoneslist File b. Select C:\SASPAC\Rezone\Postcodes_for_Rezone.CSV c. Choose Add All and then Include Selected Zones d. Close the Select Areas window Note: A maximum of 4,000 zones can be input this way. 4. Choose to Output the file to C:\SASPAC\Rezone\Gazetteer file.csv 5. In the Selected Variable Name(s) box type: zlabel sfactor 6. Click OK and when prompted to include zoneid choose No, then run the task. Note: Depending on the number of zones that you have input this process may take some time 7. Click view in Excel, delete the first row 8. Insert a column at the beginning so that your data starts in B1 9. In cell A1 type a zone id for your new area Note: Zone ids must be alpha-numeric and contain no spaces 10. Copy the zone id down all of the used rows in the CSV file 11. Save the file and close Excel Edit the file path 1. Navigate to C:\SASPAC\Rezone\ in Windows Explorer 2. Change the file extension of the Gazetteer file from csv to gaz: Gazetteer file.csv to Gazetteer file.gaz

In SASPAC 1. Go to File and Open and choose Open Gazetteer File 2. Navigate to C:\SASPAC\Rezone\Gazetteer file.gaz 3. Press Enter twice to create two blank rows 4. Return to the first row and type your zone id followed by a space and then a name for your new area. 5. Save the file and close

Part 3: Rezone Census tables using Gazetteer File This stage of the process must be re-run for each dataset that is being rezoned (Key Statistics, Quick Statistics or Local Characteristics) this example uses Key Statistics. 1. Open a new Create New Zones task in SASPAC (File > New Task > Create New Zones ) 2. Choose the dataset you wish to rezone by inputting a System File (in this case Key Statistics). The geographic level of the System File MUST be output areas. 3. Click Output to and navigate to c:\saspac\rezone\ 4. Save your file as 2011 Key Statistics for [name of zone].sys 5. Click save 6. In the System File Label type: a. KS2011 (For Quick Statistics type QS2011, for Local characteristics type LC2011) 7. Click on Gazetteer File: Input from a. Navigate to c:\saspac\rezone\ b. Choose Gazetteer file.gaz and open Note: The following values are dependent on the length of your zone id. For example if you gave your zone the identifier area1, then your id would be 5 characters long (x=5). c. Input the following values in the boxes provided Existing Zone (x+2) to (x+10) New Zone 1 to x Scale Factor x+12 to (x+21) d. Tick the With Labels box e. Save and Close 8. Click OK and run the task Note: If you are rezoning Key Statistics the task will complete with warnings because SASPAC will have recalculated the percentages in these tables. 9. A new System file(s) containing Census tables for your new zone(s) has now been created ready for data extraction.

Notes i UK postcodes contain up to seven alpha-numeric characters. Some also contain embedded spaces. In order to use postcodes as zone identifiers in SASPAC these spaces must be removed. In both the GIS Shape files and the SASPAC system files the spaces have been replaced with the character x. For example Postcode LE3 0JE E3 2PP NW10 5SN Becomes LE3x0JE E3xx2PP NW105SN In addition, some postcodes were split by ONS when creating the output areas. As a result these postcode need to have multiple entries in the system file. However, each zone must have a unique zone identifier and so where multiples exist they are identified using xb, xc, xd, etc. For example postcode E1 6PU is split five times and so appears in the GIS Shape file and SASPAC System file five times as: E1xx6PU E1xx6PUxb E1xx6PUxc E1xx6PUxd E1xx6PUxe More Information For assistance please contact the SASPAC helpdesk: info@saspac.org 0207 983 5523 ii UK postcodes contain up to seven alpha-numeric characters. Some also contain embedded spaces. In order to use postcodes as zone identifiers in SASPAC these spaces must be removed. In both the GIS Shape files and the SASPAC system files the spaces have been replaced with the character x. For example Postcode LE3 0JE E3 2PP NW10 5SN Becomes LE3x0JE E3xx2PP NW105SN In addition, some postcodes were split by ONS when creating the output areas. As a result these postcode need to have multiple entries in the system file. However, each zone must have a unique zone identifier and so where multiples exist they are identified using xb, xc, xd, etc. For example postcode E1 6PU is split five times and so appears in the GIS Shape file and SASPAC System file five times as: E1xx6PU E1xx6PUxb

E1xx6PUxc E1xx6PUxd E1xx6PUxe More Information For assistance please contact the SASPAC helpdesk: info@saspac.org 0207 983 5523