A population grid for Andalusia Year 2013 Institute of Statistics and Cartography of Andalusia (IECA) Sofia (BU), 24 th October 2013
IECA project. Population grid cells sized 250 x 250m for Andalusia 1. Highlights & main information sources 2. Georeferentiation of the population 2.1. Bottom-up approach: the geocoding of buildings main entrances 2.2. Top-down approach: the spatial disaggregation of non georeferenced households 2.3 Hybrid results 3. The dissemination of the results: visualization of the spatial distribution of the population in Andalusia
IECA project. Population grid cells sized 250 x 250m for Andalusia 1. Highlights & main information sources 2. Georeferentiation of the population 2.1. Bottom-up approach: the geocoding of buildings main entrances 2.2. Top-down approach: the spatial disaggregation of non georeferenced households 2.3 Hybrid results 3. The dissemination of the results: visualization of the spatial distribution of the population in Andalusia
1. Highlights & main information sources The Institute of Statistics and Cartography of Andalusia (IECA), official body member of the Regional Government of Andalusia, has developed a project to allocate the population of Andalusia A layer of cells has been defined using the Eurostat Grid Generation Tool for ArcGIS. We ran this add-on in the Spatial Analysis module of ArcGIS 10. Our region has been divided generating a grid of cells sized 250 x 250 m We have used as main source of information for population, administrative data coming from Population Register of Andalusia (RPA), dated January 1 st, 2013
1. Highlights & main information sources Population Register of Andalusia (RPA): It accounts for all households and individuals settled in Andalusia. It provides information about postal address and demographic characteristics such as sex, age and nationality of household members. This database is created and maintained by IECA and its data come from administrative files Street & Building Directory of Andalusia: Directory created and maintained by IECA in cooperation with town and city halls, municipalities and province governments. It contains info on street and building entrances coming from National Statistical Institute (Spain) files, cadastre files, etc. Urban Cadastre: Administrative files provided by Cadastre Unit from the Ministry of Finance and Public Administration (Spain) containing cadastre information, related to residential area, building characteristics, road characteristics, etc.
NUTS 2 in Spain
Andalusia, NUTS 2 ES-61
Andalusia is the second largest region of Spain and the most populated one in the country Surface: 87.597 km 2 Population: 8.523.567 inhabitans
IECA project. Population grid cells sized 250 x 250m for Andalusia 1. Highlights & main information sources 2. Georeferentiation of the population 2.1. Bottom-up approach: the geocoding of buildings main entrances 2.2. Top-down approach: the spatial disaggregation of non georeferenced households 2.3 Hybrid results 3. The dissemination of the results: visualization of the spatial distribution of the population in Andalusia
IECA project. Population grid cells sized 250 x 250m for Andalusia 1. Highlights & main information sources 2. Georeferentiation of the population 2.1. Bottom-up approach: the geocoding of buildings main entrances 2.2. Top-down approach: the spatial disaggregation of non georeferenced households 2.3 Hybrid results 3. The dissemination of the results: visualization of the spatial distribution of the population in Andalusia
2.1. Bottom-up approach: the geocoding of buildings main entrances Georeferentiation of population living in buildings is possible through GIS once coordinates are assigned. The linking process informs about the degree of coincidence (reliability of matching) RPA Postal address for each individual and group of individuals Matching techniques Street & building directory Postal address and geographic coordinates of main entrances A layer of points is generated representing buildings main entrances and associated number of inhabitants and their demographic characteristics RPA postal addresses linked to its corresponding main entrance and geographic coordinates
Population Register of Andalusia Street & building directory 7,453,107 inhabitants (87.4% of the population registered in RPA)
2.1. Bottom-up approach: the geocoding of buildings main entrances Next step is to aggregate the information within a cell, i.e. add all points (main entrance and its population) located in a cell. This is a spatial join operation Each main entrance is added within a cell according to its spatial location. Each cell ends up with a total number of inhabitants and main entrances located inside the cell according to the location of the points (main entrances) This procedure assigned 7,453,107 inhabitants (87.4% of the population registered in RPA) to their corresponding cells. A total of 28,806 cells registered population at this stage of the process
An example in the Metropolitan Area of Granada
IECA project. Population grid cells sized 250 x 250m for Andalusia 1. Highlights & main information sources 2. Georeferentiation of the population 2.1. Bottom-up approach: the geocoding of buildings main entrances 2.2. Top-down approach: the spatial disaggregation of non georeferenced households 2.3 Hybrid results 3. The dissemination of the results: visualization of the spatial distribution of the population in Andalusia
IECA project. Population grid cells sized 250 x 250m for Andalusia 1. Highlights & main information sources 2. Georeferentiation of the population 2.1. Bottom-up approach: the geocoding of buildings main entrances 2.2. Top-down approach: the spatial disaggregation of non georeferenced households 2.3 Hybrid results 3. The dissemination of the results: visualization of the spatial distribution of the population in Andalusia
2.2. Top-down approach: the spatial disaggregation of non georeferenced households Question People in non georeferenced households 87.4% Georeferenced Non georeferenced 68% of the censustracts have nongeoreferenced households 12.6% 12.6% of the population living in households could not be exactly georeferenced via bottom-up approach. Therefore a top-down approach was designed to allocate them spatially to a cell Census-tract level is the maximum territorial disaggregation available for non-georeferenced households Based on statistical techniques and ancillary information, the aim of the top-down approach is to transfer these data (non-georeferenced households) from the census-tracts zoning system to the grid zoning system
2.2. Top-down approach: the spatial disaggregation of non georeferenced households Alternatives To allocate spatially to a cell non georeferenced households from a census-tract Basic option: Assuming population distributes uniformly along the territory (choropleth) Alternative: Modelling settlement patterns for non georeferenced households based on ancilliary information. A descriptive analysis of georeferenced and non georeferenced households shows differences in settlement patterns and spatial correlation
2.2. Top-down approach: the spatial disaggregation of non georeferenced households Cartogram- Percentage of non-georeferenced households
2.2. Top-down approach: the spatial disaggregation of non georeferenced households
2.2. Top-down approach: the spatial disaggregation of non georeferenced households Information sources: Census-tracts layer (National Statistical Institute, Spain) And information processing census-tract/cells In order to transfer non-georeferenced households from the census-tracts zoning system to the grid zoning system a transferring spatial unit was defined: Census-tract/cells Census-tract/cells result from the intersection between the layer of census-tracts and the grid Census-tract/cells are the reference spatial unit for the analysis. The allocation model assigns population from census-tract to census-tract/cells Cells are built afterwards by adding all censustract/cells composing a cell
Census-tracts
Census-tract/cells
2.2. Top-down approach: the spatial disaggregation of non georeferenced households Information sources: - Urban Cadastre layer (Cadastre Unit, Ministry of Finance and Public Administration, Spain) And information processing census-tract/cell/parcels Cadastre territorial delimitation unit is parcel. In order to transfer parcel information to censustract/cell unit a transferring unit was defined: census-tract/cell/parcel Census-tract/cell/parcels result from the intersection between the parcel layer with the census-tract/cell layer Cadastre analysis focused on urban parcels layers containing usage information, residential as well as other usages (residential, commercial, recreational areas versus industrial or construction works areas)
Census-tract/cell/parcels
2.2. Top-down approach: the spatial disaggregation of non georeferenced households Information sources: - Urban Cadastre layer (Cadastre Unit, Ministry of Finance and Public Administration, Spain) And information processing Alphanumeric data Residential usage parcels are key to the project; hence information available in cadastre regarding their attributes & characteristics was explored in detailed Properties information: age, type of road where a property is located, etc. considered relevant as nongeoreferentiation is basically the result of a mismatching of a record linkage algorithm. Information referring to degree of urban consolidation (urban areas recently constructed, areas with high percentage of unusual types of roads, areas with high percentage of warehouses buildings, etc.) might shed light on the factors that hindered the linking process Properties information aggregated by parcel. i.e: all attributes for the properties inside a parcel were added or averaged, depending on the cases
2.2. Top-down approach: the spatial disaggregation of non georeferenced households Variables appearing highly correlated with non georeferentiation incidence Four regression models are built by non georeferentiation intensity group Population Register of Andalusia (RPA) Georeferenced households Georeferenced population by age Georeferenced population by nationality Georeferenced households and population in disseminated areas Urban Cadastre Real estate/immovable properties by type (industrial, residential ) Real estate/immovable properties by type of road where it is located (avenue, street, square, lane, etc.) Real estate/immovable properties year of construction Urban area and urban area containing residential real estate
2.2. Top-down approach: the spatial disaggregation of non georeferenced households Our proposal Iterative algorithm based on linear regressions modelled by non georeferentiation intensity group 1. Modeling non georeferenced households at census-tract level (administrative division) 2. Projection of non georeferenced households at census tract/cell level (dasymetric mapping) 3. Volume constraint, total non georeferenced households at census-tract level (pycnophylactic constraint) 4. Models re estimation at census tract/cell level and process iteration until convergence criteria is attained
IECA project. Population grid cells sized 250 x 250m for Andalusia 1. Highlights & main information sources 2. Georeferentiation of the population 2.1. Bottom-up approach: the geocoding of buildings main entrances 2.2. Top-down approach: the spatial disaggregation of non georeferenced households 2.3 Hybrid results 3. The dissemination of the results: visualization of the spatial distribution of the population in Andalusia
IECA project. Population grid cells sized 250 x 250m for Andalusia 1. Highlights & main information sources 2. Georeferentiation of the population 2.1. Bottom-up approach: the geocoding of buildings main entrances 2.2. Top-down approach: the spatial disaggregation of non georeferenced households 2.3 Hybrid results 3. The dissemination of the results: visualization of the spatial distribution of the population in Andalusia
2.3. Hybrid results The final step is to add to each cell the geo-referenced population in the bottom-up approach (geocoding of the buildings) and the population assigned in the top-down approach (allocated non geo-referenced households) The end result can be seen in the following maps:
Distribution of the population by cells (250 m) in Andalusia
IECA project. Population grid cells sized 250 x 250m for Andalusia 1. Highlights & main information sources 2. Georeferentiation of the population 2.1. Bottom-up approach: the geocoding of buildings main entrances 2.2. Top-down approach: the spatial disaggregation of non georeferenced households 2.3 Hybrid results 3. The dissemination of the results: visualization of the spatial distribution of the population in Andalusia
3. The dissemination of the results Visualization of the spatial distribution of the population in Andalusia http://www.juntadeandalucia.es/institutodeestadisticaycartografia/distribucionpob/index-en.htm
More information: Iria Enrique iria.enrique@juntadeandalucia.es Serafín Ojeda serafin.ojeda.ext@juntadeandalucia.es