Census-ACS Master Index Files

Similar documents
Redistricting Data Census P.L Summary File data from the U.S. Census Bureau. NCSL Legislative Summit 2011 San Antonio, TX 08/08/2011

How to use Microsoft Access to extract data from the 2010 Census P.L Summary Files

POLIDATA Political Data Analysis DATABASE DEVELOPMENT, ANALYSIS AND PUBLICATION; POLITICAL AND CENSUS DATA; LITIGATION SUPPORT

Using NHGIS: An Introduction

ACS Summary File Technical Documentation

Geographical Information Systems Institute. Center for Geographic Analysis, Harvard University

Downloading Census Data from American Factfinder2 for use in ArcGIS

Using PostGIS and QGIS for the Management and Geovisualization of American Community Survey Data

Downloading Census Data from American Factfinder for use in ArcGIS

Using American FactFinder

Access Data with Census APIs

Introduction to Social Explorer

Working with 2000 Census Data in ArcGIS: Using the US Census Bureau Web Site for SF1 and SF3 Data

Exercise 13. Accessing Census 2000 PUMS Data

1. Open the New American FactFinder using this link:

Lab #7: Mapping US Census Data

The New American FactFinder N E W F U N C T I O N A L I T Y B U T S O M E Q U I R K S

Using the SABINS Data Finder

Introduction to IPUMS

Downloading 2010 Census Data


Note: you must explicitly follow these instructions to avoid problems

The NEW American FactFinder Classroom Training

GeoLytics. User Guide. Online Demographic Database

User Manual Mail Merge

Exercise 16. Downloading Raw Census Data

WOMEN'S INTERAGENCY HIV STUDY SECTION 40: GEOCODING PROTOCOL

Georeferencing Tutorials and Recommended Practices

USA Census Tutorial Downloading & Mapping American Factfinder Census Data for use in ArcMap

Package censusapi. August 19, 2018

How to calculate population and jobs within ½ mile radius of site

Quick Guide to American FactFinder

Note: Use Firefox NOT Internet Explorer for this lab. The U.S. Census website and Internet Explorer do not get along well.

UNIT 4. Research Methods in Business

-In windows explorer navigate to your Exercise_4 folder and right-click the DEC_10_SF1_P1.csv file and choose Open With > Notepad.

HEALTH AND RETIREMENT STUDY. Cross-Wave Geographic Information (State) Restricted Data Data Description and Usage

Quantitative Mapping Using Census Data

Google Fusion Tables Tutorial: Combining NATA and US Census Geospatial Data

A Disclosure Avoidance Research Agenda

DRAFT THE RESILIENCE TO DISASTERS AND EMERGENCIES INDEX (REDI) Constantine E. Kontokosta, PhD, PE, AICP, LEED AP NYU CUSP & NYU Poly & NYU Schack

Creating a Microdata Extract

SAS Web Report Studio 3.1

Chapter 7. Joining Maps to Other Datasets in QGIS

Electric Vehicle Infrastructure Location Identification Tool and Visualization Map. User Guide 1 INTRODUCTION

Data Entry, Processing and Management

Evaluating the Effectiveness of Using an Additional Mailing Piece in the American Community Survey 1

Store Mapper User Guide

Steve Graves Forensic Geography Lab: Joining Data from Census to GIS. Background

Len Preston Chief, Labor Market Information New Jersey Department of Labor & Workforce Development

Cornell University Program on. Mission Statement

Survey Questions and Methodology

US Geo-Explorer User s Guide. Web:

CompetitiveAccess Training Guide

October HMDA Tools Instructions

Package qualmap. R topics documented: September 12, Type Package

0.1 Stata Program 50 /********-*********-*********-*********-*********-*********-*********/ 31 /* Obtain Data - Populate Source Folder */

GIS Basics for Urban Studies

Overview of SAS/GIS Software

Welcome to Cole On-line Help system!

Survey Questions and Methodology

Use mail merge to create and print letters and other documents

QGIS LAB SERIES GST 102: Spatial Analysis Lab 2: Introduction to Geospatial Analysis

Getting Started with ArcGIS Background and Introduction

RESOLV EDI CONTROL. User Guide Version 9.2 for HANA PRESENTED BY ACHIEVE IT SOLUTIONS

U.S. Census Bureau Spatial Data Storage and Topology in the Redesigned MAF/TIGER System Jennifer DeVore and Danielle Lessard March 10, 2005

Geodatabases. Dr. Zhang SPRING 2016 GISC /03/2016

Creating an Area-level Extract

Cause/reason (if currently known)

Beyond 20/20. QuickStart Guide. Version 7.0, SP3

How to Implement the One-Time Methodology Mark Tabladillo, Ph.D., MarkTab Consulting, Atlanta, GA Associate Faculty, University of Phoenix

Steps and Strokes. Please go to the Census Bureau home page:

C H R I S. Children s Registry and Information System Data Facilitator Manual. Technical Support Information

Downloading shapefiles and using essential ArcMap tools

Project Zip Code. Version CUNA s Powerful Grassroots Program. User Manual. Copyright 2018, All Rights Reserved

JANUARY Conversion Software User Guide for UNIX. Version 2.0

Smartphone Ownership 2013 Update

Working with International Census Data in GIS

Accessing QWIs: LED Extraction Tool and QWI Explorer

MERIDIANSOUNDINGBOARD.COM TERMS AND CONDITIONS

Introduction to GIS & Mapping: ArcGIS Desktop

Converting census data into GIS data

Mapping Tabular Data

Bronx Bus Stops, New York NY, May 2016

emerge Help Document Table of Contents

emerge Help Document Table of Contents

The Rise of the Connected Viewer

Public Use Microdata Samples

DELIVERABLE D12.6/12.7 IECM Database Extension & User Interface with Tabulator

How to Get Census Data from E-STAT

Des Moines Area Regional Transit Non Rider Survey

Real Estate Sales, New York NY, 2014

The goal of this task is to assemble the input data required to deliver national accessibility reports and

Mapping Tabular Data Display XY points from csv

Title: Extracting Custom Cross-tabs for Travel Demand Modeling Data with the PUMS 2000 DVD-ROM Beyond 20/20 Software

NAACCR Geocoding Tutorial

Bulk download in ILOSTAT:

Public Use Microdata Samples

LAB 1: Introduction to ArcGIS 8

Overview. CRA Wiz and Fair Lending Wiz 6.7 Release Notes

A population grid for Andalusia Year Institute of Statistics and Cartography of Andalusia (IECA) Sofia (BU), 24 th October 2013

Transcription:

Census-ACS Master Index Files Documentation and User s Guide VERSION 1.1 March 1, 2013 Created by Michael J. Greenwald, Ph.D., AICP With assistance from Jonathan Brooks, Texas A&M Transportation Institute

Table of Contents Part 1. User Agreement... 1 Part 2. Introduction... 2 Why create master index files?... 2 How do the Master Index Files work?... 2 Where is the original data from?... 3 How do I differentiate directories, files, and variables in this report?... 3 What if I have more questions?... 3 Part 3. Census and ACS Database Storage Structures... 4 Overview of 2010 Decennial Census SF1 Data Structure... 4 Overview of ACS 5 Year Estimate Data Structure... 5 Technical Differences in Record Organization: Census SF1 and ACS... 9 Part 4. Master Index Files: FTP Site Structure, Table Descriptions, and Variable Definitions. 10 FTP Site Structure and Contents... 10 Variable Definitions for Master Index Files... 11 Master Index Files: Variable Definitions for Block Geography Files... 11 Master Index Files: Variable Definitions for Block Group Geography Files... 12 Master Index Files: Variable Definitions for Tract Geography Files... 13 Part 5. Case Study Application and How-To... 14 Poverty in Texas: Using Master Index File to Join ACS Data to Tracts... 14 Task to Complete... 14 Basic Steps to Analysis... 14 Main Problem... 14 Potential Solution: Master Index File for Texas Tracts... 15 Case Study Application of Solution... 15 Conclusions and Observations... 16 Note about Importing Raw ACS Data into ArcGIS... 17 Problem? Question?... 17

Part 1. User Agreement In exchange for providing these data archives in the public domain, the user understands and agrees that: 1. They use these data sets at their own risk. While all reasonable efforts have been made to ensure the validity of these index file data sets, no warranty of any kind is either expressed or implied. User is strongly encouraged to perform their own data validity checks to ensure the index files are returning reasonable results; 2. User s rights are non-exclusive. Use of these archives by other individuals or organizations in no way infringes on the rights of the user. Neither does the user have any cause to interfere with the use of these archives by other individuals or organizations; 3. For-profit redistribution of this agreement, the technical documentation and any of the data archives, whether in full or partial, without express prior written consent is prohibited; 4. Periodic updates to these archives and documentation may be made to these archives to address data errors or expand functionality. Any such updates are understood to happen on an ad hoc basis, without advance notice. An update shall not create or imply liability for errors or omissions identified in any previous versions; 5. They will transmit a copy of this agreement, and this technical documentation, unaltered and in its entirety, to any third parties with whom they choose to share (in whole or in any part) these archives. The most recent version of this user agreement, and technical documentation, can be found at http://tti.tamu.edu/group/transit-mobility/sf1acsindex, or by sending an e-mail to Dr. Michael J. Greenwald (see below) In the event an error or data abnormality is noticed, please notify Dr. Michael J. Greenwald via e-mail (michael.j.greenwald@gmail.com), as soon as possible. This e-mail should contain enough detail for Dr. Greenwald to document and identify the problem. Dr. Greenwald will respond as time constraints allow. Dr. Greenwald reserves the right to change this agreement at any time, for any reason, without prior notice. 1 P a g e

Part 2. Introduction This report documents the process to create master index files capable of linking various U.S. Census Bureau (Census) data products. Systematically linking various datasets together will allow Census data users to assemble information for statistical and geographic analysis. Why create master index files? The Census makes available to the public vast datasets containing population and demographic data, for example: 2010 Decennial Census Summary File 1 (Census SF1) 2006-2010 American Community Survey (ACS) Five Year Estimates 2007-2011 ACS Five Year Estimates The Census also generates and provides geographic files compatible with software packages, examples of TIGER/Line shapefiles are: Census Tracts (Tract) Census Block Group (Block Group) Census Block (Block) The data for each of the six Census products above contain identification fields. In some cases, the ID fields are similar and therefore capable of acting as a link between datasets. However, in many cases the Census data products do not contain a single unifying field capable of a clean, reliable link between large datasets. How do the Master Index Files work? Each index file contains record identifier numbers from the 2010 Census SF1, 2006-2010 ACS, and 2007-2011 ACS and associated FIPS location codes for Tracts and Block Groups. For Block geographies, the file contains the only record identifier numbers from the 2010 Census SF1 state level file (because ACS datasets are not produced for the Block level of geography). The key files act as the relational link between the SF1 and/or ACS datasets, and any dataset which has the FIPS location code as an attribute. Each geography key file (Tract, Block Group and Block) is a comma-delimited text file, compatible with software packages such as Microsoft Excel, Microsoft Access, SAS, SPSS, ArcGIS, and others. There is a key index file for each state, Washington DC, and Puerto Rico. The files are available in compressed folders containing the key files (organized by state postal abbreviation), on the following website: http://tti.tamu.edu/group/transit-mobility/sf1acsindex NOTE: These master index files do NOT contain the actual demographic data from the Census 2010 or the ACS; they are intended as links to those data sources. Refer to Part 3 for advice on how to download the Census 2010 or ACS demographic information for your desired study areas. Refer to Part 4 for the detailed list FTP site contents, and variable definitions. Refer to Part 5 for an example of how to use the master index file to connect census geographies with their related data records. 2 P a g e

Where is the original data from? The primary source datasets used to generate the key index files originate from the Census FTP site: http://www2.census.gov/ The original source data remains unchanged from its original state unless specifically documented in this report. For questions regarding accuracy of the original source data, contact the U.S. Census Bureau using contact information from the following webpage: http://www.census.gov/aboutus/contacts.html How do I differentiate directories, files, and variables in this report? This report lists variables and information in order of appearance in the key file datasets not alphabetically. In addition, the report uses a system of symbols and font characteristics to differentiate types of information: [Directories] contain square brackets around their names. Filenames.* are bold and always contain the file type extension. Variable Names are underlined. What if I have more questions? Contact Dr. Michael J. Greenwald with any questions regarding this report or the index files: michael.j.greenwald@gmail.com 3 P a g e

Part 3. Census and ACS Database Storage Structures Overview of 2010 Decennial Census SF1 Data Structure In the SF1 database environment, the entire list of all possible data tables is broken down and stored as separate subsets, known as segments. In each segment, the subset of data tables contains all the records for every possible unit of geographic detail in that segment. The desired level of geography is obtained by applying limiting criteria to selection queries to restrict the display of results within the segment. Example 1: In the Census SF1 database, Total Population is contained in Table P1. For the segment that contains Table P1, all values at the State, County, Tract, Block Group and Block are stored in the same column of data. To retrieve the population for each Census Block Group in a state, the query would include the SUMLEV data field and limiting criteria SUMLEV=150; for Census Tract, the limiting criteria would be SUMLEV=140; for Census Block, the limiting criteria would be SUMLEV=101. In order to connect the segments together in a coherent manner, and still maintain the flexibility of storing multiple levels of geography in each segment, the Census SF1 database uses an index field known as the LOGRECNO, an abbreviation of the term LOGical RECord NUmber. The same LOGRECNO applies to the same geographic unit across all data segments, including the geographic header information. Example 2: For Census Block Group 1 in Census Tract 1 in Lane County, Oregon, the LOGRECNO value is 119370. To retrieve the Total Population (Table P1, stored in Segment 1), Population by Urban or Rural Designation (Table P2, stored in Segment 2), Hispanic or Latino By Race (Table P8, stored in Segment 3), Median Age by Sex (Table P13, stored in Segment 4) and Population in Households by Age (Table P16, stored in Segment 5), you would build a selection query that tied Segment 1 through Segment 5 on the field LOGRECNO, and set the limiting criteria to LOGRECNO=119370. Furthermore, if you wanted to make sure that you retrieved the components of the FIPS Code (i.e., State, County, Tract and Block Group numbers, stored in Segment Geo_Header_SF1), you would add into the query the data segment that contains those data columns; making sure to connect that new data segment to one (or all) of the Segments 1 through 5. The relationship diagram would look very similar to Figure 1 (a few pages over). State level raw census data archives and Microsoft Access templates can be retrieved at: http://www2.census.gov/census_2010/04-summary_file_1/ For instructions on how to import and connect SF1 data segments into a Microsoft Access database environment, refer How to use Microsoft Access to extract data from the 2010 Census Summary File 1, available online at: http://www2.census.gov/census_2010/04- Summary_File_1/0HowToUseMSAccessWithSummaryFile1.pdf For full details on the structure of the U.S. 2010 Census SF1 dataset, refer to 2010 Census Summary File 1 Technical Documentation prepared by the U.S. Census Bureau, Revised 2012, available online at: http://www.census.gov/prod/cen2010/doc/sf1.pdf. 4 P a g e

Overview of ACS 5 Year Estimate Data Structure The ACS replaces the Summary File 3 data distributed as part of Census 2000 and earlier. Instead of a single decennial estimate, the ACS produces annual reports of annual, 3 year or 5 year rolling averages on socio-demographic information such as household income, disability status, employment and journey to work, just to name a few topics. It should be noted that ACS values are based on samples, and therefore are estimates of the population parameters in each table. This is why each ACS data table provides both the estimate and associated margin of error. The tradeoff one inherently accepts when using the ACS is between geographic, temporal and topical detail. ACS annual estimates are reported for geographies with populations of 65,000 or more; 3 year estimates are reported for geographies with populations of 20,000 or more; 5 year estimates include data for all geographies as small as Block Groups (note: 5 year estimates may be suppressed if confidentiality of respondents cannot be maintained or if there are not enough responses to construct a valid estimate). In addition, some topics covered in the annual or 3-year estimate may not be addressed in the 5 year estimate, to protect respondent confidentiality. For detailed descriptions of ACS data compilation as well as discussions about ACS methodology visit: http://www.census.gov/acs/www/guidance_for_data_users/handbooks/ In order to facilitate retrieving ACS data, the U.S. Census has created an Excel spreadsheet tool which automatically retrieves the estimates and margins of error for any of the desired reporting periods and levels of geography (SummaryFileDataRetrievalTool_V1.0.0.11.xlsm). For convenience, this tool and the support documentation provided by the U.S. Census are included in the AmericanCommunitySurveyResources.zip file on the FTP site with the master index files. Census updates the tool annually to ensure that the latest version of the ACS appears in the list of data retrieval options. For more information about updates to the tool, original archives, or further details visit: http://www.census.gov/acs/www/data_documentation/summary_file/. With respect to table structure, the ACS uses a very similar (though not identical) arrangement to the Census SF 1 file structure. Multiple levels of geography are stored in the same data table, and integrity between the geographic units in each table is maintained by using a LOGRECNO index. Figure 2A provides an example from the ACS 2007-2011 5 Year Estimate for Total Population, for each Tract and Block Group within the State of Oregon. Figure 2B depicts the Geography tab in an Excel workbook that contains the State, County, Tract and Block Group information which when combined constitute the FIPS code for the particular geographic record, similar to the SF1 file structure. 5 P a g e

Figure 1. Census 2010 Summary File 1 Table Relationships 6 P a g e

Figure 2. 2007-2011 ACS 5 Year Estimate, File Indexing Example, Part 1 Logical Record Number index field for ACS 5 Year Estimate Records Common administrative name for geographic unit 7 P a g e

Figure 3. 2007-2011 ACS 5 Year Estimate, File Indexing Example, Part 2 Logical Record Number index field for ACS 5 Year Estimate Records FIPS Code components 8 P a g e

Technical Differences in Record Organization: Census SF1 and ACS For all their similarities in data structure, there is an important difference between the SF1 and ACS data sets. Although the LOGRECNO index field appears in both databases, and they share the same purpose, they are not identical. The main consequence is the inability to use the values in the LOGRECNO field from SF1 to connect to ACS, or vice-versa. Doing so will result in connecting the wrong information to the wrong geographic unit. Example 3: From Figure 2A, we know that in the ACS 2007-2011 5 Year Estimate dataset for the State of Oregon, the LOGRECNO value for Census Tract 9501 in Baker County is 0002733, stored as a text string (with a corresponding numeric value of 2733). If we use that LOGRECNO value to connect to the U.S. Census 2010 SF1 State of Oregon dataset, the returned record is still in Baker County, but it is for Census Block 1170 in Census Tract 9506 (i.e., both wrong location AND wrong geographic scale!). The correct SF1 State of Oregon dataset LOGRECNO corresponding to Census Tract 9501 in Baker County is actually 5058. In addition, for some State datasets, the update between the ACS 2006-2010 and 2007-2011 5 Year Estimates has incorporated revisions which lead to minor changes in the LOGRECNO field. Example 4: Continuing with Example 3, for the ACS 2006-2010 5 Year Estimate dataset for the State of Oregon, the LOGRECNO value for Census Tract 9501 in Baker County is 0002734, stored as a text string (with a corresponding numeric value of 2734). This is an offset of +1 from the ACS 2007-2011 5 Year Estimate. The next section describes the various data files built to address these exact issues. 9 P a g e

Part 4. Master Index Files: FTP Site Structure, Table Descriptions, and Variable Definitions The purpose of the FTP archives is to serve as a one-stop shop for documentation and index files related to 2010 Census SF1, ACS 5 Year Estimates for Tracts and Block Groups. To that end, the site includes copies of data retrieval tools, database templates, and detailed methodology reports from the U.S. Census Bureau. FTP Site Structure and Contents AmericanCommunitySurveyResources.zip A compressed archive containing the methodology documents and user guides for the American Community Survey (ACS), as distributed by the U.S. Census Bureau website. Also contains an Excel spreadsheet data retrieval tool, developed and distributed by the U.S. Census Bureau, for all ACS datasets through the 2007-2011 5 Year Estimates. For first time users of the SF1/ACS consolidated index, DOWNLOADING THIS ARCHIVE IS HIGHLY RECOMMENDED!! USCensus2010SF1Resources.zip A compressed archive containing the methodology documents, database templates, and user guides for the U.S. 2010 Census Summary File 1, as distributed by the U.S. Census Bureau website. For first time users of the SF1/ACS consolidated index, DOWNLOADING THIS ARCHIVE IS HIGHLY RECOMMENDED!! ZZ.zip A compressed archive containing the index files described below (note: ZZ is a placeholder for the two letter state/territory postal abbreviation) ZZConsolidationDatabase.accdb This is the Microsoft Access 2007 database used to clean and assemble 2010 Census SF1, ACS 2006-2010, and ACS 2007-2011 source data into the combined SF1 and ACS master index files documented in this report (note: ZZ is used here as a placeholder for the two letter state/territory postal abbreviation). The file contains the database queries, listed sequentially, so that end users can review the code and suggest improvements. In addition, a user can adapt the queries in the database to develop custom master index files for a region (e.g., Select only those block groups that are in a particular county, and return their SF1 and ACS LOGRECNO index values ). ZZBlockLevelSF1KeyIndex.txt A comma delimited text file containing the LOGRECNO values for 2010 Census SF1, the ACS 2006-2010, and the ACS 2007-2011 datasets, for each 2010 Block within a particular state (note: ZZ is a placeholder for the two letter state/territory postal abbreviation). Since the ACS 5 Year Estimates only provide geographic detail down to the Block Group geography, this file contains only the SF1 dataset LOGRECNO indices. This is the master index file to connect any data indexed using 2010 Block FIPS codes directly to the corresponding SF1 state archive data sources. See field descriptions on following pages for details. ZZBlockGroupLevelCombinedSF1ACSKeyIndex.txt A comma delimited text file containing the LOGRECNO values for 2010 Census SF1, the ACS 2006-2010, and the ACS 2007-2011 datasets, for each 2010 Block Group within a particular state (note: ZZ is a placeholder for the two letter state/territory postal abbreviation). This is the master index file to connect any data indexed using a 2010 Block Group FIPS code directly to corresponding SF1 or ACS 5 Year Estimate data sources by state. See field descriptions on following pages for details. 10 P a g e

ZZTractLevelCombinedSF1ACSKeyIndex.txt A comma delimited text file containing the LOGRECNO values for 2010 Census SF1, the ACS 2006-2010, and the ACS 2007-2011 datasets, for each 2010 Census Tract within a particular state (note: ZZ is used here as a placeholder for the two letter state/territory postal abbreviation). This is the master index file for connecting any file indexed using a 2010 Tract FIPS code directly to the corresponding SF1 or ACS 5 Year Estimate data source by state. See field descriptions on following pages for details. Variable Definitions for Master Index Files This section defines the variable names contained in each of the three categories of master index files. The three categories of master index files are for three levels of geographic detail: one set of files for Blocks, another for Block Groups, and another for Tracts. Variable Name STUSAB Block10Val Block10Txt GeoID STATE COUNTY TRACT BLKGRP BLOCK SF1LogRecNum SF1LogRecTxt Master Index Files: Variable Definitions for Block Geography Files ZZBlockLevelSF1KeyIndex.txt Definition State/U.S. Abbreviation (USPS) 15 digit FIPS code for the Census Block, stored as numeric data type 15 digit FIPS code for the Census Block, stored as text data type Block10Txt with a prefix "US" added. Used for connecting this record to Census TIGER data products State FIPS Code; 2 digits, stored as text data type County FIPS Code; 3 digits, stored as text data type Census Tract FIPS Code; 6 digits, stored as text data type Census Block Group FIPS Code; 1 digit, stored as text data type Census Block FIPS Code; 4 digits (includes BLKGRP value), stored as text data type The LOGRECNO (i.e., Logical Record Number) for this Census Block found in the U.S. 2010 Census Summary File 1 corresponding to this particular State, stored as numeric data type. The LOGRECNO (i.e., Logical Record Number) for this Census Block found in the U.S. 2010 Census Summary File 1 corresponding to this particular State, stored as text data type. 11 P a g e

Variable Name STUSAB BlkGrp10Val BlkGrp10Txt GeoID SF1LogRecNum ACS0610LogRecNum ACS0711LogRecNum SF1LogRecTxt ACS0610LogRecTxt ACS0711LogRecTxt STATE COUNTY TRACT BLKGRP GeoName Master Index Files: Variable Definitions for Block Group Geography Files ZZBlockGroupLevelCombinedSF1ACSKeyIndex.txt Definition State/U.S. Abbreviation (USPS) 12 digit FIPS code for the Census Block Group, stored as numeric data type 12 digit FIPS code for the Census Block, stored as text data type BlkGrp10Txt with a prefix "US" added. Used for connecting this record to Census TIGER data products The LOGRECNO (i.e., Logical Record Number) for this Census Block Group found in the U.S. 2010 Census Summary File 1 corresponding to this particular State. Stored as numeric data type. The LOGRECNO (i.e., Logical Record Number) for this Census Block Group found in the American Community Survey 2006-2010 5 Year Estimate dataset corresponding to this particular State. Stored as numeric data type. Note: due to updates and revisions, this number may not be the same as the ACS0711LogRecNum value defined below. The LOGRECNO (i.e., Logical Record Number) for this Census Block Group found in the American Community Survey 2007-2011 5 Year Estimate dataset corresponding to this particular State. Stored as numeric data type. Note: due to updates and revisions, this number may not be the same as the ACS00610LogRecNum value defined above. The LOGRECNO (i.e., Logical Record Number) for this Census Block Group found in the U.S. 2010 Census Summary File 1 corresponding to this particular State. Stored as text data type. The LOGRECNO (i.e., Logical Record Number) for this Census Block Group found in the American Community Survey 2006-2010 5 Year Estimate dataset corresponding to this particular State. Stored as text data type. Note: due to updates and revisions, this number may not be the same as the ACS0711LogRecNum value defined below. The LOGRECNO (i.e., Logical Record Number) for this Census Block Group found in the American Community Survey 2007-2011 5 Year Estimate dataset corresponding to this particular State. Stored as text data type. Note: due to updates and revisions, this number may not be the same as the ACS00610LogRecNum value defined above. State FIPS Code; 2 digits, stored as text County FIPS Code; 3 digits, stored as text Census Tract FIPS Code; 6 digits, stored as text Census Block Group FIPS Code; 1 digit, stored as text Common administrative name for the Census Block Group 12 P a g e

Variable Name STUSAB Tract10Val Tract10Txt GeoID SF1LogRecNum ACS0610LogRecNum ACS0711LogRecNum SF1LogRecTxt ACS0610LogRecTxt ACS0711LogRecTxt STATE COUNTY TRACT Master Index Files: Variable Definitions for Tract Geography Files ZZTractLevelCombinedSF1ACSKeyIndex.txt Definition State/U.S. Abbreviation (USPS) 11 digit FIPS code for the Census Tract, stored as numeric data type 11 digit FIPS code for the Census Tract, stored as text data type Tract10Txt with a prefix "US" added. Used for connecting this record to Census TIGER data products The LOGRECNO (i.e., Logical Record Number) for this Census Tract found in the U.S. 2010 Census Summary File 1 corresponding to this particular State. Stored as numeric data type. The LOGRECNO (i.e., Logical Record Number) for this Census Tract found in the American Community Survey 2006-2010 5 Year Estimate dataset corresponding to this particular State. Stored as numeric data type. Note: due to updates and revisions, this number may not be the same as the ACS0711LogRecNum value defined below. The LOGRECNO (i.e., Logical Record Number) for this Census Tract found in the American Community Survey 2007-2011 5 Year Estimate dataset corresponding to this particular State. Stored as numeric data type. Note: due to updates and revisions, this number may not be the same as the ACS00610LogRecNum value defined above. The LOGRECNO (i.e., Logical Record Number) for this Census Tract found in the U.S. 2010 Census Summary File 1 corresponding to this particular State. Stored as text data type. The LOGRECNO (i.e., Logical Record Number) for this Census Tract found in the American Community Survey 2006-2010 5 Year Estimate dataset corresponding to this particular State. Stored as text data type. Note: due to updates and revisions, this number may not be the same as the ACS0711LogRecNum value defined below. The LOGRECNO (i.e., Logical Record Number) for this Census Tract found in the American Community Survey 2007-2011 5 Year Estimate dataset corresponding to this particular State. Stored as text data type. Note: due to updates and revisions, this number may not be the same as the ACS00610LogRecNum value defined above. State FIPS Code; 2 digits, stored as text County FIPS Code; 3 digits, stored as text Census Tract FIPS Code; 6 digits, stored as text 13 P a g e

Part 5. Case Study Application and How-To Part 5 describes how the authors tested the use of the master index file for Tracts in Texas. The case study application was a quality control measure used during the development and testing of the master index files. The process of the case study provides insight into how the master index files can be utilized to successfully join TIGERLine files and population/demographic data from the U.S. Census Bureau. Poverty in Texas: Using Master Index File to Join ACS Data to Tracts Task to Complete Quickly evaluate and map distribution of poverty in Texas using ACS 5 Year Estimates for Census Tracts. Basic Steps to Analysis The basic steps necessary to complete the task are: Use ACS Summary File Retrieval tool to download poverty data (2007-2011) for Tracts Use Census TIGERLine FTP site to download 2010 Tracts in Texas Prepare ACS data for import, import into ArcGIS Geodatabase Join ACS data to Tracts Conduct analysis, create maps as necessary Main Problem In their native format the TIGERLine geography files and ACS data do not have data variables necessary for a join in ArcGIS software (or any other software). The two files possess similar fields, but not identical fields. Raw ACS Data Identity Variables See Note about Importing Raw ACS Data into ArcGIS before attempting to import raw ACS data into ArcGIS. Raw TIGERLine Identity Variables 14 P a g e

Potential Solution: Master Index File for Texas Tracts Import the master index file for Tracts in Texas. Use the imported master index file as the link to join the ACS poverty data to TIGERLine Tracts. Raw Master Index File for Tracts in Texas Case Study Application of Solution Two joins are necessary to evaluate if the key index file successfully links TL to ACS raw data: Join #1 o TIGERLine Tracts to master index file o tl_2010_48_tract10.shp to TXTractLevelCombinedSF1ACSKeyIndex.txt Join #2 o Join TIGERLine Tracts to 2007-2011 ACS Tract data o tl_2010_48_tract10.shp to ACS_11_5YR_S1701_with_ann.txt Details of the Join #1 in ArcGIS Target Table: tl_2010_48_tract10 Target Field: GEOID10 Join Table: TXTractLevelCombinedSF1ACSKeyIndex Join Field: Tract10Txt Join Type: Keep all records Data Type: Standalone Table 15 P a g e

Details of the Join #2 in ArcGIS Target Table: tl_2010_48_tract10 Target Field: TXTractLevelCombinedSF1ACSKeyIndex.Tract10Val Join Table: ACS_11_5YR_S1701_with_ann Join Field: Geoid2 Join Type: Keep all records Data Type: Standalone Table Conclusions and Observations The key file successfully acted as the solder between previously problematic Census TIGERLine Geography files and 2007-2011 ACS data. 16 P a g e

Two joins were required to complete the link, as it was necessary to join the master index to the Tracts before joining to the ACS data. This was a simple operation in ArcGIS. Without the master index file to act as the joining tool, a GIS user would have to employ several other steps to overcome the lack of a joining variable. The fact that the master index file removed the necessity to create new information to join two datasets means that a power GIS or database user can use the files as part of a scripted, or automated, analysis process to join disparate datasets saving time and money! Note about Importing Raw ACS Data into ArcGIS ACS data files in their native format are not compatible with ArcGIS import routines. The first three columns in datasets downloaded using the ACS Summary File Retrieval Tool typically contain both. and - values in the column headings. In the case study application, the researcher had to remove those incompatible values from the following three columns using Microsoft Excel prior to importing ACS data into ArcGIS: GEO.id changed to GEOid GEO.id2 changed to GEOid2 GEO.display-label changed to GEOdisplaylabel Problem? Question? The authors found the master index file to be sound. However, should you ever notice an issue with one of the master index files or have any other questions, please share them with Dr. Mike Greenwald: michael.j.greenwald@gmail.com 17 P a g e