ABSTRACT INTRODUCTION

Size: px
Start display at page:

Download "ABSTRACT INTRODUCTION"

Transcription

1 Iterative Proportional Fitting and Population Dynamics using SAS Himanshu Joshi, Houston-Galveston Area Council, Houston, TX Dmitry Messen, Houston-Galveston Area Council, Houston, TX ABSTRACT For doing small area socioeconomic forecast Metropolitan Planning Organizations (MPOs) often need demographic data at individual person level. There no such single data source available and often iterative proportional fitting (IPF) procedure is used to generate synthetic population using census datasets. This paper demonstrates how the Socioeconomic Modeling group at the Houston-Galveston Area Council (H-GAC) uses SAS to create synthetic population data and how this data is also used to forecast population using aging-birthing-migration. INTRODUCTION At H-GAC, Socioeconomic modeling group s main task is to prepare long range growth forecast of population and employment for 8-county region. The primary purpose of the Regional Growth Forecast is to support Travel Demand Modeling which is used in Regional Transportation Planning. H-GAC also uses the forecast for other long range planning purposes. We use UrbanSim modeling package for doing the small area forecast ( UrbanSim is a software-based simulation model for integrated planning and analysis of urban development, incorporating the interactions between land use, transportation and public policy. It is intended for use by Metropolitan Planning Organizations and others needing to interface existing travel models with new land use forecasting and analysis capabilities ( This paper explains how we are using SAS to write our own module for demographic input for the model. UrbanSim is an agent based model and requires detailed input information regarding population, employment and land use. Among the several input data tables, information about current detailed demographic data and future control totals, is required. Data preparation for these two tables is explained in following sections. A. Iterative Proportional Fitting (IPF) IPF procedure is carried out to produce synthetic population. The data sources for creating synthetic population are 2000 census summary tape file 3 (STF3) and census Public Use Microdata Sample (PUMS) data. The PUMS file has 5% sample from long form census records at relatively large geographic areas called PUMA. A PUMA consists of several census tracts or block groups. PUMS data consists of 2 tables, one for households and one for persons. Records from both tables can be related using unique household ID given for each household. While PUMS data consists of entries at individual level, STF3 data consists of population counts by different geography level. Following example shows a simple 2-dimensional IPF procedure: Suppose we have total counts of people by age and sex from STF3 table as shown in table 1 in a block group. And we have corresponding PUMS data as shown in table 2. The goal here is to use IPF procedure to fill in the? signs in table 1 such that the marginal totals are preserved.

2 Male????? 53 Female????? 47 Total Table 1: Hypothetical age by sex block group distribution Male Female Total Table 2: Hypothetical PUMS data for the example block group First step is to calculate the adjustment factors for first variable, say sex. This is done in following manner. Adjustment Factor Male /723 = Female /762 = Total Table 3: Calculate first adjustment factor In the second step, each cell in the table 2 is multiplied by the adjustment factor. The results are shown in table 4. Male 56 * = Female 61 * = Total Table 4: First step adjustments Next step is to calculate the adjustment factors for the second variable, age as shown in table 5.

3 15/ = / = / = / = / = Table 5: Calculate second adjustment factor In the last step of first iteration, the cells in the table 4 are multiplied by respective adjustment factors in table 5. The results are shown in table 6. Sex >75 Total Male Female Total Table 6: Second step adjustments The above procedure of adjusting one dimension at a time is repeated until adjustment factors are close to 1. Although it can become real complex, this procedure can be extended for 3 or more dimensions. This is well documented in TRANSIMS population synthesizer (Hobeika, Antoine 2005). Basically, idea is to fit one dimension at a time until the multi-dimensional matrix is converged. Example code: %do %until (matrix is converged) %do l = 1 %to 2; dimension */ /* For first iteration. Repeat for each data synth; set input data file; frac = sf3 /pums; /* Calculate the adjustment factor */ pumsf1 = frac * pumsf; /* Calculate the cell values */ /* Calculate new marginal totals */ create table syntha as select *, sum(pumsf1) as pumss from synth group by sex; create table synthb as select *, sum(pumsf1) as pumsa from synth group by age; create table synth1 as select x.*,y.pumsa,z.pumsi,z1.pumsr from syntha as x,synthb as y where x.sex = y.sex and x.age = y.age and order by sex,age; data psynthesis; set synth1; pop = int(pumsf1); /* Take only the integer portion of the number */ %

4 % This is an example code for a 2 X 2 matrix. The actual macro is done for 4-dimensional matrix. B. Population Dynamics This module attempts to predict the demographic change over the period of time. Once we have the base year synthetic population ready we can use it to project for future years. This is done using simple probabilistic simulation with one year increment. The data sources for this module are, persons data created using IPF as described above and survival rates, fertility rates and net migration rates. Survival rates, fertility rates and net migration rates were obtained from Texas State Data Center (TSDC). These rates are year, county, age, race and sex specific. Following section illustrates the basic concept of the procedure along with some sample code. At first, year 2000 persons file is tabulated by county, age, race and sex. In the first iteration all people are aged by 1 and a random number, between 0 and 1, is assigned to each person. This variable is called as rs. The survival rates are then joined to this table by county, age, year, race and sex. The rates are then compared with variable rs. If rs is greater than the survival rate then that person is termed as dead. These persons are separated out to a different table. data persons_2001; set persons_2000; age+1; create table persons_2001a as select x.*,y.survival_p from persons_2001 as x left join survival_p as y on x.county = y.county and x.age = y.age and y.year = 2001 and x.race = y.race and x.sex = y.sex; data persons_2001c deaths_2001; set persons_2001b; if rs > survival_p then do; person_status = 'P00'; output deaths_2001; else output persons_2001c; After this, a females table is created for birthing. Since fertility rates are for females of age between 9 and 50, only females between these age categories are selected and a random number rb, between 0 and 1, is assigned to each person. This variable is called as rb. The fertility rates are then joined to this table by county, age, year and race. The rates are then compared with variable rb and then birth event is determined including sex of the baby. A new person id is assigned to the new baby. data females_2001(keep = county person_id family_id age race hisp race); set persons_2001c; if sex = 'S02'; if 9 < age < 50;

5 data females_2001; set females_2001; RB = ranuni(2); create table females_2001a as select x.*,y.birth_event,y.p0_birth,y.p1_birth from females_2001 as x left join birthing_p as y on x.county = y.county and y.year = 2001 and x.race = y.race and x.age = y.age; Once survival and fertility modules are done a new persons table is created adding survived people and new born babies. Persons are then aggregated by county, age, race and sex. To this new base table migration rates are applied by county, age, race and sex. The migration rates are net migration rates. So a negative migration rate denotes out migration and a positive migration rate denotes in migration. A unique person id is applied to the in migrants. A separate table for in and out migrants is created. create table mig1 as select distinct county,age,race,sex,count(person_id) as people from persons_2001 group by county,age,sex,race; create table mig2 as select x.*,y.migrate,round(people*migrate) as mpop from mig1 as x,migrates as y where x.county = y.county and x.race = y.race and x.sex = y.sex and x.age = y.age and calculated mpop ~= 0; proc sort data = persons_2001; by person_id; create table mig3 as select x.*, y.mpop from persons_2001 as x,mig2 as y where x.county = y.county and x.age = y.age and x.race = y.race and x.sex = y.sex order by county,race,age,sex,rand; data imig omig; retain n;set mig3;by county race age sex; if first.county then n = 0; if first.race then n = 0; if first.age then n = 0; if first.sex then n = 0; if mpop > 0 then do; n+1; if n < = mpop then output imig; if mpop < 0 then do; n=n-1; if n < 0 then if n > = mpop then do; output omig; delete; After all three modules are done a final persons file for 2001 is created. This procedure was repeated for desired number of years.

6 CONCLUSION The household synthesis module in UrbanSim was written in Java before and it s now written in python. This was kind of a black box for a person who is not familiar with these programming languages. Population synthesis module is currently not available for UrbanSim. SAS gave us an opportunity to write and test our own modules. REFERENCES Hobeika, Antoine TRANSIMS Fundamentals. Virginia Polytechnic University. UrbanSim website. H-GAC website. CONTACT INFORMATION Your comments and questions are valued and encouraged. Contact the author at: Himanshu Joshi Houston-Galveston Area Council 3555 Timmons Lane, Suite#120 Houston, TX himanshu.joshi@h-gac.com SAS and all other SAS Institute Inc. product or service names are registered trademarks or trademarks of SAS Institute Inc. in the USA and other countries. indicates USA registration. Other brand and product names are trademarks of their respective companies.

Exercise 13. Accessing Census 2000 PUMS Data

Exercise 13. Accessing Census 2000 PUMS Data Exercise 13. Accessing Census 2000 PUMS Data Purpose: The goal of this exercise is to extract some 2000 PUMS data for Asian Indians for PUMAs within California. You may either download the records for

More information

Introduction to Social Explorer

Introduction to Social Explorer Introduction to Social Explorer Janine Billadello, Geospatial Data Lab, Baruch College October 1, 2015 Abstract This tutorial will introduce you to Social Explorer, an online mapping interface that allows

More information

A Disclosure Avoidance Research Agenda

A Disclosure Avoidance Research Agenda A Disclosure Avoidance Research Agenda presented at FCSM research conf. ; Nov. 5, 2013 session E-3; 10:15am: Data Disclosure Issues Paul B. Massell U.S. Census Bureau Center for Disclosure Avoidance Research

More information

PDQ-Notes. Reynolds Farley. PDQ-Note 3 Displaying Your Results in the Expert Query Window

PDQ-Notes. Reynolds Farley. PDQ-Note 3 Displaying Your Results in the Expert Query Window PDQ-Notes Reynolds Farley PDQ-Note 3 Displaying Your Results in the Expert Query Window PDQ-Note 3 Displaying Your Results in the Expert Query Window Most of your options for configuring your query results

More information

A METHODOLOGY TO MATCH DISTRIBUTIONS OF BOTH HOUSEHOLD AND PERSON ATTRIBUTES IN THE GENERATION OF SYNTHETIC POPULATIONS

A METHODOLOGY TO MATCH DISTRIBUTIONS OF BOTH HOUSEHOLD AND PERSON ATTRIBUTES IN THE GENERATION OF SYNTHETIC POPULATIONS A METHODOLOGY TO MATCH DISTRIBUTIONS OF BOTH HOUSEHOLD AND PERSON ATTRIBUTES IN THE GENERATION OF SYNTHETIC POPULATIONS Xin Ye Department of Civil and Environmental Engineering Arizona State University,

More information

Example 1 - Joining datasets by a common variable: Creating a single table using multiple datasets Other features illustrated: Aggregate data multi-variable recode, computational calculation Background:

More information

Basic facts about Dudley

Basic facts about Dudley Basic facts about Dudley This report provides a summary of the latest available information on the demographic and socioeconomic make-up of the 1 Big Local (DRAFT)s in Dudley. It looks at the population

More information

Creating a Microdata Extract

Creating a Microdata Extract Minnesota Population Center Training and Development Creating a Microdata Extract Exercise Objective: Use the IPUMS Terra website to obtain a customized dataset that can be used to answer research questions.

More information

Quick Guide to American FactFinder

Quick Guide to American FactFinder Quick Guide to American FactFinder 1. Search Terms... 2 2. Finding Neighborhoods... 6 3. Downloading the Tables 13 4. Interpreting the Numbers... 18 Introduction The American FactFinder is a useful online

More information

Public Use Microdata Samples

Public Use Microdata Samples Public Use Microdata Samples Using PDQ Explore Software Grace York University of Michigan Library May 2004 2000 Census Data Tabulations Summary Files 1-4, Equal Employment Opportunity, School District

More information

Comparative Evaluation of Synthetic Dataset Generation Methods

Comparative Evaluation of Synthetic Dataset Generation Methods Comparative Evaluation of Synthetic Dataset Generation Methods Ashish Dandekar, Remmy A. M. Zen, Stéphane Bressan December 12, 2017 1 / 17 Open Data vs Data Privacy Open Data Helps crowdsourcing the research

More information

Using American FactFinder

Using American FactFinder Using American FactFinder John DeWitt Lisa Neidert Project Manager Data Services Social Science Data Analysis Network Population Studies Center What is American FactFinder? http://factfinder.census.gov

More information

Memorandum Methodology Review. Paul Agnello and Danny Reese, FAMPO. David Jackson, Feng Liu, Jingjing Zang, CS. DATE: September 29, 2016

Memorandum Methodology Review. Paul Agnello and Danny Reese, FAMPO. David Jackson, Feng Liu, Jingjing Zang, CS. DATE: September 29, 2016 Memorandum TO: FROM: Paul Agnello and Danny Reese, FAMPO David Jackson, Feng Liu, Jingjing Zang, CS DATE: September 29, 2016 RE: DRAFT GWRC 2045 Socioeconomic Data Development This memorandum provides

More information

Creating Population Tree Charts (Using SAS/GRAPH Software) Robert E. Allison, Jr. and Dr. Moon W. Suh College of Textiles, N. C.

Creating Population Tree Charts (Using SAS/GRAPH Software) Robert E. Allison, Jr. and Dr. Moon W. Suh College of Textiles, N. C. SESUG 1994 Creating Population Tree Charts (Using SAS/GRAPH Software) Robert E. Allison, Jr. and Dr. Moon W. Suh College of Textiles, N. C. State University ABSTRACT This paper describes a SAS program

More information

Super boost data transpose puzzle

Super boost data transpose puzzle Paper 2100-2016 Super boost data transpose puzzle Ahmed Al-Attar, AnA Data Warehousing Consulting LLC, McLean, VA ABSTRACT This paper compares different solutions to a data transpose puzzle presented to

More information

Access Data with Census APIs

Access Data with Census APIs You are here: Census.gov APIs - US Census Bureau Access Data with Census APIs Census Bureau APIs To improve access to data and encourage innovation, the Census Bureau has begun to provide API access to

More information

Creating an Area-level Extract

Creating an Area-level Extract Minnesota Population Center Training and Development Creating an Area-level Extract Exercise Objective: Use the IPUMS Terra website to obtain a customized dataset that can be used to answer research questions.

More information

Ditch the Data Memo: Using Macro Variables and Outer Union Corresponding in PROC SQL to Create Data Set Summary Tables Andrea Shane MDRC, Oakland, CA

Ditch the Data Memo: Using Macro Variables and Outer Union Corresponding in PROC SQL to Create Data Set Summary Tables Andrea Shane MDRC, Oakland, CA ABSTRACT Ditch the Data Memo: Using Macro Variables and Outer Union Corresponding in PROC SQL to Create Data Set Summary Tables Andrea Shane MDRC, Oakland, CA Data set documentation is essential to good

More information

Census-ACS Master Index Files

Census-ACS Master Index Files Census-ACS Master Index Files Documentation and User s Guide VERSION 1.1 March 1, 2013 Created by Michael J. Greenwald, Ph.D., AICP With assistance from Jonathan Brooks, Texas A&M Transportation Institute

More information

Public Use Microdata Samples

Public Use Microdata Samples Public Use Microdata Samples Using PDQ Explore Software Grace York University of Michigan Library December 2003 Public Use Microdata Samples Copies of the original questionnaires with identifying information

More information

Title: Extracting Custom Cross-tabs for Travel Demand Modeling Data with the PUMS 2000 DVD-ROM Beyond 20/20 Software

Title: Extracting Custom Cross-tabs for Travel Demand Modeling Data with the PUMS 2000 DVD-ROM Beyond 20/20 Software Title: Extracting Custom Cross-tabs for Travel Demand Modeling Data with the PUMS 2000 DVD-ROM Beyond 20/20 Software Mike Jaffe Mid-Willamette Valley Council of Governments 105 High Street, S.E. Salem

More information

How to Get Census Data from E-STAT

How to Get Census Data from E-STAT How to Get Census Data from E-STAT Connecting to E-STAT 1) From the Library Home Page, http://library.mcmaster.ca, click on the Databases tab 2) Once on the Databases tab, type in estat and hit the Go

More information

Methods for Producing Consistent Control Totals for Benchmarking in Survey Sampling

Methods for Producing Consistent Control Totals for Benchmarking in Survey Sampling Methods for Producing Consistent Control Totals for Benchmarking in Survey Sampling Ismael Flores Cervantes 1 1 Westat, 1600 Research Blvd, Rockville, MD 20850 Abstract Estimates from probability samples

More information

POLIDATA Political Data Analysis DATABASE DEVELOPMENT, ANALYSIS AND PUBLICATION; POLITICAL AND CENSUS DATA; LITIGATION SUPPORT

POLIDATA Political Data Analysis DATABASE DEVELOPMENT, ANALYSIS AND PUBLICATION; POLITICAL AND CENSUS DATA; LITIGATION SUPPORT POLIDATA Political Data Analysis DATABASE DEVELOPMENT, ANALYSIS AND PUBLICATION; POLITICAL AND CENSUS DATA; LITIGATION SUPPORT CLARK BENSEN POLIDATA 1303 HAYWARD ROAD, P. O. BOX 530 CORINTH, VT 05039 Tel:

More information

A population grid for Andalusia Year Institute of Statistics and Cartography of Andalusia (IECA) Sofia (BU), 24 th October 2013

A population grid for Andalusia Year Institute of Statistics and Cartography of Andalusia (IECA) Sofia (BU), 24 th October 2013 A population grid for Andalusia Year 2013 Institute of Statistics and Cartography of Andalusia (IECA) Sofia (BU), 24 th October 2013 IECA project. Population grid cells sized 250 x 250m for Andalusia 1.

More information

How to extract suicide statistics by country from the. WHO Mortality Database Online Tool

How to extract suicide statistics by country from the. WHO Mortality Database Online Tool Instructions for users How to extract suicide statistics by country from the WHO Mortality Database Online Tool This guide explains how to access suicide statistics and make graphs and tables, or export

More information

Speed Dating: Looping Through a Table Using Dates

Speed Dating: Looping Through a Table Using Dates Paper 1645-2014 Speed Dating: Looping Through a Table Using Dates Scott Fawver, Arch Mortgage Insurance Company, Walnut Creek, CA ABSTRACT Have you ever needed to use dates as values to loop through a

More information

Creating a Raster Extract

Creating a Raster Extract Minnesota Population Center Training and Development Creating a Raster Extract Exercise Objective: Use the IPUMS Terra website to obtain customized datasets that can be used to answer research questions.

More information

Quick Reference for the FloridaCHARTS Fetal Death Query

Quick Reference for the FloridaCHARTS Fetal Death Query Quick Reference for the FloridaCHARTS Fetal Death Query 1. Toolbar Functions 2. Reports 3. Frequently Asked Questions This application is set up in sections. To use it, you do not have to follow any particular

More information

USING 2001 CENSUS DATA IN ArcMap 8.2

USING 2001 CENSUS DATA IN ArcMap 8.2 USING 2001 CENSUS DATA IN ArcMap 8.2 This is a guide to mapping 2001 Census data in ArcMap using age and sex profile data for Toronto at the census tract level. Importing boundary files, importing census

More information

Downloading 2010 Census Data

Downloading 2010 Census Data Downloading 2010 Census Data These instructions cover downloading the Census Tract polygons and the separate attribute data. After that, the attribute data will need additional formatting in Excel before

More information

Overview of SAS/GIS Software

Overview of SAS/GIS Software 3 CHAPTER 1 Overview of SAS/GIS Software Introduction to Geographic Information Systems 3 Features of SAS Software 4 Data in SAS/GIS Applications 5 Spatial Data 5 Spatial Data Layers 6 Spatial Data Coverages

More information

Hot-deck Imputation with SAS Arrays and Macros for Large Surveys

Hot-deck Imputation with SAS Arrays and Macros for Large Surveys Hot-deck Imation with SAS Arrays and Macros for Large Surveys John Stiller and Donald R. Dalzell Continuous Measurement Office, Demographic Statistical Methods Division, U.S. Census Bureau ABSTRACT SAS

More information

Data Manipulations Using Arrays and DO Loops Patricia Hall and Jennifer Waller, Medical College of Georgia, Augusta, GA

Data Manipulations Using Arrays and DO Loops Patricia Hall and Jennifer Waller, Medical College of Georgia, Augusta, GA Paper SBC-123 Data Manipulations Using Arrays and DO Loops Patricia Hall and Jennifer Waller, Medical College of Georgia, Augusta, GA ABSTRACT Using public databases for data analysis presents a unique

More information

How to calculate population and jobs within ½ mile radius of site

How to calculate population and jobs within ½ mile radius of site How to calculate population and jobs within ½ mile radius of site Caltrans Project P359, Trip Generation Rates for Transportation Impact Analyses of Smart Growth Land Use Projects SECTION PAGE Population

More information

Synthetic Population Techniques in Activity- Based Research

Synthetic Population Techniques in Activity- Based Research 48 Chapter 3 Synthetic Population Techniques in Activity- Based Research Sungjin Cho Hasselt University, Belgium Tom Bellemans Hasselt University, Belgium Lieve Creemers Hasselt University, Belgium Luk

More information

Paper Time Contour Plots. David J. Corliss, Wayne State University / Physics and Astronomy

Paper Time Contour Plots. David J. Corliss, Wayne State University / Physics and Astronomy ABSTRACT Paper 1311-2014 Time Contour Plots David J. Corliss, Wayne State University / Physics and Astronomy This new SAS tool is two-dimensional color chart for visualizing changes in a population or

More information

Census Transportation Planning Products (CTPP) AASHTO Update Penelope Weinberger, AASHTO,

Census Transportation Planning Products (CTPP) AASHTO Update Penelope Weinberger, AASHTO, August 2010 Census Transportation Planning Products (CTPP) AASHTO Update Penelope Weinberger, AASHTO, Pweinberger@aashto.org CTPP Three-Year ACS Data Products The Census Bureau has delivered the CTPP three-year

More information

Introduction to IPUMS

Introduction to IPUMS Introduction to IPUMS Katie Genadek Minnesota Population Center University of Minnesota kgenadek@umn.edu The IPUMS projects are funded by the National Science Foundation and the National Institutes of

More information

ACS Summary File Technical Documentation

ACS Summary File Technical Documentation ACS Summary File Technical Documentation 2015 ACS 1-year and 2011-2015 ACS 5-year Data Releases American Community Survey Office Issued September 2016 Table of Contents 1 INTRODUCTION... 1 1.1 THE AMERICAN

More information

Massachusetts Institute of Technology Department of Urban Studies and Planning

Massachusetts Institute of Technology Department of Urban Studies and Planning Massachusetts Institute of Technology Department of Urban Studies and Planning 11.204: Planning, Communication & Digital Media Fall 2004 Lab 5: Using Access to Query Multiple Data Sets Help Section Exploring

More information

SAS Visual Analytics 8.2: Getting Started with Reports

SAS Visual Analytics 8.2: Getting Started with Reports SAS Visual Analytics 8.2: Getting Started with Reports Introduction Reporting The SAS Visual Analytics tools give you everything you need to produce and distribute clear and compelling reports. SAS Visual

More information

Age & Stage Structure: Elephant Model

Age & Stage Structure: Elephant Model POPULATION MODELS Age & Stage Structure: Elephant Model Terri Donovan recorded: January, 2010 Today we're going to be building an age-structured model for the elephant population. And this will be the

More information

DELIVERABLE D12.6/12.7 IECM Database Extension & User Interface with Tabulator

DELIVERABLE D12.6/12.7 IECM Database Extension & User Interface with Tabulator Project N : 262608 ACRONYM: Data without Boundaries DELIVERABLE D12.6/12.7 IECM Database Extension & User Interface with Tabulator WORK PACKAGE 12 Implementing Improved Resource Discovery for OS Data REPORTING

More information

ABSTRACT INTRODUCTION PROBLEM: TOO MUCH INFORMATION? math nrt scr. ID School Grade Gender Ethnicity read nrt scr

ABSTRACT INTRODUCTION PROBLEM: TOO MUCH INFORMATION? math nrt scr. ID School Grade Gender Ethnicity read nrt scr ABSTRACT A strategy for understanding your data: Binary Flags and PROC MEANS Glen Masuda, SRI International, Menlo Park, CA Tejaswini Tiruke, SRI International, Menlo Park, CA Many times projects have

More information

The New American FactFinder N E W F U N C T I O N A L I T Y B U T S O M E Q U I R K S

The New American FactFinder N E W F U N C T I O N A L I T Y B U T S O M E Q U I R K S The New American FactFinder N E W F U N C T I O N A L I T Y B U T S O M E Q U I R K S Good Things Have to know less to find data Table manipulation available online Can now map any (single) variable Drawbacks

More information

2045 DEMOGRAPHIC FORECAST SURFACE TRANSPORTATION TECHNICAL COMMITTEE APRIL 28, 2017

2045 DEMOGRAPHIC FORECAST SURFACE TRANSPORTATION TECHNICAL COMMITTEE APRIL 28, 2017 2045 DEMOGRAPHIC FORECAST SURFACE TRANSPORTATION TECHNICAL COMMITTEE APRIL 28, 2017 BACKGROUND The demographic forecasts provide a common base for regional planning and resource allocations. The forecasts

More information

Using NHGIS: An Introduction

Using NHGIS: An Introduction Using NHGIS: An Introduction August 2014 Funding provided by the National Science Foundation and National Institutes of Health. Project support provided by the Minnesota Population Center at the University

More information

Creating Forest Plots Using SAS/GRAPH and the Annotate Facility

Creating Forest Plots Using SAS/GRAPH and the Annotate Facility PharmaSUG2011 Paper TT12 Creating Forest Plots Using SAS/GRAPH and the Annotate Facility Amanda Tweed, Millennium: The Takeda Oncology Company, Cambridge, MA ABSTRACT Forest plots have become common in

More information

Kyeil Kim, Guy Rousseau, Kyung-Hwa Kim & David D Onofrio

Kyeil Kim, Guy Rousseau, Kyung-Hwa Kim & David D Onofrio Kyeil Kim, Guy Rousseau, Kyung-Hwa Kim & David D Onofrio 4 th International Conference on Innovations in Travel Modeling April 30-May 2, 2012 Tampa, Florida Anti-intellectualism nurtured by the false notion

More information

Geographic Accuracy of Cell Phone RDD Sample Selected by Area Code versus Wire Center

Geographic Accuracy of Cell Phone RDD Sample Selected by Area Code versus Wire Center Geographic Accuracy of Cell Phone RDD Sample Selected by versus Xian Tao 1, Benjamin Skalland 1, David Yankey 2, Jenny Jeyarajah 2, Phil Smith 2, Meena Khare 3 1 NORC at the University of Chicago 2 National

More information

Paper SDA-11. Logistic regression will be used for estimation of net error for the 2010 Census as outlined in Griffin (2005).

Paper SDA-11. Logistic regression will be used for estimation of net error for the 2010 Census as outlined in Griffin (2005). Paper SDA-11 Developing a Model for Person Estimation in Puerto Rico for the 2010 Census Coverage Measurement Program Colt S. Viehdorfer, U.S. Census Bureau, Washington, DC This report is released to inform

More information

[spa-temp.inf] Spatial-temporal information

[spa-temp.inf] Spatial-temporal information [spa-temp.inf] Spatial-temporal information VI Table of Contents for Spatial-temporal information I. Spatial-temporal information........................................... VI - 1 A. Cohort-survival method.........................................

More information

From Manual to Automatic with Overdrive - Using SAS to Automate Report Generation Faron Kincheloe, Baylor University, Waco, TX

From Manual to Automatic with Overdrive - Using SAS to Automate Report Generation Faron Kincheloe, Baylor University, Waco, TX Paper 152-27 From Manual to Automatic with Overdrive - Using SAS to Automate Report Generation Faron Kincheloe, Baylor University, Waco, TX ABSTRACT This paper is a case study of how SAS products were

More information

Using PROC SQL to Calculate FIRSTOBS David C. Tabano, Kaiser Permanente, Denver, CO

Using PROC SQL to Calculate FIRSTOBS David C. Tabano, Kaiser Permanente, Denver, CO Using PROC SQL to Calculate FIRSTOBS David C. Tabano, Kaiser Permanente, Denver, CO ABSTRACT The power of SAS programming can at times be greatly improved using PROC SQL statements for formatting and manipulating

More information

Truncate, replicate, sample

Truncate, replicate, sample Truncate, replicate, sample A probabilistic method of integerisation Robin Lovelace & Dimitris Ballas Sheffield Presented at the IMA, May 2012, Dublin Robin Lovelace & Dimitris Ballas (Sheffield) Truncate,

More information

Synthetic Data: Modelling and Generating Complex Close-to-Reality Data for Public Use

Synthetic Data: Modelling and Generating Complex Close-to-Reality Data for Public Use Matthias Templ, Bernhard Meindl Statistics Austria & Vienna Uni. of Techn. Mai 2015 ODAM 2015, Olomouc, CZ Synthetic Data: Modelling and Generating Complex Close-to-Reality Data for Public Use Matthias

More information

Principal Components Analysis with Spatial Data

Principal Components Analysis with Spatial Data Principal Components Analysis with Spatial Data A SpaceStat Software Tutorial Copyright 2013, BioMedware, Inc. (www.biomedware.com). All rights reserved. SpaceStat and BioMedware are trademarks of BioMedware,

More information

%ANYTL: A Versatile Table/Listing Macro

%ANYTL: A Versatile Table/Listing Macro Paper AD09-2009 %ANYTL: A Versatile Table/Listing Macro Yang Chen, Forest Research Institute, Jersey City, NJ ABSTRACT Unlike traditional table macros, %ANTL has only 3 macro parameters which correspond

More information

%MAKE_IT_COUNT: An Example Macro for Dynamic Table Programming Britney Gilbert, Juniper Tree Consulting, Porter, Oklahoma

%MAKE_IT_COUNT: An Example Macro for Dynamic Table Programming Britney Gilbert, Juniper Tree Consulting, Porter, Oklahoma Britney Gilbert, Juniper Tree Consulting, Porter, Oklahoma ABSTRACT Today there is more pressure on programmers to deliver summary outputs faster without sacrificing quality. By using just a few programming

More information

Segmented or Overlapping Dual Frame Samples in Telephone Surveys

Segmented or Overlapping Dual Frame Samples in Telephone Surveys Vol. 3, Issue 6, 2010 Segmented or Overlapping Dual Frame Samples in Telephone Surveys John M Boyle *, Faith Lewis, Brian Tefft * Institution: Abt SRBI Institution: Abt SRBI Institution: AAA Foundation

More information

SHRP2 C10: Jacksonville

SHRP2 C10: Jacksonville SHRP2 C10: Jacksonville Partnership to Develop an Integrated Advanced Travel Demand Model and a Fine grained Timesensitive Network Presented by: Stephen Lawe Key Agency Partners: Florida Department of

More information

Microsimulation model user guide Flexible Modelling Framework

Microsimulation model user guide Flexible Modelling Framework National Centre for Research Methods Working Paper 06/13 Microsimulation model user guide Flexible Modelling Framework Kirk Harland, TALISMAN node, University of Leeds Microsimulation model user guide

More information

Time Contour Plots. David J. Corliss Magnify Analytic Solutions, Detroit, MI

Time Contour Plots. David J. Corliss Magnify Analytic Solutions, Detroit, MI Time Contour Plots David J. Corliss Magnify Analytic Solutions, Detroit, MI ABSTRACT This new SAS tool is two-dimensional color chart for visualizing changes in a population or system over time. Data for

More information

Fundamental Data Manipulation Techniques

Fundamental Data Manipulation Techniques The Analysis of Longitudinal Data Introduction This document describes the process of organizing longitudinal data from the HRS for the purposes of statistical analysis. We start by considering three small

More information

(c) What is the result of running the following program? x = 3 f = function (y){y+x} g = function (y){x =10; f(y)} g (7) Solution: The result is 10.

(c) What is the result of running the following program? x = 3 f = function (y){y+x} g = function (y){x =10; f(y)} g (7) Solution: The result is 10. Statistics 506 Exam 2 December 17, 2015 1. (a) Suppose that li is a list containing K arrays, each of which consists of distinct integers that lie between 1 and n. That is, for each k = 1,..., K, li[[k]]

More information

A Practical Introduction to SAS Data Integration Studio

A Practical Introduction to SAS Data Integration Studio ABSTRACT A Practical Introduction to SAS Data Integration Studio Erik Larsen, Independent Consultant, Charleston, SC Frank Ferriola, Financial Risk Group, Cary, NC A useful and often overlooked tool which

More information

Plan Sponsor Reporting

Plan Sponsor Reporting Creating a report Our reporting feature gives you the ability to independently obtain Plan data that s pertinent to your needs. To create a report, select the Reporting Portal menu item, located on the

More information

Penetrating the Matrix Justin Z. Smith, William Gui Zupko II, U.S. Census Bureau, Suitland, MD

Penetrating the Matrix Justin Z. Smith, William Gui Zupko II, U.S. Census Bureau, Suitland, MD Penetrating the Matrix Justin Z. Smith, William Gui Zupko II, U.S. Census Bureau, Suitland, MD ABSTRACT While working on a time series modeling problem, we needed to find the row and column that corresponded

More information

Trip Distribution Models in TransCAD. Murtaza Haider Tel: , ext. 2480

Trip Distribution Models in TransCAD. Murtaza Haider Tel: , ext. 2480 Trip Distribution Models in TransCAD Murtaza Haider murtaza.haider@ryerson.ca Tel: 416.979.5000, ext. 2480 Trip Distribution Procedures Goal: understand how to use TransCAD to: Generate inputs for trip

More information

Using Templates Created by the SAS/STAT Procedures

Using Templates Created by the SAS/STAT Procedures Paper 081-29 Using Templates Created by the SAS/STAT Procedures Yanhong Huang, Ph.D. UMDNJ, Newark, NJ Jianming He, Solucient, LLC., Berkeley Heights, NJ ABSTRACT SAS procedures provide a large quantity

More information

Exporting Variable Labels as Column Headers in Excel using SAS Chaitanya Chowdagam, MaxisIT Inc., Metuchen, NJ

Exporting Variable Labels as Column Headers in Excel using SAS Chaitanya Chowdagam, MaxisIT Inc., Metuchen, NJ Paper 74924-2011 Exporting Variable Labels as Column Headers in Excel using SAS Chaitanya Chowdagam, MaxisIT Inc., Metuchen, NJ ABSTRACT Excel output is the desired format for most of the ad-hoc reports

More information

Converting census data into GIS data

Converting census data into GIS data Converting census data into GIS data For those interested in analyzing Canadian demographics data, census data can be downloaded directly from ODESI or the Canadian Census Analyser. In order to use the

More information

Submission-Ready Define.xml Files Using SAS Clinical Data Integration Melissa R. Martinez, SAS Institute, Cary, NC USA

Submission-Ready Define.xml Files Using SAS Clinical Data Integration Melissa R. Martinez, SAS Institute, Cary, NC USA PharmaSUG 2016 - Paper SS12 Submission-Ready Define.xml Files Using SAS Clinical Data Integration Melissa R. Martinez, SAS Institute, Cary, NC USA ABSTRACT SAS Clinical Data Integration simplifies the

More information

Geographical Information Systems Institute. Center for Geographic Analysis, Harvard University

Geographical Information Systems Institute. Center for Geographic Analysis, Harvard University Geographical Information Systems Institute Center for Geographic Analysis, Harvard University LAB EXERCISE 5: Queries, Joins: Spatial and Non-spatial 1.0 Getting Census data 1. Go to the American Factfinder

More information

USER GUIDE. Homelessness Data Exchange (HDX), Version 2.0 BETA TEST VERSION. June, U.S. Department of Housing and Urban Development

USER GUIDE. Homelessness Data Exchange (HDX), Version 2.0 BETA TEST VERSION. June, U.S. Department of Housing and Urban Development Homelessness Data Exchange (HDX), Version 2.0 USER GUIDE BETA TEST VERSION June, 2018 U.S. Department of Housing and Urban Development 1 Table of Contents Introduction... 3 Log into the site... 3 Reset

More information

Survey Questions and Methodology

Survey Questions and Methodology Survey Questions and Methodology Spring Tracking Survey 2012 Data for March 15 April 3, 2012 Princeton Survey Research Associates International for the Pew Research Center s Internet & American Life Project

More information

NANOS SURVEY NANOS SURVEY

NANOS SURVEY NANOS SURVEY Canadians are three times more likely to say Canada should ban than allow Huawei from participating in the 5G network in Canada National survey released August, 2018 Project 2018-1260A Summary Fifty-four

More information

Setting the Percentage in PROC TABULATE

Setting the Percentage in PROC TABULATE SESUG Paper 193-2017 Setting the Percentage in PROC TABULATE David Franklin, QuintilesIMS, Cambridge, MA ABSTRACT PROC TABULATE is a very powerful procedure which can do statistics and frequency counts

More information

Cell Phone Location Data for Travel Behavior Analysis

Cell Phone Location Data for Travel Behavior Analysis Cell Phone Location Data for Travel Behavior Analysis NCHRP 08-95 presented to Southeast Florida FSUTMS User Group presented by Cambridge Systematics, Inc. Kimon Proussaloglou March 1 15, 2019 The Next

More information

The Dataset and The National ID Number. Panel 3: Government Data Friday, November 20, 2015 By Deborah Rose, PhD Visiting Scholar, FXB Center, Harvard

The Dataset and The National ID Number. Panel 3: Government Data Friday, November 20, 2015 By Deborah Rose, PhD Visiting Scholar, FXB Center, Harvard The Dataset and The National ID Number Panel 3: Government Data Friday, November 20, 2015 By Deborah Rose, PhD Visiting Scholar, FXB Center, Harvard The Dataset and The National ID Number The major topics

More information

Accessing QWIs: LED Extraction Tool and QWI Explorer

Accessing QWIs: LED Extraction Tool and QWI Explorer Accessing QWIs: LED Extraction Tool and QWI Explorer Unlocking the Powerful Labor-force Information of the Quarterly Workforce Indicators Heath Hayward Geographer LEHD Program Center for Economic Studies

More information

PREVENTION WEB CIVIL CITATION PROGRAM REGISTRATION WIZARD. A JJIS User Guide

PREVENTION WEB CIVIL CITATION PROGRAM REGISTRATION WIZARD. A JJIS User Guide PREVENTION WEB CIVIL CITATION PROGRAM REGISTRATION WIZARD A JJIS User Guide Table of Contents Overview... 2 Logging in to Prevention Web... 2 Program Registration Wizard Youth Search... 3 Program Registration

More information

The Power of Combining Data with the PROC SQL

The Power of Combining Data with the PROC SQL ABSTRACT Paper CC-09 The Power of Combining Data with the PROC SQL Stacey Slone, University of Kentucky Markey Cancer Center Combining two data sets which contain a common identifier with a MERGE statement

More information

OVERVIEW OF WINDOWS IN STATA

OVERVIEW OF WINDOWS IN STATA OBJECTIVES OF STATA This course is the series of statistical analysis using Stata. It is designed to acquire basic skill on Stata and produce a technical reports in the statistical views. After completion

More information

The goal of this task is to assemble the input data required to deliver national accessibility reports and

The goal of this task is to assemble the input data required to deliver national accessibility reports and The goal of this task is to assemble the data required to deliver national accessibility reports and blocklevel accessibility datasets to each of the project participants. Subsequent tasks in this project

More information

Using SAS software to shrink the data in your applications

Using SAS software to shrink the data in your applications Paper 991-2016 Using SAS software to shrink the data in your applications Ahmed Al-Attar, AnA Data Warehousing Consulting LLC, McLean, VA ABSTRACT This paper discusses the techniques I used at the Census

More information

A Side of Hash for You To Dig Into

A Side of Hash for You To Dig Into A Side of Hash for You To Dig Into Shan Ali Rasul, Indigo Books & Music Inc, Toronto, Ontario, Canada. ABSTRACT Within the realm of Customer Relationship Management (CRM) there is always a need for segmenting

More information

A Macro that Creates U.S Census Tracts Keyhole Markup Language Files for Google Map Use

A Macro that Creates U.S Census Tracts Keyhole Markup Language Files for Google Map Use Paper 2418-2018 A Macro that Creates U.S Census Tracts Keyhole Markup Language Files for Google Map Use ABSTRACT Ting Sa, Cincinnati Children s Hospital Medical Center This paper introduces a macro that

More information

Smartphone Ownership 2013 Update

Smartphone Ownership 2013 Update www.pewresearch.org JUNE 5, 2013 Smartphone Ownership 2013 Update 56% of American adults now own a smartphone of some kind; Android and iphone owners account for half of the cell phone user population.

More information

Predict Outcomes and Reveal Relationships in Categorical Data

Predict Outcomes and Reveal Relationships in Categorical Data PASW Categories 18 Specifications Predict Outcomes and Reveal Relationships in Categorical Data Unleash the full potential of your data through predictive analysis, statistical learning, perceptual mapping,

More information

HONORS ACTIVITY #2 EXPONENTIAL GROWTH & DEVELOPING A MODEL

HONORS ACTIVITY #2 EXPONENTIAL GROWTH & DEVELOPING A MODEL Name HONORS ACTIVITY #2 EXPONENTIAL GROWTH & DEVELOPING A MODEL SECTION I: A SIMPLE MODEL FOR POPULATION GROWTH Goal: This activity introduces the concept of a model using the example of a simple population

More information

ABSTRACT INTRODUCTION WHERE TO START? 1. DATA CHECK FOR CONSISTENCIES

ABSTRACT INTRODUCTION WHERE TO START? 1. DATA CHECK FOR CONSISTENCIES Developing Integrated Summary of Safety Database using CDISC Standards Rajkumar Sharma, Genentech Inc., A member of the Roche Group, South San Francisco, CA ABSTRACT Most individual trials are not powered

More information

MapReduce Design Patterns

MapReduce Design Patterns MapReduce Design Patterns MapReduce Restrictions Any algorithm that needs to be implemented using MapReduce must be expressed in terms of a small number of rigidly defined components that must fit together

More information

DSCI 210: Data Science Excel Take-home Spring 2015 Points: 50

DSCI 210: Data Science Excel Take-home Spring 2015 Points: 50 DSCI 210: Data Science Excel Take-home Spring 2015 Points: 50 Name: Consider the Baby Names dataset on the course webpage. This file contains the unique names of over 600 babies born in Rochester MN over

More information

Improving Internet Connectivity in Rural West Virginia

Improving Internet Connectivity in Rural West Virginia Team Four-year Transforms 2/16/2017 SYST 699 Spring 2017 Improving Internet Connectivity in Rural West Virginia Contents Contents 1 Introduction 2 Background Information 2 Existing Services and Coverage

More information

North Central Texas Council of Governments

North Central Texas Council of Governments Making Data Useful North Central Texas Council of Governments Research and InformationServices March 29, 2012 NCTCOG Voluntary association of, by, and for local governments Assist local governments Planning

More information

Multiple Facts about Multilabel Formats

Multiple Facts about Multilabel Formats Multiple Facts about Multilabel Formats Gwen D. Babcock, ew York State Department of Health, Troy, Y ABSTRACT PROC FORMAT is a powerful procedure which allows the viewing and summarizing of data in various

More information

Cause/reason (if currently known)

Cause/reason (if currently known) The Affirmatively Furthering Fair Housing Data and Mapping Tool (AFFH-T): for AFFH-T Release 4.1 Published: December 20, As part of its efforts to support program participants in conducting their Assessments

More information

A Framework for Data Quality for Synthetic Information. Ragini Gupta

A Framework for Data Quality for Synthetic Information. Ragini Gupta A Framework for Data Quality for Synthetic Information Ragini Gupta Thesis submitted to the faculty of the Virginia Polytechnic Institute and State University in partial fulfillment of the requirements

More information