Truncate, replicate, sample

Size: px
Start display at page:

Download "Truncate, replicate, sample"

Transcription

1 Truncate, replicate, sample A probabilistic method of integerisation Robin Lovelace & Dimitris Ballas Sheffield Presented at the IMA, May 2012, Dublin Robin Lovelace & Dimitris Ballas (Sheffield) Truncate, replicate, sample Integerisation 1 / 26

2 Outline 1 Introduction Background Integerisation: what and why? Existing approaches 2 TRS: A probabilistic approach Truncate Replicate Sample 3 Results Speed of calculation Population size Accuracy 4 Conclusion 5 Further work Robin Lovelace & Dimitris Ballas (Sheffield) Truncate, replicate, sample Integerisation 2 / 26

3 Background [2] Robin Lovelace & Dimitris Ballas (Sheffield) Truncate, replicate, sample Integerisation 3 / 26

4 Background Robin Lovelace & Dimitris Ballas (Sheffield) Truncate, replicate, sample Integerisation 4 / 26

5 Spatial microsimulation Robin Lovelace & Dimitris Ballas (Sheffield) Truncate, replicate, sample Integerisation 5 / 26

6 Non-integer weights Robin Lovelace & Dimitris Ballas (Sheffield) Truncate, replicate, sample Integerisation 6 / 26

7 What is integerisation? It s going from this: To this: Table: IPF results ID Weight Table: Integerised results ID Robin Lovelace & Dimitris Ballas (Sheffield) Truncate, replicate, sample Integerisation 7 / 26

8 Why integerise? Makes your analysis easier* Conceptual advantage: e.g. discrete car data Gini index for inequality Mean is available from IPF results The relationships between discrete workers and jobs Individuals useful for agent-based modelling and dynamic microsimulation Best of both worlds? [4] *Subject to debate Robin Lovelace & Dimitris Ballas (Sheffield) Truncate, replicate, sample Integerisation 8 / 26

9 Simple rounding and inclusion thresholds Simple rounding includes individuals who have weights > 0.5 [1] The problem: population totals do not match Solution: add individuals up to a given threshold count Inclusion threshold Robin Lovelace & Dimitris Ballas (Sheffield) Truncate, replicate, sample Integerisation 9 / 26

10 Performance of existing approaches Simulated 4000 Method Rounding Threshold Census Robin Lovelace & Dimitris Ballas (Sheffield) Truncate, replicate, sample Integerisation 10 / 26

11 Performance of existing approaches 2 All microdata (n = 4933) Sampled (n = 2541) Weight Index Robin Lovelace & Dimitris Ballas (Sheffield) Truncate, replicate, sample Integerisation 11 / 26 Index

12 Design criteria for better integerisation Simplicity of simple rounding Correct population size of threshold approach Representative of individuals with higher weights Robin Lovelace & Dimitris Ballas (Sheffield) Truncate, replicate, sample Integerisation 12 / 26

13 Outline 1 Introduction Background Integerisation: what and why? Existing approaches 2 TRS: A probabilistic approach Truncate Replicate Sample 3 Results Speed of calculation Population size Accuracy 4 Conclusion 5 Further work Robin Lovelace & Dimitris Ballas (Sheffield) Truncate, replicate, sample Integerisation 13 / 26

14 Truncate As simple as In R, performed by command trunc() Always leads to a population underestimate (useful) Robin Lovelace & Dimitris Ballas (Sheffield) Truncate, replicate, sample Integerisation 14 / 26

15 Replicate Simply replicate the individuals whose truncated weight > 1 Robin Lovelace & Dimitris Ballas (Sheffield) Truncate, replicate, sample Integerisation 15 / 26

16 Sample The important part Fills up partially empty zones with representative individuals Sample size = zone population population after truncation and replication Probability is determined by weight remainder (e.g ) Simple to code in R: sample(x, size =..., prob =...) Robin Lovelace & Dimitris Ballas (Sheffield) Truncate, replicate, sample Integerisation 16 / 26

17 Integerisation in action All microdata (n = 4933) Sampled (n = 2541) Weight Index Robin Lovelace & Dimitris Ballas (Sheffield) Truncate, replicate, sample Integerisation 17 / 26 Index

18 Results Rounding Threshold TRS 3000 Simulated 2000 Constraint Age/sex Distance Mode NS SEC Census Robin Lovelace & Dimitris Ballas (Sheffield) Truncate, replicate, sample Integerisation 18 / 26

19 Speed of calculation New method is faster than threshold method But integerisation small portion of overall computational time 3 seconds for integerisation vs 5 minutes for IPF Robin Lovelace & Dimitris Ballas (Sheffield) Truncate, replicate, sample Integerisation 19 / 26

20 Population size Table: Differences between Census and simulated populations. Metric Rounding Threshold TRS Mean Standard deviation Max Min Mean oversample -13% 0.3% 0.0% TRS method ensures correct population size Robin Lovelace & Dimitris Ballas (Sheffield) Truncate, replicate, sample Integerisation 20 / 26

21 Table: Accuracy results for integerisation techniques [3]. Method Variables TAE SAE Errors > 5% Zm 2 IPF Age/sex 9 0% 0% 0 Distance % 14% 640 Mode % 6% 593 NS-SEC 0 0% 0% 0 All % 5% 1233 Rounding Age/sex % 81% 5247 Distance % 80% Mode % 82% 8896 NS-SEC % 76% 7758 All % 80% Threshold Age/sex % 49% 1074 Distance % 83% 8890 Mode % 68% 3678 NS-SEC % 56% 2132 All % 63% TRS Age/sex % 28% 309 Distance % 53% 1449 Mode % 50% 1035 NS-SEC % 28% 392 All % 39% 3184 Robin Lovelace & Dimitris Ballas (Sheffield) Truncate, replicate, sample Integerisation 21 / 26

22 Allows intra-zone analysis Robin Lovelace & Dimitris Ballas (Sheffield) Truncate, replicate, sample Integerisation 22 / 26

23 Key references Dimitris Ballas, Graham Clarke, Danny Dorling, Heather Eyre, Bethan Thomas, and David Rossiter. SimBritain: a spatial microsimulation approach to population dynamics. Population, Space and Place, 11(1):13 34, January Stan Openshaw. The modifiable areal unit problem. Geo Books Norwich UK, David Voas and Paul Williamson. Evaluating Goodness-of-Fit Measures for Synthetic Microdata. Geographical and Environmental Modelling, 5(2): , November P Williamson, Mark Birkin, and P H Rees. The estimation of population microdata by using data from small area statistics and samples of anonymised records. Environment and Planning A, 30(5): , Robin Lovelace & Dimitris Ballas (Sheffield) Truncate, replicate, sample Integerisation 23 / 26

24 Integerisation in action Age Constraint Mode Constraint Simulated Variable type Age/sex Distance Mode NS SEC Distance Constraint NS SEC Constraint Simulated Census Census Robin Lovelace & Dimitris Ballas (Sheffield) Truncate, replicate, sample Integerisation 24 / 26

25 Integerisation in action 0.6 Error (proportion of points beyond 5% of census value Constraint Age/sex Distance Mode N. cars NS SEC Iteration Robin Lovelace & Dimitris Ballas (Sheffield) Truncate, replicate, sample Integerisation 25 / 26

26 Contact me! Try the model! Robin Lovelace & Dimitris Ballas (Sheffield) Truncate, replicate, sample Integerisation 26 / 26

Package raker. October 10, 2017

Package raker. October 10, 2017 Title Easy Spatial Microsimulation (Raking) in R Version 0.2.1 Date 2017-10-10 Package raker October 10, 2017 Functions for performing spatial microsimulation ('raking') in R. Depends R (>= 3.4.0) License

More information

Geographical Inequalities, Spatial Scale and Small Area Statistics for England and Wales

Geographical Inequalities, Spatial Scale and Small Area Statistics for England and Wales Geographical Inequalities, Spatial Scale and Small Area Statistics for England and Wales Chris Lloyd Centre for Spatial Demographics Research, University of Liverpool, UK Email: c.d.lloyd@liverpool.ac.uk

More information

Synthetic Data: Modelling and Generating Complex Close-to-Reality Data for Public Use

Synthetic Data: Modelling and Generating Complex Close-to-Reality Data for Public Use Matthias Templ, Bernhard Meindl Statistics Austria & Vienna Uni. of Techn. Mai 2015 ODAM 2015, Olomouc, CZ Synthetic Data: Modelling and Generating Complex Close-to-Reality Data for Public Use Matthias

More information

Probability Models.S4 Simulating Random Variables

Probability Models.S4 Simulating Random Variables Operations Research Models and Methods Paul A. Jensen and Jonathan F. Bard Probability Models.S4 Simulating Random Variables In the fashion of the last several sections, we will often create probability

More information

Microsimulation model user guide Flexible Modelling Framework

Microsimulation model user guide Flexible Modelling Framework National Centre for Research Methods Working Paper 06/13 Microsimulation model user guide Flexible Modelling Framework Kirk Harland, TALISMAN node, University of Leeds Microsimulation model user guide

More information

Synthetic Population Techniques in Activity- Based Research

Synthetic Population Techniques in Activity- Based Research 48 Chapter 3 Synthetic Population Techniques in Activity- Based Research Sungjin Cho Hasselt University, Belgium Tom Bellemans Hasselt University, Belgium Lieve Creemers Hasselt University, Belgium Luk

More information

TIBCO StreamBase 10 Distributed Computing and High Availability. November 2017

TIBCO StreamBase 10 Distributed Computing and High Availability. November 2017 TIBCO StreamBase 10 Distributed Computing and High Availability November 2017 Distributed Computing Distributed Computing location transparent objects and method invocation allowing transparent horizontal

More information

Transport Demand Modeling with Limited Data: Transfer and Calibration of a Model from another Geographical Region

Transport Demand Modeling with Limited Data: Transfer and Calibration of a Model from another Geographical Region Transport Demand Modeling with Limited Data: Transfer and Calibration of a Model from another Geographical Region Dominik Ziemke, Kai Nagel, Chandra Bhat heart Conference 2014 11 September 2014 Motivation

More information

Multimedia Storage Servers

Multimedia Storage Servers Multimedia Storage Servers Cyrus Shahabi shahabi@usc.edu Computer Science Department University of Southern California Los Angeles CA, 90089-0781 http://infolab.usc.edu 1 OUTLINE Introduction Continuous

More information

PopChange - An open source, reproducible research project

PopChange - An open source, reproducible research project PopChange - An open source, reproducible research project Nick Bearman 12 1 Centre for Spatial Demographics Research and Department of Geography and Planning, School of Environmental Sciences, University

More information

ABSTRACT INTRODUCTION

ABSTRACT INTRODUCTION Iterative Proportional Fitting and Population Dynamics using SAS Himanshu Joshi, Houston-Galveston Area Council, Houston, TX Dmitry Messen, Houston-Galveston Area Council, Houston, TX ABSTRACT For doing

More information

Statistical and Computational Challenges in Combining Information from Multiple data Sources. T. E. Raghunathan University of Michigan

Statistical and Computational Challenges in Combining Information from Multiple data Sources. T. E. Raghunathan University of Michigan Statistical and Computational Challenges in Combining Information from Multiple data Sources T. E. Raghunathan University of Michigan Opportunities Computational ability and cheap storage has made digitally

More information

Automated Estimation using Enterprise Architect August 2012 Laurence White Abstract.

Automated Estimation using Enterprise Architect August 2012 Laurence White Abstract. Predictive Current Retrospective Automated Estimation using Enterprise Architect Abstract. This paper details an approach for creating automated measures of the scale and complexity of an enhancement,

More information

CS47300: Web Information Search and Management

CS47300: Web Information Search and Management CS47300: Web Information Search and Management Web Search Prof. Chris Clifton 18 October 2017 Some slides courtesy Croft et al. Web Crawler Finds and downloads web pages automatically provides the collection

More information

How to calculate population and jobs within ½ mile radius of site

How to calculate population and jobs within ½ mile radius of site How to calculate population and jobs within ½ mile radius of site Caltrans Project P359, Trip Generation Rates for Transportation Impact Analyses of Smart Growth Land Use Projects SECTION PAGE Population

More information

An Introduction to Dynamic Simulation Modeling

An Introduction to Dynamic Simulation Modeling Esri International User Conference San Diego, CA Technical Workshops ****************** An Introduction to Dynamic Simulation Modeling Kevin M. Johnston Shitij Mehta Outline Model types - Descriptive versus

More information

SHRP2 C10: Jacksonville

SHRP2 C10: Jacksonville SHRP2 C10: Jacksonville Partnership to Develop an Integrated Advanced Travel Demand Model and a Fine grained Timesensitive Network Presented by: Stephen Lawe Key Agency Partners: Florida Department of

More information

T H E S H I F T T O SMARTPHONE DOMINANCE

T H E S H I F T T O SMARTPHONE DOMINANCE T H E S H I F T T O SMARTPHONE DOMINANCE Background To understand mobile migration patterns and which factors will accelerate the shift to a mobile-first for consumers and advertisers W H A T S C O V E

More information

Lecture 7: Decision Trees

Lecture 7: Decision Trees Lecture 7: Decision Trees Instructor: Outline 1 Geometric Perspective of Classification 2 Decision Trees Geometric Perspective of Classification Perspective of Classification Algorithmic Geometric Probabilistic...

More information

Archna Rani [1], Dr. Manu Pratap Singh [2] Research Scholar [1], Dr. B.R. Ambedkar University, Agra [2] India

Archna Rani [1], Dr. Manu Pratap Singh [2] Research Scholar [1], Dr. B.R. Ambedkar University, Agra [2] India Volume 4, Issue 3, March 2014 ISSN: 2277 128X International Journal of Advanced Research in Computer Science and Software Engineering Research Paper Available online at: www.ijarcsse.com Performance Evaluation

More information

Announcements. Data Sources a list of data files and their sources, an example of what I am looking for:

Announcements. Data Sources a list of data files and their sources, an example of what I am looking for: Data Announcements Data Sources a list of data files and their sources, an example of what I am looking for: Source Map of Bangor MEGIS NG911 road file for Bangor MEGIS Tax maps for Bangor City Hall, may

More information

Scaling Distributed Machine Learning with the Parameter Server

Scaling Distributed Machine Learning with the Parameter Server Scaling Distributed Machine Learning with the Parameter Server Mu Li, David G. Andersen, Jun Woo Park, Alexander J. Smola, Amr Ahmed, Vanja Josifovski, James Long, Eugene J. Shekita, and Bor-Yiing Su Presented

More information

Authors: Rupa Krishnan, Harsha V. Madhyastha, Sridhar Srinivasan, Sushant Jain, Arvind Krishnamurthy, Thomas Anderson, Jie Gao

Authors: Rupa Krishnan, Harsha V. Madhyastha, Sridhar Srinivasan, Sushant Jain, Arvind Krishnamurthy, Thomas Anderson, Jie Gao Title: Moving Beyond End-to-End Path Information to Optimize CDN Performance Authors: Rupa Krishnan, Harsha V. Madhyastha, Sridhar Srinivasan, Sushant Jain, Arvind Krishnamurthy, Thomas Anderson, Jie Gao

More information

A METHODOLOGY TO MATCH DISTRIBUTIONS OF BOTH HOUSEHOLD AND PERSON ATTRIBUTES IN THE GENERATION OF SYNTHETIC POPULATIONS

A METHODOLOGY TO MATCH DISTRIBUTIONS OF BOTH HOUSEHOLD AND PERSON ATTRIBUTES IN THE GENERATION OF SYNTHETIC POPULATIONS A METHODOLOGY TO MATCH DISTRIBUTIONS OF BOTH HOUSEHOLD AND PERSON ATTRIBUTES IN THE GENERATION OF SYNTHETIC POPULATIONS Xin Ye Department of Civil and Environmental Engineering Arizona State University,

More information

Location-based Data Overlay for Intermittently-Connected Networks. Nathanael Thompson, RiccardoCrepaldi, Robin Kravets

Location-based Data Overlay for Intermittently-Connected Networks. Nathanael Thompson, RiccardoCrepaldi, Robin Kravets Location-based Data Overlay for Intermittently-Connected Networks Nathanael Thompson, RiccardoCrepaldi, Robin Kravets The Urban Experience Cracking civil infrastructures I-35 W Mississippi River bridge

More information

SQL 2016 and AGs: what s new? David Barbarin, dbi services

SQL 2016 and AGs: what s new? David Barbarin, dbi services SQL 2016 and AGs: what s new? David Barbarin, dbi services > whoami : David BARBARIN 6.5

More information

The Design of Approximation Algorithms

The Design of Approximation Algorithms The Design of Approximation Algorithms David P. Williamson Cornell University David B. Shmoys Cornell University m Щ0 CAMBRIDGE UNIVERSITY PRESS Contents Preface page ix I An Introduction to the Techniques

More information

The Challenges of Robust Inter-Vehicle Communications

The Challenges of Robust Inter-Vehicle Communications The Challenges of Robust Inter-Vehicle Communications IEEE VTC2005-Fall Marc Torrent-Moreno, Moritz Killat and Hannes Hartenstein DSN Research Group Institute of Telematics University of Karlsruhe Marc

More information

2011 INTERNATIONAL COMPARISON PROGRAM

2011 INTERNATIONAL COMPARISON PROGRAM 2011 INTERNATIONAL COMPARISON PROGRAM 2011 ICP DATA ACCESS AND ARCHIVING POLICY GUIDING PRINCIPLES AND PROCEDURES FOR DATA ACCESS ICP Global Office November 2011 Contents I. PURPOSE... 3 II. CONTEXT...

More information

A population grid for Andalusia Year Institute of Statistics and Cartography of Andalusia (IECA) Sofia (BU), 24 th October 2013

A population grid for Andalusia Year Institute of Statistics and Cartography of Andalusia (IECA) Sofia (BU), 24 th October 2013 A population grid for Andalusia Year 2013 Institute of Statistics and Cartography of Andalusia (IECA) Sofia (BU), 24 th October 2013 IECA project. Population grid cells sized 250 x 250m for Andalusia 1.

More information

Statistical Timing Analysis Using Bounds and Selective Enumeration

Statistical Timing Analysis Using Bounds and Selective Enumeration IEEE TRANSACTIONS ON COMPUTER-AIDED DESIGN OF INTEGRATED CIRCUITS AND SYSTEMS, VOL. 22, NO. 9, SEPTEMBER 2003 1243 Statistical Timing Analysis Using Bounds and Selective Enumeration Aseem Agarwal, Student

More information

How to Rezone. A method for redistributing census populations to create bespoke geographies

How to Rezone. A method for redistributing census populations to create bespoke geographies How to Rezone A method for redistributing census populations to create bespoke geographies January 2014 Contents Contents... 2 Introduction... 3 Limitations of this method... 4 Extract the zip file...

More information

DONAR Decentralized Server Selection for Cloud Services

DONAR Decentralized Server Selection for Cloud Services DONAR Decentralized Server Selection for Cloud Services Patrick Wendell, Princeton University Joint work with Joe Wenjie Jiang, Michael J. Freedman, and Jennifer Rexford Outline Server selection background

More information

Performance evaluation. Performance evaluation. CS/COE0447: Computer Organization. It s an everyday process

Performance evaluation. Performance evaluation. CS/COE0447: Computer Organization. It s an everyday process Performance evaluation It s an everyday process CS/COE0447: Computer Organization and Assembly Language Chapter 4 Sangyeun Cho Dept. of Computer Science When you buy food Same quantity, then you look at

More information

ENM316E Simulation. The output obtained by running the simulation model once is also random.

ENM316E Simulation. The output obtained by running the simulation model once is also random. ENM 316E Simulation Lesson 6 Output analysis is the analysis of data generated from simulation study. The purpose of the output analysis To estimate the performance of a system To compare the performance

More information

Fathom Dynamic Data TM Version 2 Specifications

Fathom Dynamic Data TM Version 2 Specifications Data Sources Fathom Dynamic Data TM Version 2 Specifications Use data from one of the many sample documents that come with Fathom. Enter your own data by typing into a case table. Paste data from other

More information

Traffic simulation using Repast HPC Report. Yongqiang(Victor) TIAN

Traffic simulation using Repast HPC Report. Yongqiang(Victor) TIAN Traffic simulation using Repast HPC Report Yongqiang(Victor) TIAN Email: yongqtian2-c@my.cityu.edu.hk Jul-Aug 2016 Contents 1 Abstract.................................... 2 2 Introduction..................................

More information

Memory hierarchy review. ECE 154B Dmitri Strukov

Memory hierarchy review. ECE 154B Dmitri Strukov Memory hierarchy review ECE 154B Dmitri Strukov Outline Cache motivation Cache basics Six basic optimizations Virtual memory Cache performance Opteron example Processor-DRAM gap in latency Q1. How to deal

More information

How to deal with large numbers (millions) of entities in a system? IP devices in the internet (0.5 billion) Users in P2P network (millions)

How to deal with large numbers (millions) of entities in a system? IP devices in the internet (0.5 billion) Users in P2P network (millions) Designs for Scale How to deal with large numbers (millions) of entities in a system? IP devices in the internet (0.5 billion) Users in P2P network (millions) More generally: Are there advantages to large

More information

Developed as part of a research contract from EPA Region 9 Partnered with Dr. Paul English, CA DPH EHIB

Developed as part of a research contract from EPA Region 9 Partnered with Dr. Paul English, CA DPH EHIB Air quality hazards defined by CalEPA/ CARB with recommendations for health protective buffer zones to separate these hazards from sensitive populations Developed as part of a research contract from EPA

More information

DDI metadata for IPUMS I samples

DDI metadata for IPUMS I samples DDI metadata for IPUMS I samples Wendy Thomas Workshop Integrating Global Census Microdata : Dublin Ireland, 58th ISI What is DDI DDI is a metadata standard d focused ocusedprimarily on microdata from

More information

Reducing Consumer Uncertainty

Reducing Consumer Uncertainty Spatial Analytics Reducing Consumer Uncertainty Towards an Ontology for Geospatial User-centric Metadata Introduction Cooperative Research Centre for Spatial Information (CRCSI) in Australia Communicate

More information

Mission ImposSERPble:

Mission ImposSERPble: Mission ImposSERPble: Establishing Google Click-Through Rates Behavioral Study by Slingshot SEO, Inc. using client data from January June 2011 What s Inside 1 2 3 4 5 6 7 Executive Summary Main Objectives

More information

IOmark- VDI. IBM IBM FlashSystem V9000 Test Report: VDI a Test Report Date: 5, December

IOmark- VDI. IBM IBM FlashSystem V9000 Test Report: VDI a Test Report Date: 5, December IOmark- VDI IBM IBM FlashSystem V9000 Test Report: VDI- 151205- a Test Report Date: 5, December 2015 Copyright 2010-2015 Evaluator Group, Inc. All rights reserved. IOmark- VDI, IOmark- VM, VDI- IOmark,

More information

Applications. Oversampled 3D scan data. ~150k triangles ~80k triangles

Applications. Oversampled 3D scan data. ~150k triangles ~80k triangles Mesh Simplification Applications Oversampled 3D scan data ~150k triangles ~80k triangles 2 Applications Overtessellation: E.g. iso-surface extraction 3 Applications Multi-resolution hierarchies for efficient

More information

Episode 5. Scheduling and Traffic Management

Episode 5. Scheduling and Traffic Management Episode 5. Scheduling and Traffic Management Part 2 Baochun Li Department of Electrical and Computer Engineering University of Toronto Outline What is scheduling? Why do we need it? Requirements of a scheduling

More information

Interactive Design and Visualization of Urban Spaces using Geometrical and Behavioral Modeling

Interactive Design and Visualization of Urban Spaces using Geometrical and Behavioral Modeling Interactive Design and Visualization of Urban Spaces using Geometrical and Behavioral Modeling Carlos Vanegas 1,4,5 Daniel Aliaga 1 Bedřich Beneš 2 Paul Waddell 3 1 Computer Science, Purdue University,

More information

Project 2: CPU Scheduling Simulator

Project 2: CPU Scheduling Simulator Project 2: CPU Scheduling Simulator CSCI 442, Spring 2017 Assigned Date: March 2, 2017 Intermediate Deliverable 1 Due: March 10, 2017 @ 11:59pm Intermediate Deliverable 2 Due: March 24, 2017 @ 11:59pm

More information

COT 6936: Topics in Algorithms! Giri Narasimhan. ECS 254A / EC 2443; Phone: x3748

COT 6936: Topics in Algorithms! Giri Narasimhan. ECS 254A / EC 2443; Phone: x3748 COT 6936: Topics in Algorithms! Giri Narasimhan ECS 254A / EC 2443; Phone: x3748 giri@cs.fiu.edu http://www.cs.fiu.edu/~giri/teach/cot6936_s12.html https://moodle.cis.fiu.edu/v2.1/course/view.php?id=174

More information

Expectation Maximization!

Expectation Maximization! Expectation Maximization! adapted from: Doug Downey and Bryan Pardo, Northwestern University and http://www.stanford.edu/class/cs276/handouts/lecture17-clustering.ppt Steps in Clustering Select Features

More information

CPSC 531: System Modeling and Simulation. Carey Williamson Department of Computer Science University of Calgary Fall 2017

CPSC 531: System Modeling and Simulation. Carey Williamson Department of Computer Science University of Calgary Fall 2017 CPSC 531: System Modeling and Simulation Carey Williamson Department of Computer Science University of Calgary Fall 2017 Recap: Simulation Model Taxonomy 2 Recap: DES Model Development How to develop a

More information

Delay Tolerant Networking. Thomas Plagemann Distributed Multimedia Systems Group Department of Informatics University of Oslo.

Delay Tolerant Networking. Thomas Plagemann Distributed Multimedia Systems Group Department of Informatics University of Oslo. Delay Tolerant Networking Thomas Plagemann Distributed Multimedia Systems Group Department of Informatics University of Oslo Outline Background, motivation, overview Epidemic routing Message ferrying Mobility/density

More information

EDI Introduction. Table of Contents

EDI Introduction. Table of Contents Introduction Table of Contents 1. INTRODUCTION... 2 1.1 GENERIC ORDER FLOW BUSINESS MODEL... 2 1.2 DEVELOPER WORKSHEETS USAGE DEFINITIONS... 8 1.3 CONTENT... 9 1.4 TRADING PARTNER ACCESS INFORMATION...

More information

Properties of Processes

Properties of Processes CPU Scheduling Properties of Processes CPU I/O Burst Cycle Process execution consists of a cycle of CPU execution and I/O wait. CPU burst distribution: CPU Scheduler Selects from among the processes that

More information

A Constrained Delaunay Triangle Mesh Method for Three-Dimensional Unstructured Boundary Point Cloud

A Constrained Delaunay Triangle Mesh Method for Three-Dimensional Unstructured Boundary Point Cloud International Journal of Computer Systems (ISSN: 2394-1065), Volume 03 Issue 02, February, 2016 Available at http://www.ijcsonline.com/ A Constrained Delaunay Triangle Mesh Method for Three-Dimensional

More information

Social, Information, and Routing Networks: Models, Algorithms, and Strategic Behavior

Social, Information, and Routing Networks: Models, Algorithms, and Strategic Behavior Social, Information, and Routing Networks: Models, Algorithms, and Strategic Behavior Who? Prof. Aris Anagnostopoulos Prof. Luciana S. Buriol Prof. Guido Schäfer What will We Cover? Topics: Network properties

More information

Parallel DBMS. Parallel Database Systems. PDBS vs Distributed DBS. Types of Parallelism. Goals and Metrics Speedup. Types of Parallelism

Parallel DBMS. Parallel Database Systems. PDBS vs Distributed DBS. Types of Parallelism. Goals and Metrics Speedup. Types of Parallelism Parallel DBMS Parallel Database Systems CS5225 Parallel DB 1 Uniprocessor technology has reached its limit Difficult to build machines powerful enough to meet the CPU and I/O demands of DBMS serving large

More information

Domain Name System (DNS)

Domain Name System (DNS) Domain Name System (DNS) Outline Naming Hosts Domain Name Hierarchy Zones DNS Records Name Resolution CS 640 1 Naming Hosts Thus far we have identified hosts using IP addresses and MAC address Hard for

More information

Efficient solutions for the monitoring of the Internet

Efficient solutions for the monitoring of the Internet Efficient solutions for the monitoring of the Internet Chadi BARAKAT INRIA Sophia Antipolis, France Planète research group HDR defense January 22, 2009 Email: Chadi.Barakat@sophia.inria.fr WEB: http://www.inria.fr/planete/chadi

More information

Outline. CS 6776 Evolutionary Computation. Numerical Optimization. Fitness Function. ,x 2. ) = x 2 1. , x , 5.0 x 1.

Outline. CS 6776 Evolutionary Computation. Numerical Optimization. Fitness Function. ,x 2. ) = x 2 1. , x , 5.0 x 1. Outline CS 6776 Evolutionary Computation January 21, 2014 Problem modeling includes representation design and Fitness Function definition. Fitness function: Unconstrained optimization/modeling Constrained

More information

1. Estimation equations for strip transect sampling, using notation consistent with that used to

1. Estimation equations for strip transect sampling, using notation consistent with that used to Web-based Supplementary Materials for Line Transect Methods for Plant Surveys by S.T. Buckland, D.L. Borchers, A. Johnston, P.A. Henrys and T.A. Marques Web Appendix A. Introduction In this on-line appendix,

More information

CS 340 Lec. 4: K-Nearest Neighbors

CS 340 Lec. 4: K-Nearest Neighbors CS 340 Lec. 4: K-Nearest Neighbors AD January 2011 AD () CS 340 Lec. 4: K-Nearest Neighbors January 2011 1 / 23 K-Nearest Neighbors Introduction Choice of Metric Overfitting and Underfitting Selection

More information

Midterm Exam Solutions Amy Murphy 28 February 2001

Midterm Exam Solutions Amy Murphy 28 February 2001 University of Rochester Midterm Exam Solutions Amy Murphy 8 February 00 Computer Systems (CSC/56) Read before beginning: Please write clearly. Illegible answers cannot be graded. Be sure to identify all

More information

simsalud: Design and Implementation of an Open-source Wizard based Spatial Microsimulation Framework

simsalud: Design and Implementation of an Open-source Wizard based Spatial Microsimulation Framework INTERNATIONAL JOURNAL OF MICROSIMULATION (2017) 10(2) 118-143 INTERNATIONAL MICROSIMULATION ASSOCIATION simsalud: Design and Implementation of an Open-source Wizard based Spatial Microsimulation Framework

More information

Job Aid: Setting User Preferences

Job Aid: Setting User Preferences ZEBRA Repair Order Portal Job Aid: Setting User Preferences Updated August 2017 Setting User Preferences Overview The Repair Order Portal allows users to manage the following preferences: Time Zone: If

More information

Evaluation of Moving Object Tracking Techniques for Video Surveillance Applications

Evaluation of Moving Object Tracking Techniques for Video Surveillance Applications International Journal of Current Engineering and Technology E-ISSN 2277 4106, P-ISSN 2347 5161 2015INPRESSCO, All Rights Reserved Available at http://inpressco.com/category/ijcet Research Article Evaluation

More information

Chapter Three: Contents

Chapter Three: Contents Volume Three Modules 15 January 2003 i Chapter Three: Contents (Activity Generator 15 January 2003 LA-UR-00-1725 TRANSIMS 3.0) 1. INTRODUCTION...2 1.1 OVERVIEW... 2 1.2 PURPOSE... 2 1.3 ACTIVITY GENERATOR

More information

ME/CS 132: Advanced Robotics: Navigation and Vision

ME/CS 132: Advanced Robotics: Navigation and Vision ME/CS 132: Advanced Robotics: Navigation and Vision Lecture #5: Search Algorithm 1 Yoshiaki Kuwata 4/12/2011 Lecture Overview Introduction Label Correcting Algorithm Core idea Depth-first search Breadth-first

More information

Gradient-Free Boundary Tracking* Zhong Hu Faculty Advisor: Todd Wittman August 2007 UCLA Department of Mathematics

Gradient-Free Boundary Tracking* Zhong Hu Faculty Advisor: Todd Wittman August 2007 UCLA Department of Mathematics Section I. Introduction Gradient-Free Boundary Tracking* Zhong Hu Faculty Advisor: Todd Wittman August 2007 UCLA Department of Mathematics In my paper, my main objective is to track objects in hyperspectral

More information

PureEngage Cloud Release Note. Outbound

PureEngage Cloud Release Note. Outbound PureEngage Cloud Release Note Outbound 5/14/2018 Outbound Note: Not all changes listed below may pertain to your deployment. May 7, 2018 (15.5.0) April 5, 2018 (15.4.0) March 12, 2018 (15.3.0) February

More information

Approximation Basics

Approximation Basics Milestones, Concepts, and Examples Xiaofeng Gao Department of Computer Science and Engineering Shanghai Jiao Tong University, P.R.China Spring 2015 Spring, 2015 Xiaofeng Gao 1/53 Outline History NP Optimization

More information

Midterm Exam Amy Murphy 6 March 2002

Midterm Exam Amy Murphy 6 March 2002 University of Rochester Midterm Exam Amy Murphy 6 March 2002 Computer Systems (CSC2/456) Read before beginning: Please write clearly. Illegible answers cannot be graded. Be sure to identify all of your

More information

V6 Programming Fundamentals: Part 1 Stored Procedures and Beyond David Adams & Dan Beckett. All rights reserved.

V6 Programming Fundamentals: Part 1 Stored Procedures and Beyond David Adams & Dan Beckett. All rights reserved. Summit 97 V6 Programming Fundamentals: Part 1 Stored Procedures and Beyond by David Adams & Dan Beckett 1997 David Adams & Dan Beckett. All rights reserved. Content adapted from Programming 4th Dimension:

More information

Thresholds Determination for Probabilistic Rough Sets with Genetic Algorithms

Thresholds Determination for Probabilistic Rough Sets with Genetic Algorithms Thresholds Determination for Probabilistic Rough Sets with Genetic Algorithms Babar Majeed, Nouman Azam, JingTao Yao Department of Computer Science University of Regina {majeed2b,azam200n,jtyao}@cs.uregina.ca

More information

Computer Architecture

Computer Architecture Computer Architecture Architecture The art and science of designing and constructing buildings A style and method of design and construction Design, the way components fit together Computer Architecture

More information

Lecture 5: Performance Analysis I

Lecture 5: Performance Analysis I CS 6323 : Modeling and Inference Lecture 5: Performance Analysis I Prof. Gregory Provan Department of Computer Science University College Cork Slides: Based on M. Yin (Performability Analysis) Overview

More information

Buffer Aware Routing in Interplanetary Ad Hoc Network

Buffer Aware Routing in Interplanetary Ad Hoc Network Buffer Aware Routing in Interplanetary Ad Hoc Network Kamal Mistry (Wipro Technologies, Bangalore) Sanjay Srivastava (DA-IICT, Gandhinagar) R. B. Lenin (DA-IICT, Gandhinagar) January 8, 2009 Buffer Aware

More information

Enabling Efficient and Accurate Large-Scale Simulations of VANETs for Vehicular Traffic Management

Enabling Efficient and Accurate Large-Scale Simulations of VANETs for Vehicular Traffic Management Enabling Efficient and Accurate Large-Scale Simulations of VANETs for Vehicular Traffic Management 1, Felix Schmidt-Eisenlohr 1, Hannes Hartenstein 1, Christian Rössel 2, Peter Vortisch 2, Silja Assenmacher

More information

Waitlist Reservations Management Quick Reference Guide

Waitlist Reservations Management Quick Reference Guide Waitlist Reservations Management Quick Reference Guide Published Date: November 15 Introduction The purpose of this Quick Reference Guide is to outline the required configuration, set-up and management

More information

Smart Remitter Target - SmaRT A NEW APPROACH TO MONITOR REMITTANCE PRICE TRENDS

Smart Remitter Target - SmaRT A NEW APPROACH TO MONITOR REMITTANCE PRICE TRENDS Smart Remitter Target - SmaRT A NEW APPROACH TO MONITOR REMITTANCE PRICE TRENDS RATIONALE The Global average remains a simple, robust and sufficiently accurate tool to measure the impacts of cost reduction

More information

Exploring Utah's Information Technology Labor Migration. Cory Stahle, Senior Economist, Utah Department of Workforce Services

Exploring Utah's Information Technology Labor Migration. Cory Stahle, Senior Economist, Utah Department of Workforce Services Exploring Utah's Information Technology Labor Migration Cory Stahle, Senior Economist, Utah Department of Workforce Services A Little Warmup Why We Did the Research 3 How much does in-migration of labor

More information

Predict Outcomes and Reveal Relationships in Categorical Data

Predict Outcomes and Reveal Relationships in Categorical Data PASW Categories 18 Specifications Predict Outcomes and Reveal Relationships in Categorical Data Unleash the full potential of your data through predictive analysis, statistical learning, perceptual mapping,

More information

The Artifact Subspace Reconstruction method. Christian A Kothe SCCN / INC / UCSD January 2013

The Artifact Subspace Reconstruction method. Christian A Kothe SCCN / INC / UCSD January 2013 The Artifact Subspace Reconstruction method Christian A Kothe SCCN / INC / UCSD January 2013 Artifact Subspace Reconstruction New algorithm to remove non-stationary highvariance signals from EEG Reconstructs

More information

2011 INTERNATIONAL COMPARISON PROGRAM

2011 INTERNATIONAL COMPARISON PROGRAM 2011 INTERNATIONAL COMPARISON PROGRAM 2011 ICP DATA ACCESS AND ARCHIVING POLICY GUIDING PRINCIPLES AND PROCEDURES FOR DATA ACCESS ICP Global Office June 2011 Contents I. PURPOSE... 3 II. CONTEXT... 3 III.

More information

Automatic training example selection for scalable unsupervised record linkage

Automatic training example selection for scalable unsupervised record linkage Automatic training example selection for scalable unsupervised record linkage Peter Christen Department of Computer Science, The Australian National University, Canberra, Australia Contact: peter.christen@anu.edu.au

More information

Example 1 - Joining datasets by a common variable: Creating a single table using multiple datasets Other features illustrated: Aggregate data multi-variable recode, computational calculation Background:

More information

IOmark- VM. IBM IBM FlashSystem V9000 Test Report: VM a Test Report Date: 5, December

IOmark- VM. IBM IBM FlashSystem V9000 Test Report: VM a Test Report Date: 5, December IOmark- VM IBM IBM FlashSystem V9000 Test Report: VM- 151205- a Test Report Date: 5, December 2015 Copyright 2010-2015 Evaluator Group, Inc. All rights reserved. IOmark- VM, IOmark- VDI, VDI- IOmark, and

More information

Accuracy versus precision

Accuracy versus precision Accuracy versus precision Accuracy is a consistent error from the true value, but not necessarily a good or precise error Precision is a consistent result within a small error, but not necessarily anywhere

More information

Copyright The McGraw-Hill Companies, Inc. Permission required for reproduction or display.

Copyright The McGraw-Hill Companies, Inc. Permission required for reproduction or display. CHAPTER 11 VECTOR DATA ANALYSIS 11.1 Buffering 11.1.1 Variations in Buffering Box 11.1 Riparian Buffer Width 11.1.2 Applications of Buffering 11.2 Overlay 11.2.1 Feature Type and Overlay 11.2.2 Overlay

More information

Minnesota Hosting Capacity Analysis

Minnesota Hosting Capacity Analysis Minnesota Hosting Capacity Analysis MIPSYCON November 8, 2017 Chris Punt, P.E. Outline What is Hosting Capacity? Background DRIVE Results Visualization Accuracy Timeline Next Steps 2 What is Hosting Capacity?

More information

HAEC-SIM: A Simulation Framework for Highly Adaptive Energy-Efficient Computing Platforms

HAEC-SIM: A Simulation Framework for Highly Adaptive Energy-Efficient Computing Platforms HAEC-SIM: A Simulation Framework for Highly Adaptive Energy-Efficient Computing Platforms SIMUTOOLS 2015 Athens, Greece Mario Bielert, Florina M. Ciorba, Kim Feldhoff, Thomas Ilsche, Wolfgang E. Nagel

More information

Implementation and Evaluation of Mobility Models with OPNET

Implementation and Evaluation of Mobility Models with OPNET Lehrstuhl Netzarchitekturen und Netzdienste Institut für Informatik Technische Universität München Implementation and Evaluation of Mobility Models with OPNET Abschlussvortrag zur Masterarbeit von Thomas

More information

Constrained Skyline Query Processing against Distributed Data Sites

Constrained Skyline Query Processing against Distributed Data Sites Constrained Skyline Query Processing against Distributed Data Divya.G* 1, V.Ranjith Naik *2 1,2 Department of Computer Science Engineering Swarnandhra College of Engg & Tech, Narsapuram-534280, A.P., India.

More information

Probabilistic Double-Distance Algorithm of Search after Static or Moving Target by Autonomous Mobile Agent

Probabilistic Double-Distance Algorithm of Search after Static or Moving Target by Autonomous Mobile Agent 2010 IEEE 26-th Convention of Electrical and Electronics Engineers in Israel Probabilistic Double-Distance Algorithm of Search after Static or Moving Target by Autonomous Mobile Agent Eugene Kagan Dept.

More information

Sampling Distribution Examples Sections 15.4, 15.5

Sampling Distribution Examples Sections 15.4, 15.5 Sampling Distribution Examples Sections 15.4, 15.5 Lecture 27 Robb T. Koether Hampden-Sydney College Wed, Mar 2, 2016 Robb T. Koether (Hampden-Sydney College)Sampling Distribution ExamplesSections 15.4,

More information

Regression Based Cluster Formation for Enhancement of Lifetime of WSN

Regression Based Cluster Formation for Enhancement of Lifetime of WSN Regression Based Cluster Formation for Enhancement of Lifetime of WSN K. Lakshmi Joshitha Assistant Professor Sri Sai Ram Engineering College Chennai, India lakshmijoshitha@yahoo.com A. Gangasri PG Scholar

More information

Computer Science 4500 Operating Systems

Computer Science 4500 Operating Systems Computer Science 4500 Operating Systems Module 6 Process Scheduling Methods Updated: September 25, 2014 2008 Stanley A. Wileman, Jr. Operating Systems Slide 1 1 In This Module Batch and interactive workloads

More information

Operator Certification Program Update. Focus On Change 2018

Operator Certification Program Update. Focus On Change 2018 Operator Certification Program Update Focus On Change 2018 Current License Numbers 4,699 Drinking water treatment plant operators 4,843 Wastewater treatment plant operators 4,465 Water distribution system

More information

Cluster Evaluation and Expectation Maximization! adapted from: Doug Downey and Bryan Pardo, Northwestern University

Cluster Evaluation and Expectation Maximization! adapted from: Doug Downey and Bryan Pardo, Northwestern University Cluster Evaluation and Expectation Maximization! adapted from: Doug Downey and Bryan Pardo, Northwestern University Kinds of Clustering Sequential Fast Cost Optimization Fixed number of clusters Hierarchical

More information

Evolutionary Algorithms. CS Evolutionary Algorithms 1

Evolutionary Algorithms. CS Evolutionary Algorithms 1 Evolutionary Algorithms CS 478 - Evolutionary Algorithms 1 Evolutionary Computation/Algorithms Genetic Algorithms l Simulate natural evolution of structures via selection and reproduction, based on performance

More information