Proximity Analysis of Vulnerable Facilities to Infrastructure Facilities Using Density Estimation: California Facilities

Size: px
Start display at page:

Download "Proximity Analysis of Vulnerable Facilities to Infrastructure Facilities Using Density Estimation: California Facilities"

Transcription

1 Proximity Analysis of Vulnerable Facilities to Infrastructure Facilities Using Estimation: California Facilities DRAFT REPORT, May 6, 2008 Rae Zimmerman, Jeffrey S. Simonoff, Zvia Segal Naphtali, Carlos E. Restrepo and Henry Willis Introduction Public and private decision-makers continue to seek risk-based approaches to allocating funds to secure facilities that are potentially vulnerable to terrorist attacks. An important input into decisions about allocating funds for security is the density of critical infrastructure around facilities that could be potential targets of an attack. A terrorist attack on a vulnerable facility located in the vicinity of a critical infrastructure system could disable that system, thus have cascading impacts on other infrastructure systems, and ultimately on the economy. Objective The objective of this work is to develop a methodology to help decision-makers and policymakers rank potentially vulnerable facilities (i.e., facilities vulnerable to terrorist attack) by developing and applying a measure of their proximity to infrastructure facilities. High infrastructure density around a vulnerable facility may exacerbate overall vulnerability. The work presented in this report uses California as a case area. Developing measures of infrastructure density around facilities considered vulnerable would support decision-makers in allocating scarce financial resources to a number of those facilities and distribute available resources for security purposes. Financial resources are assumed to be distributed at the county level, thus, those counties with the highest number of vulnerable facilities close to infrastructure facilities would receive more resources for security. Hence, the methodology presented in this report can be used as an input to prioritize facilities according to infrastructure density around them. This work was initiated after a discussion that Rae Zimmerman, Carlos E. Restrepo and Henry Willis had about interesting policy research issues and approaches related to resource allocation for security involving infrastructure. Methodology and Approach The approach adopted is to obtain distances between the set of vulnerable facilities provided by CREATE (indicated as red dots in Figure 1) and selected infrastructure facilities from publicly available datasets. Where databases were only available in the form of addresses, the data was geocoded to obtain a consistently defined and analyzable database. The data was then put into a spreadsheet. A statistical analysis was undertaken whose objective was to provide a density measure of the infrastructure around vulnerable facilities. The infrastructure density measure developed is based on an estimate of the probability density of infrastructure distances from each vulnerable facility ( red dots ). 1

2 Data Vulnerable facilities are the list of facilities from CREATE in the state of California (referred to as red dots in this paper). The identity of these facilities has not been revealed by CREATE other than the fact that they belong to categories of important facilities that are at risk for being attacked by terrorists. Infrastructure facilities are the facilities in the NYU-Wagner ICIS infrastructure databases obtained from publicly-available sources and include facilities such as dams, bridges, airports, ports, major highways, etc. Using GIS to Map Facilities and to Estimate Distances As an initial exercise the team tested the methodologies with one set of infrastructure data: the location of dams and their spatial location relative to the vulnerable facilities provided by CREATE. Zvia Segal Naphtali produced some initial GIS maps to examine the data. Figure 1 shows the vulnerable facilities ( red dots ) and dams ( yellow dots ) for the state of California. Figure 1. Vulnerable Facilities and Dams in California As a second step the team decided to narrow down the dams database to include only high hazard dams, as these would be the dams that, if damaged, would result in the largest effects on the surrounding population. Naphtali used ArcGIS to estimate the distance between vulnerable facilities and high hazard dams. Table 1 shows a sample of the resulting data. The columns represent the vulnerable facilities (178 facilities) and each row represents a high hazard dam. Distances are in kilometers. The same procedure was followed to estimate distances between each vulnerable facility and all the Bay Area Rapid Transit (BART) stations. 2

3 Table 1. Distance Estimated Between Vulnerable Facilities and High Hazard Dams in California These distance estimates were then used as inputs into an analysis of infrastructure density around vulnerable facilities that uses smoothing methods. Jeffrey S. Simonoff developed two separate sets of analyses for infrastructure density measures using high hazard dams and Bay Area Rapid Transit (BART) stations. These analyses are presented below. Measures of Infrastructure Using Dams for the Infrastructure Measure The objective of this work is to provide a measure of the potential impact of an attack on a vulnerable facility ( red dots in Figure 1) on other infrastructure ( yellow dots in Figure 1). The data consists of a set of distances from the vulnerable facilities to a high hazard dam (in kilometers). The analytical tool used is an estimate of the probability density of infrastructure distances from the vulnerable facility. This takes as the underlying model that infrastructure is placed randomly relative to the vulnerable facility according to some underlying probability distribution that depends on both the vulnerable facility and the infrastructure. This is not technically true, of course, but as a working model it allows for measures of the density of closeness of infrastructure to vulnerable facilities. Consider a sample { x1, K, x n } of observations from a density f ( x ); in our case the x values are distances from the vulnerable facility, and f () is the underlying probability density function for the distribution of infrastructure distances from the vulnerable facility. The measures used are based on a log-linear local likelihood density estimator (Hjort and Jones, 1996; Loader, 1996). The idea behind this density estimator is to approximate the log-density locally with a straight line. Specifically, the density estimate at any point x is f ( x ) = exp( a 0), where a0 is the constant term of the minimizer of n xi x u x K [ a0 + a1( xi x)] n K exp[ a0 + a1( u x)]. i= 1 h du h ^ ^ ^ 3

4 The standard kernel density estimator corresponds to this estimator with taken to be equal to 0, but the more general version has the advantage of automatically adjusting for the tendency of the kernel estimator to be biased downwards at the boundaries 1. The smoothness of the density estimate is determined by the smoothing parameter h, with too small a value resulting in an undersmoothed estimate that has too many local bumps and dips and too large a value resulting in an oversmoothed estimate that potentially smooths over important local structure in the underlying density. The smoothing parameter can be chosen by eye in a particular case, but it is useful to have an automatic choice (particularly since we are estimating almost 200 densities for each type of infrastructure here). A method that has been shown to be useful for local likelihood density estimation is choosing h based on AIC C, the corrected Akaike Information Criterion (Hurvich, Simonoff, and Tsai, 1998; Simonoff, 1998). Once the density estimate is formed, it can be used to construct different measures of infrastructure density around the vulnerable facility. The simplest measure is which can be interpreted as the estimated probability of infrastructure occurring within one kilometer of the vulnerable facility. Similarly, the measure d ^ I( d) = f( u) du 0 a 1 ^ (0), f is an estimate of the probability of infrastructure occurring within d kilometers of the vulnerable facility (this is the area under the estimated density curve from 0 to d); values of d such as 50 or 100 would seem reasonable to represent distances within which infrastructure could be affected by an attack on the vulnerable facility. Note that the first proposed measure is I (0) ; while technically the probability of infrastructure occurring within one kilometer would be I (1) rather than I (0), these values will be virtually identical to each other. Examples of infrastructure density estimation Four vulnerable facilities were initially examined (numbers 0, 11, 44, and 136 in the database). These were chosen because they reflect different patterns in the infrastructure density, and cover reasonably well the different patterns that seem to be in the data. The smoothing parameters for the density estimates have been chosen here by eye, rather than using an automatic criterion. Vulnerable facility 0 1 See Simonoff (1996, Section 3.4) for more discussion of the advantages of local likelihood estimation over kernel estimation. 4

5 Sensitive facility The distribution is bimodal, with a smaller cluster of infrastructure centered roughly 250 kilometers away, and a larger cluster centered roughly 700 kilometers away. Since Los Angeles and San Francisco are roughly 600 kilometers apart, it is reasonable to think that the two clusters correspond to centers of dam infrastructure in those two areas. The measures of infrastructure density for this vulnerable facility are I(0)= , I(50)= , and I(100)= Vulnerable facility 11 5

6 Sensitive facility The distribution is trimodal, with a tight cluster roughly 50 kilometers away, a larger cluster centered roughly 200 kilometers away, and a smaller cluster centered 600 kilometers away. The measures of infrastructure density for this vulnerable facility are I(0)= , I(50)= , and I(100)= Vulnerable facility 44 6

7 Sensitive facility The distribution is bimodal, with a large cluster roughly 200 kilometers away, and a smaller cluster centered 500 kilometers away. The measures of infrastructure density for this vulnerable facility are I(0)= , I(50)= , and I(100)= Vulnerable facility 136 7

8 Sensitive facility The distribution is unimodal, with infrastructure centered roughly 300 kilometers away. The measures of infrastructure density for this vulnerable facility are I(0)= , I(50)= , and I(100)= The different measures imply different orderings of infrastructure density (and hence, potentially different priority levels for resource allocation) for the four vulnerable facilities (ordered from highest to lowest), but all agree that facilities 11 and 44 are closer to dam infrastructure, while facilities 0 and 136 are farther away: I(0) I(50) I(100)

9 Using BART stations for the Infrastructure Measure This section uses the same approach to measure infrastructure density around a vulnerable facility as the previous section but instead of using dams the infrastructure data refers to Bay Area Rapid Transit District (BART) stations. The situation here is less complex than that relating to dams, since there are only 43 BART stations, and they are concentrated in the San Francisco Bay Area (San Francisco, and Contra Costa and Alameda counties). Four infrastructure density measures are shown below, adding I(150) (the probability of infrastructure within 150 kilometers) to the earlier I(0), I(50), and I(100). Examples of infrastructure intensity estimation Six vulnerable facilities (numbers 0, 9, 11, 15, 28, and 34) are examined. The smoothing parameters for the density estimates have been chosen here by eye, rather than using an automatic criterion. Vulnerable facility 0 9

10 Sensitive facility The distribution is unimodal, centered roughly 690 kilometers away. Obviously this facility is nowhere near the Bay Area. The measures of infrastructure density for this vulnerable facility are all virtually zero, as would be expected: I(0)= 4.6 x 10-22, I(50)= 1.9 x 10-19, I(100)= 5.4 x 10-18, and I(150)= 1.5 x Vulnerable facility 9 10

11 Sensitive facility The distribution is unimodal, with a peak centered 140 kilometers away. The measures of infrastructure density for this vulnerable facility are I(0)= 3.7 x 10-7, I(50)= , I(100)= , and I(150)= Vulnerable facility 11 11

12 Sensitive facility The distribution is bimodal, with a large cluster roughly 15 kilometers away, and a smaller bump centered 35 kilometers away. The measures of infrastructure density for this vulnerable facility are I(0)= , I(50)= , I(100)= , and I(150)= Vulnerable facility 15 12

13 Sensitive facility The distribution is bimodal, with infrastructure centered roughly 10 and 20 kilometers away. The measures of infrastructure density for this vulnerable facility are I(0)= , I(50)= , I(100)= , and I(150)= Vulnerable facility 28 13

14 Sensitive facility The distribution is unimodal, with a sharp peak centered 80 kilometers away (and a slight hint of a bump 90 kilometers away). The measures of infrastructure density for this vulnerable facility are I(0)= 3.5 x 10-9, I(50)= , I(100)= , and I(100)= Vulnerable facility 34 14

15 Sensitive facility The distribution is bimodal, with infrastructure centered roughly 245 and 255 kilometers away. The measures of infrastructure density for this vulnerable facility are I(0)= 4.2 x 10-9, I(50)= , I(100)= , and I(150)= The different measures imply different orderings of infrastructure density for the six vulnerable facilities (ordered from highest to lowest), but all agree that facilities 11 and 15 are closest to BART infrastructure, facilities 9, 28, and 34 are a moderate distance away, while facility 0 is very far away: I(0) I(50) I(100) I(150)

16 Future Research Directions The exploratory analyses shown in this report refer to measures of infrastructure density using two sets of infrastructure data separately: high hazard dams and BART stations. The next step will be to extend these analyses to include all vulnerable facilities, and then connect the density estimates to funding priorities at (for example) the county level. Such analyses will also be applied to other publicly available infrastructure databases, such as those for Amtrak stations, airports, ports, closest distance to major highways, and others. We will also explore how the analyses for different types of infrastructure can be combined to provide an overall risk measure for collateral damage to infrastructure from an attack on a vulnerable facility; an important challenge will be to decide whether different infrastructure types should be weighted differently in the analyses. References Hjort, N.L. and Jones, M.C. (1996), Locally parametric density estimation, Annals of Statistics, 24, Hurvich, C.M., Simonoff, J.S., and Tsai, C.-L. (1998), Smoothing parameter selection in nonparametric regression using an improved Akaike information criterion, Journal of the Royal Statistical Society, Ser. B, 60, Loader, C.R. (1996), Local likelihood density estimation, Annals of Statistics, 24, Simonoff, J.S. (1996), Smoothing Methods in Statistics, Springer-Verlag, New York. Simonoff, J.S. (1998), Three sides of smoothing: Categorical data smoothing, nonparametric regression, and density estimation, International Statistical Review, 66,

Santa Clara County. San Francisco City and County. Marin County. Napa County. Solano County. Contra Costa County. San Mateo County

Santa Clara County. San Francisco City and County. Marin County. Napa County. Solano County. Contra Costa County. San Mateo County Marin County Santa Clara County San Francisco City and County Sonoma County Napa County Contra Costa County San Mateo County Solano County Alameda County ABAG History In 1961, Bay Area leaders recognized

More information

Mapping Distance and Density

Mapping Distance and Density Mapping Distance and Density Distance functions allow you to determine the nearest location of something or the least-cost path to a particular destination. Density functions, on the other hand, allow

More information

Chapter 5. Track Geometry Data Analysis

Chapter 5. Track Geometry Data Analysis Chapter Track Geometry Data Analysis This chapter explains how and why the data collected for the track geometry was manipulated. The results of these studies in the time and frequency domain are addressed.

More information

On Kernel Density Estimation with Univariate Application. SILOKO, Israel Uzuazor

On Kernel Density Estimation with Univariate Application. SILOKO, Israel Uzuazor On Kernel Density Estimation with Univariate Application BY SILOKO, Israel Uzuazor Department of Mathematics/ICT, Edo University Iyamho, Edo State, Nigeria. A Seminar Presented at Faculty of Science, Edo

More information

Improved smoothing spline regression by combining estimates of dierent smoothness

Improved smoothing spline regression by combining estimates of dierent smoothness Available online at www.sciencedirect.com Statistics & Probability Letters 67 (2004) 133 140 Improved smoothing spline regression by combining estimates of dierent smoothness Thomas C.M. Lee Department

More information

Modelling Infrastructure Systems for Resilience and Sustainability

Modelling Infrastructure Systems for Resilience and Sustainability Modelling Infrastructure Systems for Resilience and Sustainability Sarah Dunn a* Sean WilkinsonError! Bookmark not defined. a Gaihua Fu a Richard Dawson a Abstract: Modern infrastructure systems are vital

More information

Getting Started with GeoQuery

Getting Started with GeoQuery Getting Started with GeoQuery A quick-start guide to the download and use of spatial data for international development geo.aiddata.org GeoQuery Quick Start Handbook v. 1.01, December 2017 WWW.GEOQUERY.ORG

More information

AN INTRODUCTION TO GRID ANALYSIS: HOTSPOT DETECTION

AN INTRODUCTION TO GRID ANALYSIS: HOTSPOT DETECTION AN INTRODUCTION TO GRID ANALYSIS: HOTSPOT DETECTION etraining Introduction Use the Grid Analysis commands of GeoMedia to extract hotspot locations from a feature class containing a collection of incident

More information

Using Hidden Markov Models to analyse time series data

Using Hidden Markov Models to analyse time series data Using Hidden Markov Models to analyse time series data September 9, 2011 Background Want to analyse time series data coming from accelerometer measurements. 19 different datasets corresponding to different

More information

Smoothing parameterselection forsmoothing splines: a simulation study

Smoothing parameterselection forsmoothing splines: a simulation study Computational Statistics & Data Analysis 42 (2003) 139 148 www.elsevier.com/locate/csda Smoothing parameterselection forsmoothing splines: a simulation study Thomas C.M. Lee Department of Statistics, Colorado

More information

California Cybersecurity Integration Center (Cal-CSIC)

California Cybersecurity Integration Center (Cal-CSIC) California Cybersecurity Integration Center (Cal-CSIC) Agenda Mission and Scope Whole of State Government Approach Where is the Cal-CSIC? Cal-CSIC Partners Attaining Cyber Maturity in Parallel Machine

More information

Smoothing Dissimilarities for Cluster Analysis: Binary Data and Functional Data

Smoothing Dissimilarities for Cluster Analysis: Binary Data and Functional Data Smoothing Dissimilarities for Cluster Analysis: Binary Data and unctional Data David B. University of South Carolina Department of Statistics Joint work with Zhimin Chen University of South Carolina Current

More information

Bayesian analysis of genetic population structure using BAPS: Exercises

Bayesian analysis of genetic population structure using BAPS: Exercises Bayesian analysis of genetic population structure using BAPS: Exercises p S u k S u p u,s S, Jukka Corander Department of Mathematics, Åbo Akademi University, Finland Exercise 1: Clustering of groups of

More information

Multi-Criteria Decision Making 1-AHP

Multi-Criteria Decision Making 1-AHP Multi-Criteria Decision Making 1-AHP Introduction In our complex world system, we are forced to cope with more problems than we have the resources to handle We a framework that enable us to think of complex

More information

FY Bay Area UASI Risk and Grants Management Program Update. November 14, 2013

FY Bay Area UASI Risk and Grants Management Program Update. November 14, 2013 FY 2013-2014 Bay Area UASI Risk and Grants Management Program Update November 14, 2013 Overview FY 2013 Bay Area UASI Risk and Grants Management Program May 2013 December 2013 Data Management Analysis

More information

Assessing the Quality of the Natural Cubic Spline Approximation

Assessing the Quality of the Natural Cubic Spline Approximation Assessing the Quality of the Natural Cubic Spline Approximation AHMET SEZER ANADOLU UNIVERSITY Department of Statisticss Yunus Emre Kampusu Eskisehir TURKEY ahsst12@yahoo.com Abstract: In large samples,

More information

Single-slit diffraction plots by Sohrab Ismail-Beigi, February 5, 2009

Single-slit diffraction plots by Sohrab Ismail-Beigi, February 5, 2009 Single-slit diffraction plots by Sohrab Ismail-Beigi, February 5, 2009 On the following pages are a set of computed single-slit intensities. The slit is long and narrow (i.e not circular but linear). The

More information

Data Mining Chapter 3: Visualizing and Exploring Data Fall 2011 Ming Li Department of Computer Science and Technology Nanjing University

Data Mining Chapter 3: Visualizing and Exploring Data Fall 2011 Ming Li Department of Computer Science and Technology Nanjing University Data Mining Chapter 3: Visualizing and Exploring Data Fall 2011 Ming Li Department of Computer Science and Technology Nanjing University Exploratory data analysis tasks Examine the data, in search of structures

More information

SM05: Risk Analysis: A Comparison in Quantifying Asset Values, Threats, Vulnerabilities and Risk. Doug Haines Haines Security Solutions 9 April 2013

SM05: Risk Analysis: A Comparison in Quantifying Asset Values, Threats, Vulnerabilities and Risk. Doug Haines Haines Security Solutions 9 April 2013 SM05: Risk Analysis: A Comparison in Quantifying Asset Values, Threats, Vulnerabilities and Risk Doug Haines Haines Security Solutions 9 April 2013 The Broad Picture Learning Objectives Know the differences

More information

Nonparametric Estimation of Distribution Function using Bezier Curve

Nonparametric Estimation of Distribution Function using Bezier Curve Communications for Statistical Applications and Methods 2014, Vol. 21, No. 1, 105 114 DOI: http://dx.doi.org/10.5351/csam.2014.21.1.105 ISSN 2287-7843 Nonparametric Estimation of Distribution Function

More information

Frequency Distributions

Frequency Distributions Displaying Data Frequency Distributions After collecting data, the first task for a researcher is to organize and summarize the data so that it is possible to get a general overview of the results. Remember,

More information

Dynamic Thresholding for Image Analysis

Dynamic Thresholding for Image Analysis Dynamic Thresholding for Image Analysis Statistical Consulting Report for Edward Chan Clean Energy Research Center University of British Columbia by Libo Lu Department of Statistics University of British

More information

Electric Vehicle Infrastructure Location Identification Tool and Visualization Map. User Guide 1 INTRODUCTION

Electric Vehicle Infrastructure Location Identification Tool and Visualization Map. User Guide 1 INTRODUCTION Electric Vehicle Infrastructure Location Identification Tool and Visualization Map User Guide 1 INTRODUCTION The Infrastructure Location Identification Tool ( Tool ) and Visualization Map ( Map ) represent

More information

CHAOS Chaos Chaos Iterate

CHAOS Chaos Chaos Iterate CHAOS Chaos is a program that explores data analysis. A sequence of points is created which can be analyzed via one of the following five modes: 1. Time Series Mode, which plots a time series graph, that

More information

The City of St. Petersburg (City) initiated

The City of St. Petersburg (City) initiated FWRJ Remote Facility Condition Inspections and Subsequent Integration into the City of St. Petersburg s Asset Management Program Laura Baumberger, Robert Labrie, and Rebecca Overacre The City of St. Petersburg

More information

You will begin by exploring the locations of the long term care facilities in Massachusetts using descriptive statistics.

You will begin by exploring the locations of the long term care facilities in Massachusetts using descriptive statistics. Getting Started 1. Create a folder on the desktop and call it your last name. 2. Copy and paste the data you will need to your folder from the folder specified by the instructor. Exercise 1: Explore the

More information

100 Myung Hwan Na log-hazard function. The discussion section of Abrahamowicz, et al.(1992) contains a good review of many of the papers on the use of

100 Myung Hwan Na log-hazard function. The discussion section of Abrahamowicz, et al.(1992) contains a good review of many of the papers on the use of J. KSIAM Vol.3, No.2, 99-106, 1999 SPLINE HAZARD RATE ESTIMATION USING CENSORED DATA Myung Hwan Na Abstract In this paper, the spline hazard rate model to the randomly censored data is introduced. The

More information

TERRORISM LIAISON OFFICER OUTREACH PROGRAM - (TLOOP)

TERRORISM LIAISON OFFICER OUTREACH PROGRAM - (TLOOP) To: Bay Area UASI Approval Authority From: Mike Sena, Director NCRIC/HIDTA Date: January 10, 2019 Re: Item 7: NCRIC Annual Report and Proposed FY19 Allocation Recommendation: Approve $4,454,066 from the

More information

Chapter 17 Creating a New Suit from Old Cloth: Manipulating Vector Mode Cartographic Data

Chapter 17 Creating a New Suit from Old Cloth: Manipulating Vector Mode Cartographic Data Chapter 17 Creating a New Suit from Old Cloth: Manipulating Vector Mode Cartographic Data Imagine for a moment that digital cartographic databases were a perfect analog of the paper map. Once you digitized

More information

A Method for Comparing Multiple Regression Models

A Method for Comparing Multiple Regression Models CSIS Discussion Paper No. 141 A Method for Comparing Multiple Regression Models Yuki Hiruta Yasushi Asami Department of Urban Engineering, the University of Tokyo e-mail: hiruta@ua.t.u-tokyo.ac.jp asami@csis.u-tokyo.ac.jp

More information

Executive summary. Natural disasters including bushfires, floods, storms and cyclones have destructive and devastating consequences for Australia

Executive summary. Natural disasters including bushfires, floods, storms and cyclones have destructive and devastating consequences for Australia Natural disasters including bushfires, floods, storms and cyclones have destructive and devastating consequences for Australia The impacts of these disasters to businesses, properties and people have been

More information

Model Based Symbolic Description for Big Data Analysis

Model Based Symbolic Description for Big Data Analysis Model Based Symbolic Description for Big Data Analysis 1 Model Based Symbolic Description for Big Data Analysis *Carlo Drago, **Carlo Lauro and **Germana Scepi *University of Rome Niccolo Cusano, **University

More information

Hazard Management Cayman Islands

Hazard Management Cayman Islands Hazard Management Cayman Islands Strategic Plan 2012 2016 Executive Summary HMCI strategic plan outlines the agency s outlook in the next five years and illustrates the main strategies as goals that will

More information

Honorable Mayor and Members of the City Council. Memorandum of Understanding: Interstate 80 Integrated Corridor Mobility (I-80 ICM) Project

Honorable Mayor and Members of the City Council. Memorandum of Understanding: Interstate 80 Integrated Corridor Mobility (I-80 ICM) Project Office of the City Manager CONSENT CALENDAR May 1, 2012 To: From: Honorable Mayor and Members of the City Council Christine Daniel, Interim City Manager Submitted by: Andrew Clough, Director, Public Works

More information

City Implements Disaster Recovery Plan to Keep Citizens Connected

City Implements Disaster Recovery Plan to Keep Citizens Connected City Implements Disaster Recovery Plan to Keep Citizens Connected City of Hayward connects public safety and emergency operations via Cisco 802.11 wireless bridging and microwave channels Business Challenge

More information

Introduction to Objective Analysis

Introduction to Objective Analysis Chapter 4 Introduction to Objective Analysis Atmospheric data are routinely collected around the world but observation sites are located rather randomly from a spatial perspective. On the other hand, most

More information

A Statistical Analysis of UK Financial Networks

A Statistical Analysis of UK Financial Networks A Statistical Analysis of UK Financial Networks J. Chu & S. Nadarajah First version: 31 December 2016 Research Report No. 9, 2016, Probability and Statistics Group School of Mathematics, The University

More information

Alternative Fuel Vehicles in State Energy Assurance Planning

Alternative Fuel Vehicles in State Energy Assurance Planning + Alternative Fuel Vehicles in State Energy Assurance Planning July 17, 2014 Webinar hosted by the National Association of State Energy Officials (NASEO), with support from the U.S. Department of Energy

More information

3. Data Structures for Image Analysis L AK S H M O U. E D U

3. Data Structures for Image Analysis L AK S H M O U. E D U 3. Data Structures for Image Analysis L AK S H M AN @ O U. E D U Different formulations Can be advantageous to treat a spatial grid as a: Levelset Matrix Markov chain Topographic map Relational structure

More information

Tutorial 2: Query and Select TRI Spatial Data to Study State-Wide Emissions Quantum GIS

Tutorial 2: Query and Select TRI Spatial Data to Study State-Wide Emissions Quantum GIS Tutorial 2: Query and Select TRI Spatial Data to Study State-Wide Emissions Quantum GIS This tutorial will introduce you to the following: Identifying Attribute Data Sources Toxic Release Inventory (TRI)

More information

Statistical Modeling with Spline Functions Methodology and Theory

Statistical Modeling with Spline Functions Methodology and Theory This is page 1 Printer: Opaque this Statistical Modeling with Spline Functions Methodology and Theory Mark H Hansen University of California at Los Angeles Jianhua Z Huang University of Pennsylvania Charles

More information

Nonparametric Regression

Nonparametric Regression Nonparametric Regression John Fox Department of Sociology McMaster University 1280 Main Street West Hamilton, Ontario Canada L8S 4M4 jfox@mcmaster.ca February 2004 Abstract Nonparametric regression analysis

More information

Spatial Analysis (Vector) II

Spatial Analysis (Vector) II Spatial Analysis (Vector) II GEOG 300, Lecture 9 Dr. Anthony Jjumba 1 A Spatial Network is a set of geographic locations interconnected in a system by a number of routes is a system of linear features

More information

[Los Alamos National Laboratory release number LA-UR ]

[Los Alamos National Laboratory release number LA-UR ] The RACER (Risk Analysis, Communication, Evaluation, and Reduction) Stakeholder Environmental Data Transparency Project for Los Alamos National Laboratory - 9499 [Los Alamos National Laboratory release

More information

DRAFT THE RESILIENCE TO DISASTERS AND EMERGENCIES INDEX (REDI) Constantine E. Kontokosta, PhD, PE, AICP, LEED AP NYU CUSP & NYU Poly & NYU Schack

DRAFT THE RESILIENCE TO DISASTERS AND EMERGENCIES INDEX (REDI) Constantine E. Kontokosta, PhD, PE, AICP, LEED AP NYU CUSP & NYU Poly & NYU Schack THE RESILIENCE TO DISASTERS AND EMERGENCIES INDEX (REDI) A Unified Index of Resilience Capacity and Survivability Update: March 27, 2014 Constantine E. Kontokosta, PhD, PE, AICP, LEED AP NYU CUSP & NYU

More information

OSLDPS Information Bulletin No. 34 May 8, Job Aids for the Data Analysis Tool and Jurisdiction Prioritization Tool

OSLDPS Information Bulletin No. 34 May 8, Job Aids for the Data Analysis Tool and Jurisdiction Prioritization Tool OSLDPS Information Bulletin No. May 8, 00 TO: FROM: SUBJECT: All State Administrative Agency Heads All State Administrative Agency Points of Contact C.H. Butch Straub, II Director Job Aids for the Data

More information

Geography 281 Map Making with GIS Project Three: Viewing Data Spatially

Geography 281 Map Making with GIS Project Three: Viewing Data Spatially Geography 281 Map Making with GIS Project Three: Viewing Data Spatially This activity introduces three of the most common thematic maps: Choropleth maps Dot density maps Graduated symbol maps You will

More information

Nonparametric Approaches to Regression

Nonparametric Approaches to Regression Nonparametric Approaches to Regression In traditional nonparametric regression, we assume very little about the functional form of the mean response function. In particular, we assume the model where m(xi)

More information

6. Object Identification L AK S H M O U. E D U

6. Object Identification L AK S H M O U. E D U 6. Object Identification L AK S H M AN @ O U. E D U Objects Information extracted from spatial grids often need to be associated with objects not just an individual pixel Group of pixels that form a real-world

More information

US Geo-Explorer User s Guide. Web:

US Geo-Explorer User s Guide. Web: US Geo-Explorer User s Guide Web: http://usgeoexplorer.org Updated on October 26, 2016 TABLE OF CONTENTS Introduction... 3 1. System Interface... 5 2. Administrative Unit... 7 2.1 Region Selection... 7

More information

Threat Centric Vulnerability Management

Threat Centric Vulnerability Management Threat Centric Vulnerability Management Solution Brief When it comes to vulnerability management, security leaders continue struggle to identify which of the thousands even millions of vulnerabilities

More information

Model Assessment and Selection. Reference: The Elements of Statistical Learning, by T. Hastie, R. Tibshirani, J. Friedman, Springer

Model Assessment and Selection. Reference: The Elements of Statistical Learning, by T. Hastie, R. Tibshirani, J. Friedman, Springer Model Assessment and Selection Reference: The Elements of Statistical Learning, by T. Hastie, R. Tibshirani, J. Friedman, Springer 1 Model Training data Testing data Model Testing error rate Training error

More information

Identifying Critical Infrastructure Through the Use of Hydraulic Modeling to Support Asset Management

Identifying Critical Infrastructure Through the Use of Hydraulic Modeling to Support Asset Management Identifying Critical Infrastructure Through the Use of Hydraulic Modeling to Support Asset Management James P. Cooper, Prof. Engineer, Cert. Operator Acknowledgements Lisa Gresehover Kimberly Six Karem

More information

Viscous Fingers: A topological Visual Analytic Approach

Viscous Fingers: A topological Visual Analytic Approach Viscous Fingers: A topological Visual Analytic Approach Jonas Lukasczyk University of Kaiserslautern Germany Bernd Hamann University of California Davis, U.S.A. Garrett Aldrich Univeristy of California

More information

This guide covers 3 functions you can perform with DataPlace: o Mapping, o Creating Tables, and o Creating Rankings. Registering with DataPlace

This guide covers 3 functions you can perform with DataPlace: o Mapping, o Creating Tables, and o Creating Rankings. Registering with DataPlace Guide for Using DataPlace DataPlace is one-stop source for housing and demographic data about communities, the region, and the nation. The site assembles a variety of data sets from multiple sources, and

More information

MKTG 460 Winter 2019 Solutions #1

MKTG 460 Winter 2019 Solutions #1 MKTG 460 Winter 2019 Solutions #1 Short Answer: Data Analysis 1. What is a data table and how are the data values organized? A data table stores the data values for variables across different observations,

More information

Clustering. Supervised vs. Unsupervised Learning

Clustering. Supervised vs. Unsupervised Learning Clustering Supervised vs. Unsupervised Learning So far we have assumed that the training samples used to design the classifier were labeled by their class membership (supervised learning) We assume now

More information

Redefining and Enhancing K-means Algorithm

Redefining and Enhancing K-means Algorithm Redefining and Enhancing K-means Algorithm Nimrat Kaur Sidhu 1, Rajneet kaur 2 Research Scholar, Department of Computer Science Engineering, SGGSWU, Fatehgarh Sahib, Punjab, India 1 Assistant Professor,

More information

CALCULATION OF OPERATIONAL LOSSES WITH NON- PARAMETRIC APPROACH: DERAILMENT LOSSES

CALCULATION OF OPERATIONAL LOSSES WITH NON- PARAMETRIC APPROACH: DERAILMENT LOSSES 2. Uluslar arası Raylı Sistemler Mühendisliği Sempozyumu (ISERSE 13), 9-11 Ekim 2013, Karabük, Türkiye CALCULATION OF OPERATIONAL LOSSES WITH NON- PARAMETRIC APPROACH: DERAILMENT LOSSES Zübeyde Öztürk

More information

CHAPTER 1. Introduction. Statistics: Statistics is the science of collecting, organizing, analyzing, presenting and interpreting data.

CHAPTER 1. Introduction. Statistics: Statistics is the science of collecting, organizing, analyzing, presenting and interpreting data. 1 CHAPTER 1 Introduction Statistics: Statistics is the science of collecting, organizing, analyzing, presenting and interpreting data. Variable: Any characteristic of a person or thing that can be expressed

More information

The Use of Biplot Analysis and Euclidean Distance with Procrustes Measure for Outliers Detection

The Use of Biplot Analysis and Euclidean Distance with Procrustes Measure for Outliers Detection Volume-8, Issue-1 February 2018 International Journal of Engineering and Management Research Page Number: 194-200 The Use of Biplot Analysis and Euclidean Distance with Procrustes Measure for Outliers

More information

Points Lines Connected points X-Y Scatter. X-Y Matrix Star Plot Histogram Box Plot. Bar Group Bar Stacked H-Bar Grouped H-Bar Stacked

Points Lines Connected points X-Y Scatter. X-Y Matrix Star Plot Histogram Box Plot. Bar Group Bar Stacked H-Bar Grouped H-Bar Stacked Plotting Menu: QCExpert Plotting Module graphs offers various tools for visualization of uni- and multivariate data. Settings and options in different types of graphs allow for modifications and customizations

More information

Center, Shape, & Spread Center, shape, and spread are all words that describe what a particular graph looks like.

Center, Shape, & Spread Center, shape, and spread are all words that describe what a particular graph looks like. Center, Shape, & Spread Center, shape, and spread are all words that describe what a particular graph looks like. Center When we talk about center, shape, or spread, we are talking about the distribution

More information

Vocabulary. 5-number summary Rule. Area principle. Bar chart. Boxplot. Categorical data condition. Categorical variable.

Vocabulary. 5-number summary Rule. Area principle. Bar chart. Boxplot. Categorical data condition. Categorical variable. 5-number summary 68-95-99.7 Rule Area principle Bar chart Bimodal Boxplot Case Categorical data Categorical variable Center Changing center and spread Conditional distribution Context Contingency table

More information

arxiv: v1 [physics.data-an] 27 Sep 2007

arxiv: v1 [physics.data-an] 27 Sep 2007 Classification of Interest Rate Curves Using Self-Organising Maps arxiv:0709.4401v1 [physics.data-an] 27 Sep 2007 M.Kanevski a,, M.Maignan b, V.Timonin a,1, A.Pozdnoukhov a,1 a Institute of Geomatics and

More information

Halftoning and quasi-monte Carlo

Halftoning and quasi-monte Carlo Halftoning and quasi-monte Carlo Ken Hanson CCS-2, Methods for Advanced Scientific Simulations Los Alamos National Laboratory This presentation available at http://www.lanl.gov/home/kmh/ LA-UR-04-1854

More information

Optimizing Pharmaceutical Production Processes Using Quality by Design Methods

Optimizing Pharmaceutical Production Processes Using Quality by Design Methods Optimizing Pharmaceutical Production Processes Using Quality by Design Methods Bernd Heinen, SAS WHITE PAPER SAS White Paper Table of Contents Abstract.... The situation... Case study and database... Step

More information

Appendix M. Lifeline Sector Coordination

Appendix M. Lifeline Sector Coordination Appendix M. Lifeline Sector Coordination Regional Water Supply Resiliency Project Phase 2 Summary Report July 2018 This page intentionally left blank. Regional Water Supply Resiliency Project Phase 2 Summary

More information

City Mesh Resilient First Responder Communication

City Mesh Resilient First Responder Communication Telecooperation Prof. Dr. Max Mühlhäuser City Mesh Resilient First Responder Communication Kamill Panitzek Dirk Bradler Immanuel Schweizer Max Mühlhäuser Technische Universität Darmstadt This work has

More information

Image Segmentation. Shengnan Wang

Image Segmentation. Shengnan Wang Image Segmentation Shengnan Wang shengnan@cs.wisc.edu Contents I. Introduction to Segmentation II. Mean Shift Theory 1. What is Mean Shift? 2. Density Estimation Methods 3. Deriving the Mean Shift 4. Mean

More information

Can We Reliably Benchmark HTA Organizations? Michael Drummond Centre for Health Economics University of York

Can We Reliably Benchmark HTA Organizations? Michael Drummond Centre for Health Economics University of York Can We Reliably Benchmark HTA Organizations? Michael Drummond Centre for Health Economics University of York Outline of Presentation Some background Methods Results Discussion Some Background In recent

More information

INTERPOLATED GRADIENT FOR DATA MAPS

INTERPOLATED GRADIENT FOR DATA MAPS Technical Disclosure Commons Defensive Publications Series August 22, 2016 INTERPOLATED GRADIENT FOR DATA MAPS David Kogan Follow this and additional works at: http://www.tdcommons.org/dpubs_series Recommended

More information

3 Graphical Displays of Data

3 Graphical Displays of Data 3 Graphical Displays of Data Reading: SW Chapter 2, Sections 1-6 Summarizing and Displaying Qualitative Data The data below are from a study of thyroid cancer, using NMTR data. The investigators looked

More information

Unsupervised Learning : Clustering

Unsupervised Learning : Clustering Unsupervised Learning : Clustering Things to be Addressed Traditional Learning Models. Cluster Analysis K-means Clustering Algorithm Drawbacks of traditional clustering algorithms. Clustering as a complex

More information

Reflection and Image Formation by Mirrors

Reflection and Image Formation by Mirrors Purpose Theory a. To study the reflection of light Reflection and Image Formation by Mirrors b. To study the formation and characteristics of images formed by different types of mirrors. When light (wave)

More information

Adaptive Robotics - Final Report Extending Q-Learning to Infinite Spaces

Adaptive Robotics - Final Report Extending Q-Learning to Infinite Spaces Adaptive Robotics - Final Report Extending Q-Learning to Infinite Spaces Eric Christiansen Michael Gorbach May 13, 2008 Abstract One of the drawbacks of standard reinforcement learning techniques is that

More information

NCSS Statistical Software

NCSS Statistical Software Chapter 245 Introduction This procedure generates R control charts for variables. The format of the control charts is fully customizable. The data for the subgroups can be in a single column or in multiple

More information

SELECTION OF A MULTIVARIATE CALIBRATION METHOD

SELECTION OF A MULTIVARIATE CALIBRATION METHOD SELECTION OF A MULTIVARIATE CALIBRATION METHOD 0. Aim of this document Different types of multivariate calibration methods are available. The aim of this document is to help the user select the proper

More information

CHAPTER 6 MODIFIED FUZZY TECHNIQUES BASED IMAGE SEGMENTATION

CHAPTER 6 MODIFIED FUZZY TECHNIQUES BASED IMAGE SEGMENTATION CHAPTER 6 MODIFIED FUZZY TECHNIQUES BASED IMAGE SEGMENTATION 6.1 INTRODUCTION Fuzzy logic based computational techniques are becoming increasingly important in the medical image analysis arena. The significant

More information

Performance Measurement, Data and Decision Making: A Matter of Alignment. Mark F. Muriello Assistant Director Tunnels, Bridges & Terminals

Performance Measurement, Data and Decision Making: A Matter of Alignment. Mark F. Muriello Assistant Director Tunnels, Bridges & Terminals Performance Measurement, Data and Decision Making: A Matter of Mark F. Muriello Assistant Director Tunnels, Bridges & Terminals The Port Authority of NY & NJ: Delivering Vital Connections Tunnels and Bridges

More information

Using the DATAMINE Program

Using the DATAMINE Program 6 Using the DATAMINE Program 304 Using the DATAMINE Program This chapter serves as a user s manual for the DATAMINE program, which demonstrates the algorithms presented in this book. Each menu selection

More information

Week 7 Picturing Network. Vahe and Bethany

Week 7 Picturing Network. Vahe and Bethany Week 7 Picturing Network Vahe and Bethany Freeman (2005) - Graphic Techniques for Exploring Social Network Data The two main goals of analyzing social network data are identification of cohesive groups

More information

Representatives of the Global Environment Facility and UNDP, Distinguished Ministers, Ladies and Gentlemen, I would like to thank the United Nations,

Representatives of the Global Environment Facility and UNDP, Distinguished Ministers, Ladies and Gentlemen, I would like to thank the United Nations, High-level Forum of South-South Cooperation on Climate Change - Addressing Climate Change and Achieving Sustainable Development in Developing Countries: The Role of South-South Cooperation November 14;

More information

1. Start ArcMap by going to the Start menu > All Programs > ArcGIS > ArcMap.

1. Start ArcMap by going to the Start menu > All Programs > ArcGIS > ArcMap. Learning ArcGIS: Introduction to ArcMap 10.1 The Map Document Feature Manipulation Navigating ArcMap Map Documents, Layers, and Features Shapes, Location, and Attribute Data Symbology Zoom, Pan and Map

More information

Managed Lane owner decision needed San Mateo County s options Understanding revenues & costs Pros & cons of County s options Proposed next steps

Managed Lane owner decision needed San Mateo County s options Understanding revenues & costs Pros & cons of County s options Proposed next steps San Mateo County Transportation Authority Meeting August 2, 2018 1 OVERVIEW Managed Lane owner decision needed San Mateo County s options Understanding revenues & costs Pros & cons of County s options

More information

LOESS curve fitted to a population sampled from a sine wave with uniform noise added. The LOESS curve approximates the original sine wave.

LOESS curve fitted to a population sampled from a sine wave with uniform noise added. The LOESS curve approximates the original sine wave. LOESS curve fitted to a population sampled from a sine wave with uniform noise added. The LOESS curve approximates the original sine wave. http://en.wikipedia.org/wiki/local_regression Local regression

More information

JUST THE MATHS UNIT NUMBER STATISTICS 1 (The presentation of data) A.J.Hobson

JUST THE MATHS UNIT NUMBER STATISTICS 1 (The presentation of data) A.J.Hobson JUST THE MATHS UNIT NUMBER 18.1 STATISTICS 1 (The presentation of data) by A.J.Hobson 18.1.1 Introduction 18.1.2 The tabulation of data 18.1.3 The graphical representation of data 18.1.4 Exercises 18.1.5

More information

Instance-based Learning CE-717: Machine Learning Sharif University of Technology. M. Soleymani Fall 2015

Instance-based Learning CE-717: Machine Learning Sharif University of Technology. M. Soleymani Fall 2015 Instance-based Learning CE-717: Machine Learning Sharif University of Technology M. Soleymani Fall 2015 Outline Non-parametric approach Unsupervised: Non-parametric density estimation Parzen Windows K-Nearest

More information

NIBS Building Innovation 2014

NIBS Building Innovation 2014 NIBS Building Innovation 2014 Integrated Rapid Visual Screening Process to Assess and Design Safe Schools Mila Kennett Department of Homeland Security Science and Technology Resilient Systems Division

More information

Kernel Density Estimation

Kernel Density Estimation Kernel Density Estimation An Introduction Justus H. Piater, Université de Liège Overview 1. Densities and their Estimation 2. Basic Estimators for Univariate KDE 3. Remarks 4. Methods for Particular Domains

More information

Hierarchical Mixture Models for Nested Data Structures

Hierarchical Mixture Models for Nested Data Structures Hierarchical Mixture Models for Nested Data Structures Jeroen K. Vermunt 1 and Jay Magidson 2 1 Department of Methodology and Statistics, Tilburg University, PO Box 90153, 5000 LE Tilburg, Netherlands

More information

Best Practices for the Governance of ArcGIS Online Mitigating Chaos. Presented by: Laura Kingston, Land Information Coordinator, MTO October 18, 2017

Best Practices for the Governance of ArcGIS Online Mitigating Chaos. Presented by: Laura Kingston, Land Information Coordinator, MTO October 18, 2017 Best Practices for the Governance of ArcGIS Online Mitigating Chaos Presented by: Laura Kingston, Land Information Coordinator, MTO October 18, 2017 Organization Overview Ministry of Transportation Ontario

More information

Recognizing hand-drawn images using shape context

Recognizing hand-drawn images using shape context Recognizing hand-drawn images using shape context Gyozo Gidofalvi Department of Computer Science and Engineering University of California, San Diego La Jolla, CA 92037 gyozo@cs.ucsd.edu Abstract The objective

More information

Sustainable Networks: Challenges and Opportunities. Anne Meltzer

Sustainable Networks: Challenges and Opportunities. Anne Meltzer Sustainable Networks: Challenges and Opportunities Anne Meltzer NSF workshop in conjunction with 2008 AAAS meeting on transitioning networks of earthquake monitoring stations into fully sustainable networks

More information

Introduction to GIS & Mapping: ArcGIS Desktop

Introduction to GIS & Mapping: ArcGIS Desktop Introduction to GIS & Mapping: ArcGIS Desktop Your task in this exercise is to determine the best place to build a mixed use facility in Hudson County, NJ. In order to revitalize the community and take

More information

Robust Shape Retrieval Using Maximum Likelihood Theory

Robust Shape Retrieval Using Maximum Likelihood Theory Robust Shape Retrieval Using Maximum Likelihood Theory Naif Alajlan 1, Paul Fieguth 2, and Mohamed Kamel 1 1 PAMI Lab, E & CE Dept., UW, Waterloo, ON, N2L 3G1, Canada. naif, mkamel@pami.uwaterloo.ca 2

More information

Soft Threshold Estimation for Varying{coecient Models 2 ations of certain basis functions (e.g. wavelets). These functions are assumed to be smooth an

Soft Threshold Estimation for Varying{coecient Models 2 ations of certain basis functions (e.g. wavelets). These functions are assumed to be smooth an Soft Threshold Estimation for Varying{coecient Models Artur Klinger, Universitat Munchen ABSTRACT: An alternative penalized likelihood estimator for varying{coecient regression in generalized linear models

More information

THE L.L. THURSTONE PSYCHOMETRIC LABORATORY UNIVERSITY OF NORTH CAROLINA. Forrest W. Young & Carla M. Bann

THE L.L. THURSTONE PSYCHOMETRIC LABORATORY UNIVERSITY OF NORTH CAROLINA. Forrest W. Young & Carla M. Bann Forrest W. Young & Carla M. Bann THE L.L. THURSTONE PSYCHOMETRIC LABORATORY UNIVERSITY OF NORTH CAROLINA CB 3270 DAVIE HALL, CHAPEL HILL N.C., USA 27599-3270 VISUAL STATISTICS PROJECT WWW.VISUALSTATS.ORG

More information

PASS Sample Size Software. Randomization Lists

PASS Sample Size Software. Randomization Lists Chapter 880 Introduction This module is used to create a randomization list for assigning subjects to one of up to eight groups or treatments. Six randomization algorithms are available. Four of the algorithms

More information

Statistical Pattern Recognition

Statistical Pattern Recognition Statistical Pattern Recognition Features and Feature Selection Hamid R. Rabiee Jafar Muhammadi Spring 2013 http://ce.sharif.edu/courses/91-92/2/ce725-1/ Agenda Features and Patterns The Curse of Size and

More information