Cell Count Method on a Network with SANET

Similar documents
R s s f. m y s. SPH3UW Unit 7.3 Spherical Concave Mirrors Page 1 of 12. Notes

Steps for Computing the Dissimilarity, Entropy, Herfindahl-Hirschman and. Accessibility (Gravity with Competition) Indices

CHAPTER 2 DECOMPOSITION OF GRAPHS

X- Chart Using ANOM Approach

Parallelism for Nested Loops with Non-uniform and Flow Dependences

Subspace clustering. Clustering. Fundamental to all clustering techniques is the choice of distance measure between data points;

TN348: Openlab Module - Colocalization

Assignment # 2. Farrukh Jabeen Algorithms 510 Assignment #2 Due Date: June 15, 2009.

Support Vector Machines

TECHNIQUE OF FORMATION HOMOGENEOUS SAMPLE SAME OBJECTS. Muradaliyev A.Z.

C2 Training: June 8 9, Combining effect sizes across studies. Create a set of independent effect sizes. Introduction to meta-analysis

Sum of Linear and Fractional Multiobjective Programming Problem under Fuzzy Rules Constraints

APPLICATION OF A COMPUTATIONALLY EFFICIENT GEOSTATISTICAL APPROACH TO CHARACTERIZING VARIABLY SPACED WATER-TABLE DATA

A New Approach For the Ranking of Fuzzy Sets With Different Heights

Smoothing Spline ANOVA for variable screening

S1 Note. Basis functions.

UC Berkeley Working Papers

Machine Learning. Topic 6: Clustering

For instance, ; the five basic number-sets are increasingly more n A B & B A A = B (1)

Content Based Image Retrieval Using 2-D Discrete Wavelet with Texture Feature with Different Classifiers

An Approach in Coloring Semi-Regular Tilings on the Hyperbolic Plane

A MOVING MESH APPROACH FOR SIMULATION BUDGET ALLOCATION ON CONTINUOUS DOMAINS

Complex Numbers. Now we also saw that if a and b were both positive then ab = a b. For a second let s forget that restriction and do the following.

Hermite Splines in Lie Groups as Products of Geodesics

An Optimal Algorithm for Prufer Codes *

Synthesizer 1.0. User s Guide. A Varying Coefficient Meta. nalytic Tool. Z. Krizan Employing Microsoft Excel 2007

Module Management Tool in Software Development Organizations

Problem Definitions and Evaluation Criteria for Computational Expensive Optimization

Classifier Selection Based on Data Complexity Measures *

On Some Entertaining Applications of the Concept of Set in Computer Science Course

Tsinghua University at TAC 2009: Summarizing Multi-documents by Information Distance

Helsinki University Of Technology, Systems Analysis Laboratory Mat Independent research projects in applied mathematics (3 cr)

User Authentication Based On Behavioral Mouse Dynamics Biometrics

A Five-Point Subdivision Scheme with Two Parameters and a Four-Point Shape-Preserving Scheme

Machine Learning 9. week

FEATURE EXTRACTION. Dr. K.Vijayarekha. Associate Dean School of Electrical and Electronics Engineering SASTRA University, Thanjavur

Empirical Distributions of Parameter Estimates. in Binary Logistic Regression Using Bootstrap

Solitary and Traveling Wave Solutions to a Model. of Long Range Diffusion Involving Flux with. Stability Analysis

The Shortest Path of Touring Lines given in the Plane

ON SOME ENTERTAINING APPLICATIONS OF THE CONCEPT OF SET IN COMPUTER SCIENCE COURSE

Analysis of Malaysian Wind Direction Data Using ORIANA

Compiler Design. Spring Register Allocation. Sample Exercises and Solutions. Prof. Pedro C. Diniz

Load Balancing for Hex-Cell Interconnection Network

SLAM Summer School 2006 Practical 2: SLAM using Monocular Vision

Multiblock method for database generation in finite element programs

Solutions to Programming Assignment Five Interpolation and Numerical Differentiation

Machine Learning: Algorithms and Applications

K-means and Hierarchical Clustering

Range images. Range image registration. Examples of sampling patterns. Range images and range surfaces

MOTION BLUR ESTIMATION AT CORNERS

Intra-Parametric Analysis of a Fuzzy MOLP

Optimization Methods: Integer Programming Integer Linear Programming 1. Module 7 Lecture Notes 1. Integer Linear Programming

Term Weighting Classification System Using the Chi-square Statistic for the Classification Subtask at NTCIR-6 Patent Retrieval Task

The Research of Support Vector Machine in Agricultural Data Classification

Load-Balanced Anycast Routing

Line geometry, according to the principles of Grassmann s theory of extensions. By E. Müller in Vienna.

Outline. Type of Machine Learning. Examples of Application. Unsupervised Learning

Positive Semi-definite Programming Localization in Wireless Sensor Networks

Quality Improvement Algorithm for Tetrahedral Mesh Based on Optimal Delaunay Triangulation

Design of Structure Optimization with APDL

y and the total sum of

CS 534: Computer Vision Model Fitting

Mathematics 256 a course in differential equations for engineering students

Available online at ScienceDirect. Procedia Environmental Sciences 26 (2015 )

Improvement of Spatial Resolution Using BlockMatching Based Motion Estimation and Frame. Integration

Programming in Fortran 90 : 2017/2018

Feature Reduction and Selection

Topology Design using LS-TaSC Version 2 and LS-DYNA

Unsupervised Learning

An Application of the Dulmage-Mendelsohn Decomposition to Sparse Null Space Bases of Full Row Rank Matrices

Learning-Based Top-N Selection Query Evaluation over Relational Databases

Data structures and algorithms to support interactive spatial analysis using dynamic Voronoi diagrams. Abstract

Cluster Analysis of Electrical Behavior

UNIT 2 : INEQUALITIES AND CONVEX SETS

Introduction. Leslie Lamports Time, Clocks & the Ordering of Events in a Distributed System. Overview. Introduction Concepts: Time

Course Introduction. Algorithm 8/31/2017. COSC 320 Advanced Data Structures and Algorithms. COSC 320 Advanced Data Structures and Algorithms

2x x l. Module 3: Element Properties Lecture 4: Lagrange and Serendipity Elements

Learning the Kernel Parameters in Kernel Minimum Distance Classifier

A Fast Content-Based Multimedia Retrieval Technique Using Compressed Data

Lecture #15 Lecture Notes

Brave New World Pseudocode Reference

Hybridization of Expectation-Maximization and K-Means Algorithms for Better Clustering Performance

Fast Computation of Shortest Path for Visiting Segments in the Plane

Simulation Based Analysis of FAST TCP using OMNET++

The Greedy Method. Outline and Reading. Change Money Problem. Greedy Algorithms. Applications of the Greedy Strategy. The Greedy Method Technique


The Codesign Challenge

NUMERICAL SOLVING OPTIMAL CONTROL PROBLEMS BY THE METHOD OF VARIATIONS

Lecture 5: Probability Distributions. Random Variables

Radial Basis Functions

Unsupervised Learning and Clustering

Accounting for the Use of Different Length Scale Factors in x, y and z Directions

MULTISPECTRAL IMAGES CLASSIFICATION BASED ON KLT AND ATR AUTOMATIC TARGET RECOGNITION

AP PHYSICS B 2008 SCORING GUIDELINES

A New Token Allocation Algorithm for TCP Traffic in Diffserv Network

Proper Choice of Data Used for the Estimation of Datum Transformation Parameters

Module 6: FEM for Plates and Shells Lecture 6: Finite Element Analysis of Shell

USING GRAPHING SKILLS

Data Representation in Digital Design, a Single Conversion Equation and a Formal Languages Approach

SHAPE RECOGNITION METHOD BASED ON THE k-nearest NEIGHBOR RULE

Transcription:

CSIS Dscusson Paper No.59 Cell Count Method on a Network wth SANET Atsuyuk Okabe* and Shno Shode** Center for Spatal Informaton Scence, Unversty of Tokyo 7-3-1, Hongo, Bunkyo-ku, Tokyo 113-8656, Japan *atsu@ua.t.u-tokyo.ac.jp **shno@ua.t.u-tokyo.ac.jp Aug, 2003 Abstract Ths paper frst proposes a cell count method defned on a network, called the network cell count method, as an extenson of the ordnary cell count method defned on a plane. Second, the paper develops a user-frendly tool for the network cell count method n conjuncton wth a general toolbox for spatal analyss on a network, called SANET. Thrd, the paper shows an actual applcaton of the network cell count method to the dstrbuton of retal stores n Shbuya, Tokyo. Key words: cell count method, network, toolbox, software, SANET, GIS 1

1. Introducton Ths paper has two objectves. The frst objectve of ths paper s to propose a new cell count method, called the network cell count method, as an extenson of the ordnary cell count method (or the quadrat method). The second objectve s to develop a user-frendly tool for the network cell count method, and ncorporate t nto a toolbox called SANET, a general GIS software package for spatal analyss on a network. The cell count method s one of the most classc methods for pont pattern analyss (Baley and Gatrell, 1995; Cresse, 1993; Upton and Fngleton, 1985). Ths cell count method ordnarly makes the followng assumptons. Assumpton : Geographcal space s represented by a plane. Assumpton : The plane s homogeneous,.e., the probablty of a pont beng placed on a unt area s nvarant regardless of the locaton of the unt area on the plane. There may be some actual cases n whch these assumptons hold or approxmately hold. For example, (Matu, 1932) examned the scattered vllage (houses are scattered) on the Tonam plan, whch appears to be a farly homogeneous plane. However, when we examne the dstrbuton of features n an urbanzed area, the above assumptons are hard to accept. For example, consder the dstrbuton of beauty parlors n Shbuya, one of the sub-centers n Tokyo, whch s shown n Fgure 1. As s seen n ths fgure, the beauty parlors are located along streets, and the dstrbuton of beauty parlors seems to be strongly constraned by the street network. Ths suggests us to replace Assumpton wth the followng assumpton. Assumpton : Geographcal space s represented by a network. As a consequence of ths assumpton, we assume a heterogeneous space n the sense that the space s not sotropc. As a matter of fact, drecton s restrcted along a path of a network. However, we assume a homogeneous space n the followng sense (.e. a network verson of Assumpton ). Assumpton : The network s homogeneous n the sense that the probablty of a pont beng placed on a unt lne segment s nvarant regardless of the locaton of the unt lne segment on the network. 2

Obvously ths assumpton s unrealstc to deal wth the dstrbuton of features n an urbanzed area, but ths assumpton s easly relaxed n a network space through the unform network transformaton. Ths transformaton s the topc of the next secton. Fgure 1: Beauty parlors n Shbuya, Tokyo 2. Unform network transformaton A heterogeneous network, N, s represented by a densty functon defned on lnks of the network. To be explct, let D be the densty of the th lnk of the network, and L be the length of the th lnk. Then the heterogeneous network means that D vares from lnk to lnk; the homogeneous network (Assumpton ) s wrtten as D = c for = 1,, n. Note that the locaton of a pont, p, on the th lnk can be dentfed by the path length, t, from an end pont of the th lnk to the pont p along the th lnk. * Let us now consder a new network, N, by replacng the th lnk wth the new th lnk and we correspond a pont p at t on the th lnk to the pont p at s on the new th lnk n such a way that the path length, s, from an end pont of the new th lnk to the pont p along the new th lnk s gven by s = t 0 D dx. 3

Note that the ntegraton s done wth respect to x along the th lnk. Ths transformaton mples that the th lnk of length L s replaced wth the new th lnk of length D L. The new network obtaned from the above transformaton has a nce property. Notcng that the total quantty on the th lnk s gven by L 0 D dx = D L, and that the length of the new th lnk s D L, we obtan the densty of the new th lnk s D L /( D L ) = 1. Ths equaton hold for all lnks. Ths means that the transformed network N* s homogeneous as n Assumpton. We refer to the above transformaton as the unform network transformaton (Okabe, 2002). We can easly deal wth a heterogeneous network by transformng t nto a homogeneous network through the unform network transformaton. 3. Cell count method Let us consder a network, L, consstng of lne segments that are connected. We decompose the network L nto a set of sub-networks (called cells) satsfyng that the length of each sub-network s the same, c, and each sub-network s connected. In practce, however, ths decomposton s mpossble except for very specal cases, because the total length dvded by c s not nteger n general. PC Notcng ths fact, we consder two types of cells: proper cells, L { L1,, L n } IC mproper cells, { L L } L n+ 1,, n+ n * =, and =. The former cells are used for cell countng, but the latter cells are perpheral cells not used for cell countng. We suppose that these cells satsfy the followng condtons. () L s connected n the sense that for any par of ponts on L, there exsts a path between these ponts that s ncluded n L ( = 1,, n + n* ). 4

() Proper cells L,, L have the same sze,.e. 1 n L = c for = 1,, n, where L denotes the length of L. () The sze of the mproper cells L n+ 1,, L n + n* s smaller than that of the proper cells,.e. L < c for = n + 1,, n + n*. (v) The unon of the proper cells and the mproper cells cover the whole network L,.e. PC IC n n+ n* L = L L = [ L ] [ L ]. = 1 = n+ 1 (v) Cells do not ntersect each other except at boundary ponts,.e. L L = 0 for j,, j = 1,, n + n *. j We make fve remarks on ths decomposton. Frst, ths decomposton s not unque. There are many ways of decomposton satsfyng the above condtons. Second, the shape of cells s not the same, because a gven network L s not regular. Ths contrasts to the planar case where the shape s the same (usually squares) for all cells. Thrd, t s very dffcult to control the shape of cells, but the shape should be as compact as possble. Forth, the length of the mproper cells should be as small as possble. Last, mproper cells should not appear nsde a network. In the planar case, mproper cells appear only on the perphery, but n the network case, mproper cells may appear nsde a network. We call such cells hole cells. It s not easy to avod hole cells n decomposng a network, and we shall dscuss ths problem n Secton 5. The dea of the network cell count method s the same as that of the planar cell count 5

PC n method. Suppose that we observe m ponts dstrbuted on the network = L ], L [ = 1 and we want to test the null hypothess that the m ponts are randomly dstrbuted on PC n the network = L ] accordng to the unform dstrbuton, L [ = 1 f 1 L PC ( x) =, x L. PC We can test ths hypothess usng the goodness-of-ft test. To be explct, let observed number of cells that contans exactly j ponts, except for N k. observed number of cells that contan k or more ponts. Let N j be N k s the P j be the probablty that j ponts are placed n a cell under the above null hypothess, whch s gven by the bnomnal dstrbuton or the Posson dstrbuton for a large number of ponts. Thus we can test the null hypothess wth the test statstc k 2 χ = = 1 2 ( np N). np 4. SANET We develop a user-frendly tool for achevng the network cell count method defned above and nclude t n a general toolbox for spatal analyss on a network, called SANET (Okabe, Okunuk and Funamoto, 2002). SANET conssts of two components. The frst component s a software package for computng methods for spatal analyss on a network. We can nterface ths package wth a vewer of GIS through nput and output fles. The computaton n ths software package s ndependent of a vewer of GIS, and so we can use any vewer. The second component s an nterface wth a vewer of GIS, and ths nterface depends on a vewer of GIS. We use ArcVew8.x., and we have developed the nterface that commute data between the network computaton software package and ArcVew8.x. SANET s under development but the frst verson was released n November, 2002. The frst verson provdes the followng seven tools: Tool 1: Constructon of dataset for SANET. Tool 2: Access pont assgnment. 6

Tool 3: Table calculaton. Tool 4: Generaton of the network Vorono dagram (Okabe, Boots, Sughara and Chu, 2000). Tool5: Generaton of random ponts on a network (for Monte Carlo smulatons). Tool 6: Cross K-functon method (Okabe and Yamada, 2002). Tool 7: K-functon method (Okabe and Yamada, 2002; Yamada and Thll, 2002). SANET s open to non-proft users wthout charge and t can be downloaded from the SANET ste: http://okabe.t.u-tokyo.ac.jp/okabelab/atsu/sanet/sanet-ndex.html Ths paper adds the cell count method to SANET as the eghth tool. The cell count method uses Tool 1 for convertng the network data format n the network computaton software package to data format of ArcVew8.x. It also uses Tool 2 for assgnng the representatve ponts of polygon-lke features, such as beauty parlors, to the nearest ponts on a network (whch may be regarded as entrances of the facltes). By ths assgnment, we can use the cell count method that assumes that ponts are dstrbuted over the network. 5. Computatonal method for the cell count method The computatonal method s farly complex and techncal and so we only outlne the algorthm n ths secton. In the frst step, we choose a node of a network around the center of the network. We call the node the root node. In the second step, we construct the extended shortest-path tree rooted at the root node. The extended shortest-path tree s obtaned from the ordnary shortest-path tree n the followng manner. The ordnary shortest-path tree does not cover the whole network. We call the lnks that are not ncluded n the ordnary shortest-path tree collson lnks. On each collson lnk, we can fnd such a pont, c, that the shortest-path dstance from the pont to the root node through one end node of the collson lnk s the same as the shortest-path dstance from the pont c to the root node through the other end node of the collson node. We call the pont c a collson pont. We cut the network at all collson ponts and add nodes on both cut ends. We call the resultng tree the extended shortest-path tree. Once we obtan the shortest-path tree, we can easly obtan the shortest-path from 7

any pont on the network to the root node. In the thrd step, we construct a tree rooted at the root node whose total length s a gven dstance. We next construct a tree rooted at each end node of the tree whose total length s a gven dstance. If we cannot construct such a tree, we leave t. We contnue ths procedure untl when we cannot construct trees. Then we obtan a set of trees that are ncluded n the network and the length of each tree s the same. We may use the resultng trees as cells. However, these cells are lkely to produce the holes cells n the network. The last step s to put out these hole cells to the perphery of the network. Ths part s too lengthy to explan, and so t s omt here. 6. An Applcaton Havng establshed the network cell count method and ts software package, let us show an actual example. The study regon s part of Shbuya, one of the sub-centers n Tokyo, and the street network s shown by colored lnes n Fgure 2. Fgure 2: Retal stores assgned to the street network n Shbuya, Tokyo (cells are ndcated by dfferent colors) Frst we decompose the street network nto cells. In Fgure 2, the cells are ndcated 8

by dfferent colors. The yellow lnes are mproper cells. Second, we assgn retal stores to the nearest ponts on the network (whch may be regarded as the entrances of the stores). The ponts n Fgure 2 show the assgned retal stores on the street network. Thrd, the tool counts the number of retal stores n each cell and shows the numbers of cells havng ponts as n Fgure 3. Fgure 3: The number of cells havng ponts Last we apply the goodness-of-ft test to the data shown n Fgure 4. The test rejects the null hypothess, mplyng that the retal stores are non-randomly dstrbuted n Shbuya. 7. Concludng remarks In ths paper we have proposed the network cell count method as an extenson of the ordnary cell count method defned on a plane and developed a tool for the network cell count method. Compared wth the ordnary cell count method, the proposed network cell count method has a few advantages. Frst, the network space s more natural to deal wth the dstrbuton of features n an urbanzed area than a homogeneous plane. Second, t s easy to treat heterogenety wth the unform network transformaton. For example, t s 9

easy to treat the dstrbuton of stores n relaton to the dstrbuton of populaton. One mght consder that the results obtaned from the ordnary cell count method are not so dfferent from those obtaned from the network cell count method. We compared these results n a few actual examples and found that ths dfference was sgnfcant. We wll further examne ths comparson usng many cases and we wll report the results n another occason. Acknowledgements We express our thanks to Yasush Asam, Yuko Sadahro and Ke-ch Okunuk for ther comments on earler drafts. Ths study was part of the outcomes of the Grant-n-ad project "Spatal Informaton Scence for Human and Socal Scences" supported by the Japanese Mnstry of Educaton and Scence. References Baley, T. and Gatrell, A. (1995) Interactve Spatal Analyss, London: Longman. Cresse, N. (1993) Statstcs for Spatal Data, New York: Wley. Matu, I. (1932) Statstcal study of the dstrbuton of scattered vllages n two regons of the Tonam plan, Toyama prefecture, Japanese Journal of Geology and Geography, 9, 251-266. Okabe, A. (2002) Unform network transformaton for spatal analyss on a heterogeneous network, CSIS Dscusson Paper #51. Okabe, A., Okunuk, K. and Funamoto, S. (2002) SANET: A Toolbox for Spatal Analyss on a Network, Center for Spatal Informaton Scence, Unversty of Tokyo. Okabe, A., Boots, B., Sughara, K. and Chu, S. N. (2000) Spatal Tessellatons: Concepts and Applcatons of Vorono Dagrams, (2 nd edton), Chchester: John Wley. Okabe, A. and Yamada, I. (2001) The K-functon method on a network and ts computatonal mplementaton, Geographcal Analyss, Vol.33, No.3, pp.271-290. Upton, G. and Fngleton, B. (1985) Spatal Data Analyss by Example, Chchester: Wley Yamada, I. and Thll, J-C. (2002) An Emprcal Comparson of Planar and Network K-functon Analyses, Abstract, AAG Annual Meetng, Los Angeles, CA. 10