Thoughts on Representing Spatial Objects. William A. Huber Quantitative Decisions Rosemont, PA

Similar documents
Lecture 6: GIS Spatial Analysis. GE 118: INTRODUCTION TO GIS Engr. Meriam M. Santillan Caraga State University

DATA MODELS IN GIS. Prachi Misra Sahoo I.A.S.R.I., New Delhi

layers in a raster model

Introduction to Geographic Information Science. Some Updates. Last Lecture 4/6/2017. Geography 4103 / Raster Data and Tesselations.

SPATIAL DATA MODELS Introduction to GIS Winter 2015

Understanding Geospatial Data Models

Geometric Rectification of Remote Sensing Images

Surface Creation & Analysis with 3D Analyst

Spatial Data Models. Raster uses individual cells in a matrix, or grid, format to represent real world entities

Representing Geography

Lab 12: Sampling and Interpolation

Surface Analysis with 3D Analyst

Computer Database Structure for Managing Data :-

Lab 12: Sampling and Interpolation

Creating Surfaces. Steve Kopp Steve Lynch

RASTER ANALYSIS GIS Analysis Winter 2016

Geol 588. GIS for Geoscientists II. Zonal functions. Feb 22, Zonal statistics. Interpolation. Zonal statistics Sp. Analyst Tools - Zonal.

RASTER ANALYSIS GIS Analysis Fall 2013

Maps as Numbers: Data Models

LECTURE 2 SPATIAL DATA MODELS

Esri International User Conference. San Diego, California. Technical Workshops. July Creating Surfaces. Steve Kopp and Steve Lynch

Creating raster DEMs and DSMs from large lidar point collections. Summary. Coming up with a plan. Using the Point To Raster geoprocessing tool

Geographic Information Systems (GIS) Spatial Analyst [10] Dr. Mohammad N. Almasri. [10] Spring 2018 GIS Dr. Mohammad N. Almasri Spatial Analyst

Getting Started with Spatial Analyst. Steve Kopp Elizabeth Graham

17/07/2013 RASTER DATA STRUCTURE GIS LECTURE 4 GIS DATA MODELS AND STRUCTURES RASTER DATA MODEL& STRUCTURE TIN- TRIANGULAR IRREGULAR NETWORK

Geographic Information Fundamentals Overview

L7 Raster Algorithms

Lesson 5 overview. Concepts. Interpolators. Assessing accuracy Exercise 5

Dijkstra's Algorithm

Introduction to Computer Vision. Human Eye Sampling

Class #2. Data Models: maps as models of reality, geographical and attribute measurement & vector and raster (and other) data structures

Getting Started with Spatial Analyst. Steve Kopp Elizabeth Graham

Raster Data. James Frew ESM 263 Winter

GEOGRAPHIC INFORMATION SYSTEMS Lecture 02: Feature Types and Data Models

Watershed Sciences 4930 & 6920 GEOGRAPHIC INFORMATION SYSTEMS

Introduction to Geographic Information Systems Dr. Arun K Saraf Department of Earth Sciences Indian Institute of Technology, Roorkee

Data Representation in Visualisation

DETERMINATION OF THE OPTIMUM PATH ON THE EARTH S SURFACE. (extended abstract) Abstract

CPSC 695. Methods for interpolation and analysis of continuing surfaces in GIS Dr. M. Gavrilova

Topic 5: Raster and Vector Data Models

Biometrics Technology: Image Processing & Pattern Recognition (by Dr. Dickson Tong)

Remote Sensing in an

Review of Cartographic Data Types and Data Models

Lecture overview. Visualisatie BMT. Fundamental algorithms. Visualization pipeline. Structural classification - 1. Structural classification - 2

What can we represent as a Surface?

Chapter 3. Sukhwinder Singh

Geometric Correction

ENGRG Introduction to GIS

Polygonal representation of 3D urban terrain point-cloud data

EAT 233/3 GEOGRAPHIC INFORMATION SYSTEM (GIS)

Georeferencing & Spatial Adjustment

Algorithms for GIS csci3225

Surface Analysis. Data for Surface Analysis. What are Surfaces 4/22/2010

Scientific Visualization Example exam questions with commented answers

University of West Hungary, Faculty of Geoinformatics. Béla Márkus. Spatial Analysis 5. module SAN5. 3D analysis

DIGITAL TERRAIN MODELS

Lecture 9. Raster Data Analysis. Tomislav Sapic GIS Technologist Faculty of Natural Resources Management Lakehead University

UNIT-2 IMAGE REPRESENTATION IMAGE REPRESENTATION IMAGE SENSORS IMAGE SENSORS- FLEX CIRCUIT ASSEMBLY

Spatial Analysis and Modeling (GIST 4302/5302) Guofeng Cao Department of Geosciences Texas Tech University

Scientific Visualization. CSC 7443: Scientific Information Visualization

DIGITAL TERRAIN MODELLING. Endre Katona University of Szeged Department of Informatics

Slide 1. Advanced Cartography in ArcGIS. Robert Jensen Edie Punt. Technical Workshops

Brightness and geometric transformations

2D rendering takes a photo of the 2D scene with a virtual camera that selects an axis aligned rectangle from the scene. The photograph is placed into

Purpose: To explore the raster grid and vector map element concepts in GIS.

Georeferencing & Spatial Adjustment 2/13/2018

Cell based GIS. Introduction to rasters

Range Imaging Through Triangulation. Range Imaging Through Triangulation. Range Imaging Through Triangulation. Range Imaging Through Triangulation

Generating a New Shapefile

Maps as Numbers: Data Models

The Problem. Georeferencing & Spatial Adjustment. Nature Of The Problem: For Example: Georeferencing & Spatial Adjustment 9/20/2016

The Problem. Georeferencing & Spatial Adjustment. Nature of the problem: For Example: Georeferencing & Spatial Adjustment 2/4/2014

9. Three Dimensional Object Representations

High-Precision Positioning Unit 2.2 Student Exercise: Calculating Topographic Change

Analytical and Computer Cartography Winter Lecture 9: Geometric Map Transformations

Watershed Sciences 4930 & 6920 GEOGRAPHIC INFORMATION SYSTEMS

Sung-Eui Yoon ( 윤성의 )

GIS Tools - Geometry. A GIS stores data as different layers of information Different feature types are stored in individual files.

Spatial Interpolation & Geostatistics

3 Data Representation. Data Representation. Department of Computer Science and Engineering 3-1

03 Vector Graphics. Multimedia Systems. 2D and 3D Graphics, Transformations

Geographic Information Systems. using QGIS

IMPROVING THE ACCURACY OF DIGITAL TERRAIN MODELS

Overview. Spectral Processing of Point- Sampled Geometry. Introduction. Introduction. Fourier Transform. Fourier Transform

Key Learning for Grade 3

Introduction to Flash - Creating a Motion Tween

Collision Detection. These slides are mainly from Ming Lin s course notes at UNC Chapel Hill

Fractals: a way to represent natural objects

DESIGN NETWORK ANALYSIS

Shape fitting and non convex data analysis

Intersection of an Oriented Box and a Cone

Nearest Neighbor Predictors

Spatial Analysis (Vector) II

Scalar Visualization

AMCS / CS 247 Scientific Visualization Lecture 4: Data Representation, Pt. 1. Markus Hadwiger, KAUST

Spatial Interpolation - Geostatistics 4/3/2018

Last update: May 4, Vision. CMSC 421: Chapter 24. CMSC 421: Chapter 24 1

CSG obj. oper3. obj1 obj2 obj3. obj5. obj4

RASTER ANALYSIS S H A W N L. P E N M A N E A R T H D A T A A N A LY S I S C E N T E R U N I V E R S I T Y O F N E W M E X I C O

Solids as point set. Solid models. Solid representation schemes (cont d) Solid representation schemes. Solid representation schemes (cont d)

Transcription:

Thoughts on Representing Spatial Objects William A. Huber Quantitative Decisions Rosemont, PA

Overview 1. Some Ways to Structure Space 2. What to Put into a Grid 3. Objects and Fields 4. Hybrid Structures

Part 1 Some Ways to Structure Space

A Square Grid

A Rectangular Grid

A hexagonal grid

The Cell Centers Form a Regular Array

An Irregular Rectangular Grid Finite element models e.g., for groundwater flow routinely use such grids as a simple way to adapt their resolution to the rate of change of input parameters (e.g., hydraulic gradient).

A Confocal Conic Grid I once used these for quantum mechanical calculations. See W.A. Huber and C. Bottcher, J. Phys. B 13, L399 (1980)

A Grid with Free Boundaries Application: Spatial Simulated Annealing. Row 22: (Left, right) = {(9, 13); (14, 17); (25, 39)} 9 13 25 39 14 17

A Quadtree

Application: Minimize Average Nearest Neighbor Distance Colors are soil types. Challenge: Find 10 points that minimize the average distance to the nearest. (If you like, weight the average according to soil type. More importantly: re-express the distance as a semivariance. The solution tells us where to sample to obtain the most information when interpolating from the sample.)

A Solution Method These locations are found via spatial simulated annealing (SSA). During this randomized algorithm, the points bounce around. As they bounce, their Thiessen polygons (restricted to this convoluted study region) change. Because several hundred thousand bounces are needed, it is imperative to update the Thiessen polygon boundaries as quickly as possible.

Illustration of SSA Objective: Situate 20 points to minimize average distance within the gray square to the nearest point; i.e., minimize y P x Square min{ y - x } dx where P is the set of 20 points. Constraint: The points must lie within the large square but not within the central black square. Solution: Adjust the points randomly, accepting new positions with controlled probabilities.

SSA Steps Have to be Fast! Naïve algorithm for one SSA step: Move a point. Check that it satisfies the constraints. Recompute the distance grid. Estimate the integral with a global average. Improved algorithm: Move a point anywhere. Recompute only the part of the grid affected by the move. Update the integral based only on the recomputed cells, adding a penalty for a point located in a forbidden area. The improved algorithm is made possible by using a free-boundary representation of the Voronoi cells (Thiessen polygons) around each point.

Ideas Spatial structure and spatial content are different. Structure is a container for the content. Good structures: Allow rapid identification of immediate neighbors. Are homogeneous enough to permit efficient computation of local properties such as slopes. As we move down the ladder from highly regular, static structures to less regular, adaptive, or dynamic structures, Calculations become more difficult and slower. Storage can go down. Accuracy (for a given amount of storage) can go up.

Application: Air Deposition Modeling (TNO, 2001)

Kernels Spread All Sources Over the Land Surface

How a Kernel is Represented Data consist of numeric values provided at all nodes of this irregularly-spaced radial grid. This value is interpolated logarithmically or cubically in the radial direction and linearly in the angular direction.

A Hybrid, Adaptive Data Structure Makes Optimization Possible The large grid is computed within the region occupied by very low kernel values. This value is estimated by the approximate value in the center of the large grid cell.

Consequences of Adaptive Data Structures Typically, the expensive computation (log-linear interpolation) has to occur only within 0.1% of the kernel s support region. Typically, the large grid cells contain 100-200 model grid points. Thus, values at 100-200 points are computed with one calculation, reducing computation in that region by 98 to 99%. The resulting computation time can be 1/50 of the original (fully accurate) computation time without substantial loss of accuracy.

Accuracy Assessment Modeling the model The reference

In our case, accuracy is spatially variable: it is best in the areas that drive the management decisions.

Part 2 What to Put into a Grid

ESRI Puts in Indexes and Floats

For ESRI- Watchers: a Short List of What Spatial Analyst Needs

What Else? Vector fields Fields on a sphere And tensor fields, etc. (These are hard to picture but are important for analysis of movement on a surface. E.g., a (1,1)-tensor describes the actual motion at a location ensuing from an attempted motion there.)

Why Do We Care? Parallel transport of a vector field on a surface produces a holonomy group reflecting its curvature. We need this kind of calculation when doing global GIS! Applications include meteorology, climate analysis, and plate tectonics. This vector, when carried around the path PQR, ends up as the red vector. The amount of change is proportional to the total curvature of the ellipsoid within the triangle PQR. Tensors exhibit holonomy too.

General Objects We should be able to associate any kind of objects with points on a surface, curved or flat. E.g., spectra, time series, photographs, videos, sound, touch, arbitrary text, etc. This becomes interesting when the object changes in continuous ways with change in position on the surface. E.g., a video shows a continuity of scene as one travels around the earth. Continuity implies the ability to interpolate and suggests spatial correlation.

Part 3 Objects and Fields Objects refer to things in the world whilst a field refers to a single valued function of location in two-dimensional space. --Robert Haining, Spatial Data Analysis, Theory and Practice (2003), p. 44. However, we can represent things in the world by means of their indicator functions; i.e., as special kinds of fields. Using an object or a field is a choice about how best to represent data for a particular need, whether it be storage, cartography, or analysis.

A Mathematical Picture Object: Domain Space Field: Space Range The following ideas generalize easily to 3D and time-varying GIS.

Objects Are Represented as Simplicial Complexes Object: Domain Space Multipoint Polyline Polygon; surface Solid 0 R 1 R 2 R 3 R

Fields Are Represented by Gridded Data Field: Space Range {Numbers}, {Tensors}, {Whatever }

In All Cases, the Start of the Arrow is Discretized 0 R 1 R 2 R 3 R Space Range

Indicator Fields Represent Objects Blue indicates presence. Gray indicates absence. 1-0, true-false, present-absent: all are forms of binary indicators.

Discretizing the Range Creates Contours f : Space Range 1 f Space Levels (a discrete subset) Contours effectively represent realvalued fields as collections of objects.

Part 4 Hybrid Structures

Why Not Put the Two Together? Domain Space Range We can simultaneously represent objects and fields. The trick is to connect the two representations in useful ways. What do we gain?

Here are 20,000 blocks with people in them. Census Blocks

This subset of the road network contains over 100,000 features represented by a million total edges and vertices. Streets

Comparison of Two Geometries Travel time Euclidean distance (converted to travel time by using the average road speed)

Extending Travel Times Off-Network We create a discrete grid to represent off-network points. The grid actually is a network where each cell is connected to some of its neighbors (usually eight of them). The cost to travel left-right or up-down is fixed. The cost to travel diagonally is 2 times greater. Barrier cells are disconnected from all neighbors: one can neither leave them nor arrive at them. The network travel times are rasterized into the grid to establish initial conditions. The cost of this operation is proportional to the total length of the network and inversely proportional to the grid cell size. The computational cost and RAM requirements for finding travel times on the grid are inversely proportional to the square of the grid cell size, so there is a limit to the resolution we can use. In practice, the resolution is about 25 to 100 meters, or up to one minute in travel time at a walking pace.

Extending Travel Times: Illustration Compute travel times at every street node. Interpolate times along edges and rasterize them onto this grid. Mark the barrier cells. Extend travel times off-grid at a walking speed. These cells are 50 m across. The speed is 2 miles/hr. Notice the effect of the barrier in the lower left corner: the time at one cell is 4.47 minutes, although it is diagonally opposite (70 meters from) a cell just 1.24 minutes from the site.

Evaluation of this Hybrid Data Format Advantages: The network representation lets us apply Dijkstra s algorithm to get accurate on-network travel times. The grid representation handles off-network travel times well and copes with barriers easily. Once a grid is computed, obtaining a travel time to any location incurs a computational cost of O(1). This setup is ideal when a large number of travel times from a small set of given locations is needed. Analyzing travel-time differences among competing suppliers is easy. Disadvantages: Grids can occupy a lot of RAM. For example, to cover a 200 km region at 50 m resolution requires a 4000 x 4000 array of floats, occupying 64 MB.

Dreaming Why not let the system store spatial entities both as objects and fields? We can automate geoprocessing without needing to know the data structure: the system can choose the appropriate storage method and resolution. We can improve resampling, visualization, and analysis by using relevant parts of a (redundant) data structure. The casual user does not need to know or understand how spatial entities are represented. Accuracy can be arbitrarily high.

Conclusions Musing about the fundamentals of our profession how space is represented and how spatial objects are stored suggests creative ways to enhance the capabilities of GIS and the performance of spatial algorithms. Overlapping nontraditional grid structures Grids with dynamic, free boundaries Raster-vector hybrids Quantitative Decisions Rosemont, Pennsylvania