Representing Geography

Similar documents
Class #2. Data Models: maps as models of reality, geographical and attribute measurement & vector and raster (and other) data structures

Longley Chapter 3. Representations

Maps as Numbers: Data Models

Contents of Lecture. Surface (Terrain) Data Models. Terrain Surface Representation. Sampling in Surface Model DEM

Understanding Geospatial Data Models

Topic 5: Raster and Vector Data Models

layers in a raster model

Spatial Data Models. Raster uses individual cells in a matrix, or grid, format to represent real world entities

GIS Data Models. 4/9/ GIS Data Models

17/07/2013 RASTER DATA STRUCTURE GIS LECTURE 4 GIS DATA MODELS AND STRUCTURES RASTER DATA MODEL& STRUCTURE TIN- TRIANGULAR IRREGULAR NETWORK

Maps as Numbers: Data Models

Lecture 2: GIS Data Sources, Data Types and Representation. GE 118: INTRODUCTION TO GIS Engr. Meriam M. Santillan Caraga State University

M. Andrea Rodríguez-Tastets. I Semester 2008

Lecture 6: GIS Spatial Analysis. GE 118: INTRODUCTION TO GIS Engr. Meriam M. Santillan Caraga State University

SPATIAL DATA MODELS Introduction to GIS Winter 2015

DATA MODELS IN GIS. Prachi Misra Sahoo I.A.S.R.I., New Delhi

Statistical surfaces and interpolation. This is lecture ten

Geographic Surfaces. David Tenenbaum EEOS 383 UMass Boston

LECTURE 2 SPATIAL DATA MODELS

Lecture 06. Raster and Vector Data Models. Part (1) Common Data Models. Raster. Vector. Points. Points. ( x,y ) Area. Area Line.

Chapter 3: Maps as Numbers

GIS Data Collection. This chapter reviews the main methods of GIS data capture and transfer and introduces key practical management issues.

The GIS Spatial Data Model

Algorithms for GIS csci3225

Computer Database Structure for Managing Data :-

Raster Data. James Frew ESM 263 Winter

L7 Raster Algorithms

Representing the Real World

Data Models and Data processing in GIS

GEOGRAPHIC INFORMATION SYSTEMS Lecture 25: 3D Analyst

Algorithms for GIS. Spatial data: Models and representation (part I) Laura Toma. Bowdoin College

GIS Workshop Spring 2016

4/7/2009. Model: Abstraction of reality following formal rules e.g. Euclidean space for physical space

Spatial Analysis and Modeling (GIST 4302/5302) Guofeng Cao Department of Geosciences Texas Tech University

GIS in agriculture scale farm level - used in agricultural applications - managing crop yields, monitoring crop rotation techniques, and estimate

N.J.P.L.S. An Introduction to LiDAR Concepts and Applications

Maps as Numbers. Maps as Numbers. Chapter 3: Maps as Numbers 14SND Getting Started with GIS Chapter 3

Creating raster DEMs and DSMs from large lidar point collections. Summary. Coming up with a plan. Using the Point To Raster geoprocessing tool

Import, view, edit, convert, and digitize triangulated irregular networks

Raster GIS. Raster GIS 11/1/2015. The early years of GIS involved much debate on raster versus vector - advantages and disadvantages

GEOGRAPHIC INFORMATION SYSTEMS Lecture 02: Feature Types and Data Models

Introduction to Geographic Information Science. Some Updates. Last Lecture 4/6/2017. Geography 4103 / Raster Data and Tesselations.

Purpose: To explore the raster grid and vector map element concepts in GIS.

Tips for a Good Meshing Experience

Notes: Notes: Notes: Notes:

Attribute Accuracy. Quantitative accuracy refers to the level of bias in estimating the values assigned such as estimated values of ph in a soil map.

Analytical and Computer Cartography Winter Lecture 9: Geometric Map Transformations

Review of Cartographic Data Types and Data Models

Vector Data. James Frew ESM 263 Winter

v TUFLOW-2D Hydrodynamics SMS Tutorials Time minutes Prerequisites Overview Tutorial

EAT 233/3 GEOGRAPHIC INFORMATION SYSTEM (GIS)

I. An Intro to ArcMap Version 9.3 and 10. 1) Arc Map is basically a build your own Google map

GIS DATA MODELS AND SPATIAL DATA STRUCTURE

Spatial Analysis and Modeling (GIST 4302/5302) Database Fundaments. Database. Review: Bits and Bytes

Creating Surfaces. Steve Kopp Steve Lynch

Welcome to NR402 GIS Applications in Natural Resources. This course consists of 9 lessons, including Power point presentations, demonstrations,

Accuracy, Support, and Interoperability. Michael F. Goodchild University of California Santa Barbara

Introduction to GIS 2011

The Raster Data Model

Introducing ArcScan for ArcGIS

INTRODUCTION TO GIS WORKSHOP EXERCISE

MODULE 1 BASIC LIDAR TECHNIQUES

GIS LAB 8. Raster Data Applications Watershed Delineation

Lecture 4: Digital Elevation Models

Raster Data Models 9/18/2018

Georeferencing. Georeferencing: = linking a layer or dataset with spatial coordinates. Registration: = lining up layers with each other

Rasters are: The Raster Data Model. Cell location specified by: Why squares? Raster Data Models 9/25/2014. GEO327G/386G, UT Austin 1

The Raster Data Model

Thoughts on Representing Spatial Objects. William A. Huber Quantitative Decisions Rosemont, PA

LAB #7 Creating TIN and 3D scenes (ArcScene) GISC, UNIVERSITY OF CALIFORNIA BERKELEY

The 3D Analyst extension extends ArcGIS to support surface modeling and 3- dimensional visualization. 3D Shape Files

DIGITAL TERRAIN MODELLING. Endre Katona University of Szeged Department of Informatics

AUTOMATIC EXTRACTION OF TERRAIN SKELETON LINES FROM DIGITAL ELEVATION MODELS

Map Library ArcView Version 1 02/20/03 Page 1 of 12. ArcView GIS

Geographic Information Systems Notes For OC 3902/ OC 2902 Dr. James R. Clynch, 2002

Main concepts of ILWIS 3.0

Bharath Setturu Research scholar, EWRG, Center for Ecological Sciences, IISc, Bangalore & EWRG-CES IIIT-HYDERABAD

Data Assembly, Part II. GIS Cyberinfrastructure Module Day 4

WMS 10.1 Tutorial Hydraulics and Floodplain Modeling Simplified Dam Break Learn how to run a dam break simulation and delineate its floodplain

FUZZY DIJKSTRA ALGORITHM FOR SHORTEST PATH PROBLEM IN GEOGRAPHIC INFORMATION SYSTEMS

GIS Basics for Urban Studies

Image Services for Elevation Data

v SMS 11.1 Tutorial Scatter Data - TINs Requirements Scatter Module Map Module Time minutes Prerequisites None Objectives

Lecture 21 - Chapter 8 (Raster Analysis, part2)

Geographic Information Systems (GIS) Spatial Analyst [10] Dr. Mohammad N. Almasri. [10] Spring 2018 GIS Dr. Mohammad N. Almasri Spatial Analyst

8 Geographers Tools: Automated Mapping. Digitizing a Map IMPORTANT 2/19/19. v Tues., Feb. 26, 2019.

WMS 8.4 Tutorial Hydraulics and Floodplain Modeling Simplified Dam Break Learn how to run a dam break simulation and delineate its floodplain

I. Project Title Light Detection and Ranging (LIDAR) Processing

EEOS 381 -Spatial Databases and GIS Applications

Objectives This tutorial will introduce how to prepare and run a basic ADH model using the SMS interface.

Spatial Analysis and Modeling (GIST 4302/5302) Representation of Spatial Data. Representation of Spatial Data Models 3/5/18

Surface Creation & Analysis with 3D Analyst

What can we represent as a Surface?

8 Geographers Tools: Automated Mapping. Digitizing a Map 2/19/19 IMPORTANT. Revising a Digitized Map. The Digitized Map. vtues., Feb. 26, 2019.

v Importing Rasters SMS 11.2 Tutorial Requirements Raster Module Map Module Mesh Module Time minutes Prerequisites Overview Tutorial

Bharath Setturu Research scholar, EWRG, Center for Ecological Sciences, IISc, Bangalore & IIIT-HYDERABAD

Alaska Department of Transportation Roads to Resources Project LiDAR & Imagery Quality Assurance Report Juneau Access South Corridor

GIS Workbook #1. GIS Basics and the ArcGIS Environment. Helen Goodchild

Generating a New Shapefile

Overview. Image Geometric Correction. LA502 Special Studies Remote Sensing. Why Geometric Correction?

Transcription:

Data models and axioms Chapters 3 and 7 Representing Geography

Road map Representing the real world Conceptual models: objects vs fields Implementation models: vector vs raster Vector topological model Surfaces and networks

The world is infinitely complex The contents of a spatial database represent a limited view of reality the spatial database is representation of a model of reality Ontological and epistemological issues The user sees the real world through the medium of the database Representing the real world in a GIS

A database may include What kinds of objects exist in the real world? digital versions of real objects e.g., houses, roads, trees If I tell you that in the figure on the right there are digital versions of fictitious (i.e., invented) objects two cities (Vancouver and Burnaby), what exists that you cannot see? e.g., political boundaries, census tracts, forest stand boundaries Representing the real world

Some features are discrete, clearly defined entities (e.g., houses, districts) discrete Some representation things you can is no hold, problem are clearly delimited. Other features Other things exist everywhere are known through and vary measurements, continuously (e.g., temperature, ph) are infinite in scope. variation needs to be approximated using discrete representations Representing the real world

Reality GIS representation? Forest??? Non- Forest (Fuzzy logic) Sometimes, the distinction between discrete and continuous is not very clear

Computers are good at storing discrete spatial data, but bad at storing continuous spatial data (since everything in a computer is ultimately represented as a binary [0,1] number) (also, fractals) Representing the real world

GIScientists have been developing a conceptual model of space, independent of the way the data is physically represented in the computer: Object or entity view empty space littered with objects (points, lines or areas) Field view value is defined for every location Objects versus Fields Are these mutually exclusive?

Vancouver / Burnaby Census Tracts 3-D view of Prague Examples of objects

Elevation (a DEM) Groundwater nitrate concentration Examples of fields

Implementing, in a computer, valid representations of fields and objects requires a set of rules. These rules will depend on the particular (physical) data model selected. These rules convert (conceptual) geographic models into discrete (physical, computer-based) representations. Data model implementation

The two dominant representations: raster or grid data model vector data model What are the two dominant models? GIS data models

Divides the study area into square cells (a grid) Register the corners of the grid to the Earth Represent discrete objects as collections of one or more cells Represent fields by assigning attribute values (NOIR) to cells More commonly used to represent fields than discrete objects The raster data model

Legend Mixed conifer Douglas fir Oak savannah Grassland Raster representation: each color represents a different value of a (e.g.) nominal-scale field denoting land cover class. The raster data model

Pixel size The size of the cell or picture element, defining the level of spatial detail All variation within a pixel is lost (mixed pixel problem) Assignment schemes The value of a cell may be an average over the cell, a total within the cell, the commonest value in the cell, based on a hierarchy (ensure that rare things get recorded), or a boolean value (present [1] or not [0]) It may also be the value found at the cell s central point (centroid). (There are more rules.) The raster data model

Regardless of the assignment rule used, ambiguity will always be present. Sub-pixel Boundary pixel Intergrade Linear sub-pixel Mixed pixel problem

Saving space: Raster compaction methods Run-length encoding BBBBWWWB: 4B3W1B Block encoding Wavelet, JPEG, TIFF, etc. Quadtrees: Technical Box 8.1 in text 1 2 3 4 The raster data model

The raster data model is a member of a larger group of field models known as tessellation data models. Grid or raster Hexagonal TIN Quadtree All completely cover the space, and therefore can be used to represent fields (but not all are raster-based models) Tessellation data models

TIN Hexagons Spatial ecology Quadtree Cells Tesselation models

Real world objects are represented as points, lines and areas Points identify locations (0 D) Lines connect points (1 D) Areas (polygons) consist of connected line segments (2 D) The vector data model

points lines areas Node y Vertex The vector data model x

Objects are defined by their (x,y) coordinates in a coordinate system (spherical: lat,long or planar: x,y) Precision of coordinates virtually infinite (only machinedependent) (resolution) but: accuracy most often limited! A E.g., Distance A to B is given as 1458.394958397 m Data was digitized off of a 1:20,000 scale map. Is that distance meaningful? (Sig Dig) B The vector data model

Resolution: raster & vector

Precision: the smallest difference between adjacent positions that was recorded and stored (e.g., were distances measured to the nearest m or mm?). (Also applies to attributes?) Accuracy: the freedom from error (what is the truth?) Uncertainty: now the more commonly-used term, as it represents the overall ambiguity (map scale) Which demonstrates accuracy? Which demonstrates precision? Precision versus Accuracy

Conceptual models Objects (e.g. spot hts) Fields (e.g. Elevation) Vector (e.g. Contours, TIN) Raster (e.g. DEM) Implementation models

Road map Representing the real world Conceptual models: objects vs fields Implementation models: vector vs raster Vector topological model Surfaces and networks

Levels of GIS data model abstraction

C B B A B C A Points Lines Polygons Object representations

What Raster are grid some of Fishnet Polygons the ways Regular we point can grid store in ArcGIS 200 240 260 fields in a GIS? 180 130 200 270 170 220 250 Irregular points TIN Contour lines Common field representations

There are three main vector data structures: Simple (or spaghetti) data structure No intelligence, duplication of data (within a layer) (KML) Point dictionary No intelligence, no duplication of data (within a layer) Topological structure Intelligence, no duplication of data (within a layer) Vector data structures

Each point, line, or polygon is stored as a record in a file that consists of that entity s ID and a list of coordinates that define geometry. For Points: 1 2 ID Coordinates 1 2,3 2 5,5 Spaghetti Vector Data Model

Each point, line, or polygon is stored as a record in a file that consists of that entity s ID and a list of coordinates that define geometry. For Lines: ID Coordinates 1 (0,1), (3,4), (5,6) 2 (3,1), (5,2), (4,3) 1 2 Spaghetti Vector Data Model

Each point, line, or polygon is stored as a record in a file that consists of that entity s ID and a list of coordinates that define geometry. For Polygons: 1 2 ID Coordinates 1 (2,4), (4,3), (3,6), (2,4) 2 (3,1), (5,2), (4,3), (3,2), (3,1) KML Spaghetti Vector Data Model

Advantages simple efficient for display and plotting Disadvantages inefficient for most types of spatial analysis, especially for generalization Spaghetti Vector Data Model

Representing the real world Conceptual models: objects vs fields Implementation models: vector vs raster Vector topological model Surfaces and Networks Road map

Records coordinates (x,y) of spatial features and encodes spatial relations (i.e., which arcs are connected to a node, which polygons lie on either side of an arc, which arcs make up a polygon) Also called arc-node data model arc = line node = end-point of a line, or a point where two or more lines connect Arcs Topological data model

Planar Enforcement: No two individual features can overlap. There are no holes or íslands that are not themselves features. Every feature is represented as a record in the attribute table. 4 records - X How many topologicallycorrect areas? 6 records - Y Topological data model

1 2 3 Spaghetti: can encode as 2 or 3 polygons (and have 2 or 3 records in the attribute table) Topologic: must be encoded as 3 polygons (and have 3 records in the attribute table) Topological data model vs. Spaghetti

6 5 4 3 2 1 O = outside polygon B 1 A II I 4 5 III C 2 3 IV 1 2 3 4 5 6 6 Line From To Left Right 1 I III O A 2 I IV B O 3 III IV O C 4 I II A B 5 II III A C 6 II IV C B Node X Y I 1 4 II 4 4 III 6 4 IV 4 1 Poly Lines A 1,4,5 B 2,4,6 C 3,5,6 Topological data model

Can quickly answer these questions: which roads connect to the central square? which roads do I take to get from here to the hospital? what are the fertility rates in the neighboring districts? Topology: adjacency, connectivity, overlap, intersect Topological data model

Advantages stores spatial relations explicitly spatial analysis can be done without accessing coordinates Very useful for data validation (if topological rules are established) Disadvantages more complex data structure Topology and map projections topology needs to be re-established after each update Preferred system for high-end systems Topological data model ESRI s help files on topology

Road map Representing the real world Conceptual models: objects vs fields Implementation models: vector vs raster Vector topological model Surfaces and Networks

Two additional spatial entities should be introduced since they extend the basic properties of points, lines, areas and raster cells. For some groups / agencies, these are the most important routines / data structures in a GIS. These are: Surfaces Networks Two special spatial entities

As mentioned before, surfaces are examples of field data There are both raster and vector approaches to storing / presenting surface data Surfaces

Surface presentations 3D? 2.5D [Box 3.4] ArcGlobe

Remote sensing Radar (grid) Lidar (dense points) Stereo aerial or satellite imagery (grid) Photogrammetry (softcopy) (points, lines) Orthophotos Field collection Surveying (leveling, total stations) (points) GPS (points, lines) Existing data paper maps contours digitization (lines, points) Surfaces data collection Chapter 9

Grid data store as a raster Points, lines can store as is, but generally would want to create a more intelligent data structure. TINs (Triangulated Irregular Networks) were developed to intelligently store vector surface data Surfaces data storage

Quality TINs are built using VIPS (very important points, such as peaks, pits) and break lines (cliffs, roadways, rivers, lake shores). Triangulated Irregular Network (TIN)

FACES Threading a contour A topological model that stores elevations at the nodes, and associates slope with the edges and faces. TIN

TINs can, most often, represent a complex surface with much less data than a DEM, since in a DEM an elevation must be known for every cell in the DEM. However, deriving drainage networks, shortest paths, etc., are much easier to compute using DEMs. The information associated with every cell in a high resolution DEM can, in fact, be very low if there is little topographic relief. (Think spatial autocorrelation [e.g., a lake]). Surfaces TINs vs DEMs

Converting raster data structures (DEMs) to vector data is both easy (creating contours) and complex (creating quality TINs) Converting vector data structures (TINs) to other vector structures (contours) and to raster structures (DEMs) is easy Surfaces data conversion

Networks are another special case of a vector topological data structure Can be looped (e.g. bus routes) or radial (e.g., rivers) Special attributes are associated with Arc Node -- Arc relations turn tables (can you turn left/right, go straight); Stop, no stop Arcs Directionality (one way, two way, speed limit) Networks

Used for route analysis Couriers Emergency vehicles Garbage collection Vehicle navigation systems Used for hydrological analysis Flow of water through a river system Networks

Networks Planar enforcement?

An extended topological database model Networks

Network analysis Vector or raster

How real can we be? Thinking of space (O/F) Putting space into a computer (R/V) Structuring space (Topology) Adding complexity (Surfaces and Networks) Looking back