CPSC 695. Data Quality Issues M. L. Gavrilova

Similar documents
4.0 DIGITIZATION, EDITING AND STRUCTURING OF MAP DATA

Introduction :- Storage of GIS Database :- What is tiling?

GIS in agriculture scale farm level - used in agricultural applications - managing crop yields, monitoring crop rotation techniques, and estimate

Class #2. Data Models: maps as models of reality, geographical and attribute measurement & vector and raster (and other) data structures

Line Generalisation Algorithms Specific Theory

CPSC 695. Methods for interpolation and analysis of continuing surfaces in GIS Dr. M. Gavrilova

Module 1: Basics of Solids Modeling with SolidWorks

Vector-Based GIS Data Processing. Chapter 6

Lecture 7 Digitizing. Dr. Zhang Spring, 2017

Introducing ArcScan for ArcGIS

PDHonline Course L154G (5 PDH) Data in GIS. Instructor: Steve Ramroop, Ph.D. PDH Online PDH Center

GEOSPATIAL ENGINEERING COMPETENCIES. Geographic Information Science

VALLIAMMAI ENGINEERING COLLEGE

GIS Data Models. 4/9/ GIS Data Models

GIS Data Collection. This chapter reviews the main methods of GIS data capture and transfer and introduces key practical management issues.

Object modeling and geodatabases. GEOG 419: Advanced GIS

Exploring GIS Data. I) GIS Data Models-Definitions II) Database Management System III) Data Source & Collection IV) Data Quality

Topic 5: Raster and Vector Data Models

Terms and definitions * keep definitions of processes and terms that may be useful for tests, assignments

Analytical and Computer Cartography Winter Lecture 9: Geometric Map Transformations

SPATIAL DATA MODELS Introduction to GIS Winter 2015

DATA MODELS IN GIS. Prachi Misra Sahoo I.A.S.R.I., New Delhi

Data Models and Data processing in GIS

A CONSISTENCY MAINTENANCE OF SHARED BOUNDARY AFTER POLYGON GENERALIZATION

Lesson 5: Surface Check Tools

L1 - Introduction. Contents. Introduction of CAD/CAM system Components of CAD/CAM systems Basic concepts of graphics programming

Chapter 12 Solid Modeling. Disadvantages of wireframe representations

GISCI GEOSPATIAL CORE TECHNICAL KNOWLEDGE EXAM CANDIDATE MANUAL AUGUST 2017

3DCITY. Spatial Mapping and Holographic Tools for 3D Data Acquisition and Visualization of Underground Infrastructure Networks

THE TOOLS OF AUTOMATED GENERALIZATION AND BUILDING GENERALIZATION IN AN ArcGIS ENVIRONMENT

LECTURE 2 SPATIAL DATA MODELS

Advances in geographic information systems and remote sensing for fisheries and aquaculture

Geometric Modeling. Introduction

Lab 1: Exploring data format

Lecture 06. Raster and Vector Data Models. Part (1) Common Data Models. Raster. Vector. Points. Points. ( x,y ) Area. Area Line.

TOPOLOGICAL CONSTRAINTS, ACTIONS AND REFLEXES FOR GENERALIZATION BY OPTIMIZATION

TIPS4Math Grades 4 to 6 Overview Grade 4 Grade 5 Grade 6 Collect, Organize, and Display Primary Data (4+ days)

Scientific Visualization Example exam questions with commented answers

Creating an Environmental Database to Facilitate Groundwater Contamination Analysis. Jimmy Bowden Daniel B. Stephens and Associates

Geodatabase over Taita Hills, Kenya

Extraction of the valley bottom Toolset name : SPATIAL COMPONENTS

Lesson 2: Wireframe Creation

CAD package for electromagnetic and thermal analysis using finite element. CAD geometry import in flux : principle and good practices

Research Collection. Approaches to data modelling in GIS. Conference Paper. ETH Library. Author(s): Mitášová, Irena; Niederöst, Jana; Hájek, Milan

Copyright The McGraw-Hill Companies, Inc. Permission required for reproduction or display.

The GIS Spatial Data Model

Introduction to Solid Modeling Parametric Modeling. Mechanical Engineering Dept.

Overview.! Manual Digitizing! Heads-up Digitizing! Common Errors! Summary! Heads-up Digitizing Tutorial

Chapter 1. Introduction

by CHARLES TUCmR, TNRIS SYSTEMS CENTRAL

Feature Enhancements by Release

PARAMETRIC MODELING FOR MECHANICAL COMPONENTS 1

Lecture 6: GIS Spatial Analysis. GE 118: INTRODUCTION TO GIS Engr. Meriam M. Santillan Caraga State University

coding of various parts showing different features, the possibility of rotation or of hiding covering parts of the object's surface to gain an insight

Technology for Cadastral Applications. John R. Hacker, Jr. Marketing Manager Geospatial Applications

v Working with Rasters SMS 12.1 Tutorial Requirements Raster Module Map Module Mesh Module Time minutes Prerequisites Overview Tutorial

Geographical Information Systems Institute. Center for Geographic Analysis, Harvard University. LAB EXERCISE 1: Basic Mapping in ArcMap

layers in a raster model

17/07/2013 RASTER DATA STRUCTURE GIS LECTURE 4 GIS DATA MODELS AND STRUCTURES RASTER DATA MODEL& STRUCTURE TIN- TRIANGULAR IRREGULAR NETWORK

Introduction to Computer Graphics

DISAGGREGATION METHODS FOR GEOREFERENCING INHABITANTS WITH UNKNOWN CZECH REPUBLIC PLACE OF RESIDENCE : THE CASE STUDY OF POPULATION CENSUS 2011 IN THE

Gmsh GUI Tutorial I: How to Create a Simple 2D Model?

7 Fractions. Number Sense and Numeration Measurement Geometry and Spatial Sense Patterning and Algebra Data Management and Probability

ArcGIS Pro Editing. Jennifer Cadkin & Phil Sanchez

Import, view, edit, convert, and digitize triangulated irregular networks

Model data extraction. Mass property data. Mass property data. Mass property data. Integral Processes. Units and volume

Software Form Control

NAGI METHODS OF MAP QUALITY EVALUATION

Multidimensional (spatial) Data and Modelling (2)

RECOMMENDATION ITU-R P DIGITAL TOPOGRAPHIC DATABASES FOR PROPAGATION STUDIES. (Question ITU-R 202/3)

InfoVis: a semiotic perspective

GIS DATA MODELS AND SPATIAL DATA STRUCTURE

Everyday Mathematics

ANNEX V RASTER TO VECTOR CONVERSION MANUAL

v Overview SMS Tutorials Prerequisites Requirements Time Objectives

v Map Module Operations SMS Tutorials Prerequisites Requirements Time Objectives

SEOUL NATIONAL UNIVERSITY

1 Introduction. 1.1 Raster-to-vector conversion

Computer Database Structure for Managing Data :-

Strategy. Using Strategy 1

Lecture 4: GIS Data Input Methods and Techniques. GE 118: INTRODUCTION TO GIS Engr. Meriam M. Santillan Caraga State University

Simulation-Supported Decision Making. Gene Allen Director, Collaborative Development MSC Software Corporation

Number- Algebra. Problem solving Statistics Investigations

Attribute Accuracy. Quantitative accuracy refers to the level of bias in estimating the values assigned such as estimated values of ph in a soil map.

Maths Standards File Year 4. KPI: Number and place value Counts in multiples of 6,7,9,25 and 1000.

Contents of Lecture. Surface (Terrain) Data Models. Terrain Surface Representation. Sampling in Surface Model DEM

Geometric Modeling Lecture Series. Prof. G. Wang Department of Mechanical and Industrial Engineering University of Manitoba

v Modeling Orange County Unit Hydrograph GIS Learn how to define a unit hydrograph model for Orange County (California) from GIS data

VECTOR ANALYSIS: QUERIES, MEASUREMENTS & TRANSFORMATIONS

Scalar Visualization

Making ArcGIS Work for You. Elizabeth Cook USDA-NRCS GIS Specialist Columbia, MO

GEOGRAPHIC INFORMATION SYSTEMS Lecture 02: Feature Types and Data Models

Maths. Formative Assessment/key piece of work prior to end of unit: Term Autumn 1

Introduction. Contour Lines Generation

COMPUTER AIDED ENGINEERING. Part-1

True Advancements for Longitudinal Weld Pipe Inspection in PA

Lab.4 & Assignment 2. Lab4. Conversion of Hardcopy Map to ArcGIS Map

Automated schematic map production using simulated annealing and gradient descent approaches

IMPORTANCE OF TOPOLOGY IN A GIS PROJECT OF MONITORING THE SOILS IN AGRICULTURAL LAND

Mid-term exam. GIS and Forest Engineering Applications. Week 5. FE 257. GIS and Forest Engineering Applications. Week 5

Transcription:

CPSC 695 Data Quality Issues M. L. Gavrilova 1

Decisions Decisions 2

Topics Data quality issues Factors affecting data quality Types of GIS errors Methods to deal with errors Estimating degree of errors 3

Development of computer methods for handling spatial data Computer cartography Computer cartography (digital mapping) is the generation, storage and editing of maps using a computer. Spatial statistics Initially, developments were in areas such as measures of spatial distribution, three-dimensional analysis, network analysis and modeling techniques. CAD Computer-aided design (CAD) is used to enter and manipulate graphics data, whereas GIS is used to store and analise spatial data. Traditional users of CAD used CAD for the production, maintenance and analysis of design graphics. 4

Causes of errors in spatial data Measurement errors: accuracy (ex. altitude measurement or soil samples, usually related to instruments) Computational errors: precision (ex. to what decimal point the data is represented?) Human error: error in using instruments, selecting scale, location of samples Data model representation errors Errors in derived data 5

What type of error? 6

Factors affecting reliability of GIS data Age of data collected at different times Arial coverage Map scale and resolution Density of observations Data formats and exchanges in formats Accessibility 7

Data quality issues: errors during data collections 8

Data quality issues: Sources of error in GIS Errors arising from our understanding and modeling of reality The different ways in which people perceive reality can have effects on how they model the world using GIS. Errors in source data for GIS Survey data can contain errors due to mistakes made by people operating equipment, or due to technical problems with equipment. Remotely sensed, and aerial photography data could have spatial errors if they were referenced wrongly, and mistakes in classification and interpretation would create attribute errors. 9

Data quality issues: Sources of error in GIS Perceptions or reality mental mapping Your mental map of your home town or local area is your personal representation. You could commit this to paper as a sketch map if required. Figure in next slide shows two sketch map describing the location of Northallerton in the UK. They would be flawed as cartographic products, but nevertheless are a valid model of reality. Keates (1982) considers these maps to be personal, fragmentary, incomplete and frequently erroneous. In all cases, distance and shape would be distorted in inconsistent ways and the final product would be influenced by the personality and experience of the cartographer. 10

Two mental maps of Northallerton, UK 11

Operational errors introduced during manual digitizing Psychological errors: Difficulties in perceiving the true centre of the line being digitized and inability to move the cursor cross-hairs accurately along it. Physiological errors: It result from involuntary muscle spasms that give rise to random displacement. Line thickness: The thickness of lines on a map is determined by the cartographic generation employed. Method of digitizing: Point mode and stream mode 12

Topological errors in vector GIS (a) Effects of tolerance on topological cleaning (b) Topological ambiguities in raster to vector conversion 13

Vector to raster classification error 14

Rasterization errors Vector to raster conversion can cause an interesting assortment of errors in the resulting data. For example Topological errors Loss of small polygons Effects of grid orientation Variations in grid origin and datum Topological error in vector GIS: (a) loss of connectivity and creation of false connectivity (b) loss of information 15

Errors in data processing and analysis GIS operations that can introduce errors include the classification of data, aggregation or disaggregation of area data and the integration of data using overlay technique. Where a certain level of spatial resolution or a certain set of polygon boundaries are required, data sets that are not mapped with these may need to be aggregated or disaggregated to the required level. 16

Attribute error due to processing Attribute error result from positional error (such as the missing hole feature in map A that is present as an island in map B). If one of the two maps that overlaid contains an error, then a classification error will result in the composite map (polygon BA). 17

GIS project design and management errors 18

Project management and human error Measuring error Typos/drawing errors Incorrect implementation error Planning/coordination error Incorrect use of devices error Erroneous methodology error Other human errors 19

Geometry related errors Rounding errors Processing errors Geometric coordinate transformation Map scanning, geometric approximations Vector to fine raster errors 20

Numerical modeling - propagating errors In numerical modeling and simulation errors propagate due to the multiplication of numerical error and initial measurement error. This presents additional challenges. 21

How errors can be controlled? Estimating degree of an error is an interesting area of GIS and computational science. 22

Methods to deal with errors Initial data: control quality of measurement, develop standards, prevent human error. Data models: select correct data models based on experience or model appropriateness, reduce errors during conversion from one to another. 23

Finding and modeling errors in GIS Checking for errors Probably the simplest means of checking for data errors is by visual inspection. Various statistical methods can be employed to help pinpoint potential errors. Estimating degree of an error helps in controlling and correcting errors 24

Estimating degree of an error - map digitizing errors Map digitizing errors in digitizing a line or a geometric shape are estimated by studying the number of segments for curve approximation and map properties (distorted, rotated map). Perkal s concept there is an epsilon error band around a digitized line within which the data is considered to be correct. 25

Estimating degree of an error raster to vector graphics Statistical approaches for error estimation based on probabilities of error Estimating the size of the grid and its influence on data approximation (Switzer s method) Double-conversion method for vector to raster conversion (Bregt s method): convert data twice, compare differences 26

Estimating degree of an error while processing data Errors associated with overlaying data and other spatial operations on data arise from inexact representation of original data and processing errors. Measurement of agreement between an original and overlaid polygons can be taken (McAlpine and Cook) 27

Estimating degree of progressing errors Two approaches to the modeling of errors in spatial data: epsilon modeling and Monte Carlo simulation. Epsilon modeling is based on a well known method of line generation (epsilon width). A Monte Carlo simulation approach has also been used to model the effects of error propagation in GIS overlays. 28

Methods to deal with progressing errors Conversion errors develop more reliable algorithms Numerical or progressing errors: maintain consistence of data or result utilize exact computation methods 29