Geodatabase over Taita Hills, Kenya

Similar documents
Class #2. Data Models: maps as models of reality, geographical and attribute measurement & vector and raster (and other) data structures

Spatial Data Models. Raster uses individual cells in a matrix, or grid, format to represent real world entities

Topic 5: Raster and Vector Data Models

GIS Data Models. 4/9/ GIS Data Models

GIS DATA MODELS AND SPATIAL DATA STRUCTURE

9. GIS Data Collection

SPATIAL DATA MODELS Introduction to GIS Winter 2015

Introduction to Geographic Information Systems Dr. Arun K Saraf Department of Earth Sciences Indian Institute of Technology, Roorkee

LSGI 521: Principles of GIS. Lecture 5: Spatial Data Management in GIS. Dr. Bo Wu

LECTURE 2 SPATIAL DATA MODELS

SVY2001: Databases for GIS

Lecture 2: GIS Data Sources, Data Types and Representation. GE 118: INTRODUCTION TO GIS Engr. Meriam M. Santillan Caraga State University

A GIS-BASED SUPPORT SYSTEM FOR DECLARATION AND VERIFICATION 1. INTRODUCTION IAEA-SM-367/13/02

Chapter 3: Maps as Numbers

Spatial data and QGIS

The GIS Spatial Data Model

GIS Workbook #1. GIS Basics and the ArcGIS Environment. Helen Goodchild

CHAPTER 10 Creating and maintaining geographic databases Overview Learning objectives Keywords and concepts Overview Chapter summary

Advances in geographic information systems and remote sensing for fisheries and aquaculture

Introduction to Geographic Information Science. Some Updates. Last Lecture 4/6/2017. Geography 4103 / Raster Data and Tesselations.

Lecture 06. Raster and Vector Data Models. Part (1) Common Data Models. Raster. Vector. Points. Points. ( x,y ) Area. Area Line.

DIGITAL IMAGE ANALYSIS. Image Classification: Object-based Classification

Lecture 4. Image Georeferencing, Accuracy and Precision, File Geodatabase

layers in a raster model

FOUR ADVANCES IN HANDLING UNCERTAINTIES IN SPATIAL DATA AND ANALYSIS

PART ONE - CONCEPT OF GIS 3

Introducing ArcScan for ArcGIS

Data Models and Data processing in GIS

Overview. Image Geometric Correction. LA502 Special Studies Remote Sensing. Why Geometric Correction?

Computer Database Structure for Managing Data :-

Map Library ArcView Version 1 02/20/03 Page 1 of 12. ArcView GIS

GOVERNMENT GAZETTE REPUBLIC OF NAMIBIA

Understanding Geospatial Data Models

Introduction to Geodatabase and Spatial Management in ArcGIS. Craig Gillgrass Esri

Representing Geography

Longley Chapter 3. Representations

Feature Enhancements by Release

DATA MODELS IN GIS. Prachi Misra Sahoo I.A.S.R.I., New Delhi

INTRODUCTION TO GIS WORKSHOP EXERCISE

Remote Sensing in an

Chapter 8 Visualization and Optimization

NRM435 Spring 2017 Accuracy Assessment of GIS Data

How to georeference images in ArcMap (Arcinfo, ArcReader, ArcView)

Welcome to NR402 GIS Applications in Natural Resources. This course consists of 9 lessons, including Power point presentations, demonstrations,

Technical Specifications

Main concepts of ILWIS 3.0

Introduction to GIS. Geographic Information Systems SOCR-377 9/24/2015. R. Khosla Fall Semester The real world. What in the world is GIS?

Introduction :- Storage of GIS Database :- What is tiling?

Geographic Information Fundamentals Overview

GIS data input. In the early days of GIS 1980s and early 1990s. We used canned datasets for teaching (from USA)

Addressing Geospatial Big Data Management and Distribution Challenges ERDAS APOLLO & ECW

RASTER ANALYSIS GIS Analysis Fall 2013

Exploring GIS Data. I) GIS Data Models-Definitions II) Database Management System III) Data Source & Collection IV) Data Quality

automatic digitization. In the context of ever increasing population worldwide and thereby

Google Earth Engine. Introduction to satellite data analysis in cloud-based environment. Written by: Petr Lukeš

Imagery and Raster Data in ArcGIS. Abhilash and Abhijit

Graphic Display of Vector Object

RASTER ANALYSIS GIS Analysis Winter 2016

4.0 DIGITIZATION, EDITING AND STRUCTURING OF MAP DATA

GIS in agriculture scale farm level - used in agricultural applications - managing crop yields, monitoring crop rotation techniques, and estimate

Terms and definitions * keep definitions of processes and terms that may be useful for tests, assignments

More than a drafting task. Integration for efficiency

Representing the Real World

The Feature Analyst Extension for ERDAS IMAGINE

MRR (Multi Resolution Raster) Revolutionizing Raster

GIS Data Collection. This chapter reviews the main methods of GIS data capture and transfer and introduces key practical management issues.

Geographic Information Systems. using QGIS

17/07/2013 RASTER DATA STRUCTURE GIS LECTURE 4 GIS DATA MODELS AND STRUCTURES RASTER DATA MODEL& STRUCTURE TIN- TRIANGULAR IRREGULAR NETWORK

8 Geographers Tools: Automated Mapping. Digitizing a Map IMPORTANT 2/19/19. v Tues., Feb. 26, 2019.

Prof. Jose L. Flores, MS, PS Dept. of Civil Engineering & Surveying

Geometric Rectification of Remote Sensing Images

GPS/GIS Activities Summary

How to Create a Tile Package

Georeferencing Imagery in ArcGIS 10.3.x

L7 Raster Algorithms

Attribute Accuracy. Quantitative accuracy refers to the level of bias in estimating the values assigned such as estimated values of ph in a soil map.

First Exam Thurs., Sept 28

Lecture 8. Database Management and Queries

Outline. 14. Query, Measurement, and Transformation. Spatial analysis. Definitions. The Snow map. What is spatial analysis?

BAT Quick Guide 1 / 20

DETERMINATION OF THE OPTIMUM PATH ON THE EARTH S SURFACE. (extended abstract) Abstract

8 Geographers Tools: Automated Mapping. Digitizing a Map 2/19/19 IMPORTANT. Revising a Digitized Map. The Digitized Map. vtues., Feb. 26, 2019.

CREATION OF THE SYSTEM OF DISTRIBUTED SHAREABLE GEOGRAPHICAL DATA BASES FOR CARRYING OUT COMPLEX RESEARCHES OF TERRITORIES

INCREASING CLASSIFICATION QUALITY BY USING FUZZY LOGIC

Lecture 6: GIS Spatial Analysis. GE 118: INTRODUCTION TO GIS Engr. Meriam M. Santillan Caraga State University

Review of Cartographic Data Types and Data Models

FREE TUTORING. Digitizing a Map. 8 Geographers Tools: Automated Mapping. The Digitized Map. Revising a Digitized Map 9/28/2018. Next class: First Exam

Lab 1: Exploring data format

Neighbourhood Operations Specific Theory

Contents of Lecture. Surface (Terrain) Data Models. Terrain Surface Representation. Sampling in Surface Model DEM

Alaska Department of Transportation Roads to Resources Project LiDAR & Imagery Quality Assurance Report Juneau Access South Corridor

EAT 233/3 GEOGRAPHIC INFORMATION SYSTEM (GIS)

NAGI METHODS OF MAP QUALITY EVALUATION

Image georeferencing is the process of developing a model to transform from pixel coordinates into GIS coordinates such as meters on the ground.

Module 10 Data-action models

PDHonline Course L154G (5 PDH) Data in GIS. Instructor: Steve Ramroop, Ph.D. PDH Online PDH Center

Geographical Information Systems Institute. Center for Geographic Analysis, Harvard University. LAB EXERCISE 1: Basic Mapping in ArcMap

APPENDIX E2. Vernal Pool Watershed Mapping

GIS OPERATION MANUAL

ArcMap - EXPLORING THE DATABASE Part I. SPATIAL DATA FORMATS Part II

Transcription:

Geodatabase over Taita Hills, Kenya Anna Broberg & Antero Keskinen Abstract This article introduces the basics of geographical information systems (GIS) and explains how the Taita Hills project can benefit from them. GIS are used to store and manipulate geographical data. In a GIS data is stored in databases either in raster or vector format. Databases are widely used because of their flexibility in managing data. The Taita Hills database contains at this stage only a limited amount of different themes as roads, administrative area boundaries etc., but will be extended as the need arises and as more data is collected and produced. What is a geographical information system? Geographical information systems (hereafter GIS) are computer-based systems that are used to store and manipulate geographic information. A GIS is designed for the collection, storage and analysis of objects and phenomena where geographic location is an important characteristic or critical to the analysis (Aronoff 1995:1). Most information is somehow spatial and thus it is important to be able to place it in geographical context. The revolution of information technology has enabled the building of these computer systems handling geographical information. GIS technology has provided relatively instant access to the information via computer terminals. These enable locational data to be displayed and also updated directly on the screen. When geographical data is in digital form, it becomes possible to display and analyze the data in ways that are often much quicker and more effective than has been possible using manual techniques (Jones 1999:6). According to several sources (Aronoff 1995:39; Jones 1999:7; Kraak & Ormeling 1996:9-10) a computer system must provide a few sets of capabilities to handle georeferenced data in order to be a real GI-system. Through data input georeferenced existing data sets as digital maps or aerial photographs and satellite images are added to the system. Data can also be added through digitizing maps and documents. The management of the added data is usually done using the database approach. The core of a GIS is however manipulation and analysis of data. Examples of these capabilities could be data processing; interpreting and classifying the surveyed data; searching by location, class or attribute and modelling real-life processes or characteristics of a dataset. While analysing and modelling the data it can be turned into information. This means that data is no longer just values in the dataset, but they are converted into information that tells us something more than just the raw-data. Through data output (usually computer display or printed maps) the information is passed to the user or wider audience. In this communication process the visualisation of the information is a very important factor. GI-systems are indeed widely used in order to create maps, either normal paper maps or digital maps e.g. for Internet use. Data output is also significant for purposes of exploring data. Producing several visualisations of some dataset can tell a lot of new things about it. Why use GIS in the Taita project? The advantages gained while using GIS instead of traditional maps are remarkable. GIS data is maintained in digital format. As such the data is in a form more physically compact than that of paper maps, tabulations or other conventional types (Aronoff 1995:43). A countless amount of data can be stored for each location, whereas in conventional maps scale always limits the amount of information that can be presented.

Also the ability to manipulate the spatial data and corresponding attribute information and to integrate different types of data in a single analysis are unmatched by any manual methods (Aronoff 1995:43). With normal maps only visual analyses overlaying several maps offers similar capabilities. Visualisation-wise GIS provides the ability to keep storage of the data and its visualisation apart. This enables the production of different kinds of maps from the same data for diverse uses or user-groups. In the Taita project there are many varied assignments containing tasks relating to remote sensing, geoinformatics or producing maps. Here a common geographical database can be of great help. It will act as reference data for several datasets produced during the project like classifications of the satellite imagery and aerial photographs. Thus it must be accurately georeferenced and put into a coordinate system so that further datasets can be included in it. Also, using the ready-made datasets we have made, visualisations and maps can be easily produced for dissimilar purposes during the project. Making the database: technical solutions and contents Modelling a database A spatial database can be thought of as an integrated set of geographic information on a particular subject and area (Longley et al. 2002:226). A database consists of a digital representation of Earth s surface or near surface, in order to serve some problem-solving or scientific purposes (Longley et al. 2002:18)). A database can be built for one major project such as the Taita project, or it can be continuously maintained in order to serve daily service actions and transactions. Depending on the purpose, a database can vary in size from a megabyte to several petabytes (Longley et al. 2002:18). The geodatabase over Taita Hills is still quite small but will be extended and maintained during the period of the project when more information is acquired. A database is not only a collection of information about things; it is also, crucially, a representation of their relationships to each other. The objective in collecting and maintaining information in a database is to relate facts and situations that were previously separate (Aronoff 1995:151). In a database all the data are handled in the same context, which enables making far more comprehensive analysis and manipulations of data. In general, the database approach to storing geographic data offers a number of advantages over traditional file-based datasets. For instance, data collected and better organised at a single location reduces redundancy and duplication, and decreases maintenance costs. Furthermore, applications become data independent so that they can evolve separately over time and use the same data. Moreover, a database remains constant, enabling transferring user knowledge between applications more easily (Date 1995; Longley et al. 2002:226). The database over Taita Hills is an important method of storing, sharing and utilizing the data collected about the area during the project. While modelling a database three sequent and increasingly abstract stages can be distinguished: conceptual, logical and physical modelling (see Longley et al. 2002:233; Jones 1999:163). At the level of conceptual modelling object types (or classes as roads, forests etc) and their relationships are defined and proper geographic representation for these is selected (Longley et al. 1999:234). Then a logical model for the database, or database type and structure is chosen. Of these the relational structure is most widely applied. It is based on a simple and powerful idea whereby information is represented by reasonably simple tables, which are subjected to various relational operators (Jones 1999:175). With the aid of functions such as primary keys and relational joins the relational structure often operates very effectively. Thus, the information from various sources may be linked together and analysed in the same context enabling a number of operations, which would be unreasonable without the relational structure. Most simply relational database can be made using e.g. ArcView shapefiles. The organization of its data files can be described in terms of records, fields and keys. Each entity (for example each polygon representing an administrative area) has its own row, or record, in a shapefile. A record is divided into fields, each of which contains an item of data (e.g. area name, area number). Some or one of the fields is termed as

key, a record with which the record is retrieved from the data file. In this case the key could be an area number. Fields that are not designated as key fields and containing whatever kind of data connected to the entity are termed attribute fields (Aronoff 1995:155). The final stage in designing a database is physical modelling, or defining how the data will be loaded into the database. It is the definition of the actual physical database schema that will hold the database data values. There are several definition languages such as SQL, which are used to build the physical model of databases. It is at this level that data items are defined in terms of parameters such as the number of bytes of storage they occupy and the address at which they are to be found in physical memory (Jones 1999:164). Choosing between raster and vector data Choosing the types of geographical representation is a critical database design task. There are two spatial data types, raster and vector, which are used to represent aspects of the real world (Figure 1). The raster data model is associated to field type continuous phenomena, such as elevation, and is based on an array of cells each having its own value. The raster data model has the great virtue of simplicity but it can produce very large files. The accuracy with which a raster layer can represent spatial data is related to the size of the pixel. The smaller the pixel size the greater the accuracy. However, larger grids need more space for storage and data handling. As for the vector data model, it can be divided into three different forms of representing objects: point, line or polygon. It is usually associated with representing phenomena that are discreet objects by nature. In this method all objects are created using points, whose coordinates are stored, and more complex combinations of these. Vector modelling has a precise nature of representing discreet objects with the storage efficiency and availability of functional tools for various operations (Longley et al. 2002:189). Contents of the Taita database The information of the database over Taita Hills is mainly based on the maps, satellite images and aerial photographs acquired from different sources. The maps over the area are in scale 1:50 000 from the year 1991. The SPOT XS satellite image from 1987 is presented in Figure 2. The maps have been converted to digital format in the scanning process. Furthermore, in order to make the maps spatially functional they have been registered to the Transverse Mercator projection used in the area, and also used on the original paper maps. Digital aerial photography was acquired in spring 2003. The photography produced a large amount of separate pictures, which will be mosaicked in order to get a wider perspective of the area. From the pictures also a digital elevation model can be generated using terrain distortion. All these processed data will be used to extend and update the database. Classifications will be made from the pictures, which makes it possible to get more accurate information over the area for features such as settlement, new roads, forests, cultivated ground etc. The database over Taita Hills will include data both in raster and vector formats. The vector data model is used to represent discreet objects such as roads, rivers and administrative area boundaries. Furthermore, railways, lakes, water points and airstrips are built using the vector data model. The construction of the vector shapefiles is carried out using ArcView software. The created files are also suitable for use in other software, such as MapInfo after converting them. The raster data model will be used at the later stage in order to include image files made with software packages for remotely sensed images or aerial photographs such as Erdas Imagine. We digitised several feature classes from the scanned maps. Roads (digitised as lines) were then classified into five different categories and the numbers expressing the classes were saved as an attribute. This enables the creation of visualisations based on the attributes (Figure 3).

In: Pellikka, P., J. Ylhäisi & B. Clark (eds.) Taita Hills and Kenya, 2004 seminar, reports and journal of a field excursion to Kenya. Feature in real life Feature in vector format Feature in raster format Figure 1. Comparison of vector and raster data models. Figure 2. SPOT XS satellite image from January 1987 showing the Taita Hills on the left and Sagala Hills on the right.

Figure 3. A simple visualisation showing the different road classes. Figure 4. Every sub-location has its own number and relates to one of the divisions. Later it might be useful to build topology for the road network. The topologically structured format is designed to encode geographic information in a form better suited for spatial analysis and other geographic studies (Aronoff 1995:113). Vector-based models enable the determination of the spatial relationships between the components. Spatial relationships of connectivity and adjacency are examples of topological relationships (Jones 1999:32). This means that for example analyses about moving around the road network could be made.

Administrative area boundaries were digitised as polygons; first creating the highest-level boundaries and then splitting these to smaller units using lower level boundaries. For many purposes, meanings are standardised in the form of classifications consisting of a set of categories. Often categories are grouped or classified into successively higher levels that enable phenomena to be referred to at different levels of abstraction (Jones 1999:18). We used this hierarchical structure with administrative regions, which were classified at three levels. Thus, the administrative regions may be examined in various levels of accuracy depending on the scale preferred (Figure 4). Moreover, Tsavo national park, airstrips and some lakes are found in the database as polygon shapefiles. All the shapefiles were registered in the right projection. In this phase of the project only some point features (waterholes) were included in the database. More themes can be added as the need arises. Also we found that it is not use digitising very many features, because many of the things of interest are more easily and accurately gained from the classifications carried out using Erdas Imagine. As a conclusion we would like to emphasize that this stage of database construction was carried out only in favour to create a reference dataset to advantage further research. As information is added into the base, also the possibilities for analyses increase. References Aronoff, S. (1995). Geographic information systems: a management perspective. 4 th printing. 294 p. WDL Publications, Ottawa. Date, C.J. (1995). Introduction to Database Systems. 6 th edition. 839 p. Addison-Wesley, Reading, Massachusetts. Longley, P.A., M.F. Goodchild, D.J. Maguire & D.W. Rhind (2002). 3 nd edition. 454p. Geographic information systems and science. John Wiley & Sons, Chichester, England. Jones, C. (1999). Geographical Information Systems and Computer Cartography. 3 rd printing. 319 p. Addison Wesley Longman Limited, Singapore. Kraak, M.-J. & F. Ormeling (1996). Cartography: visualization of spatial data. 222 p. Longman, Harlow.