Using Free and Open Source GIS to Automatically Create Standards- Based Spatial Metadata

Similar documents
Gower Street, London, WC1E 6BT UK Tel. +44 (0) Fax +44 (0) PB Wageningen, The Netherlands

Using Free and Open Source GIS to Automatically Create Standards-Based Spatial Metadata in Academia - First Investigations

Setting up the Ihlet Social Cadastre

Statistical Viewer. Coping with the Complexity

ewater SDI for water resource management

Introduction to GeoServer

Relation between Geospatial information projects related to GBIF

Managing Translations for Blaise Questionnaires

World Premium Points of Interest Getting Started Guide

WFD Art. V groundwater body data gap analysis

Compass INSPIRE Services. Compass INSPIRE Services. White Paper Compass Informatics Limited Block 8, Blackrock Business

Welcome to Verizon Business Internet Global Utilization Statistics

World Premium Points of Interest Getting Started Guide

NFMS web portals Focus on monitoring functions

Common European Submission Portal

User Manual for the delivery of a new national Natura 2000 database to the Commission. Version 1.1

Supplier Invoice Submission Guide. English

World Premium Points of Interest Getting Started Guide

Items exceeding one or more of the maximum weight and dimensions of a flat. For maximum dimensions please see the service user guide.

From data source to data view: A practical guide to uploading spatial data sets into MapX

ARCHITECTURE OF SPATIAL DATA INFRASTRUCTURE (SDI) (DRAFT)

DanubeGIS User Manual Document number: Version: 1 Date: 11-Nov-2016

Introduction to the Dimensionally Extended 9 Intersection Model (DE-9IM) in PostgreSQL/PostGIS Tutorial

Mississippi Public Schools 2015

EarthLookCZ as Czech way to GMES

Configuring a Microstrategy Resource in Metadata Manager 9.5.0

County of Los Angeles. Chief Information Office Preferred Technologies for Geographic Information Systems (GIS) Version 2 May 2015

PROCESSING ZOOPLA HISTORIC DATA

WORKSHOP ON ALL WEEE FLOWS 14/02/17 Alberto Canni Ferrari ERP Italy Country General Manager

Croatian Motorways Road Database Management System

The European Commission s science and knowledge service. Joint Research Centre

Sberbank Online User Guide

The power of PostgreSQL exposed with automatically generated API endpoints. Sylvain Verly Coderbunker 2016Postgres 中国用户大会 Postgres Conference China 20

GUIDE TO ONLINE APPLICATION FOR SPACE Individual and Group Applications 15 May 2013

INSPIRE roadmap and architecture: lessons learned INSPIRE 2017

ISESS An Integrated Decision-Support Information System on the Impact of Extreme Natural Hazards on Critical Infrastructure

International Telecommunication Union

Introduction to INSPIRE. Network Services

Professional, quality services

Verwaltung und Qualitätssicherung von BIM-Modellen via IFCWebServer.org Data Server

Putting the Pieces together: Combining Open Source Components to create a WebGIS.

GeoNode Intro & Demo

CUSTOMER GUIDE Interoute One Bridge Outlook Plugin Meeting Invite Example Guide

EUROPEAN READY-MIXED CONCRETE INDUSTRY STATISTICS YEAR

Country-specific notes on Waste Electrical and Electronic Equipment (WEEE)

Schema transformations in mass data harmonization process. Yohann Ly 8/9 Octobre 2013 Marne la Vallée

The Butterfly Effect. A proposal for distribution and management for butterfly data programs. Dave Waetjen SESYNC Butterfly Workshop May 10, 2012

DELIVERABLE D12.6/12.7 IECM Database Extension & User Interface with Tabulator

CRE investment weakens in Q as investors struggle to find product in prime markets

Technology Platform. Spectrum. Version 10.0 SP1. Spectrum Spatial Getting Started Guide

5 Extract the information of location from the geometry column of PostgreSQL table

Key Terms. Attribute join Target table Join table Spatial join

MOBILE AND WEB GIS APPLICATION FOR MAPPING AND MONITORING LAND FEATURES

INSPIRE: The ESRI Vision. Tina Hahn, GIS Consultant, ESRI(UK) Miguel Paredes, GIS Consultant, ESRI(UK)

RadiantBlue Technologies, Inc. Page 1

Web Map Servers. Mark de Blois. Septembre 2016

HUMBOLDT Application Scenario: Protected Areas

This document is a preview generated by EVS

International Packets

InCLUDE Data Exchange. Julia Harrell, GISP GIS Coordinator, NC DENR

Networking Session - A trusted cloud ecosystem How to help SMEs innovate in the Cloud

This document is a preview generated by EVS

PostgreSQL/PostGIS: Introduction

Real Estate Sales, New York NY, 2014

International Roaming Critical Information Summaries JULY 2017

pgrouting extends the postgis/postgresql geospatial database to provide geospatial routing and other network analysis functionality.

Open source technologies and INSPIRE network services in Slovak environmental agency

ArcGIS Server: publishing geospatial data to the web using the EEA infrastructure

THE BEST OF ICT WITH A HUMAN TOUCH

ESHI Diploma THE EUROPEAN SPECIALISATION IN TRANSPLANTATION IMMUNOLOGY, HISTOCOMPATIBILITY & IMMUNOGENETICS

Data Model and Management

Spectrum Spatial Map Uploader Guide

Quickly design labels and signs with Brady Workstation Apps. 30 day free trial for every app!

Training courses. Course Overview Details Audience Duration. Applying GIS

Mapping, Visualization, and Digitization of the Geo-Referenced Information: A case study on Road Network Development in Near Real Time

This document is a preview generated by EVS

This document is a preview generated by EVS

EU funded research is keeping up trust in digital society

Networking European Digital Repositories

Part 2: Build your own Map Tour with ArcGIS Online

European Cybersecurity PPP European Cyber Security Organisation - ECSO November 2016

MapInfo Professional Evolution!

Setting Up and Deploying Geospatial Data Shares Using the QGIS Server Application

Spotfire Extension for OpenSpirit User s Guide

file:///c:/users/c_harmak/appdata/local/temp/arc8f36/tmp308d.tmp.htm

Basic Tasks in ArcGIS 10.3.x

Learn Well Technocraft

Customers want to transform their datacenter 80% 28% global IT budgets spent on maintenance. time spent on administrative tasks

Fast 3D EMC/EMI Scan with Detectus Scanning System and Tektronix Real Time Spectrum Analyzers CASE STUDY

Corporate R&D The Case of ICT Companies

List of nationally authorised medicinal products

Insights for Clubs

u D i g W a l k t h r o u g h 2 E d i t w i t h u D i g a n d W F S - T

Observation trends: Expectations from European Comission regarding data exchange and interoperability

INSPIRE Download Services of the CNIG (Spain) Emilio López Romero

D4.4 Procedures GIS Update System report. FORCIP+: Forest Roads for Civil Protection

Analysis of the Interoperability Possibilities of Implemented Governmental e-services EU15

BoR (10) 13. BEREC report on Alternative Retail Voice and SMS Roaming Tariffs and Retail Data Roaming Tariffs

Tutorial 2: Query and Select TRI Spatial Data to Study State-Wide Emissions Quantum GIS

ICT Legal Consulting on GDPR: the possible value of certification in data protection compliance and accountability

Transcription:

Using Free and Open Source GIS to Automatically Create Standards- Based Spatial Metadata Claire Ellul University College London

Overview The Problem with Metadata Automation Results Further Work

The Problem with Metadata Metadata Is data about data Gives you important information such as When the data was created Who by For what purpose When it was updated How to obtain the data

The Problem with Metadata However.... Metadata is boring!

The Problem with Metadata and Metadata is: Complex and time consuming to create Requires expertise about the data Requires expertise about how to create useful metadata How much detail should be included? Who are the end users of the metadata? Requires MAINTENANCE when data changes!

Metadata Automation Many elements of standards-based metadata may potentially be automatically created including: Keywords Dataset language Metadata language FOSS tools provide a great environment for this!

Metadata Element Resource Title Resource Abstract Resource Type Resource language Keyword(s) Bounding Box Metadata language Last Revision Date Automation Potential Created manually. If not inserted by the user, default value is the dataset name (i.e. the PostGIS table name). Created manually. Can default to dataset. Automatically populated by PostGIS Can be automated using language detection algorithms This could be implemented by concatenating all text fields of the dataset and picking the top 10 repeating words while eliminating common words. Can be automatically identified from the spatial coordinates in the dataset This can be detected by applying a language detection algorithm to the metadata Automatically update the metadata when the data changes

Metadata Element Metadata Date Responsible Party Metadata Contact Resource Identifier Metadata Geometry Automation Potential Automatically defaults to the date the metadata was created/updated Can be populated automatically depending on the login (user id) for PostGIS Can be populated automatically depending on the login (user id) for PostGIS Can be automatically generated using the metadata record ID PostgreSQL identifiers Automatically created as a spatial geometry in PostGIS

Quantum GIS - Data Import: 1. Shapefile to PostGIS 2. Metadata automatically created (Other import tools to be added) Any GIS - Data Editing: 1. Users can search metadata spatially or via text (SQL) 2. Metadata automatically updated if data edited Future: Web Services WFS, WMS, Catalog Based on the stored Data and Metadata PostGIS 2.0 Database Spatial Datasets Metadata Tight Coupling (Triggers)

Triggers in PostGIS CREATE OR REPLACE FUNCTION public.add_boundingbox() RETURNS trigger AS $boundingbox$ -- this trigger function calculates the bounding box (Xmin, Xmax, Ymin, Ymax) of a new dataset added to the database and inserts it in the metadata table Declare Begin End; table_name text; --variable that holds the name of the table (i.e. dataset) the_coord real; -- used to store the long/lat values curs1 refcursor; -- used to hold SQL query results. the trigger code goes here $boundingbox$ LANGUAGE plpgsql VOLATILE; -- VOLATILE indicates that the function value can change

Triggers in PostGIS Open curs1 FOR EXECUTE CLOSE curs1; 'SELECT ST_XMax(ST_Extent(ST_Transform(the_geom,4326))) as the_coord FROM ' table_name; FETCH curs1 into the_coord; EXECUTE 'UPDATE metadata SET bb_eastbound_long = ' the_coord ' WHERE dataset_name = ' quote_literal(table_name);

Triggers in PostGIS A series of INSERT triggers are run every time a new metadata record is created: CREATE TRIGGER add_boundingbox AFTER INSERT ON public.metadata FOR EACH ROW EXECUTE PROCEDURE public.add_boundingbox(); A series of metadata update triggers are run every time a dataset is modified, e.g.: CREATE TRIGGER roaddata_bb_update AFTER INSERT OR UPDATE OR DELETE on roaddata FOR EACH ROW EXECUTE PROCEDURE update_bounding_box_roaddata(); ** NB: A new version of this trigger function is created automatically when a new spatial dataset is inserted into the database.

Add Bounding Box (PL/pgSQL Trigger) Select the minimum Longitude, transforming into WGS84 if required Repeat for minimum latitude, maximum longitude and latitude Insert the values into the metadata table columns Create a trigger to run this process every time the dataset is edited

Identifying Keywords Identify Text Fields in the Data Set (PL/pgSQL Trigger) Split any text into single words using the space character as delimiter Create a single column list of all the words using the SQL UNION query Use an SQL GROUP BY query to identify the 10 most frequently used words

Testing the System Metadata was created for Open Street Map datasets: Points of interest, administrative boundaries, road network and location data And for 10 European countries UK, Austria, Greece, Malta, Italy, Spain, Belgium, Netherlands, Portugal, Sweden

Points of Interest Keywords - Keywords were in English and included: - Public, Services, Tourism, Tree, Automotive Location Data Keywords Keywords were predominantly in English and included locality, hamlet, village Also included place names: Aachen, Birmingham, Munchen, Trento

Administrative Areas Keywords For keywords, the datasets yielded numbers such as 8, 6, 9, 10 in some cases Road Network Keywords Keywords were predominantly in English and included track, footway, cycleway Also included the words for street in other languages: via (Italian) calle (Spanish), strasse (Austria)

Summary Results Using FOSS (in particular PostGIS) means that the resulting data and metadata can be accessed from other GIS packages Metadata is automatically updated when data is modified No matter which software is used to edit the data Using a central database means that the data and metadata can be published via OGC services such as WFS and Catalog Service for Web

Further Work Testing with additional, more appropriate, single language datasets from different sources Extending the system to allow metadata to be created automatically for ANY spatial data in a PostGIS database, no matter how it is loaded Publishing the data and metadata via tools such as GeoServer

Further Work Identify and resolve any issues related to performance i.e. the time taken to create the metadata each time the data is modified. Improve handling of non-latin character-sets Thinking about deployment how to ensure that the approach can be used by users not having spatial database expertise

Any Questions? c.ellul@ucl.ac.uk

Metadata Element Resource Title Resource Abstract Resource Type Resource language Keyword(s) Bounding Box Date of publication Date of last revision Automation Potential Created manually. If not inserted by the user, default value is the dataset name. Created manually. Can default to dataset Can be automated using language detection algorithms This could be implemented by concatenating all text fields of the dataset and picking the top 10 repeating words while eliminating common words. Can be automatically identified from the spatial coordinates in the dataset Can default to the date that data was uploaded to the system, with updates when the data is edited. Manual verification required by the end user. Default to the date the data was uploaded to the system. Update automatically any time data edited

Metadata Element Date of creation Limitations on public access and conditions of use (2 elements) Responsible party name, email and role (3 elements) Metadata contact name, email and date (3 elements) Metadata language Automation Potential Default to the date the data was uploaded. Manual verification required by the end user Given the academic context, a default value can be assigned, perhaps taking the most open value or perhaps on a per project basis. Based on user groups (identified from the user s login details and a corresponding lookup table). This can be derived from the database login of the person uploading the dataset or creating the new dataset. This can be detected by applying a language detection algorithm to the metadata

The Problem with Metadata Metadata is important for academic research: The EU FP7 project SECOA is developing models of coastal conflicts in countries including Italy, India, the UK, Portugal, Israel, Vietnam, Sweden and Belgium Data underpins these models

The Problem with Metadata Metadata is important for academic research: However, comparable data is not always available so alternative data is sometimes substituted In the Italian case, the total Employees in industrial sector in Rome is not available. The number of employees was not available so the number of local units in the industrial sector has been used Without this metadata, the SECOA team could be comparing employees with industrial units!