ITARC Stockholm Olle Olsson World Wide Web Consortium (W3C) Swedish Institute of Computer Science (SICS)

Similar documents
ITARC Stockholm Olle Olsson World Wide Web Consortium (W3C) Swedish Institute of Computer Science (SICS)

Semantiska webben DFS/Gbg

Linking Distributed Data across the Web

Linked Data: What Now? Maine Library Association 2017

Interacting with Linked Data Part I: General Introduction

The Emerging Data Lake IT Strategy

An overview of RDB2RDF techniques and tools

Copyright 2012 Taxonomy Strategies. All rights reserved. Semantic Metadata. A Tale of Two Types of Vocabularies

Semantic Web Fundamentals

Semantic Web Fundamentals

Reducing Consumer Uncertainty

Semantic Web Company. PoolParty - Server. PoolParty - Technical White Paper.

Corso di Biblioteche Digitali

Exploring and Using the Semantic Web

COMP9321 Web Application Engineering

COMP9321 Web Application Engineering

Linked Data: Fast, low cost semantic interoperability for health care?

Proposal for Implementing Linked Open Data on Libraries Catalogue

Corso di Biblioteche Digitali

Linked Open Data: a short introduction

Google indexed 3,3 billion of pages. Google s index contains 8,1 billion of websites

Oracle Spatial and Graph: Benchmarking a Trillion Edges RDF Graph ORACLE WHITE PAPER NOVEMBER 2016

What is new in W3C land? 2009 Semantic Technology Conference San Jose, California, USA June 15, Ivan Herman, W3C

Using Linked Data and taxonomies to create a quick-start smart thesaurus

Position Paper for Ubiquitous WEB

SPARQL Protocol And RDF Query Language

Introduction. October 5, Petr Křemen Introduction October 5, / 31

Semantic Technologies and CDISC Standards. Frederik Malfait, Information Architect, IMOS Consulting Scott Bahlavooni, Independent

Enhancing Security Exchange Commission Data Sets Querying by Using Ontology Web Language

a paradigm for the Semantic Web Linked Data Angelica Lo Duca IIT-CNR Linked Open Data:

BUILDING THE SEMANTIC WEB

Linked Data Evolving the Web into a Global Data Space

The Semantic Web DEFINITIONS & APPLICATIONS

Linked Data and RDF. COMP60421 Sean Bechhofer

IT Infrastructure for BIM and GIS 3D Data, Semantics, and Workflows

The role of vocabularies for estimating carbon footprint for food recipies using Linked Open Data

Towards the Semantic Desktop. Dr. Øyvind Hanssen University Library of Tromsø

Linked Data Practices for the Geospatial Community

A Semantic Web-Based Approach for Harvesting Multilingual Textual. definitions from Wikipedia to support ICD-11 revision

Linked data and its role in the semantic web. Dave Reynolds, Epimorphics

Semantic Interoperability of Basic Data in the Italian Public Sector Giorgia Lodi

Copyright 2012 Taxonomy Strategies. All rights reserved. Semantic Metadata. A Tale of Two Types of Vocabularies

The Data Web and Linked Data.

Version 11

Practical Semantic Applications Master Title for Oil and Gas Asset Reporting. Information Integration David Price, TopQuadrant

Porting Social Media Contributions with SIOC

MDA & Semantic Web Services Integrating SWSF & OWL with ODM

CHAPTER 1 INTRODUCTION

DBpedia-An Advancement Towards Content Extraction From Wikipedia

Land Administration and Management: Big Data, Fast Data, Semantics, Graph Databases, Security, Collaboration, Open Source, Shareable Information

WebGUI & the Semantic Web. William McKee WebGUI Users Conference 2009

case study The Asset Description Metadata Schema (ADMS) A common vocabulary to publish semantic interoperability assets on the Web July 2011

THE GETTY VOCABULARIES TECHNICAL UPDATE

Architecture and Applications

Semantic Annotation, Search and Analysis

Taxonomy Tools: Collaboration, Creation & Integration. Dow Jones & Company

The Emerging Web of Linked Data

APPLYING KNOWLEDGE BASED AI TO MODERN DATA MANAGEMENT. Mani Keeran, CFA Gi Kim, CFA Preeti Sharma

ISA Action 1.17: A Reusable INSPIRE Reference Platform (ARE3NA)

The Semantic Web Revisited. Nigel Shadbolt Tim Berners-Lee Wendy Hall

Prof. Dr. Christian Bizer

4) DAVE CLARKE. OASIS: Constructing knowledgebases around high resolution images using ontologies and Linked Data

Semantic e-science. Bibliographic Cloud

Chapter 16 Linked Data, Ontologies, and DBpedia

SRI International, Artificial Intelligence Center Menlo Park, USA, 24 July 2009

Semantic challenges in sharing dataset metadata and creating federated dataset catalogs

FIBO Metadata in Ontology Mapping

COMP9321 Web Application Engineering

Semantic Adaptation Approach for Adaptive Web-Based Systems

Automated Classification. Lars Marius Garshol Topic Maps

Europeana update: aspects of the data

Semantic Web: vision and reality

Toward Horizon 2020: INSPIRE, PSI and other EU policies on data sharing and standardization

SPARQL เอกสารหล ก ใน มคอ.3

Report from the W3C Semantic Web Best Practices Working Group

OWLIM Reasoning over FactForge

Collage: A Declarative Programming Model for Compositional Development and Evolution of Cross-Organizational Applications

Orchestrating Music Queries via the Semantic Web

Linked Data and RDF. COMP60421 Sean Bechhofer

Linked Open Europeana: Semantic Leveraging of European Cultural Heritage

Vocabulary and Semantics in the Virtual Observatory

Terminologies, Knowledge Organization Systems, Ontologies

a paradigm for the Introduction to Semantic Web Semantic Web Angelica Lo Duca IIT-CNR Linked Open Data:

INSPIRE & Linked Data: Bridging the Gap Part II: Tools for linked INSPIRE data

Finding Similarity and Comparability from Merged Hetero Data of the Semantic Web by Using Graph Pattern Matching

Open Research Online The Open University s repository of research publications and other research outputs

Semantic Web: Core Concepts and Mechanisms. MMI ORR Ontology Registry and Repository

Semantic Web and Python Concepts to Application development

ISO CTS2 and Value Set Binding. Harold Solbrig Mayo Clinic

Web-based BIM. Seppo Törmä Aalto University, School of Science

ANNUAL REPORT Visit us at project.eu Supported by. Mission

Future Trends of ILS

GeoDCAT-AP Representing geographic metadata by using the "DCAT application profile for data portals in Europe"

SOA: Service-Oriented Architecture

Ontologies SKOS. COMP62342 Sean Bechhofer

SKOS Standards and Best Practises for USING Knowledge Organisation Systems ON THE Semantic Web

Introduction. Semantic Web Services

Day 2. RISIS Linked Data Course

1 Copyright 2011, Oracle and/or its affiliates. All rights reserved.

A Scenario for Business Benefit from Public Data

Transcription:

2 ITARC 2010 Stockholm 100420 Olle Olsson World Wide Web Consortium (W3C) Swedish Institute of Computer Science (SICS)

3 Contents Trends in information / data Critical factors... growing importance Needs Highlighting areas of needs Technology Enabling technology Applications Examples

4 Contents: Trends in information / data Emerging needs Technology Applications

Trends Information/Data Volumes 5 Growing volumes Price/performance Networking Devices Storage, etc Structured vs unstructured data! Challenge: capacity and capability to handle huge data volumes Illustration source: IDC/EMC Digital Universe (2009)

Trends Information/Data Volumes 6 Total volume vs number of units Increased number of small data chunks Challenge: keeping track of many smallsize data chunks Illustration source: IDC/EMC Digital Universe (2009)

Trends Information/Data Sources 7 New types of devices arriving in accelerated pace Sources of information / data Challenge: Tracking data and dynamic sources Illustration source: IDC/EMC Digital Universe (2009)

Trends Information/Data Sources 8 Data generating device examples Wireless sensor networks: multipurpose... Mobile phones: GPS, temperature,... Illustration source: Frost&Sullivan (2009)

Trends Information/Data Communication 9 Internet paradigm evolving From transmission centric To content centric Focus on data Content distribution (cf. the web!) Challenge: how to package and describe data to profit from envisioned Internet functionality Illustration source: UC San Diego (2009)

10 Contents Trends in information / data Emerging needs Technology Applications

Emerging needs stream processing 11 Processing data on-the-fly Data/information generated continuously Streams of data Data stream processing Drivers, examples: Sensor networks Internet-of-Things Messaging, blogging, micro-blogging Challenge: how to process data efficiently/effectively

Emerging needs the web data space 12 Web context as picture of business context Drivers: Interconnected sources of data Business interdependencies Data I need vs. Data I have & data you have growing volumes of available data cf. public sector information data evolution Cost-efficiency! Challenge: how to increase automation, data interoperability, adaptive processing

13 Contents Trends in information / data Emerging needs Technology Applications

Technology Semantic Web 14 Web technology Formats (XML, WS-*, MathML, RDF, SVG,...) Protocols (HTTP, SOAP,...) Processing (XForms, DOM, Powder, Pipeline,...) Semantic Web Technologies Rich representation: RDF, RDFS, OWL, SKOS,... Processing support: OWL, RIF, SPARQL,...

The semantics in the Semantic Web 15 Semantic web is about what? about meaning and automation Meaning-based automation Meaning -- pragmatic approach in Semantic Web: a program knows what it can do with data self-describing data No magic... instead well-founded engineering

Semantic web building blocks 16

RDF Resource Description Framework 17 Basic data model a triple triple (s,p,o) : s -- subject p -- predicate o -- object Conceptually: s is related by p to o RDF is a general model for such triples machine readable formats like RDF/XML, Turtle, n3, RXR

RDF Example 18 dbpedia:resource/volkswagen_phaeton dbpedia:ontology/weight dbpedia:ontology/manufacturer 2449000 dbpedia:resource/volkswagen dbpedia:resource/volkswagen_phaeton dbpedia:ontology/weight 2449000 Set dbpedia:resource/volkswagen_phaeton of triples form a graph the RDF graph dbpedia:ontology/manufacturer dbpedia:resource/volkswagen

OWL Web Ontology Language 19 Define ontologies (conceptual model,...) for data Built on top of RDF Basic components: instance entity class type property relationship Ontology enables: Checking consistency of instance graph Inferring implicit statements about instance graph

SKOS Simple Knowledge Organisation System 20 Define simple ontologies (conceptual model,...) Targeting traditional modelling approaches Taxonomies, classification schemes, thesauri,... Built on top of RDF Compatible with modelling standards: NISO Z39.19-2005; ISO 5964:1985

SPARQL RDF Query Language 21 Retrieve RDF data from RDF data graphs RDF graph as answer to query Syntax inspired by SQL Example: PREFIX foaf: <http://xmlns.com/foaf/0.1/> PREFIX skos: <http://www.w3.org/2004/02/skos/core#> PREFIX dbo: <http://dbpedia.org/ontology/> SELECT?manufacturer?name?car WHERE {?car skos:subject <http://dbpedia.org/resource/category:luxury_vehicles>.?car foaf:name?name.?car dbo:manufacturer?man.?man foaf:name?manufacturer } ORDER by?manufacturer?name In progress, functionality for: update; subqueries; negation; etc

Triple store 22 Database system for triples (RDF graphs) store, manage, query Implementation approaches Native triple store RDF from ground up triplified RDB basic storage in relational form Interface: SPARQL

Integration via SPARQL 23

SemWeb technology implementations 24 Free / open source: Franz, Google, HP, Mozilla,... Commercial: IBM, Ontoprise, OpenLink, Oracle, Talis,... Still somewhat fragmented supplier space. And the winner will be...? But core standards are established!

25 Contents Trends in information / data Emerging needs Technology Applications

Applications of SemWeb 26 Aspects: Technology: availability; quality; cost;... Methodology: scope; depth;... Application areas: selection; scope;... Technological environment: interoperability;... Needs evolution: stable, isolated; dynamic, open;... Ambition: Small, component-oriented, add-on,... Large, all-embracing;...

SemWeb technologies vs the Web 27 Semantic web technologies : Support critical web requirements Not necessarily used on the web Internal encapsulated component in some application Web requirements positive effects on Interoperability; maintainability; evolution;... SemWeb acceptance analogous to XML, e.g.: Used in all contexts Standardised Uniform tool support Enables interoperability

Limited-scope application 28 Adobe XMP Extensible Metadata Platform Copyright, Creator, Date, Location,... Aim: share metadata across applications, file formats, and devices Metadata added by tools, e.g. Photoshop Formatted as RDF/XML Advantage: Supports vendor-independent management of metadata

Limited-scope application 29 Dublin Core (DC) Document metadata (initial target: library metadata) Dublin Core Metadata Element Set Title, Creator, Date, Publisher, Language,... Aim: Make existing DC metadata accessible as RDF Simple change of format (syntax) Semantics unchanged Advantage: Using standard technologies Simplifies data integration

Limited-scope application 30 Microsoft Interactive Media Manager (IMM) Aim: Extension to Sharepoint for media sector Metadata framework using RDF and OWL Support workflow for media assets Advantage: Metadata sharing between systems Simplifies data integration Illustration source: Microsoft

Larger application 31 Renault Automotive Repair Aim: Generate context-sensitive diagnosis/repair information Fetch general information from several sources Provide user with relevant information Source information Metamodel in OWL Advantage Sharable modeling. Unified view over distributed data.

Larger application 32 NASA Expertise Finder - POPS Aim: Solution Find expertise for task at hand 70,000 experts in organisation and contractor workforce Search tool against several databases RDF-based integrating framework Advantage Consistent data model Loosely coupled infrastructure Share and reuse information

Larger application 33 BBC Music Aim: Solution Provide rich information about broadcast music Not only broadcast data Offer users fresh data, enable navigation to non-bbc sites RDF-based; uses other RDF sources Advantage Minimizes own data management New sources appear: easy to extend

Linked Open Data Initiative 34 Web of data: Many open datasets on the web Interoperable when accessible as RDF Examples: Wikipedia ( text ) ==> dbpedia (RDF); Scientific data sets (experimental data) Public sector information (geodata, census data, statistics,...) Different aims and coverage But semantically interrelated Increasingly so over time!

Linked Open Data Initiative 35 Experimental initiative Explore opportunities Identify technology strength/weakness Synthesize best-practice guidance Stakeholders Data owners Data re-users Specific case, profit from experience Open public data PSI directive How to make data more easy to reuse?

Linked Open Data Initiative 36 Linked Open Data (LOD) objective: expose open datasets via RDF set RDF links among the data items from different datasets a typical example is to set an owl:sameas between two items in different datasets that refer to the same thing set up query endpoints (usually SPARQL) Altogether billions of triples, millions of links The seed for a general Web of Data

The LOD cloud, March 2008 37

The LOD cloud, September 2008 38

The LOD cloud, March 2009 39

The LOD cloud, July 2009 40

Applications using the cloud emerge 41 Bookmarking systems, exploration of social graphs, financial reporting LOD nodes (eg, DBPedia) provide a set of referenceable URI-s for many things Worth looking at the proceedings of the latest workshop, for example April 2009, at WWW2009 http://events.linkeddata.org/ldow2009

Semantic Sensor Web 42 Sensor networks deliver data Semantic Sensor Web (OGC: Sensor Web Enablement) Deliver semantically represented data Control sensing & delivery via semantic specifications Discovery, access, tasking, alerts Towards a semantic Internet-of-Things Illustration source: OGC (2007)

Reflections and Conclusions 43

Semantic techniques 44 Toolset Basic standardised technologies Proven through use Emerging extensions and additions (needs driven) Applications Many products and applications in use Competence Drivers Awareness and experience growing More demanding business needs Public sector initiatives

Growth of data / information 45 Increased volumes Moore's law helps! Increased diversity (more types of data) Semantic techniques definitely appropriate Increased complexity (more interconnections) Semantic techniques are enabling Increased rate of change (the tough challenge!)

Thank you for your attention 46