Linked Data Demystified: Practical Efforts to Transform CONTENTdm Metadata for the Linked Data Cloud

Similar documents
CONTENTDM METADATA INTO LINKED DATA

Not Just for Geeks: A practical approach to linked data for digital collections managers

Running head: LINKED OPEN DATA IMPLEMENTATION REPORT 1

Managing the Transition from Large-Scale Oral History Research to Digital Archive: The Digital Librarian s Perspective

Using link resolver reports for collection management

Digital Preservation Efforts at UNLV Libraries

Semantic e-science. Bibliographic Cloud

Why You Should Care About Linked Data and Open Data Linked Open Data (LOD) in Libraries

Guidelines for Developing Digital Cultural Collections

Hello, I m Melanie Feltner-Reichert, director of Digital Library Initiatives at the University of Tennessee. My colleague. Linda Phillips, is going

Future Trends of ILS

Digital Public Space: Publishing Datasets

The Emerging Data Lake IT Strategy

Contribution of OCLC, LC and IFLA

Library of Congress BIBFRAME Pilot. NOTSL Fall Meeting October 30, 2015

MarkLogic 8 Overview of Key Features COPYRIGHT 2014 MARKLOGIC CORPORATION. ALL RIGHTS RESERVED.

Linked Data Evolving the Web into a Global Data Space

> Semantic Web Use Cases and Case Studies

NOTSL Fall Meeting, October 30, 2015 Cuyahoga County Public Library Parma, OH by

Ivan Herman. F2F Meeting of the W3C Business Group on Oil, Gas, and Chemicals Houston, February 13, 2012

Multi-agent and Semantic Web Systems: Linked Open Data

Linked Data in Archives

I know I m preaching to the choir when I say librarians are really good at creating quality data.

From Online Community Data to RDF

Linked Data Overview and Usage in Social Networks

A Formal Definition of RESTful Semantic Web Services. Antonio Garrote Hernández María N. Moreno García

Prof. Dr. Christian Bizer

Reducing Consumer Uncertainty

COMP6217 Social Networking Technologies Web evolution and the Social Semantic Web. Dr Thanassis Tiropanis

INF3580/4580 Semantic Technologies Spring 2015

Update on the TDL Metadata Working Group s activities for

> Semantic Web Use Cases and Case Studies

Linked Data: What Now? Maine Library Association 2017

Alphabet Soup: A Metadata Overview Melanie Schlosser Metadata Librarian

a paradigm for the Semantic Web Linked Data Angelica Lo Duca IIT-CNR Linked Open Data:

Enhancing discovery with entity reconciliation: Use cases from the Linked Data for Libraries (LD4L) project

WorldCat Digital Collection Gateway Visibility for Digital Collections

Semantic Web Systems Linked Open Data Jacques Fleuriot School of Informatics

Database of historical places, persons, and lemmas

The Semantic Web DEFINITIONS & APPLICATIONS

Linking Distributed Data across the Web

a paradigm for the Introduction to Semantic Web Semantic Web Angelica Lo Duca IIT-CNR Linked Open Data:

Linked data implementations who, what, why?

RDF and Digital Libraries

The Emerging Web of Linked Data

SEMANTIC WEB AN INTRODUCTION. Luigi De

Library Technical Services Process Improvement Based on LEAN

Data Governance for the Connected Enterprise

Semantic Web Fundamentals

Resilient Linked Data. Dave Reynolds, Epimorphics

Chinese Geo-Names Calculator A Linked Data Approach

The Linking Open Data Project Bootstrapping the Web of Data

JENA: A Java API for Ontology Management

The Data Web and Linked Data.

Google indexed 3,3 billion of pages. Google s index contains 8,1 billion of websites

Europeana update: aspects of the data

Towards Open Innovation with Open Data Service Platform

Linked Data and RDF. COMP60421 Sean Bechhofer

Federated Search. Frank Bilotto Muse Global Salt Lake City, Utah, USA

O.Curé [1 ] Mashup, Microformats, RDFa and GRDDL

Semantiska webben DFS/Gbg

Semantic Web Company. PoolParty - Server. PoolParty - Technical White Paper.

Pipe Dreams: Harvesting Local Collections into Primo Using OAI-PMH

Workflow option for getting an existing CONTENTdm collection ready for IM DPLA harvest

COAR Interoperability Roadmap. Uppsala, May 21, 2012 COAR General Assembly

Proposal for Implementing Linked Open Data on Libraries Catalogue

Sindice Widgets: Lightweight embedding of Semantic Web capabilities into existing user applications.

Linked Data. The World is Your Database

Semantic Technologies and CDISC Standards. Frederik Malfait, Information Architect, IMOS Consulting Scott Bahlavooni, Independent

Linked data and its role in the semantic web. Dave Reynolds, Epimorphics

The type of organization for which you created the collection and the potential user and their needs.

Enrichment, Reconciliation and Publication of Linked Data with the BIBFRAME model. Tiziana Possemato Casalini Libri

Delivering your special collections online: Digitization and CONTENTdm

Melvyl Webinar UC and OCLC Roadmap Discussion

Semantic Web Fundamentals

Linked data from your pocket: The Android RDFContentProvider

Reducing Consumer Uncertainty Towards a Vocabulary for User-centric Geospatial Metadata

SEPA SPARQL Event Processing Architecture

Ontology Servers and Metadata Vocabulary Repositories

The RMap Project: Linking the Products of Research and Scholarly Communication Tim DiLauro

Schema org/microdata Exposing Y our Your Data the Web (The Easy Way) Linked Data vs Schema.org: A Town Hall Debate about the Future of Information

Corso di Biblioteche Digitali

Webinar Annotate data in the EUDAT CDI

SECTION 10 EXCHANGE PROTOCOL

Package rrdf. R topics documented: February 15, Type Package

LINKING WEB DATA WEB:

4 th Linked Data on the Web Workshop (LDOW 2011)

Accessing information about Linked Data vocabularies with vocab.cc

Data Integration and Structured Search

The Semantic Web Revisited. Nigel Shadbolt Tim Berners-Lee Wendy Hall

EDEN An Epigraphic Web Database of Ancient Inscriptions

Web 2.0, AJAX and RIAs

warwick.ac.uk/lib-publications

The Semantic Institution: An Agenda for Publishing Authoritative Scholarly Facts. Leslie Carr

Building a Linked Open Data Knowledge Graph Henning Schoenenberger Michele Pasin. Frankfurt Book Fair 2017 October 11, 2017

Linked Open Data: a short introduction

From Raw Sensor Data to Semantic Web Triples Information Flow in Semantic Sensor Networks

Corso di Biblioteche Digitali

Introduction to Linked Data

DC Regional Fedora Users Meeting

Transcription:

Library Faculty Presentations Library Faculty/Staff Scholarship & Research 11-7-2012 Linked Data Demystified: Practical Efforts to Transform CONTENTdm Metadata for the Linked Data Cloud Silvia B. Southwick University of Nevada, Las Vegas, silvia.southwick@unlv.edu Cory K. Lampert University of Nevada, Las Vegas, cory.lampert@unlv.edu Follow this and additional works at: https://digitalscholarship.unlv.edu/libfacpresentation Part of the Cataloging and Metadata Commons, and the Databases and Information Systems Commons Repository Citation Southwick, S. B., Lampert, C. K. (2012, November). Linked Data Demystified: Practical Efforts to Transform CONTENTdm Metadata for the Linked Data Cloud. Presentation at Virtual OCLC CONTENTdm User Group Meeting, Las Vegas, NV. Available at: https://digitalscholarship.unlv.edu/libfacpresentation/99 This Presentation is brought to you for free and open access by the Library Faculty/Staff Scholarship & Research at Digital Scholarship@UNLV. It has been accepted for inclusion in Library Faculty Presentations by an authorized administrator of Digital Scholarship@UNLV. For more information, please contact digitalscholarship@unlv.edu.

LINKED DATA DEMYSTIFIED PRACTICAL EFFORTS TO TRANSFORM CONTENTDM METADATA INTO LINKED DATA

PRESENTERS Silvia Southwick Digital Collections Metadata Librarian UNLV Libraries Cory Lampert Head of the Digital Collections Department UNLV Libraries

OUTLINE Why should I care? What is it? Defining Linked Data / Introduction to Linked Data Concepts / Linked Data Principles Technologies & Standards for Linked Data The Linked Data Cloud How? Applying these concepts to digital collection records Anticipated challenges working with CONTENTdm The UNLV Libraries Linked Data Project How could you start working with Linked Data?

LINKED DATA MYTHS My collections are already visible through Google; so who cares This is a topic for catalogers It s too technical / complicated / boring Actually... Linked data is the future of the Web Data will no longer be in silos (catalog, CONTENTdm) Relationships are powerful and worth the effort

HOW DO WE CURRENTLY CREATE OUR DIGITAL COLLECTIONS? Data (or metadata) are encapsulated in records Records are contained in collections Very few links are created within and/or across collections Links have to be manually created Existing links do not specify the nature of the relationships among records This structure hides potential links within and across collections DATA IS TRAPPED!

UNIQUE LOCAL COLLECTIONS, HIDDEN RELATIONSHIPS Example: A search on water in the OCLC collection of collections resulted in 26 collections that are not crosslinked Digital Collections containing records on water California Water Documents Western Waters Digital Library Bear River Watershed Historical Collection The Historic Landscape of Nevada: Development, Water, and Natural Environment Seattle Power Water Supply Collection Western Waters Digital Library: The Columbia River Basin in Oregon

EXPOSED DATA RELATIONSHIPS POWERFUL, RELATED DATA Example: Google Knowledge Graph http://youtu.be/mmql6vgvx-c

A LEGO METAPHOR FOR CREATING LINKED DATA This is the Data Model

Transforming records into data Publishing data Linking data as you search or browse

DEFINING LINKED DATA Linked Data refers to a set of best practices for publishing and interlinking data on the Web Data needs to be machine-readable Linked data (Web of Data) is an expansion of the Web we know (Web of documents)

WEB IN TRANSITION 1. Two types of data: 1. Human-readable documents (email, brochure, report) 2. Machine-readable data (calendar, playlist, spreadsheet) 2. Shopping example 1. A web page ad (document) says dress, color, price, designer 2. But machines cannot extract data to re-use in another application (e.g., spreadsheet to compare prices) 3. RDF new way to specify relationships and transfer context with data across applications: reusable data 4. The time is now to start to evolve our documents into data

TECHNOLOGIES FOR LINKED DATA Linked data is built in the Web architecture (HTTP, URIs) RDF is a data model (not a format) Most common serializations: RDF/XML RDFa RDF is based on triples/statements SPARQL - SPARQL Protocol and RDF Query Language -- is an query language able to retrieve and manipulate data stored in RDF.

WHAT ARE TRIPLES? Triples are expressed as: Examples: subject predicate object Frank Sinatra -- is an entertainer Frank Sinatra knows Jack Entratter

EXAMPLE TRIPLE RDF Introduction to RDF at http://www.linkeddatatools.com/introducing-rdf

PRINCIPLES OF LINKED DATA 1. Use URIs as names for things (people, organizations, artifacts, abstract concepts, etc.) 2. Use HTTP URIs so that people can look up those names 3. When someone looks up a URI, provide useful information, using the standards (RDF, SPARQL) 4. Include links to other URI so that they users discover other related items (note: RDF Links have types)

THE LINKED DATA CLOUD

CREATING LINKED DATA FROM ORIGINAL RECORD VS. HARVESTED RECORD

ORIGINAL RECORD Title: Café Monico menu, February 19, 1903Category: regular services Restaurant Name: Café Monico (London, England) Additional Information: Advertisement on back and around edges if the menu. Insert lists Indian curries as special on Mondays and Thursdays Graphic Elements: Borders(Ornament areas); Buildings; Photographs Enclosures: daily specials; advertisements Type of restaurant: Non-specialized restaurant Type of menu: `a la carte Meals served: dinner; lunch City: London..

OCLC WORLDCAT LINKED DATA SAME RECORD (HARVESTED)

HOW CAN WE ADDRESS THIS PROBLEM? Create a complementary data structure that would allow dynamic interlinking among data How? Export records from the collections Deconstruct these records by extracting data from them Apply vocabularies Adopt a common model to express data Publish data in a data space (Linked Data Cloud) where links among data are created automatically

EXAMPLES OF RECORDS

TRANSFORMING RECORDS INTO DATA What are possible triples for this photo? <this photo> <created by> <Las Vegas News Bureau> < this photo > <is a> <photograph> < this photo > <is a> <print> < this photo > <depicts> <Frank Sinatra> < this photo > <depicts> <Jack Entratter> -------------- <Frank Sinatra> <knows> <Jack Entratter> <Jack Entratter> <knows> <Frank Sinatra> <Frank Sinatra> <is an> <entertainer> <Jack Entratter> <is a> <theatrical producer> ----------

GRAPHICAL REPRESENTATION OF THE PHOTO TRIPLES

ADDING TRIPLES FROM THE OTHER RECORDS What are the URIs for subjects, predicates and objects?

VOCABULARIES ALERT: Finally a place in the presentation we feel at home! -------------- Vocabularies are specific terms used in RDF statements to describe specific resources. ---------- Vocabulary examples in linked data (Linked Open Vocabulary): DCMI Type Vocabulary Friend of a Friend Vocabulary Geonames MARC Code List for Relators Creative Commons Rights Expression vocabulary Schema.org Many more at: http://lov.okfn.org/dataset/lov/

UNLV LINKED DATA PROJECT Goals: Study the feasibility of developing a single process that would allow the conversion of our collection records into linked data preserving their original expressivity and richness Publish data from our collections in the Linked Data Cloud to improve discoverability and connections with other related data sets on the Web.

PHASES OF THE PROJECT Literature Review Evaluating Technologies Research existing technologies and best practices Develop small experiments with technologies Make decisions of which technologies to adopt, adapt or develop Data preparation Select and prepare records from digital collections to participate in the project Run process for data transformation Publish on the Linked Data Cloud Assess results

DATA PREPARATION Defining vocabularies that will be adopted for predicates Defining types of triples to be created (literal, outgoing links, incoming links, triples that describe related resources, triples that link to descriptions, triples that indicate provenance of the data, etc.) Specifying URIs for new things Identifying potential URIs for external links (e.g., things that already have URIs) Describing data sets that will be published in the linked data cloud

TECHNOLOGY OPTIONS FOR DATA TRANSFORMATION

Type of Data Structured Data (CONTENTdm) Data Preparation RDF-izers for Excel or XML Data Storage Drupal DB Data Source API RDF Store RDF Files Data Publication Drupal RDFa Linked Data Wrapper Linked Data Interface Web Server Linked Data on the Web Adapted from Linked Data Evolving the Web into a Global Data Space by Heath and Bizer

ANTICIPATED CHALLENGES Developing of a single process for transforming records into data because digital collections adopt different metadata schema Creating URIs for all our unique materials Finding ways to associate URIs to things in CONTENTdm Adopting linked data while it is in early stage of development

TIPS TO CONSIDER WHEN CREATING DIGITAL COLLECTIONS METADATA Avoid mixing different types of data in metadata fields Avoid creating aggregated data fields Record URIs whenever available Reinforce use of controlled vocabularies Monitor how CMS are adopting linked data technologies

HOW WE STARTED Created a study group in the Library (members from various areas of the library) Watched webinars on the topic and have discussions after the webinars Created an internal wiki with linked data resources Participated in linked data interest groups Follow the literature on this topic

QUESTIONS? Contact Information: Silvia Southwick Silvia.Southwick@unlv.edu Cory Lampert Cory.Lampert@unlv.edu Department of Digital Collections UNLV Libraries