WikiD (Wiki/Data) Jeffrey A. Young OCLC Office of Research code4lib 2006 Oregon State University, Corvallis, Oregon 15 February 2006

Similar documents
Expected and Unexpected Synergies

A Repository of Metadata Crosswalks. Jean Godby, Devon Smith, Eric Childress, Jeffrey A. Young OCLC Online Computer Library Center Office of Research

Virtual Collections. Challenges in Harvesting and Transforming Metadata from Harvard Catalogs for Topical Collections

Materials Data Curation System

1 Copyright 2013, Oracle and/or its affiliates. All rights reserved.

Terminology Services. Diane Vizine-Goetz Senior Research Scientist OCLC Research

SRW and CQL; Open Source at LC

XML for Java Developers G Session 8 - Main Theme XML Information Rendering (Part II) Dr. Jean-Claude Franchitti

Agenda. Summary of Previous Session. XML for Java Developers G Session 6 - Main Theme XML Information Processing (Part II)

Joining the BRICKS Network - A Piece of Cake

Harvester Service Technical and User Guide 5 June 2008

Comparing Open Source Digital Library Software

CSI 3140 WWW Structures, Techniques and Standards. Representing Web Data: XML

Using metadata for interoperability. CS 431 February 28, 2007 Carl Lagoze Cornell University

Building The Czech Digital Mathematics Library upon DSpace System

Database Systems: Design, Implementation, and Management Tenth Edition. Chapter 14 Database Connectivity and Web Technologies

Taking D2D Services to the Users with OpenURL, RSS, and OAI-PMH. Chuck Koscher Technology Director, CrossRef

Metadata and Encoding Standards for Digital Initiatives: An Introduction

INTRO INTO WORKING WITH MINT

DEPARTMENT OF COMPUTER SCIENCE AND ENGINEERING

Problem: Solution: No Library contains all the documents in the world. Networking the Libraries

com Spring + Spring-MVC + Spring-Boot + Design Pattern + XML + JMS Hibernate + Struts + Web Services = 8000/-

OAI-PMH. DRTC Indian Statistical Institute Bangalore

KINGS COLLEGE OF ENGINEERING 1

Working with the Seagull Framework. By Demian Turner, Seagull Systems

Building for the Future

Institutional Repository using DSpace. Yatrik Patel Scientist D (CS)

Distributed Multitiered Application

Persistent identifiers, long-term access and the DiVA preservation strategy

Chapter 3. Architecture and Design

D4.8 Report on semantic interoperability with Europeana

Slide 1 & 2 Technical issues Slide 3 Technical expertise (continued...)

Making DSpace Your Own

Dissemination Web Service. Programmatic access to Eurostat data & metadata

Registry Interchange Format: Collections and Services (RIF-CS) explained

Resilient Linked Data. Dave Reynolds, Epimorphics

Appendix REPOX User Manual

A tutorial report for SENG Agent Based Software Engineering. Course Instructor: Dr. Behrouz H. Far. XML Tutorial.

ORCA-Registry v2.4.1 Documentation

Copyright 2014 Blue Net Corporation. All rights reserved

Fedora. CS 431 April 17, 2006 Carl Lagoze Cornell University. Acknowledgements: Sandy Payette (Cornell)

Florida Coastal Everglades LTER Program

Fedora and GSearch in a Research Project about Integrated Search Open Repositories 2009

DISCOVER THE POWER OF VITAL The Solution for Your Digital Collection Management

Network Information System. NESCent Dryad Subcontract (Year 1) Metacat OAI-PMH Project Plan 25 February Mark Servilla

Copyright 2008 Pearson Education, Inc. Publishing as Pearson Addison-Wesley. Chapter 7 XML

The OAI2LOD Server: Exposing OAI-PMH Metadata as Linked Data

XML. Objectives. Duration. Audience. Pre-Requisites

XML Metadata Standards and Topic Maps

If you build it, will they come? Issues in Institutional Repository Implementation, Promotion and Maintenance

Community Platform for Delivering Library Services

IVOA Registry Interfaces Version 0.1

RVOT: A Tool For Making Collections OAI-PMH Compliant

The OCLC Metadata Switch Project

EXTENDING OAI-PMH PROTOCOL WITH DYNAMIC SETS DEFINITIONS USING CQL LANGUAGE

Metadata Standards & Applications. 7. Approaches to Models of Metadata Creation, Storage, and Retrieval

B. Assets are shared-by-copy by default; convert the library into *.jar and configure it as a shared library on the server runtime.

Delivery Options: Attend face-to-face in the classroom or via remote-live attendance.

BIBLID (2004) 93:1 pp (2004.6) 209. NBINet NBINet 92

Metadata Harvesting Framework

Programming Technologies for Web Resource Mining

Introducing Manakin: Overview & Architecture. Scott Phillips, Cody Green, Alexey Maslov, Adam Mikeal, and John Leggett

Creating a National Federation of Archives using OAI-PMH

7.1 Introduction. extensible Markup Language Developed from SGML A meta-markup language Deficiencies of HTML and SGML

Institutional repositories: description of VITAL as an example of a Fedora-based digital assets management system.

Research Data Repository Interoperability Primer

JBPM Course Content. Module-1 JBPM overview, Drools overview

Illustration 1: The Data Page builder inputs specifying the model variable, page and mode

Apache Wink Developer Guide. Draft Version. (This document is still under construction)

MuseKnowledge Hybrid Search

Delivery Options: Attend face-to-face in the classroom or remote-live attendance.

Digital Libraries: Interoperability

Semantic Extensions to Defuddle: Inserting GRDDL into XML

Migrating traditional Java EE applications to mobile

How to Build a Digital Library

Building a missing item in INSPIRE: The Re3gistry

Software Requirements Specification for the Names project prototype

Alter Package Schema Name Package Name Compile Debug Package Specification Body

Intro to XML. Borrowed, with author s permission, from:

XML Update. Royal Society of the Arts London, December 8, Jon Bosak Sun Microsystems

Local Metadatamanagement in a global environment

Building Interoperable and Accessible ETD Collections: A Practical Guide to Creating Open Archives

Developing an Institutional Repository Service in Chinese Academy of Sciences

References differences between SVG 1.1 Full and SVG 1.2 Tiny

Signed metadata : method and application

Develop Mobile Front Ends Using Mobile Application Framework A - 2

Cross-domain Metadata Interoperability for Integrated Information Services

GeoNetwork opensource

Guidelines for preparing a Z39.50/SRU target to enable metadata harvesting

what and why DSpace AddOn and Component Management System

RDF VISUALIZER. 3/4/ th CRM-SIG meeting M.Doerr, K. Doerr, K.Petrakis, L.Harami, N.Minadakis

BRA BIHAR UNIVERSITY, MUZAFFARPUR DIRECTORATE OF DISTANCE EDUCATION

An RDF NetAPI. Andy Seaborne. Hewlett-Packard Laboratories, Bristol

2 Apache Wink Building Blocks

SMART CONNECTOR TECHNOLOGY FOR FEDERATED SEARCH

Digital Library Curriculum Development Module 4-b: Metadata Draft: 6 May 2008

WSDL versioning. Facts Basic scenario. WSDL -Web Services Description Language SAWSDL -Semantic Annotations for WSDL and XML Schema

Increasing access to OA material through metadata aggregation

A beginners guide to MarcEdit and beyond the editor: Advanced tools and techniques for working with metadata

XML and XSLT. XML and XSLT 10 February

Transcription:

WikiD (Wiki/Data) Jeffrey A. Young OCLC Office of Research jyoung@oclc.org code4lib 2006 Oregon State University, Corvallis, Oregon 15 February 2006

WikiD Overview WikiD is an open-source online database for servicing multiple data collections involving arbitrary schemas of XML J2EE webapp Customizable and extensible Displays (XSL) Crosswalks (XSL) Services (Java abstractions) Built using modular standards-based APIs SRU/SRU Update OpenURL 1.0 OAI-PMH 2.0 RSS 2.0

OAI/WikiD Comparisons OAI-PMH Identifiable distributed repositories oai:gsafd.oclc.org:localid WikiD Identifiable local collections CollectionGsafd Identifiable items oai:gsafd.oclc.org:gsafd000001 Multiple metadataformats Identifiable items CollectionGsafd:GSAFD000001 Multiple recordformats OAI Registry at UIUC oai:urlregistry.uiuc.edu:gsafd.oclc.org CollectionCollection CollectionCollection:CollectionGsa fd Fixed OAI verbs Repository-level branding Read-only Access latest records only Extensible wiki-style actions Collection-level XSL Read/write Access historical records

Primary Bootstrap Collections CollectionCollection Collection items: descriptions of collections CollectionExternalSchemas Collection items: descriptions of XML Schemas that are referenced by items in CollectionCollection Schema title (for use in pull-down menus) Namespace URI Schema URL CollectionWikiPages Collection items: wiki markup pages Provides the user interface for WikiD

Persistent URIs All items in all collections are referred to internally by URIs info:wikid/collectioncollection:collectiongsafd info:wikid/collectiongsafd info:wikid/collectiongsafd:gsafd000001 The WikiD API can recognize and service shorthand forms CollectionCollection:CollectionGsafd CollectionGsafd CollectionGsafd:GSAFD000001 Persistent in the same sense that OAI identifiers persist

Content Storage Overview Updates to items in WikiD are written to the fileserver according to the collection s native format But WikiD can t intelligently index arbitrary XML, so it creates and indexes metadata instead Metadata fields: (with their approximate OAI equivalents) Persistent item identifier ( local-identifier ) Collection identifier ( namespace-identifier ) Sort/brief display value (e.g. dc:title or MARC 245) Date created Date last updated (oai:datestamp) Relative URL containing the native record (dc:identifier) The metadata records for all items in all collections get stored in a single SRU database (and also to the fileserver)

Content Storage Detail: WebEdit A lightweight open-source web service to transform HTML form submittals into a target XML format 1. HTML form fields get encoded in a linear XML structure <document> <fieldname1>fieldvalue1</fieldname1> <fieldname2>fieldvalue2</fieldname2> </document> 2. This document gets transformed into a metadata record by an XSL Stylesheet indicated by a hidden form field 3. Included in this stylesheet are templates to construct the native record and Base64-encode it as an element in the metadata record

Content Storage Detail: SRU Update A standards-based web service protocol developed by the SRU community to update XML records in an SRU database Ralph Levan s open-source implementation automatically handles the strange version-controlled native/metadata bifurcation described earlier Yanks the Base64-encoded native record from the metadata record, decodes it, and stores it on the fileserver Inserts the relative URL of the native record back into the metadata record Touches the metadata record s date fields Stores/indexes the metadata record in the database

GSAFD Collection Description (Metadata) Persistent item identifier CollectionGsafd Collection identifier CollectionCollection Sort/brief display value GSAFD Thesaurus Relative URL containing the native record 2006/01/27/20/inf_3awikid_2fCollectionCollectio n_3acollectiongsafd_5f200601272016502020

GSAFD Item (Metadata) Persistent item identifier GSAFD000001 Collection identifier CollectionGsafd Sort/brief display value (taken from the MARC 245 field) Adventure fiction Relative URL for the native file 2005/06/10/23/inf_3awikid_2fCollectionGsafd_3 agsafd000001_5f20050610232847520

The Need for OpenURL 1.0 SRU is great for interactive searching, but sometimes we want to provide services involving more complex operations Dereference the native items in the metadata records Perform complex query chains Include a pull-down menu of alternative record formats obtained from the collection description record on an item-level record display Result transformations Crosswalks SRU to OAI-PMH SRU to RSS OpenURL 1.0 gives us a single consistent API for performing any and all services that reference these items

WikiD API Although the OpenURL 1.0 API can mediate all conceivable functionality, the URLs involved are hideous http://localhost:8080/wikid/resolver?url_ver=z39.88-2004&url_ctx_fmt=info%3aofi/fmt%3akev%3amtx%3actx&ct x_enc=info%3aofi/enc%3autf- 8&rft_id=info%3Awikid/CollectionWikiPages%3AFrontPage &svc_dat=action%3dedit&rfr_id=info%3asid/oclc.org%3are ferrer/wikid In contrast, the WikiD Servlet accepts wiki-style URLs http://localhost:8080/wikid/frontpage?action=edit WikiD performs a simple mechanical mapping into the OpenURL equivalent and hands it off to the OpenURL resolver Human users will generally prefer the WikiD API, but automated clients might prefer to access the OpenURL, SRU, WebEdit, and SRU Update APIs directly

Servlets and data flows WikiD OpenURL 1.0 WebEdit SRW/U Update Pears DB Engine File system

Reference Links WikiD project page http://www.oclc.org/research/projects/wikid/ WikiD demo http://alcme.oclc.org/wikid/