Design of The PORTA EUROPA Portal (PEP) Pilot Project

Similar documents
Implementing OAI Data and Service Providers

A Dublin Core Application Profile in the Agricultural Domain

Metadata Harvesting Framework

How to contribute information to AGRIS

Open Archives Initiatives Protocol for Metadata Harvesting Practices for the cultural heritage sector

Getting Started with the Digital Commonwealth. Robin L. Dale Director of Digital & Preservation Services LYRASIS

Introduction to Omeka.net

Metadata Catalogue Issues. Daan Broeder Max-Planck Institute for Psycholinguistics

Integration of resources on the World Wide Web using XML

Indonesian Citation Based Harvester System

2nd Technical Validation Questionnaire - interim results -

Joint UNECE/Eurostat/OECD work session on statistical metadata (METIS) (Geneva, 3-5 April 2006)

Online repositories of Eu informations sources. European Documentation Centres: Online repositories of EU information sources

RDF and Digital Libraries

From Open Data to Data- Intensive Science through CERIF


Metadata Standards and Applications

Nuno Freire National Library of Portugal Lisbon, Portugal

SciX Open, self organising repository for scientific information exchange. D15: Value Added Publications IST

Specific requirements on the da ra metadata schema

First metadata-enabled service in Croatian Webspace

Ontology-based Navigation of Bibliographic Metadata: Example from the Food, Nutrition and Agriculture Journal

PROCESSING AND CATALOGUING DATA AND DOCUMENTATION - QUALITATIVE

PROCESSING AND CATALOGUING DATA AND DOCUMENTATION: QUALITATIVE

Institutional Repository using DSpace. Yatrik Patel Scientist D (CS)

Building Online Archives: SNCA Workshop 2016

Orbis Cascade Alliance Content Creation & Dissemination Program Digital Collections Service. Enabling OAI & Mapping Fields in Digital Commons

EUDAT-B2FIND A FAIR and Interdisciplinary Discovery Portal for Research Data

Problem: Solution: No Library contains all the documents in the world. Networking the Libraries

BIBLID (2004) 93:1 pp (2004.6) 209. NBINet NBINet 92

15/06/2018 In Out, In Out, And Shake It All About. A Moving Story of Data

ACDH AUSTRIAN CENTRE FOR DIGITAL HUMANITIES

Appendix REPOX User Manual

Data is the new Oil (Ann Winblad)

EXTENDING OAI-PMH PROTOCOL WITH DYNAMIC SETS DEFINITIONS USING CQL LANGUAGE

Digibess: thanks Islandora! Arcidosso Italy March, 20-22, Giancarlo Birello, Anna Perin IT office and Library CNR-Ceris

Robin Dale RLG

Purpose: A dynamic approach to make legacy databases like CDS/ISIS, interoperable with OAI-compliant digital libraries (DL).

Chuck Cartledge, PhD. 25 February 2018

Introduction

Digital Library Interoperability. Europeana

DRI: Preservation Planning Case Study Getting Started in Digital Preservation Digital Preservation Coalition November 2013 Dublin, Ireland

Data Exchange and Conversion Utilities and Tools (DExT)

Utilizing PBCore as a Foundation for Archiving and Workflow Management

Metadata and Encoding Standards for Digital Initiatives: An Introduction

Using metadata for interoperability. CS 431 February 28, 2007 Carl Lagoze Cornell University

Persistent identifiers, long-term access and the DiVA preservation strategy

Its All About The Metadata

Metadata. Week 4 LBSC 671 Creating Information Infrastructures

SobekCM. Compiled for presentation to the Digital Library Working Group School of Oriental and African Studies

The Biblioteca de Catalunya and Europeana

Questionnaire for effective exchange of metadata current status of publishing houses

The Necessity of a New Culture of Electronic Publishing C A S L I N

Research repository models: Can one size fit all?

(Geo)DCAT-AP Status, Usage, Implementation Guidelines, Extensions

IVOA Registry Interfaces Version 0.1

Preserving State Government Digital Information Core Legislative XML Schema Meeting. Minnesota Historical Society

The Scottish Collections Network: landscaping the Scottish common information environment. Gordon Dunsire

The Design of a DLS for the Management of Very Large Collections of Archival Objects

Summary of Bird and Simons Best Practices

Open Archives Forum - Technical Validation -

DCMI Abstract Model - DRAFT Update

OAI-PMH. DRTC Indian Statistical Institute Bangalore

USER GUIDE TO THE DIGITAL LIBRARY OF IBERO-AMERICAN HERITAGE (BDPI)

Share.TEC System Architecture

Metadata aggregation for digital libraries

Lessons Learned. Implementing Rosetta in the Harold B. Lee Library

> Semantic Web Use Cases and Case Studies

CONTENTdm & The Digital Collection Gateway New Looks for Discovery and Delivery

A service-oriented national e-thesis information system and repository

MuseKnowledge Hybrid Search

Million Book Universal Library Project :Manual for Metadata Capture, Digitization, and OCR

Interoperability Patterns in Digital Library Systems Federations. Paolo Manghi, Leonardo Candela, Pasquale Pagano

7.3. In t r o d u c t i o n to m e t a d a t a

Metadata for Data Discovery: The NERC Data Catalogue Service. Steve Donegan

RVOT: A Tool For Making Collections OAI-PMH Compliant

The Open Archives Initiative and the Sheet Music Consortium

Joining the BRICKS Network - A Piece of Cake

ESRI & Interoperability. David Danko ISO TC 211 Metadata Project Leader OGC Metadata WG Chair ESRI Senior Consultant GIS Standards

OAI (Open Archives Initiative) Suite Version 3.0. Introductory Guide for New Users

The DataCite Metadata Schema. Frauke Ziedorn Workshop: Metadata and Persistent Identifiers for Social and Economic Data 7th May 2012

ISLE Metadata Initiative (IMDI) PART 1 B. Metadata Elements for Catalogue Descriptions

Networking European Digital Repositories

METAINFORMATION INCORPORATION IN LIBRARY DIGITISATION PROJECTS

Two Traditions of Metadata Development

The Semantics of Semantic Interoperability: A Two-Dimensional Approach for Investigating Issues of Semantic Interoperability in Digital Libraries

The Knowledge Portal, or, the Vision of Easy Access to Information

IMu OAI-PMH Web Service

Report on Deployment of Customized OJS

OAI-PMH for dummies: how to build an institutional repository with limited resources?

Persistent Identifiers for Audiovisual Archives and Cultural Heritage

Evaluation of Islandora & SobekCM

Presented by Dr Joanne Evans, Centre for Organisational and Social informatics Faculty of IT, Monash University Designing for interoperability

The Swedish National Archives digital preservation. Mats Berggren, IT-department,

The MIND Approach. Fabio Crestani University of Strathclyde, Glasgow, UK. Open Archive Forum Workshop Berlin, Germany, March 2003

Creating a National Federation of Archives using OAI-PMH

Multi-agent Semantic Web Systems: Data & Metadata

Copyright 2008, Paul Conway.

The Metadata Assignment and Search Tool Project. Anne R. Diekema Center for Natural Language Processing April 18, 2008, Olin Library 106G

Taking D2D Services to the Users with OpenURL, RSS, and OAI-PMH. Chuck Koscher Technology Director, CrossRef

Transcription:

Design of The PORTA EUROPA Portal (PEP) Pilot Project Marco Pirri Maria Chiara Pettenati Electronics and Telecommunications Department University of Florence (Italy) Library European University Institute (EUI) San Domenico di Fiesole Florence (Italy) OAForum Workshop 67 Dec 2002 1

The aim of the project To conceive a federation system to handle heterogeneous data sources In the framework of the PEP project Goal: to implement a PEP pilot project on a restricted domain > history OAForum Workshop 67 Dec 2002 2

The problem of heterogeneous data sources Different format of data (Text, Audio, Video ) Different types in the same format (for example in text format: Pdf, Doc, Txt ) Different format to describe data (metadata) Different protocols to retrieve data, build services OAForum Workshop 67 Dec 2002 3

Porta Europa Portal pilot project Porta Europa purpose is to present the EUI (European University Institute) on the Web as a leader in the "European debate" as a natural gateway to highquality research information in the areas of law, political and social sciences, economics and history What is it? a specific Portal integrated inside the EUI Web Site offering opportunities to link the currently dispersed information sources The Pilot Project: The Library will create the first prototype based on three history projects. OAForum Workshop 67 Dec 2002 4

Voices on Europe Resource 1/3 It s an oral Archive of Interviews to Important European personalities http://wwwarc.iue.it/webpub/ Details: Access Database with 5 representative tables, Input made by Staff Today: interviews papers available in jpeg format (only on CDROM) and cassette available only at request to Archive Future: interview papers available in pdf format (online) and audio available (online with authorised access) OAForum Workshop 67 Dec 2002 5

VL History Project Resource 2/3 WWW VL History Project is part of Virtual Library the oldest catalog of the web, started by Tim BernersLee http://vlib.iue.it/ Run by a loose confederation of volunteers, who compile pages of key links for particular areas in which they are expert Today: 11 main Projects and 6 subprojects are involved, all pages are in HTML and database is not implemented The Future: Create a database, transfer old data, build new and homogeneous output, administration interface. OAForum Workshop 67 Dec 2002 6

Biblio, Innopac Resource 3/3 Biblio is the main catalogue of the EUI Library (more than 260.000 bibliographic records) http://www.iue.it/lib/catalogue/ Based on Innopac Automation System Today: Staff administrate catalogue, but still some limits on extracting and elaborating data from catalogue Future: Innopac module, OAI compliant OAForum Workshop 67 Dec 2002 7

Summary of the three resources Voices on Europe Oral Archive of interviews, access database WWW Virtual Library History Project Website with a collection of links selected by experts in history fields organized by categories Biblio, Innopac Main catalogue of Library of European University Institute private database OAForum Workshop 67 Dec 2002 8

Analysis of the three resources Characterisitc of the archive Voices on Europe Virtual Library Data Objects Audio, tran. HTML pages Records Collection of Archive Archive is metadata organized in structured in structures Access DB web pages Format Collection of services Domain focus Community of users Access: SQL queries, DB staff management, no log on or statistic Access: through Web, managed by project admin. No log on or stat. Biblio Library catalog Proprietary DB, USMARC Information management through INNOPAC Lib. Automation System European history Everybody for information Search, Restricted access for full documents consultation, Admin. OAForum Workshop 67 Dec 2002 9

The adopted approach Metadata > Dublin core International Standard based on XML Extensibile Protocol > Open Archives Initiative Data and Service Provider Harvesting (Retrieve data from Repositories) More details in the project home page OAForum Workshop 67 Dec 2002 10

Meta Resource Card DC element Title Creator Voices On Europe Interviewee s name Name of Interviewer VL History Project Title Author Biblio Innopac Title Author Level 1,2,3 Subject Type 3 Subject (eurovoc) Rights User Profile Free User Profile OAForum Workshop 67 Dec 2002 11

User Profile Function General Administration (Information management) Information search Full information access Restricted information access (restriction is due to property right on some resources contained in the archives) Personalised services User Administrator and Project leaders Public Internal Users (iue members, professors, studends, etc.) External users, groups of users Registered users OAForum Workshop 67 Dec 2002 12

Architecture Layered Structure Layer provide services to the upper layer OAI federation Database Independent data source OAForum Workshop 67 Dec 2002 13

Data Source Layer Voices on Europe Oral Archive of interviews, access database WWW Virtual Library History Project Website with a collection of links selected by experts in history fields organized by categories Biblio, Innopac Main catalogue of Library of European University Institute private database OAForum Workshop 67 Dec 2002 14

Adapter Layer Extracting tools, SQL queries Extracting of data fields automatic process Update Frequency occasionally (every day/week/...) OAForum Workshop 67 Dec 2002 15

Federation Layer & User Interface Meta Resource Card that map the fields extracted Management of OAI repository Implementation of OAI PMH Protocol Communication with user interface (Service Provider) OAForum Workshop 67 Dec 2002 16

Eui, OAI Repository XML Response (request: GetRecord) <?xml version="1.0" encoding="utf8"?> <OAIPMH xmlns="http://www.openarchives.org/oai/2.0/ PROTOCOL VERSION (OAIPMH 2.0) PROTOCOL NAMESPACE DECLARATIONS, RESPONSE DATE, METADATA PREFIX <GetRecord> RESPONSE TO GETRECORD REQUEST HEADER ( IDENTIFIER, DATE, SET) <metadata> METADATA <oai_dc:dc. RECORD NAMESPACE DECLARATIONS http://www.openarchives.org/oai/2.0/oai_dc.xsd"> SCHEMA LOCATION <dc:title>titolo1</dc:title> DUBLIN CORE METADATA <dc:creator>autore</dc:creator> <dc:subject>argomento</dc:subject> <dc:description>descrizione</dc:description> <dc:publisher>editore</dc:publisher> <dc:date>data</dc:date> <dc:type>tipo</dc:type> <dc:format>formato</dc:format> <dc:identifier>identificativo</dc:identifier> <dc:source>source</dc:source> <dc:language>lingua</dc:language> <dc:relation>relation</dc:relation> <dc:coverage>coverage</dc:coverage> <dc:rights>diritti</dc:rights> </oai_dc:dc> </metadata> CLOSING TAGS </GetRecord> </OAIPMH> OAForum Workshop 67 Dec 2002 17

Conclusion Current state Analisys of Resources (data source layer) Meta Resource card PEP Architecture Test of OAI Repository (federation layer) Work in progress Extracting tools (adapter layer) User Interface (OAI Service Provider) PEP Homepage: http://www.iue.it/personal/staff/pirri OAForum Workshop 67 Dec 2002 18

PEP pilot project Home Page http://www.iue.it/personal/staff/pirri OAForum Workshop 67 Dec 2002 19

Voices on Europe Mapping Dublin Core Element Title Creator Subject Description Publisher Contributor Date Type Format Identifier Source Language Relation Coverage Rights Voices on Europe Interviewee's surname Name of Interviewer Level 1,2,3 (eurovoc) Number of Pages Eui Date of recording Audio/Testo Pdf Url Language Additional Material User Profile OAForum Workshop 67 Dec 2002 20

Virtual Library History Project Mapping Dublin Core Element Title Creator Subject Description Publisher Contributor Date Type Format Identifier Source Language Relation Coverage Rights Virtual Library Title Author Type 3 Abstract Type 1 Date of input Text Html Url English free OAForum Workshop 67 Dec 2002 21

Biblio Innopac Mapping Dublin Core Element Title Creator Subject Description Publisher Contributor Date Type Format Identifier Source Language Relation Coverage Rights Biblio Title Author Subject Note Imprint Cat Date Text Pdf Isbn Lang User Profile OAForum Workshop 67 Dec 2002 22

Mapping Summary Meta Resource Card Dublin Core Element Voices on Europe Virtual Library Biblio Title Interviewee's surname Title Title Creator Name of Interviewer Author Author Subject Level 1,2,3 (eurovoc) Type 3 Subject Description Full text Interview Abstract Note Publisher Eui Type 1 Imprint Contributor Date Date of recording Date of insertion Cat Date Type Video/Audio/Testo Text (Html) Text Format Pdf Html Pdf Identifier Url Url Isbn Source Language Language English Lang Relation Additional Material Coverage Rights User Profile free User Profile OAForum Workshop 67 Dec 2002 23

Voices on Europe Table Interviews Reference code Collection title Interviewee's surname Interviewee's forename Title Date of birtd Place of birtd Nationality Sex Biographical notes Type of recorder Total number of tape Type of format Type of tape Speed Clearance Cassettes Copyright agreement Individual consultation Public broadcast Partial or entire publication Pages Topicsen Full name imgfo appo1 Date(s) of recording (from) Date(s) of recording (to) Location of interview Name of interviewer Language Original or copy Transcription Number of pages Additional material Clearance Transcription Partial or entire reproduction Closed until Names of persons Comments Topics appo2 appo3 appo4 Pagesen tmp dd1 mm1 yy1 National program OAForum Workshop 67 Dec 2002 24

Fields Description Reference Code: Code that identify the interview Interviewee's surname: The surname of the person interviewed Interviewee's forename: The forename of the person interviewed Date(s) of recording (from):the date of startin of interview Name of interviewer: The name of the person that made the interview Language: The language of the interviewed person Number of pages: The number of pages where the interviewed person is mentioned Additional material: The additional material with the interview OAForum Workshop 67 Dec 2002 25

Voices on Europe Table Evoc and description Eurovoc Language level1 level2 level3 level4 level5 todos Level 1, 2, 3 Thesaurus Eurovoc. This European multilingual listing of expressions has been developed for onetoone indexing of documentary information from the European institutions OAForum Workshop 67 Dec 2002 26

Dublin Core Fields Mapping Felds Extracted from Tables Fixed Fields 1) Reference Code Identifier 2) Interviewee's surname and name Title 3) Date(s) of recording (from) Date 4) Name of interviewer Creator 5) Language Language 6) Number of pages Description 7) Additional Material Relation 8) level 1,2,3 (eurovoc) Subject 9) EUI Publisher 10) Audio/Text Type 11) Pdf Format 12) User Profile Rights Fields Not Used 13) Contributor 14) Source 15) Coverage OAForum Workshop 67 Dec 2002 27

Voices on Europe Mapping Dublin Core Element Title Creator Subject Description Publisher Contributor Date Type Format Identifier Source Language Relation Coverage Rights Voices on Europe Interviewee's surname Name of Interviewer Level 1,2,3 (eurovoc) Number of Pages Eui Date of recording Audio/Testo Pdf Url Language Additional Material User Profile OAForum Workshop 67 Dec 2002 28

Voices on Europe Table Interviews Dublin Core Element Title Creator Subject Description Publisher Contributor Date Type Format Identifier Source Language Relation Coverage Rights Normalization No No No No No Not Used Yes UTC Format ISO8601 OAI Yes Audio / Text Yes Pdf / mp3 No Not Used No No Not Used Yes User Profile / free OAForum Workshop 67 Dec 2002 29