A service-oriented national e-thesis information system and repository

Similar documents
A service-oriented national e-theses information system and repository

Pandektis: Implementing a repository of greek historical and cultural material with DSpace

OpenData Hackathon Δημόσια, Ανοικτά Δεδομένα H εμπειρία του Εθνικού Κέντρου Τεκμηρίωσης

Institutional Repository using DSpace. Yatrik Patel Scientist D (CS)

National Documentation Centre Open access in Cultural Heritage digital content

From Open Data to Data- Intensive Science through CERIF

Nuno Freire National Library of Portugal Lisbon, Portugal

VI-SEEM Data Repository. Presented by: Panayiotis Charalambous

SciX Open, self organising repository for scientific information exchange. D15: Value Added Publications IST

How to contribute information to AGRIS

The DART-Europe E-theses Portal

Implementing enhanced OAI-PMH requirements for Europeana

Managing Data in the long term. 11 Feb 2016

SobekCM. Compiled for presentation to the Digital Library Working Group School of Oriental and African Studies

Adding OAI ORE Support to Repository Platforms

OPENAIRE FP7 POST-GRANT OPEN ACCESS PILOT

15/06/2018 In Out, In Out, And Shake It All About. A Moving Story of Data

Introduction

Registry Interchange Format: Collections and Services (RIF-CS) explained

Comparing Open Source Digital Library Software

is an electronic document that is both user friendly and library friendly

Implementation of OpenAIRE Guidelines for CRIS Managers to Finnish VIRTA Publication Information Service

Persistent identifiers, long-term access and the DiVA preservation strategy

Ing. José A. Mejía Villar M.Sc. Computing Center of the Alfred Wegener Institute for Polar and Marine Research

Online repositories of Eu informations sources. European Documentation Centres: Online repositories of EU information sources

Research Data Edinburgh: MANTRA & Edinburgh DataShare. Stuart Macdonald EDINA & Data Library University of Edinburgh

Igitur Archive: Institutional Repository Utrecht University. May , Martin Slabbertje

Institutional repositories: description of VITAL as an example of a Fedora-based digital assets management system.

WIReDSpace (Wits Institutional Repository Environment on DSpace)

CREATING DIGITAL REPOSITORIES PRESENTED BY CHAMA MPUNDU MFULA CHIEF LIBRARIAN NATIONAL ASSEMBLY OF ZAMBIA


Open Source Software Packages for E-Resource Management

SNHU Academic Archive Policies

ProQuest Dissertations and Theses Overview. Austin McLean and Marlene Coles CGS Summer Workshop, July 2017

Non-text theses as an integrated part of the University Repository

ScienceDirect. Providing an application-specific interface over a CERIF back-end: challenges and solutions. Dragan Ivanović a, Nikos Houssos b *

Inge Van Nieuwerburgh OpenAIRE NOAD Belgium. Tools&Services. OpenAIRE EUDAT. can be reused under the CC BY license

Open source software for building open access repositories. Imma Subirats Coll knowledge and information management officer FAO of the United Nations

Building for the Future

CLARIN for Linguists Portal & Searching for Resources. Jan Odijk LOT Summerschool Nijmegen,

Contents 1. What is Digitalback Books Who can access Digitalback Books How do I access Digitalback Books

COAR Interoperability Roadmap. Uppsala, May 21, 2012 COAR General Assembly

Building a Digital Repository on a Shoestring Budget

Citation Services for Institutional Repositories: Citebase Search. Tim Brody Intelligence, Agents, Multimedia Group University of Southampton

Survey of Existing Services in the Mathematical Digital Libraries and Repositories in the EuDML Project

Sessions 3/4: Member Node Breakouts. John Cobb Matt Jones Laura Moyers 7 July 2013 DataONE Users Group

StratusLab Cloud Distribution Installation. Charles Loomis (CNRS/LAL) 3 July 2014

MINT METADATA INTEROPERABILITY SERVICES

Citation Services for Institutional Repositories: Citebase Search. Tim Brody Intelligence, Agents, Multimedia Group University of Southampton

Functional Requirements - DigiRepWiki

Data is the new Oil (Ann Winblad)

CONTENTdm & The Digital Collection Gateway New Looks for Discovery and Delivery

TRANSFORMING CUSTOMS THROUGH TECHNOLOGY

Site Administrator Help

Is it possible to use your institutional repository as a records management system?

Its All About The Metadata

ScholarOne Manuscripts. Publisher Level Reporting Guide

OpenAIRE. Fostering the social and technical links that enable Open Science in Europe and beyond

Open Archives Forum - Technical Validation -

Appendix REPOX User Manual

Hello, I m Melanie Feltner-Reichert, director of Digital Library Initiatives at the University of Tennessee. My colleague. Linda Phillips, is going

The Design of a DLS for the Management of Very Large Collections of Archival Objects

Greenstone Publications

The Necessity of a New Culture of Electronic Publishing C A S L I N

The Open Archives Initiative in Practice:

Oracle VM Server for x86: Administration

May 2012 Oracle Spatial User Conference

OnCommand Insight 7.1 Planning Guide

Usability Inspection Report of NCSTRL

Repositório institucional do IST

DataSTORRE Deposit Guide

USER MANUAL. for. Examination / Result Review Application Functions for Student (MyUPSI PORTAL) Prepared By:

OAI-PMH for dummies: how to build an institutional repository with limited resources?

Backup Solution. User Guide. Issue 01 Date

Building Virtual Collections

Showing it all a new interface for finding all Norwegian research output

WEB-BASED COLLECTION MANAGEMENT FOR LIBRARIES

Software Requirements Specification for the Names project prototype

Omeka. iskills Workshop February 16, Leslie Barnes, Digital Scholarship Librarian

EUDAT. A European Collaborative Data Infrastructure. Daan Broeder The Language Archive MPI for Psycholinguistics CLARIN, DASISH, EUDAT

OnCommand Insight 7.2

BPMN Processes for machine-actionable DMPs

SciX Open, self organising repository for scientific information exchange. D9a: Architecture Synthesis IST

Preservation Planning in the OAIS Model

Interoperability Framework Recommendations

Project Horizon Technical Overview. Bob Rullo GM; Presentation Architecture

Scalable, Reliable Marshalling and Organization of Distributed Large Scale Data Onto Enterprise Storage Environments *

A Comparative Study of the Search and Retrieval Features of OAI Harvesting Services

Grid Computing Fall 2005 Lecture 5: Grid Architecture and Globus. Gabrielle Allen

Building a Digital Library Software

Ubiquitous and Open Access: the NextGen library. Edmund Balnaves, Phd. Information Officer, IFLA IT Section

Digital Curation and Preservation: Defining the Research Agenda for the Next Decade

Grant agreement no N4U. NeuGRID for you: expansion of NeuGRID services and outreach to new user communities

The OAIS Reference Model: current implementations

NARCIS: The Gateway to Dutch Scientific Information

Archive II. The archive. 26/May/15

Harvesting of Additional Metadata Schema into DSpace through OAI-PMH: Issues and Challenges

Working with Islandora

Data Management Plans. Sarah Jones Digital Curation Centre, Glasgow

One Body, Many Heads for Repository-Powered Library Applications

Transcription:

Title of the presentation Date 1 A service-oriented national e-thesis information system and repository Nikos Houssos Panagiotis Stathopoulos Ioanna Sarantopoulou Dimitris Zavaliadis Evi Sachini National Documentation Centre (EKT) 5 TH INTERNATIONAL CONFERENCE ON OPEN REPOSITORIES, Madrid, July 2010

Agenda Title of the presentation Date 2 The Hellenic National Archive of Doctoral Dissertations (HEDI) - Introduction System overview and architecture Design choices and technical challenges Repositories interoperating in a SOA environment

Hellenic National Archive of PhD Theses Title of the presentation Date 3 By law - all theses awarded by Greek Universities Theses of Greek scholars for PhDs obtained in foreign universities In operation at EKT (print archive) since 1985: submission of theses by universities to EKT, centralised cataloguing in UNIMARC Open online access to metadata and full-text since late 90 s Z39.50 compliant bibliographic system (ARGO - http://argo.ekt.gr) 24000 theses in total (since 1901) 25% theses not yet digitized 1200-1400 arriving every year

HEDI Goals for new version Title of the presentation Date 4 Make theses available through DSpace Develop administration application to support and monitor workflows for processing incoming material Create separate authority servers Migrate to an open source software infrastructure from operating system to repository software platform Make the whole system work with individual components in a SOA configuration

Title of the presentation Date 5 System architecture and overview EKT digitization expert Back-end digital content processing Extended ETD-MS app profile and full text SOAP External systems (e.g. DART Europe) OAI-PMH interface DSpace Repository Open access to theses User-friendly environment Submitted material, electronic and print Theses admin system Internet user UNIMARC records Z39.50 Extended Service ARGO System Access / download / transformation Of bibliographic records EKT librarian REST Authorities Server Z39.50 interface

The administration application functions Title of the presentation Date 6 Record incoming material and monitor its status throughout its lifecycle both for print and electronic material Support and monitor internal workflows for processing digital content Reporting on aspects like throughput of internal workflows, numbers of material in each processing state, etc. Cataloguing and export in UNIMARC Production of appropriate UNIMARC and DC records and update of both ARGO and DSpace repository

Title of the presentation Date 7 Benefits of introducing DSpace User friendly interface, much better browsing facilities Easier compatibility with aggregators / harvesters (OAI-PMH), persistent identifier systems, search engines, web 2.0 systems, etc.

The DSpace e-thesis e repository Title of the presentation Date 8 Available at http://phdtheses.ekt.gr Contributor to DART Europe via OAI-PMH harvesting Main customisations: Web services for CRUD operations to metadata records Service for uploading digital material to DSpace Custom application profile based on ETD-MS Frascati-based classification based initially on department / lab / research group information Enhance search to support stress independent search Various UI customisations

Title of the presentation Date 9 Implementation technologies Groovy / Grails great web applications framework, great integration with Java and Spring Lightweight workflow management engine built inhouse Models workflows as finite state machines Uses Spring dependency injection for configuration SOAP and REST web services Ruby implementation of the Z39.50 Database Update Extended used to update ARGO Custom solution based on Spring framework to upload digital file(s) to DSpace

Title of the presentation Date 10 5 TH INTERNATIONAL CONFERENCE ON OPEN REPOSITORIES, Madrid, July 2010

Title of the presentation Date 11 5 TH INTERNATIONAL CONFERENCE ON OPEN REPOSITORIES, Madrid, July 2010

Title of the presentation Date 12 5 TH INTERNATIONAL CONFERENCE ON OPEN REPOSITORIES, Madrid, July 2010

Title of the presentation Date 13 5 TH INTERNATIONAL CONFERENCE ON OPEN REPOSITORIES, Madrid, July 2010

Title of the presentation Date 14 5 TH INTERNATIONAL CONFERENCE ON OPEN REPOSITORIES, Madrid, July 2010

Title of the presentation Date 15

Title of the presentation Date 16

Title of the presentation Date 17

Title of the presentation Date 18

Title of the presentation Date 19

Online reading Title of the presentation Date 20 An alternative way to present content opening a range of possibilities Better promotion of the overall system - back links to repository Put burden on digital content handling Using Internet Archive Book Reader On-going work Full-text search with hit highlighting Bookmarking Detailed access statistics navigation / bookmarking

Title of the presentation Date 21

Title of the presentation Date 22

Title of the presentation Date 23

Design choices and technical challenges Title of the presentation Date 24 (1) Adoption of a SOA approach with DSpace being an crucial distinct part of the overall architecture Continuation of UNIMARC cataloguing support of the ARGO bibliographic portal Adoption of software stacks consisting of open source or home grown components

Design choices and technical challenges Title of the presentation Date 25 (2) Support of distributed transactions System should gracefully recover from update failures Ad hoc implementation of transaction behaviour Cleaning and enhancing metadata records for theses Data migration

Title of the presentation Date 26 The underlying infrastructure Gradual migration of modules in parallel with the software development process From closed source software, & proprietary hardware to a fully open source stack using a virtualisation platform over commodity hardware. From Filenet, Oracle 9, and 2 proprietary Operating Systems to DSpace, Postgress, and the CentOS distribution End result: Fully open source, fully virtualised. Virtualisation: computing and storage resources allocated, are flexible and dynamic. Includes each VM memory, number of processors, disk space, without reboot for many of them Resource pool of up to 48 processors and160gb memory to be allocated for the servers comprising the system, TBs of disk space

Resources, Title of the presentation Date 27 monitoring and mgt Resources allocated based on observed performance No overdimensioning, need to have resources when needed monitor and control loop (broadly basde on IT Service Lifecycle definition) Monitoring system for the full range of each of the platform elements, tiers, software modules in all levels: From hosts to NFS fileshares and DBs and random s pages content sensitive monitoring SOA puts stress on that, large number of elements that must be monitored Virtualization: added value, Green Hedi system, can check at http://code.google.com/p/e-vigr/ (OK it s the whole platform not just HEDI!)

Further work Title of the presentation Date 28 Application of automated metadata extraction from full-text theses to assist cataloguing Integration of digital content back-end processing workflows full migration from a commercial to an open source environment Automation of the procedure of digital file quality checking Interconnection with CRIS systems linking theses with authors, organisations and projects which have specifically funded PhD theses

Thank you Title of the presentation Date 29 More information: Nikos Houssos / nhoussos AT ekt.gr Panagiotis Stathopoulos / pstath AT ekt.gr