Mapping classifications and linking related classes through SciGator, a DDC-based browsing library interface

Similar documents
Mokka, main guidelines and future

Syrtis: New Perspectives for Semantic Web Adoption

Setup of epiphytic assistance systems with SEPIA

QuickRanking: Fast Algorithm For Sorting And Ranking Data

Blind Browsing on Hand-Held Devices: Touching the Web... to Understand it Better

KeyGlasses : Semi-transparent keys to optimize text input on virtual keyboard

The Connectivity Order of Links

BoxPlot++ Zeina Azmeh, Fady Hamoui, Marianne Huchard. To cite this version: HAL Id: lirmm

Linked data from your pocket: The Android RDFContentProvider

Open Digital Forms. Hiep Le, Thomas Rebele, Fabian Suchanek. HAL Id: hal

HySCaS: Hybrid Stereoscopic Calibration Software

Service Reconfiguration in the DANAH Assistive System

SIM-Mee - Mobilizing your social network

Multimedia CTI Services for Telecommunication Systems

How to simulate a volume-controlled flooding with mathematical morphology operators?

Assisted Policy Management for SPARQL Endpoints Access Control

DANCer: Dynamic Attributed Network with Community Structure Generator

Structuring the First Steps of Requirements Elicitation

Tacked Link List - An Improved Linked List for Advance Resource Reservation

An FCA Framework for Knowledge Discovery in SPARQL Query Answers

Relabeling nodes according to the structure of the graph

Fault-Tolerant Storage Servers for the Databases of Redundant Web Servers in a Computing Grid

The optimal routing of augmented cubes.

The Proportional Colouring Problem: Optimizing Buffers in Radio Mesh Networks

Linux: Understanding Process-Level Power Consumption

Study on Feebly Open Set with Respect to an Ideal Topological Spaces

Simulations of VANET Scenarios with OPNET and SUMO

THE COVERING OF ANCHORED RECTANGLES UP TO FIVE POINTS

Catalogue of architectural patterns characterized by constraint components, Version 1.0

YAM++ : A multi-strategy based approach for Ontology matching task

NP versus PSPACE. Frank Vega. To cite this version: HAL Id: hal

Representation of Finite Games as Network Congestion Games

Teaching Encapsulation and Modularity in Object-Oriented Languages with Access Graphs

LaHC at CLEF 2015 SBS Lab

Moveability and Collision Analysis for Fully-Parallel Manipulators

Malware models for network and service management

From medical imaging to numerical simulations

YANG-Based Configuration Modeling - The SecSIP IPS Case Study

Natural Language Based User Interface for On-Demand Service Composition

The New Territory of Lightweight Security in a Cloud Computing Environment

BugMaps-Granger: A Tool for Causality Analysis between Source Code Metrics and Bugs

Experimental Evaluation of an IEC Station Bus Communication Reliability

Every 3-connected, essentially 11-connected line graph is hamiltonian

A Voronoi-Based Hybrid Meshing Method

Scalewelis: a Scalable Query-based Faceted Search System on Top of SPARQL Endpoints

Fuzzy sensor for the perception of colour

Change Detection System for the Maintenance of Automated Testing

Accurate Conversion of Earth-Fixed Earth-Centered Coordinates to Geodetic Coordinates

Regularization parameter estimation for non-negative hyperspectral image deconvolution:supplementary material

Comparison of spatial indexes

A N-dimensional Stochastic Control Algorithm for Electricity Asset Management on PC cluster and Blue Gene Supercomputer

Using a Medical Thesaurus to Predict Query Difficulty

QAKiS: an Open Domain QA System based on Relational Patterns

Very Tight Coupling between LTE and WiFi: a Practical Analysis

Comparator: A Tool for Quantifying Behavioural Compatibility

Sliding HyperLogLog: Estimating cardinality in a data stream

UsiXML Extension for Awareness Support

The Sissy Electro-thermal Simulation System - Based on Modern Software Technologies

Robust IP and UDP-lite header recovery for packetized multimedia transmission

MUTE: A Peer-to-Peer Web-based Real-time Collaborative Editor

lambda-min Decoding Algorithm of Regular and Irregular LDPC Codes

Multi-atlas labeling with population-specific template and non-local patch-based label fusion

Branch-and-price algorithms for the Bi-Objective Vehicle Routing Problem with Time Windows

A 64-Kbytes ITTAGE indirect branch predictor

Emerging and scripted roles in computer-supported collaborative learning

Quasi-tilings. Dominique Rossin, Daniel Krob, Sebastien Desreux

Traffic Grooming in Bidirectional WDM Ring Networks

IntroClassJava: A Benchmark of 297 Small and Buggy Java Programs

A Methodology for Improving Software Design Lifecycle in Embedded Control Systems

FIT IoT-LAB: The Largest IoT Open Experimental Testbed

Real-Time and Resilient Intrusion Detection: A Flow-Based Approach

Comparison of radiosity and ray-tracing methods for coupled rooms

Application of RMAN Backup Technology in the Agricultural Products Wholesale Market System

An Efficient Numerical Inverse Scattering Algorithm for Generalized Zakharov-Shabat Equations with Two Potential Functions

A Resource Discovery Algorithm in Mobile Grid Computing based on IP-paging Scheme

Monitoring Air Quality in Korea s Metropolises on Ultra-High Resolution Wall-Sized Displays

Zigbee Wireless Sensor Network Nodes Deployment Strategy for Digital Agricultural Data Acquisition

Quality of Service Enhancement by Using an Integer Bloom Filter Based Data Deduplication Mechanism in the Cloud Storage Environment

Real-time FEM based control of soft surgical robots

Deformetrica: a software for statistical analysis of anatomical shapes

Stream Ciphers: A Practical Solution for Efficient Homomorphic-Ciphertext Compression

Use of the Hydra/Sufia repository and Portland Common Data Model for research data description, organization, and access

A Practical Evaluation Method of Network Traffic Load for Capacity Planning

Modularity for Java and How OSGi Can Help

Taking Benefit from the User Density in Large Cities for Delivering SMS

Reverse-engineering of UML 2.0 Sequence Diagrams from Execution Traces

XML Document Classification using SVM

DSM GENERATION FROM STEREOSCOPIC IMAGERY FOR DAMAGE MAPPING, APPLICATION ON THE TOHOKU TSUNAMI

X-Kaapi C programming interface

Framework for Hierarchical and Distributed Smart Grid Management

From Microsoft Word 2003 to Microsoft Word 2007: Design Heuristics, Design Flaws and Lessons Learnt

COM2REACT: V2V COMMUNICATION FOR COOPERATIVE LOCAL TRAFFIC MANAGEMENT

Technical Overview of F-Interop

ASAP.V2 and ASAP.V3: Sequential optimization of an Algorithm Selector and a Scheduler

SDLS: a Matlab package for solving conic least-squares problems

Prototype Selection Methods for On-line HWR

[Demo] A webtool for analyzing land-use planning documents

Scan chain encryption in Test Standards

Self-optimisation using runtime code generation for Wireless Sensor Networks Internet-of-Things

An Experimental Assessment of the 2D Visibility Complex

Transcription:

Mapping classifications and linking related classes through SciGator, a DDC-based browsing library interface Marcin Trzmielewski, Claudio Gnoli, Marco Lardera, Gaia Heidi Pallestrini, Matea Sipic To cite this version: Marcin Trzmielewski, Claudio Gnoli, Marco Lardera, Gaia Heidi Pallestrini, Matea Sipic. Mapping classifications and linking related classes through SciGator, a DDC-based browsing library interface. Catalogue Index, Library Association, Cataloguing and Indexing Group, 2017, <https://archive.cilip.org.uk>. <hal-01886550> HAL Id: hal-01886550 https://hal.archives-ouvertes.fr/hal-01886550 Submitted on 2 Oct 2018 HAL is a multi-disciplinary open access archive for the deposit and dissemination of scientific research documents, whether they are published or not. The documents may come from teaching and research institutions in France or abroad, or from public or private research centers. L archive ouverte pluridisciplinaire HAL, est destinée au dépôt et à la diffusion de documents scientifiques de niveau recherche, publiés ou non, émanant des établissements d enseignement et de recherche français ou étrangers, des laboratoires publics ou privés.

Mapping classifications and linking related classes through SciGator, a DDC-based browsing library interface Marcin Trzmielewski, Claudio Gnoli, Marco Lardera, Gaia Heidi Pallestrini and Matea Sipic, Library Service, University of Pavia Introduction The plurality and diversity of KOSs on the horizon of documentary institutions and the missing of index unification at different levels (local, national, international) disturbs the interoperability of bibliographic data and makes a documentary research less pertinent. Web 3.0 is still a vague promise of interconnection between machines and tools and of an easy exchange of bibliographic data. The first step into the interoperability of the records in the OPAC is the provision of tools allowing mapping of different index vocabularies in the same documentary area or institution. With the information explosion and new trends in science that prefer interdisciplinary studies, plenty of contemporary published documents now deal with several subjects touched on the same work. Because of the limitations of enumerative classification schemes (often chosen for indexing documents in university libraries) where the combination of concepts from several disciplines is complicated or impossible, interdisciplinary aspects are not made retrievable enough. Today, with relational database systems and PHP, Java or HTML languages, adequate online tools linking different disciplines can finally be proposed. SciGator 1,one of the rare tools that join mapping of index vocabularies and linking of related subjects on one and the same interface, was created in December 2014 by the Library Service at University of Pavia, Italy. As a general reference KOS, SciGator adopts the Dewey Decimal Classification (DDC), which is also used as a basis for linking related classes and mapping other schemes. The DDC-based browsing library interface was presented on June 28, 2017 during the Symposium of the EDUG Annual Meeting held at the National Library of France in Paris 2. With the present paper, we invite you to discover this innovative tool and to navigate with us in the OPAC. DDC in Italy Nowadays, DDC is the most used classification scheme to index documents in Italian libraries, which increases the potential interoperability of the local catalogues with other information resources and tools. An Italian version of the online DDC schedules exists, called WebDewey Italiana, to which the scientific libraries of the University of Pavia are subscribed. These libraries were quite scattered until 2009, when they were reorganized into 8 libraries and three of those, The Science Library, the Science and Technology Library and the Medical Library, were converted to a DDC-base shelving system. Conversion of documents in these libraries still progresses, while other libraries are considering to join this classification network as well. Origins of mapping and linking interface Unfortunately, each of the Pavia scientific libraries is still physically divided into several sections, each with a different tradition of shelving based on local schemes. Moreover, they are in different positions in the town. So, books on related subjects or on the same subject are often shelved in different places, which is a potential source of confusion for users. In this situation, the Library Service staff realized that DDC may work as a virtual bridge between different local schemes and shelvings. 1. SciGator interface: http://scigator.unipv.it/ 2. EDUG, Annual meetings, 27-28 June 2017: http://edug.pansoft.de/tiki-index.php?page=2017+meeting. A short report of the meeting is available on 025.431: The Dewey Blog at http://ddc.typepad.com/025431/2017/07/2017-european-ddc-users-group-meeting.html 30

Moreover, input of a subject heading in the bibliographic records during the cataloguing process is not an available option. So, bibliographic records in Pavia are only indexed by subject when they are derived from the SBN national bibliographic database. For the other records, shelfmarks are the only database field where subject information can be stored. Furthermore, a well-known limitation of DDC is that it forces every item to be shelved or indexed under a specific discipline. That implies a loss of information on related subjects discussed in the same work. For example, many books owned by our libraries deal with mathematical subjects applied to physics, or with building subjects applied to architecture. This factor stimulated our staff to look for a tool allowing more interdisciplinary searches. All the above considered, the Library Service came up with the idea of SciGator, an interface allowing to browse DDC classes adopted in Pavia libraries and at the same time to navigate from these to related DDC classes or to equivalent classes in different KOSs that are also used to shelve documents. SciGator database structure Like many of the modern search tools, SciGator is a relational MySQL data base structured in 6 main fields, as we may see in Fig. 1: 1. notation a DDC number, for example: 628.9 2. caption - a WebDewey Italiana name of the DDC class, for example: protezione antincendio 3. captione - English translation of the DDC class, for example: fire prevention 4. scopenote - notes 5. seealso - a DDC notation of a related class, for example: 363.1 (public security) 6. equivalent - notation of an equivalent class in a local scheme, for example: AR16 (fires) Fig. 1 The SciGator database fields 31

SciGator Web interface SciGator interface is written in PHP language and managed by phpmyadmin software tool and has a tree structure. Subjects can be found under their respective disciplinary class, or in a linked class in the network of cross references, as you may observe in Fig. 2. So, in this case 628.9 (fire prevention) is our root notation. Then, its related class 363.1 (public security) is connected by an arrow and its equivalent in a local scheme AR16 is connected by around sign. Fig. 2 The SciGator interface Browsing functionalities At present, SciGator interface proposes 3 browsing functionalities in the form of 3 buttons displayed between the root caption class and the related classes (Fig. 2). The first one, shelf (icon with black book spines), allows the user to retrieve all documents having a shelf mark that begins with the corresponding notation. Its subclasses will also be included because of the default right truncation. The second one, catalogue (icon with a lens and a blue list of records) enables the user to retrieve all documents having the corresponding notation as their shelf mark or as a subject metadata coming from SBN - Italian National Catalogue. The last one, expand, (icon with four divergent arrows), enables query expansion by retrieving all documents in the previous options plus documents shelved or indexed by related classes, including both associated DDC classes and equivalent classes in local schemes. Then, after each browsing action, the user is redirected to the OPAC interface when he may consult the results of its research. All the obtained records are sorted by descending date of publication to answer better the scientific needs of users, asking generally for the most recently edited documents. Some limitations The limitations of this tool are those of the online catalogue in which searches are launched. For example, the final notation for all the learning and exercises books, especially used in mathematics textbooks, is 07. But in the catalogue only right truncation is possible. So, in SciGator as well, a search for only exercises books (of calculus or geometry ones for example) won t be allowed because the search by left truncation is not available. Furthermore, all the 3 browsing options proposed by SciGator interface often produce a great number of results, but the last one especially ( expand in the catalogue ) may also increase noise, due to the well-known inverse proportionality between recall and precision. So, each user should be informed about that and take it into consideration during her expanded search. 32

Future work Ideas for possible developments of SciGator increase day after day. Among others, we plan to eliminate the catalogue button in the case where a class is from a local classification, as in practice only DDC classes can be found in catalogue records besides shelfmarks. Another need would be to collect statistical data about the use of SciGator, which may allow us to identify categories of users and estimate their number. We suppose that SciGator interface is used the most by librarians, but we would like to reach other categories of users. We are also aware that the Web interface should be improved through more user-friendly graphic design. Moreover, adding buttons for searching the same DDC classes in the SBN national catalogue and/or WorldCat browse button seems to be especially useful for interlibrary loan services. Conclusion In short, SciGator seems to be an effective mapping tool between DDC classes and their equivalents in local schemes and makes interdisciplinary search possible. This experience also shows how classification schemes have the potential to provide more powerful search tools than is currently the case on most information services. Cooperation around a single tool between staff who came from different disciplines: librarians, information computer specialists, ERASMUS+ interns and the boys and girls of Alternative Civilian Service, including the authors of the present paper, proved to be successful. Librarians working at the front desk found SciGator a useful tool. We are now waiting for users feedback, which would allow us to ameliorate its interface and to make it more user friendly. Similar tools are starting to emerge on the European field. Norwegian mapping software CCMapper 3 or German project coli-conc 4 give evidence of the need of this type of search tools. At the EDUG Annual Meeting 2017 the opportunity of a potential collaboration between developers of such tools has emerged. 3. About Mapping through CCMapper: http://www.uio.no/for-ansatte/enhetssider/ub/prosjekter/mapping-for-sluttbrukertjenester/deltedokumenter/brosjyre-engelsk-endelig.pdf 4. About coli-conc project: https://coli-conc.gbv.de 33