Automatic creation of mappings between classification systems for bibliographic data

Size: px
Start display at page:

Download "Automatic creation of mappings between classification systems for bibliographic data"

Transcription

1 Automatic creation of mappings between classification systems for bibliographic data Prof. Magnus Pfeffer Stuttgart Media University

2 Agenda Motivation Instance-based matching Current implementation Experimental evaluation Further work October 14th, 2016 DC

3 Motivation October 14th, 2016 DC

4 Current situation in Germany Five regional library unions Subject headings Predominantly RSWK ( Regeln für den Schlagwortkatalog - Rules for the subject catalogue ) using a shared authority file Classification systems RVK (Regensburg Union Classification) BK (Basic Classification) DDC (Dewey Decimal Classification) Various local classification systems Low proportion of indexed titles (25-30%) October 14th, 2016 DC

5 Current situation in Germany National library Subject headings Predominantly RSWK ( Regeln für den Schlagwortkatalog - Rules for the subject catalogue ) using a shared authority file Classification systems DDC (Dewey Decimal Classification) Coarse categories DDC only for titles published since 2007 Only Reihe A (print trade publications) is fully indexed with RSWK October 14th, 2016 DC

6 Mapping: Goals Re-use existing indexing information National level BK is used mainly in northern Germany RVK mainly in southern Germany DDC mainly by the National Library International level Make RVK data more accessible to DDC users Use DDC indexing information available from e.g. the Library of Congress October 14th, 2016 DC

7 Mappings: Application ideas Use of appropriate classification systems Facetted search in resource discovery systems Should be monohierarchical Should have limited number of classes DDC (first digits) or BK Browsing of similar titles Should be fine-grained DDC (full) or RVK Multi-lingual retrieval October 14th, 2016 DC

8 Mappings: Application ideas Enable the use of existing tools and visualisations Denton (2012) Legrady (2005) October 14th, 2016 DC

9 Current projects Coli-conc Coordinated by the Gemeinsamer Bibliotheksverbund Library union of the states in northern Germany Aim: Exhaustive concordances from DDC to other library classification systems Main work on DDC RVK mapping Mainly manual mapping Use of statistical analysis Development of software to support mapping process October 14th, 2016 DC

10 Current projects Austrian National Library Manual creation of mappings from RVK BK Very time-consuming process Several partial mappings completed Enrichment of local data Titles with existing RVK classification are enriched with BK classes from mapping Main use-case: Facetted browsing of large result lists in the resource discovery system See Plößnig (2014) October 14th, 2016 DC

11 Instance-based matching October 14th, 2016 DC

12 Ontology matching Well-studied problem in computer science Several approaches Based on the descriptors Based on the structure Based on the manifestations (instances) October 14th, 2016 DC

13 Instances Entries in catalogues with multiple classifications October 14th, 2016 DC

14 Instance-based matching Assumptions Classes with semantic overlap appear together The more often these classes co-occur, the stronger the overlap Preparation Extraction of all pairs of classifications from the data Analysis of the extracted pairs October 14th, 2016 DC

15 Example October 14th, 2016 DC

16 Normalisation Comparing absolute numbers is useless Some classes are used more often than others Number of pairs correlates with the number of entries that are classified using a given class Instead: Use proportion of co-occurrence single occurrence E c1 E c2 E c1 E c2 number of entries with both classifications divided by number of entries with either classification (Jaccard measure for overlap of sets) October 14th, 2016 DC

17 Prior and related work Isaac (2007), Schopman (2012) Application of instance-based matching in the library domain Several projects using data from the Dutch national library Exhaustive analysis of different statistical measures Overall very encouraging results Probstmeyer (2009) Application of instance-based matching to German library data Mapping classification system to index terms October 14th, 2016 DC

18 Current implementation October 14th, 2016 DC

19 Challenges Lack of dually annotated items BK and RVK annotations located in different library union catalogues Austrian data already contains enrichment results Scaling Merging data from all German catalogues results in >100 million records International open data catalogs should be included as well October 14th, 2016 DC

20 Clustering Multiple editions Multiple document types October 14th, 2016 DC

21 Clustering Skewed data Multiple editions More pairs Some co-occurrences could appear stronger than others Solution: Pre-clustering individual titles on the work level Each cluster contributes only once Benefit: Increases chance for instances with more than one classifications October 14th, 2016 DC

22 Scaling Fast clustering Based solely on author and title information Generates and stores match keys and key equivalences for each record Generates the full closure of equivalent keys by iterating over the key equivalences Backend All software uses basic key-value access and search features Allows use of different NoSQL products to adapt to local infrastructure October 14th, 2016 DC

23 Experimental evaluation October 14th, 2016 DC

24 Data sets Library union catalogs with BK or RVK data All Entries Monographic entries Monographic with RVK Monographic with BK GBV 32,027,977 24,267, ,976,154 SWB 18,789,185 16,447,890 4,383,273 0 BVB 26,680,083 23,658,674 7,215,483 0 October 14th, 2016 DC

25 Data results Clusters Using only authors and corporate bodies (no editors, other persons) Using only main title or uniform title (no subtitles) 21,653,606 clusters 904,876 contain both BK and RVK annotations Co-occurrence 1,155,552 different RVK-BK pairs October 14th, 2016 DC

26 Evaluation Parameters Jaccard-like measure: the ratio of clusters with a given RVK-BK classes pair clusters containing the RVK class (we are only looking for RVK BK mappings) Absolute number of clusters with a given pair Gold standard Partial mapping RVK BK from the field of economics Manually created by a domain expert analysing class descriptions and structure October 14th, 2016 DC

27 Precision num num num num num num October 14th, 2016 DC

28 Recall num num num num num num October 14th, 2016 DC

29 f-measure num num num num num num October 14th, 2016 DC

30 Additional analysis All mappings from 0.6 and number 6 that were not part of the manual gold standard Manual analysis by reading class descriptions Results: 49 pairs 31 considered to be correct 12 partially correct 1 false 5 contained RVK classes that are no longer in active use Usable for improvement of manual mapping October 14th, 2016 DC

31 Results Combined data contains enough annotation information to use instance-based methods Very robust, fast and scalable implementation Encouraging results from comparison to gold standard Potential fur further improvement October 14th, 2016 DC

32 Future work October 14th, 2016 DC

33 More! More bibliographic data National catalogs International catalogs Bibliographies More experiments Different clustering parameters Different statistical analysis October 14th, 2016 DC

34 Share! Publish open source code Implement data import stage in a data modelling toolkit, e.g. knime.org Test different open source backends Publishing results in an accessible format October 14th, 2016 DC

35 Thank you for listening. Slides available online This work is licensed under a Creative Commons Attribution-ShareAlike 3.0 Unported License. October 14th, 2016 DC

36 References Isaac, Antoine, Lourens van der Meij, Stefan Schlobach, and Shenghui Wang (2007). An empirical study of instance-based ontology matching. In Karl Aberer (Editor), The Semantic Web. 6th International Semantic Web Conference, 2nd Asian Semantic Web Conference, ISWC ASWC 2007, Busan, Korea, November 11-15, 2007, (Lecture Notes in Computer Science, 4825). Berlin: Springer Plößnig, Veronika; Christoph Steiner (2014). Klassifikationen: Konkordanzen, Anreicherungsprojekte und RVK - Datenkorrekturen im Österreichischen Bibliothekenverbund. Ein Update. Retrieved September 12th, 2016, from regensburg.de/34088/1/ppt%20plnig_steiner-rvk-bk- October pdf 14th, 2016 DC

Automatic Creation of Mappings between Classification Systems for Bibliographic Data

Automatic Creation of Mappings between Classification Systems for Bibliographic Data Automatic Creation of Mappings between Classification Systems for Bibliographic Data Magnus Pfeffer Stuttgart Media University, Germany pfeffer@hdm-stuttgart.de Abstract In this paper, the implementation

More information

Putting ontology alignment in context: Usage scenarios, deployment and evaluation in a library case

Putting ontology alignment in context: Usage scenarios, deployment and evaluation in a library case : Usage scenarios, deployment and evaluation in a library case Antoine Isaac Henk Matthezing Lourens van der Meij Stefan Schlobach Shenghui Wang Claus Zinn Introduction Alignment technology can help solving

More information

Simple library thesaurus alignment with SILAS

Simple library thesaurus alignment with SILAS Simple library thesaurus alignment with SILAS Roelant Ossewaarde 1 Linguistics Department University at Buffalo, the State University of New York rao3@buffalo.edu Abstract. This paper describes a system

More information

Linked Open Data in Aggregation Scenarios: The Case of The European Library Nuno Freire The European Library

Linked Open Data in Aggregation Scenarios: The Case of The European Library Nuno Freire The European Library Linked Open Data in Aggregation Scenarios: The Case of The European Library Nuno Freire The European Library SWIB14 Semantic Web in Libraries Conference Bonn, December 2014 Outline Introduction to The

More information

Linking library data: contributions and role of subject data. Nuno Freire The European Library

Linking library data: contributions and role of subject data. Nuno Freire The European Library Linking library data: contributions and role of subject data Nuno Freire The European Library Outline Introduction to The European Library Motivation for Linked Library Data The European Library Open Dataset

More information

Lider Roadmapping Workshop

Lider Roadmapping Workshop Deutsche Nationalbibliothek Software-supported Bibliographic Recording and Linked Data Lider Roadmapping Workshop Mark Zöpfgen Leipzig, 02.09.2014 1 Overview - DNB German National Library - Activities

More information

Enrichment, Reconciliation and Publication of Linked Data with the BIBFRAME model. Tiziana Possemato Casalini Libri

Enrichment, Reconciliation and Publication of Linked Data with the BIBFRAME model. Tiziana Possemato Casalini Libri Enrichment, Reconciliation and Publication of Linked Data with the BIBFRAME model Tiziana Possemato Casalini Libri - @Cult New cooperative scenarios New context: new ways of cooperating between institutions

More information

Mapping Project coli-conc

Mapping Project coli-conc Mapping Project coli-conc Progress, learning & next steps U. Balakrishnan, J. Agne, J. Voß Content Aim & Project Start Partners coli-conc-key Objectives & Approaches Work Packages Survey JSKOS Dataformat

More information

Putting ontology alignment in context: usage scenarios, deployment and evaluation in a library case

Putting ontology alignment in context: usage scenarios, deployment and evaluation in a library case Putting ontology alignment in context: usage scenarios, deployment and evaluation in a library case Antoine Isaac 1,2, Henk Matthezing 2, Lourens van der Meij 1,2, Stefan Schlobach 1, Shenghui Wang 1,2,

More information

Lazy Big Data Integration

Lazy Big Data Integration Lazy Big Integration Prof. Dr. Andreas Thor Hochschule für Telekommunikation Leipzig (HfTL) Martin-Luther-Universität Halle-Wittenberg 16.12.2016 Agenda Integration analytics for domain-specific questions

More information

Extending the Facets concept by applying NLP tools to catalog records of scientific literature

Extending the Facets concept by applying NLP tools to catalog records of scientific literature Extending the Facets concept by applying NLP tools to catalog records of scientific literature *E. Picchi, *M. Sassi, **S. Biagioni, **S. Giannini *Institute of Computational Linguistics **Institute of

More information

Mymory: Enhancing a Semantic Wiki with Context Annotations

Mymory: Enhancing a Semantic Wiki with Context Annotations Mymory: Enhancing a Semantic Wiki with Context Annotations Malte Kiesel, Sven Schwarz, Ludger van Elst, and Georg Buscher Knowledge Management Department German Research Center for Artificial Intelligence

More information

ALOE - A Socially Aware Learning Resource and Metadata Hub

ALOE - A Socially Aware Learning Resource and Metadata Hub ALOE - A Socially Aware Learning Resource and Metadata Hub Martin Memmel & Rafael Schirru Knowledge Management Department German Research Center for Artificial Intelligence DFKI GmbH, Trippstadter Straße

More information

Interaction Design and Implementation for Multimodal Mobile Semantic Web Interfaces

Interaction Design and Implementation for Multimodal Mobile Semantic Web Interfaces HCI International, Beijing, China, 27th July 2007 Interaction Design and Implementation for Multimodal Mobile Semantic Web Interfaces Daniel Sonntag German Research Center for Artificial Intelligence 66123

More information

Linking library metadata to the web: the German experience

Linking library metadata to the web: the German experience Linking library metadata to the web: the German experience Gabriele Meßmer «What is the value of a catalogue of more than 23 million records?»was one of the questions we discussed when starting the linked

More information

Linked Data and cultural heritage data: an overview of the approaches from Europeana and The European Library

Linked Data and cultural heritage data: an overview of the approaches from Europeana and The European Library Linked Data and cultural heritage data: an overview of the approaches from Europeana and The European Library Nuno Freire Chief data officer The European Library Pacific Neighbourhood Consortium 2014 Annual

More information

Open Research Online The Open University s repository of research publications and other research outputs

Open Research Online The Open University s repository of research publications and other research outputs Open Research Online The Open University s repository of research publications and other research outputs The Smart Book Recommender: An Ontology-Driven Application for Recommending Editorial Products

More information

Knowledge Retrieval. Franz J. Kurfess. Computer Science Department California Polytechnic State University San Luis Obispo, CA, U.S.A.

Knowledge Retrieval. Franz J. Kurfess. Computer Science Department California Polytechnic State University San Luis Obispo, CA, U.S.A. Knowledge Retrieval Franz J. Kurfess Computer Science Department California Polytechnic State University San Luis Obispo, CA, U.S.A. 1 Acknowledgements This lecture series has been sponsored by the European

More information

Supporting interoperability of distributed digital archives using authority-controlled ontologies ABSTRACT

Supporting interoperability of distributed digital archives using authority-controlled ontologies ABSTRACT Supporting interoperability of distributed digital archives using authority-controlled ontologies Alfons Ruch (1) (1) University of Passau 94030 Passau, Germany EMail: Alfons.Ruch@uni-passau.de ABSTRACT

More information

Outline. Structures for subject browsing. Subject browsing. Research issues. Renardus

Outline. Structures for subject browsing. Subject browsing. Research issues. Renardus Outline Evaluation of browsing behaviour and automated subject classification: examples from KnowLib Subject browsing Automated subject classification Koraljka Golub, Knowledge Discovery and Digital Library

More information

Lars G. Svensson. DDC as Linked Data: DNB efforts, other efforts and opportunities

Lars G. Svensson. DDC as Linked Data: DNB efforts, other efforts and opportunities Lars G. Svensson DDC as Linked Data: DNB efforts, other efforts and opportunities 1 30 EMEA 2011: Future Search for Technical Services March 2, 2011 When people search for information to-day, the usually

More information

Whitestein Series in software Agent Technologies. About whitestein Technologies

Whitestein Series in software Agent Technologies. About whitestein Technologies Whitestein Series in software Agent Technologies Series Editors: Marius Walliser Stefan Brantschen Monique Calisti Thomas Hempfling This series reports new developments in agent-based software technologies

More information

Crowdsourcing the Dewey Decimal Classification: When Users Become Contributors

Crowdsourcing the Dewey Decimal Classification: When Users Become Contributors Submitted on: 22.10.2017 2016 Satellite meeting - Subject Access: Unlimited Opportunities 11 12 August 2016 State Library of Ohio, Columbus, Ohio, USA Crowdsourcing the Dewey Decimal Classification: When

More information

Lecture 17 MaRC as Metadata

Lecture 17 MaRC as Metadata IMS2603 Information Management in Organisations Lecture 17 MaRC as Metadata Revision Last lecture looked at philosophical bases for thinking about metadata, in particular looking at ontology as an approach

More information

Math Information Retrieval: User Requirements and Prototype Implementation. Jin Zhao, Min Yen Kan and Yin Leng Theng

Math Information Retrieval: User Requirements and Prototype Implementation. Jin Zhao, Min Yen Kan and Yin Leng Theng Math Information Retrieval: User Requirements and Prototype Implementation Jin Zhao, Min Yen Kan and Yin Leng Theng Why Math Information Retrieval? Examples: Looking for formulas Collect teaching resources

More information

Visual Concept Detection and Linked Open Data at the TIB AV- Portal. Felix Saurbier, Matthias Springstein Hamburg, November 6 SWIB 2017

Visual Concept Detection and Linked Open Data at the TIB AV- Portal. Felix Saurbier, Matthias Springstein Hamburg, November 6 SWIB 2017 Visual Concept Detection and Linked Open Data at the TIB AV- Portal Felix Saurbier, Matthias Springstein Hamburg, November 6 SWIB 2017 Agenda 1. TIB and TIB AV-Portal 2. Automated Video Analysis 3. Visual

More information

Ontology Matching with CIDER: Evaluation Report for the OAEI 2008

Ontology Matching with CIDER: Evaluation Report for the OAEI 2008 Ontology Matching with CIDER: Evaluation Report for the OAEI 2008 Jorge Gracia, Eduardo Mena IIS Department, University of Zaragoza, Spain {jogracia,emena}@unizar.es Abstract. Ontology matching, the task

More information

Appendix. Description of Parameters Necessary to run EARTH. The EARTH simulation requires that 29 parameters be specified. A brief

Appendix. Description of Parameters Necessary to run EARTH. The EARTH simulation requires that 29 parameters be specified. A brief Appendix Description of Parameters Necessary to run EARTH The EARTH simulation requires that 29 parameters be specified. A brief description of each parameter and its minimum and maximum possible values

More information

NOTSL Fall Meeting, October 30, 2015 Cuyahoga County Public Library Parma, OH by

NOTSL Fall Meeting, October 30, 2015 Cuyahoga County Public Library Parma, OH by NOTSL Fall Meeting, October 30, 2015 Cuyahoga County Public Library Parma, OH by Roman S. Panchyshyn Catalog Librarian, Assistant Professor Kent State University Libraries This presentation will address

More information

What you have learned so far. Interoperability. Ontology heterogeneity. Being serious about the semantic web

What you have learned so far. Interoperability. Ontology heterogeneity. Being serious about the semantic web What you have learned so far Interoperability Introduction to the Semantic Web Tutorial at ISWC 2010 Jérôme Euzenat Data can be expressed in RDF Linked through URIs Modelled with OWL ontologies & Retrieved

More information

LOD in Digital Libraries - Current Issues

LOD in Digital Libraries - Current Issues LOD in Digital Libraries - Current Issues Ansgar Scherp a.scherp@zbw.eu FG Datenbanken March 2014 Braunschweig Index Newly Acquired Media Ancient world: Library of Alexandria Today: database-oriented systems

More information

Hunting for semantic clusters

Hunting for semantic clusters Hunting for semantic clusters Hierarchical structuring of Cultural Heritage objects within large aggregations Shenghui Wang 1 Antoine Isaac 2 Valentine Charles 2 Rob Koopman 1 Anthi Agoropoulou 2 Titia

More information

Semantic Web and Natural Language Processing

Semantic Web and Natural Language Processing Semantic Web and Natural Language Processing Wiltrud Kessler Institut für Maschinelle Sprachverarbeitung Universität Stuttgart Semantic Web Winter 2014/2015 This work is licensed under a Creative Commons

More information

Assessing Metadata Utilization: An Analysis of MARC Content Designation Use

Assessing Metadata Utilization: An Analysis of MARC Content Designation Use Assessing Metadata Utilization: An Analysis of MARC Content Designation Use William E. Moen , Penelope Benardino School of Library and Information Sciences, Texas Center

More information

Predictive Analysis: Evaluation and Experimentation. Heejun Kim

Predictive Analysis: Evaluation and Experimentation. Heejun Kim Predictive Analysis: Evaluation and Experimentation Heejun Kim June 19, 2018 Evaluation and Experimentation Evaluation Metrics Cross-Validation Significance Tests Evaluation Predictive analysis: training

More information

Mineração de Dados Aplicada

Mineração de Dados Aplicada Data Exploration August, 9 th 2017 DCC ICEx UFMG Summary of the last session Data mining Data mining is an empiricism; It can be seen as a generalization of querying; It lacks a unified theory; It implies

More information

Evaluation and Design Issues of Nordic DC Metadata Creation Tool

Evaluation and Design Issues of Nordic DC Metadata Creation Tool Evaluation and Design Issues of Nordic DC Metadata Creation Tool Preben Hansen SICS Swedish Institute of computer Science Box 1264, SE-164 29 Kista, Sweden preben@sics.se Abstract This paper presents results

More information

DARIAH-DE Geo-Browser and Datasheet Editor

DARIAH-DE Geo-Browser and Datasheet Editor Förderkennzeichen 01UG1610A bis J DARIAH-DE Geo-Browser and Datasheet Editor Thomas Kollatz Steinheim-Institut, Essen Workshop 6. September 2016 Stuttgart de.dariah.eu DARIAH-DE GEO-BROWSER 27/09/2016

More information

Oshiba Tadahiko National Diet Library Tokyo, Japan

Oshiba Tadahiko National Diet Library Tokyo, Japan http://conference.ifla.org/ifla77 Date submitted: June 30, 2011 A service of the National Diet Library, Japan, to the semantic web community Oshiba Tadahiko National Diet Library Tokyo, Japan Meeting:

More information

Organizing Economic Information

Organizing Economic Information Organizing Economic Information An Overview of Application and Reuse Scenarios of an Economics Knowledge Organization System Tobias Rebholz, Andreas Oskar Kempf, Joachim Neubert ZBW Leibniz Information

More information

Improving access and facilitating research: The music collections in the new catalogues of the French National Library (BnF)

Improving access and facilitating research: The music collections in the new catalogues of the French National Library (BnF) Improving access and facilitating research: The music collections in the new catalogues of the French National Library (BnF) The general catalogue of the BnF First computer catalogue for the users of the

More information

Linked data for manuscripts in the Semantic Web

Linked data for manuscripts in the Semantic Web Linked data for manuscripts in the Semantic Web Gordon Dunsire Summer School in the Study of Historical Manuscripts Zadar, Croatia, 26 30 September 2011 Topic II: New Conceptual Models for Information

More information

Automatically Generating Queries for Prior Art Search

Automatically Generating Queries for Prior Art Search Automatically Generating Queries for Prior Art Search Erik Graf, Leif Azzopardi, Keith van Rijsbergen University of Glasgow {graf,leif,keith}@dcs.gla.ac.uk Abstract This report outlines our participation

More information

Gene Clustering & Classification

Gene Clustering & Classification BINF, Introduction to Computational Biology Gene Clustering & Classification Young-Rae Cho Associate Professor Department of Computer Science Baylor University Overview Introduction to Gene Clustering

More information

Table of contents for The organization of information / Arlene G. Taylor and Daniel N. Joudrey.

Table of contents for The organization of information / Arlene G. Taylor and Daniel N. Joudrey. Table of contents for The organization of information / Arlene G. Taylor and Daniel N. Joudrey. Chapter 1: Organization of Recorded Information The Need to Organize The Nature of Information Organization

More information

Business to Consumer Markets on the Semantic Web

Business to Consumer Markets on the Semantic Web Workshop on Metadata for Security (W-MS) International Federated Conferences (OTM '03) Business to Consumer Markets on the Semantic Web Prof. Dr.-Ing. Robert Tolksdorf, Dipl.-Kfm. Christian Bizer Freie

More information

Ex Libris Integrated and Consortia Solutions.

Ex Libris Integrated and Consortia Solutions. Ex Libris Integrated and Consortia Solutions Heal-Link Workshop September 2007 www.exlibrisgroup.com Agenda Ex Libris who and where we are New trends an challenges Primo and our integrated solutions Consortia

More information

Grid Resources Search Engine based on Ontology

Grid Resources Search Engine based on Ontology based on Ontology 12 E-mail: emiao_beyond@163.com Yang Li 3 E-mail: miipl606@163.com Weiguang Xu E-mail: miipl606@163.com Jiabao Wang E-mail: miipl606@163.com Lei Song E-mail: songlei@nudt.edu.cn Jiang

More information

Semantic Interoperability. Being serious about the Semantic Web

Semantic Interoperability. Being serious about the Semantic Web Semantic Interoperability Jérôme Euzenat INRIA & LIG France Natasha Noy Stanford University USA 1 Being serious about the Semantic Web It is not one person s ontology It is not several people s common

More information

A Bagging Method using Decision Trees in the Role of Base Classifiers

A Bagging Method using Decision Trees in the Role of Base Classifiers A Bagging Method using Decision Trees in the Role of Base Classifiers Kristína Machová 1, František Barčák 2, Peter Bednár 3 1 Department of Cybernetics and Artificial Intelligence, Technical University,

More information

Europeana and semantic alignment of vocabularies

Europeana and semantic alignment of vocabularies Europeana and semantic alignment of vocabularies Antoine Isaac Jacco van Ossenbruggen, Victor de Boer, Jan Wielemaker, Guus Schreiber Europeana & Vrije Universiteit Amsterdam NKOS workshop, Berlin, Sept.

More information

Portale und Ontologien

Portale und Ontologien Portale und Ontologien Kerstin Zimmermann http://www.deri.org DERI Innsbruck 1 Inhalt Digitale Bibliothek Fachportal Semantic Portal Ontologien Semantic Library Beispiele 2 Die Digitale Bibliothek 3 vascoda

More information

THIS LECTURE. How do we know if our results are any good? Results summaries: Evaluating a search engine. Making our good results usable to a user

THIS LECTURE. How do we know if our results are any good? Results summaries: Evaluating a search engine. Making our good results usable to a user EVALUATION Sec. 6.2 THIS LECTURE How do we know if our results are any good? Evaluating a search engine Benchmarks Precision and recall Results summaries: Making our good results usable to a user 2 3 EVALUATING

More information

Natural Language Processing. SoSe Question Answering

Natural Language Processing. SoSe Question Answering Natural Language Processing SoSe 2017 Question Answering Dr. Mariana Neves July 5th, 2017 Motivation Find small segments of text which answer users questions (http://start.csail.mit.edu/) 2 3 Motivation

More information

CHIP demonstrator: Semantics-driven recommendations and museum tour generation Aroyo, L.M.; Stash, N.; Wang, Y.; Gorgels, P.; Rutledge, L.W.

CHIP demonstrator: Semantics-driven recommendations and museum tour generation Aroyo, L.M.; Stash, N.; Wang, Y.; Gorgels, P.; Rutledge, L.W. CHIP demonstrator: Semantics-driven recommendations and museum tour generation Aroyo, L.M.; Stash, N.; Wang, Y.; Gorgels, P.; Rutledge, L.W. Published in: Proceedings of the 6th International Semantic

More information

A Novel Categorized Search Strategy using Distributional Clustering Neenu Joseph. M 1, Sudheep Elayidom 2

A Novel Categorized Search Strategy using Distributional Clustering Neenu Joseph. M 1, Sudheep Elayidom 2 A Novel Categorized Search Strategy using Distributional Clustering Neenu Joseph. M 1, Sudheep Elayidom 2 1 Student, M.E., (Computer science and Engineering) in M.G University, India, 2 Associate Professor

More information

WSMO Working Draft 04 October 2004

WSMO Working Draft 04 October 2004 Page 1 of 10 D17 WSMO Tutorial WSMO Working Draft 04 October 2004 This version: http://www.wsmo.org/2004/d17/20041004/ Latest version: http://www.wsmo.org/2004/d17/ Previous version: http://www.wsmo.org/2004/d17/v0.1/20040913/

More information

CSE 7/5337: Information Retrieval and Web Search Document clustering I (IIR 16)

CSE 7/5337: Information Retrieval and Web Search Document clustering I (IIR 16) CSE 7/5337: Information Retrieval and Web Search Document clustering I (IIR 16) Michael Hahsler Southern Methodist University These slides are largely based on the slides by Hinrich Schütze Institute for

More information

A Set of Annotations for supporting a TTS Application for Folktales

A Set of Annotations for supporting a TTS Application for Folktales A Set of Annotations for supporting a TTS Application for Folktales Thierry Declerck Multilingual Technologies German Research Center for Artificial Intelligence (DFKI GmbH) E-mail: declerck@dfki.de Abstract

More information

PUBLICATION OF INSPIRE-BASED AGRICULTURAL LINKED DATA

PUBLICATION OF INSPIRE-BASED AGRICULTURAL LINKED DATA This project has received funding from the European Union s Horizon 2020 research and innovation programme under grant agreement No 732064 This project is part of BDV PPP PUBLICATION OF INSPIRE-BASED AGRICULTURAL

More information

Building a Linked Open Data Knowledge Graph Henning Schoenenberger Michele Pasin. Frankfurt Book Fair 2017 October 11, 2017

Building a Linked Open Data Knowledge Graph Henning Schoenenberger Michele Pasin. Frankfurt Book Fair 2017 October 11, 2017 Building a Linked Open Data Knowledge Graph Henning Schoenenberger Michele Pasin Frankfurt Book Fair 2017 October 11, 2017 1 Springer Nature s Metadata Mission Statement We understand metadata as the gateway

More information

An information retrieval system may include 3 categories of information: Factual Bibliographical Institutional Exchange and sharing of these

An information retrieval system may include 3 categories of information: Factual Bibliographical Institutional Exchange and sharing of these An information retrieval system may include 3 categories of information: Factual Bibliographical Institutional Exchange and sharing of these categories of information across different user communities

More information

Lecture Topic Projects 1 Intro, schedule, and logistics 2 Data Science components and tasks 3 Data types Project #1 out 4 Introduction to R,

Lecture Topic Projects 1 Intro, schedule, and logistics 2 Data Science components and tasks 3 Data types Project #1 out 4 Introduction to R, Lecture Topic Projects 1 Intro, schedule, and logistics 2 Data Science components and tasks 3 Data types Project #1 out 4 Introduction to R, statistics foundations 5 Introduction to D3, visual analytics

More information

Nesnelerin İnternetinde Veri Analizi

Nesnelerin İnternetinde Veri Analizi Bölüm 4. Frequent Patterns in Data Streams w3.gazi.edu.tr/~suatozdemir What Is Pattern Discovery? What are patterns? Patterns: A set of items, subsequences, or substructures that occur frequently together

More information

Application of Hierarchical Clustering to Find Expression Modules in Cancer

Application of Hierarchical Clustering to Find Expression Modules in Cancer Application of Hierarchical Clustering to Find Expression Modules in Cancer T. M. Murali August 18, 2008 Innovative Application of Hierarchical Clustering A module map showing conditional activity of expression

More information

Information System on Literature in the Field of ICT for Environmental Sustainability

Information System on Literature in the Field of ICT for Environmental Sustainability International Environmental Modelling and Software Society (iemss) 2010 International Congress on Environmental Modelling and Software Modelling for Environment s Sake, Fifth Biennial Meeting, Ottawa,

More information

data elements (Delsey, 2003) and by providing empirical data on the actual use of the elements in the entire OCLC WorldCat database.

data elements (Delsey, 2003) and by providing empirical data on the actual use of the elements in the entire OCLC WorldCat database. Shawne D. Miksa, William E. Moen, Gregory Snyder, Serhiy Polyakov, Amy Eklund Texas Center for Digital Knowledge, University of North Texas Denton, Texas, U.S.A. Metadata Assistance of the Functional Requirements

More information

The OASIS Applications Semantic (Inter-) Connection Framework Dionisis Kehagias, CERTH/ITI

The OASIS Applications Semantic (Inter-) Connection Framework Dionisis Kehagias, CERTH/ITI ISWC 2011 - OASIS Symposium Monday, 24th October 2011 The OASIS Applications Semantic (Inter-) Connection Framework Dionisis Kehagias, CERTH/ITI Contents of this presentation Interoperability problems

More information

LOGICAL DATA MODELING

LOGICAL DATA MODELING LOGICAL DATA MODELING INTEGRATED SERIES IN INFORMATION SYSTEMS Professor Ramesh Sharda Oklahoma State University Series Editors Prof. Dr. Stefan VoB Universitat Hamburg Expository and Research Monographs

More information

Do we need metadata? An on-line survey in german archives

Do we need metadata? An on-line survey in german archives Do we need metadata? An on-line survey in german archives Marcel Ruhl University of Applied Science Potsdam, Faculty of Information Sciences, Friedrich-Ebert-Str. 4, 14467 Potsdam, Germany ruhl@fh-potsdam.de

More information

Local Metadatamanagement in a global environment

Local Metadatamanagement in a global environment Frankfurt 16 June 2010 Local Metadatamanagement in a global environment Daniel van Spanje Global Productmanager Metadata Services OCLC metadata has become the structure on which we re building information

More information

OCLC Pica CBS. The Central Library system

OCLC Pica CBS. The Central Library system OCLC Pica CBS The Central Library system A generic solution for: creation and maintenance of union catalogues controlled document ordering and delivery Fourth generation CBS4 Technical data Hardware: SUN

More information

'Mixed Methods' Indexing: Building-Up a Multi-Level Infrastructure for Subject Indexing

'Mixed Methods' Indexing: Building-Up a Multi-Level Infrastructure for Subject Indexing Submitted on: 14.11.2017 2016 Satellite meeting - Subject Access: Unlimited Opportunities 11 12 August 2016 State Library of Ohio, Columbus, Ohio, USA 'Mixed Methods' Indexing: Building-Up a Multi-Level

More information

3. Finding Components in Component Repositories

3. Finding Components in Component Repositories 3. Finding Components in Component Repositories 1. Component Search with Metadata 2. Searching and Browsing with Faceted Classication 3. Faceted Component Stores 4. Searching by Conformance to Protocols

More information

3. Finding Components in Component Repositories Component Search. Obligatory Literature. References

3. Finding Components in Component Repositories Component Search. Obligatory Literature. References 3. Finding Components in Component Repositories 1. Component Search with Metadata 2. Searching and Browsing with Faceted Classication 3. Faceted Component Stores 4. Searching by Conformance to Protocols

More information

Investigating Collaboration Dynamics in Different Ontology Development Environments

Investigating Collaboration Dynamics in Different Ontology Development Environments Investigating Collaboration Dynamics in Different Ontology Development Environments Marco Rospocher DKM, Fondazione Bruno Kessler rospocher@fbk.eu Tania Tudorache and Mark Musen BMIR, Stanford University

More information

Recognition of Animal Skin Texture Attributes in the Wild. Amey Dharwadker (aap2174) Kai Zhang (kz2213)

Recognition of Animal Skin Texture Attributes in the Wild. Amey Dharwadker (aap2174) Kai Zhang (kz2213) Recognition of Animal Skin Texture Attributes in the Wild Amey Dharwadker (aap2174) Kai Zhang (kz2213) Motivation Patterns and textures are have an important role in object description and understanding

More information

Europeana Core Service Platform

Europeana Core Service Platform Europeana Core Service Platform DELIVERABLE D7.1: Strategic Development Plan, Architectural Planning Revision Final Date of submission 30 October 2015 Author(s) Marcin Werla, PSNC Pavel Kats, Europeana

More information

Riding the Wave: Move Beyond Text TIB's strategy in the context of non-textual materials. Uwe Rosemann, Irina Sens IATUL Conference Singapur

Riding the Wave: Move Beyond Text TIB's strategy in the context of non-textual materials. Uwe Rosemann, Irina Sens IATUL Conference Singapur Riding the Wave: Move Beyond Text TIB's strategy in the context of non-textual materials Uwe Rosemann, Irina Sens IATUL Conference Singapur Outline TIB Role and functions Requirements Politicians - Funders

More information

IKR EmuLib. A Library for Seamless Integration of Simulation and Emulation. Marc Necker, Christoph Gauger [necker

IKR EmuLib. A Library for Seamless Integration of Simulation and Emulation. Marc Necker, Christoph Gauger [necker Universität Stuttgart INSTITUT FÜR NACHRICHTENVERMITTLUNG UND DATENVERARBEITUNG Prof. Dr.-Ing. Dr. h. c. mult. P. J. Kühn INSTITUT FÜR KOMMUNIKATIONSNETZE UND RECHNERSYSTEME Prof. Dr.-Ing. Dr. h. c. mult.

More information

The Dutch case The vocabulary work of RKD and the National Strategy for Digital Heritage

The Dutch case The vocabulary work of RKD and the National Strategy for Digital Heritage The Dutch case The vocabulary work of RKD and the National Strategy for Digital Heritage Pacific Neighbourhood Consortium 2017, Annual conference, November 6-9 intainan Taiwan by: Reem Weda Information

More information

BIBLIODATA. Subject Coverage. File Type. Features Thesaurus None. Record Content. File Size. Coverage Updates. Language.

BIBLIODATA. Subject Coverage. File Type. Features Thesaurus None. Record Content. File Size. Coverage Updates. Language. Subject Coverage File Type The database is multidisciplinary. Parts of the German National Bibliography, which are included in : A: Monographs and periodicals from the publishers' book trade B: Monographs

More information

THEORY AND PRACTICE OF CLASSIFICATION

THEORY AND PRACTICE OF CLASSIFICATION THEORY AND PRACTICE OF CLASSIFICATION Ms. Patience Emefa Dzandza pedzandza@ug.edu.gh College of Education: School of Information and Communication Department of Information Studies ICT and Library Classification

More information

RLC RLC RLC. Merge ToolBox MTB. Getting Started. German. Record Linkage Software, Version RLC RLC RLC. German. German.

RLC RLC RLC. Merge ToolBox MTB. Getting Started. German. Record Linkage Software, Version RLC RLC RLC. German. German. German RLC German RLC German RLC Merge ToolBox MTB German RLC Record Linkage Software, Version 0.742 Getting Started German RLC German RLC 12 November 2012 Tobias Bachteler German Record Linkage Center

More information

Improving data quality at Europeana New requirements and methods for better measuring metadata quality

Improving data quality at Europeana New requirements and methods for better measuring metadata quality Improving data quality at Europeana New requirements and methods for better measuring metadata quality Péter Király 1, Hugo Manguinhas 2, Valentine Charles 2, Antoine Isaac 2, Timothy Hill 2 1 Gesellschaft

More information

Social Business Intelligence in Action

Social Business Intelligence in Action Social Business Intelligence in ction Matteo Francia, nrico Gallinucci, Matteo Golfarelli, Stefano Rizzi DISI University of Bologna, Italy Introduction Several Social-Media Monitoring tools are available

More information

Architekturen für die Cloud

Architekturen für die Cloud Architekturen für die Cloud Eberhard Wolff Architecture & Technology Manager adesso AG 08.06.11 What is Cloud? National Institute for Standards and Technology (NIST) Definition On-demand self-service >

More information

Testbed a walk-through

Testbed a walk-through Testbed a walk-through Digital Preservation Planning: Principles, Examples and the Future with Planets, July 2008 Matthew Barr HATII at the University of Glasgow Contents Definitions and goals Achievements

More information

FOAM Framework for Ontology Alignment and Mapping Results of the Ontology Alignment Evaluation Initiative

FOAM Framework for Ontology Alignment and Mapping Results of the Ontology Alignment Evaluation Initiative FOAM Framework for Ontology Alignment and Mapping Results of the Ontology Alignment Evaluation Initiative Marc Ehrig Institute AIFB University of Karlsruhe 76128 Karlsruhe, Germany ehrig@aifb.uni-karlsruhe.de

More information

Crossing the Archival Borders

Crossing the Archival Borders IST-Africa 2008 Conference Proceedings Paul Cunningham and Miriam Cunningham (Eds) IIMC International Information Management Corporation, 2008 ISBN: 978-1-905824-07-6 Crossing the Archival Borders Fredrik

More information

Syrtis: New Perspectives for Semantic Web Adoption

Syrtis: New Perspectives for Semantic Web Adoption Syrtis: New Perspectives for Semantic Web Adoption Joffrey Decourselle, Fabien Duchateau, Ronald Ganier To cite this version: Joffrey Decourselle, Fabien Duchateau, Ronald Ganier. Syrtis: New Perspectives

More information

Ontology Generation from Session Data for Web Personalization

Ontology Generation from Session Data for Web Personalization Int. J. of Advanced Networking and Application 241 Ontology Generation from Session Data for Web Personalization P.Arun Research Associate, Madurai Kamaraj University, Madurai 62 021, Tamil Nadu, India.

More information

Introduction

Introduction Introduction EuropeanaConnect All-Staff Meeting Berlin, May 10 12, 2010 Welcome to the All-Staff Meeting! Introduction This is a quite big meeting. This is the end of successful project year Project established

More information

Emergency Services: Process, Rules and Events

Emergency Services: Process, Rules and Events Emergency Services: Process, Rules and Events Mauricio Salatino, Esteban Aliverti, and Demian Calcaprina Plugtree salaboy@gmail.com Abstract. The Emergency Service Application was built as a blue print

More information

Document Clustering for Mediated Information Access The WebCluster Project

Document Clustering for Mediated Information Access The WebCluster Project Document Clustering for Mediated Information Access The WebCluster Project School of Communication, Information and Library Sciences Rutgers University The original WebCluster project was conducted at

More information

Introduction to Information Retrieval. Lecture Outline

Introduction to Information Retrieval. Lecture Outline Introduction to Information Retrieval Lecture 1 CS 410/510 Information Retrieval on the Internet Lecture Outline IR systems Overview IR systems vs. DBMS Types, facets of interest User tasks Document representations

More information

Inferring Protocol State Machine from Network Traces: A Probabilistic Approach

Inferring Protocol State Machine from Network Traces: A Probabilistic Approach Inferring Protocol State Machine from Network Traces: A Probabilistic Approach Yipeng Wang, Zhibin Zhang, Danfeng(Daphne) Yao, Buyun Qu, Li Guo Institute of Computing Technology, CAS Virginia Tech, USA

More information

A Review: Content Base Image Mining Technique for Image Retrieval Using Hybrid Clustering

A Review: Content Base Image Mining Technique for Image Retrieval Using Hybrid Clustering A Review: Content Base Image Mining Technique for Image Retrieval Using Hybrid Clustering Gurpreet Kaur M-Tech Student, Department of Computer Engineering, Yadawindra College of Engineering, Talwandi Sabo,

More information

Describing Knowledge Organization Systems in BARTOC and JSKOS

Describing Knowledge Organization Systems in BARTOC and JSKOS Describing Knowledge Organization Systems in BARTOC and JSKOS Andreas Ledl 1 and Jakob Voß 2 1 University Library of Basel, Basel 2 Verbundzentrale des GBV (VZG), Göttingen Abstract. This paper introduces

More information

Enhancing information services using machine to machine terminology services

Enhancing information services using machine to machine terminology services Enhancing information services using machine to machine terminology services Gordon Dunsire Presented to the IFLA 2009 Satellite Conference Looking at the past and preparing for the future 20-21 Aug 2009,

More information