BreXLiMe: A Semantically Enriched Brexit Dataset Using Cross-Lingual and Cross-Media Knowledge Extraction

Size: px
Start display at page:

Download "BreXLiMe: A Semantically Enriched Brexit Dataset Using Cross-Lingual and Cross-Media Knowledge Extraction"

Transcription

1 BreXLiMe: A Semantically Enriched Brexit Dataset Using Cross-Lingual and Cross-Media Knowledge Extraction Lei Zhang 1, Michael Färber 2, Steffen Thoma 3, Maribel Acosta 3, and Achim Rettinger 3 1 FIZ Karlsruhe Leibniz Institute for Information Infrastructure, Germany lei.zhang@fiz-karlsruhe.de 2 University of Freiburg, Germany michael.faerber@cs.uni-freiburg.de 3 Karlsruhe Institute of Technology (KIT), Germany {steffen.thoma acosta rettinger}@kit.edu Abstract. The withdrawal of the UK from the EU, known as Brexit, has already been subject to a number of sociological and economic studies using data from social media especially Twitter. However, comprehensive studies based on datasets supporting different media and languages are still missing so far. In this paper, we introduce such a dataset collected from various media sources (including online news sites, social media and live-tv) in three languages (i.e., English, German, and Spanish), which has been further semantically enriched with annotations of both entities and categories from DBpedia. This dataset is provided online in the RDF serialization formats Turtle and N-Triples and can serve as a data basis for applications and studies of various disciplines on the case of Brexit. Keywords: semantic annotation, semantic integration, semantic search, cross-lingual, cross-modality, Brexit. 1 Introduction The outcome of the Brexit referendum held on June 23, 2016 in the UK was until the last moment unclear and, hence, thrilling. In retro-perspective, besides the pure result (exiting the EU), the case of Brexit is from a sociological, political and psychological perspective particularly interesting and relevant. For instance, psychologists and sociologists are interested in how topics and opinions spread in the public discussions (visible/transmitted via public media such as news articles, social media, and TV shows) and how those topics and opinions relate to specific items such as persons in the public (David Cameron etc.) and subjects of public concern (Euroscepticism etc.). Also, with the ongoing discussion about fake media there is interest in investigating the difference of media coverage between traditional (news and TV) versus social media. Recently, many approaches have been proposed for prediction and analysis concerning Brexit. Most related work is based on data from Twitter [1 4] and

2 2 other social media platforms [5 7], where Celli [7] deals with multilingual social media data, but all other work only addresses the English language. In order to facilitate a more comprehensive investigation of Brexit, there is an impending need for datasets containing multilingual data from various multimedia sources. Also, the existing analysis of the Brexit use case ranges from using the most frequently mentioned words, topics, themes, sentiments and URLs, however, no explicit semantics in knowledge bases has been utilized for semantic processing. In this regard, we present the BreXLiMe dataset, which has been collected from the multilingual data streams regarding BreXit in different media channels and further semantically enriched by Cross-Lingual and Cross-Media knowledge extraction technologies developed in our xlime project [8]. To summarize, the main contributions of this work are threefold: 1. The BreXLiMe dataset provides a semantically enriched collection of media coverage on the Brexit referendum from different media sources (including online news sites, social media and live-tv streams) in three languages (i.e., English, German and Spanish). To the best of our knowledge, this is the first dataset regarding Brexit that contains multilingual media data from several channels. Besides the pure meta-information (such as publication date and source), the media content has been semantically enriched with DBpedia [9] entities and categories. This dataset is available in the RDF formats Turtle and N-Triples at and licensed under CC BY-NC-SA 4.0. Due to licensing issues, we cannot provide the full content of the media items, which is accessible via the provided links. 2. The BreXLiMe development architecture shows a methodology for creating such a dataset that is capable of cross-lingual and cross-media knowledge extraction from media streams (see Sec. 2) as well as a data model based on RDF that is suited to describe and integrate knowledge in different media and languages (see Sec. 3). This architecture provides a general way of semantic data processing and can be easily adapted to other use cases on request. 3. The BreXLiMe dataset has been utilized to support BreXearch, our semantic search system with a focus on the case of Brexit, which enables cross-lingual and cross-media Brexit data retrieval and analytics (see Sec. 4). Besides that, we believe that this dataset can also be beneficial to many other applications and studies. In particular, it allows researchers to investigate differences and commonalities among these media channels and languages. 2 Development Methodology The development architecture of BreXLiMe is illustrated in Fig. 1. In this section, we briefly introduce the used media sources and the main development modules including data filtering, entity linking and entity-based text categorization.

3 3 2.1 Media Sources Fig. 1. BreXLiMe development architecture. Within the context of our xlime project, we first extracted textual data from different media sources, where three main data providers (JSI Newsfeed 4, VICO 5 and Zattoo 6 ) have delivered the multimedia data as streams: JSI NewsFeed provides a real-time aggregated multilingual stream of news articles from around 75,000 news websites across the world, such as The New York Times, Bloomberg News and Spiegel Online. VICO harvests large amounts of social media data in multiple languages not only from large social networks like Twitter, Facebook, Google+, and YouTube, but also from a broad spectrum of forums, blogs and review sites. Zattoo provides live TV streams consisting of video frames and audio for around 150 multilingual channels, such as CNN International, BBC World, N24 and Tageschau24. The textual data is then extracted from these media sources accompanied by preprocessing tools for converting image and speech to text, i.e., optical character recognition (OCR) and automatic speech recognition (ASR), in the case of the video and audio streams from the TV content. 2.2 Data Filtering Based on the multilingual data streams extracted from the above media sources, the first development module of data filtering aims to collect a custom dataset of media items, which are related to the Brexit referendum held on June 23,

4 4 News Social Media TV #Items #Entities #Cate. #Items #Entities #Cate. #Items #Entities #Cate. English 166.4K 7.04M 832K 10.48M 30.4M 52.4K K 3290 German 50.2K 1.21M 251K 426.7K 1.03M 2.13M K 1120 Spanish 27.9K 856K 139.5K 1.06M 2.34M 5.3M Total 244.5K 8.25M 1.22M 11.97M 33.77M 59.85M K 4410 Table 1. Statistics about the BreXLiMe dataset. As part of the xlime project, the dataset was gathered in June 2016 by adding a set of filters on the extracted multilingual data streams. Based on the xlime use cases, we first limited the data streams of newsfeed and social media to three languages, i.e., English, German, Spanish, and selected a subset of Zattoo s available TV streams, which cover both English and German. In addition, we have implemented a subscription service that allows us to use queries to filter the full streams and the terms being filtered are related to the Brexit referendum, such as Brexit, UK and EU, which resulted in around 240 thousand new articles, 12 million microposts and 900 TV programs in a month. The statistics of the media items in the dataset is shown in Table 1. The mechanism for adding new filters to these media streams is straightforward and can be easily adapted to other use cases on request. 2.3 Cross-lingual Entity Linking As the BreXLiMe dataset serves for analyzing how topics and opinions about Brexit spread in the public discussions and how they are related to entities of public interest, the development module of cross-lingual entity linking aims to detect not only named entities (e.g., David Cameron) but also nominal entities (e.g., Prime Minister of the United Kingdom) in multilingual text extracted from the media items using DBpedia as the knowledge base. In order to match words and phrases in different languages against DBpedia entities, we have built our cross-lingual linked data lexica, called xlid-lexica, by exploiting multilingual Wikipedia to extract the cross-lingual groundings of entities. With the goal of addressing the challenges of correctness, completeness and emergence of mention detection, we employ our recent work [10] to recognize the boundaries of mentions in text that are likely to denote both named entities and nominal entities. For each detected mention, its candidate entities have been then extracted using xlid-lexica. Then, a graph-based disambiguation method has been employed to determine the final entity for each mention based on both features of mention-entity compatibility and entity-entity coherence [11]. The statistics about the detected entities in the dataset is shown in Table Entity-based Text Categorization Besides entities, it is also interesting to study how public discussions on different media channels correlate with the subjects of concern, e.g., Euroscepticism, which can be addressed by making use of the full potential of the structured knowledge base in the background. In most of the semantic knowledge bases like DBpedia, entities are organized in a category hierarchy. For example, the entity Brexit has

5 5 Fig. 2. The schema used in BreXLiMe. Apart from sioc:has creator (used for social media items) and dcterms:publisher (used for both news and social media items), the listed classes and relations are used for data from all three media sources. its parent category Category:Euroscepticism in the United Kingdom, which in turn is a subcategory of Category:Euroscepticism. By utilizing this category hierarchy, the next development module of entity-based text categorization aims to derive the categories related to media items based on their mentioned entities. For this purpose, we firstly let sociologists decide on the relevant categories in DBpedia for the Brexit use case, resulting in a set of 73 candidate categories, e.g., Category:Immigration to Europe, Category:European Union law and Category:Economy of the European Union. Given a media item, each detected entity and all its reachable candidate categories in DBpedia are then added into a directed graph, whree the scores of all categories are computed by a random walk algorithm based on both entity-category associations and category-category dependencies (see more details in [12]). Finally, the top-5 categories with the highest scores are output for each media item. The statistics about the derived categories in the dataset is shown in Table 1. 3 Data Modeling The semantic integration of cross-lingual and cross-media data streams poses a new challenge to identify a common model that suits the diversity of media data from different sources and the output of the development methodology discussed

6 6 Fig. 3. Example of an annotated social media item in RDF (Turtle fomat). in Sec. 2. To address the above challenge, we introduce a general data model in the media domain, which enables semantic integration of media data on multiple modalities, languages and sources, and thus allows for a seamless semantic access to media data streams in combination with additional background knowledge. This data model used by BreXLiMe is defined as an RDF vocabulary and tailored specifically to the different modalities: text, audio and video. It extends other vocabularies, such as the Dublin Core 7, SIOC 8 and KDO 9. Its main schema is depicted in Fig. 2. Similarly to the Web Annotation Model 10, it enables to relate text and video or audio streams to entities and categories in the knowledge base. In this work, we refrained from using the Web Annotation Model to reduce the amount of unnecessary blank nodes and thus, at query time, joins. For each entity annotation, the predicates that define the start and end positions of the entity mention are used in a flexible manner and may define character positions, in the case of text, or milliseconds/frame numbers in case of audio/video. Each category annotation captures one topic of the media content. In any case, each entity mentioned in or each topic covered by any media item should relate to a resource in the knowledge base, namely an entity or a category in DBpedia. Based on the schema shown in Fig. 2, we model the annotated media data as RDF triples, which are available online as RDF dumps for further processing and analysis. An example of annotated social media item modeled by RDF is shown in Fig. 3. A SPARQL endpoint is provided for querying the annotated media data. This enables restrictions and aggregates on multiple modalities, languages and

7 7 (a) (b) Fig. 4. Examples of SPARQL queries for (a) Brexit data retrieval and (b) analytics. media sources as well as a combination with additional background knowledge in the knowledge base, which will be discussed in Sec Applications The availability of the BreXLiMe dataset can facilitate many applications for the use case of Brexit. To demonstrate this, we present BreXearch 11, a semantic search system for cross-lingual and cross-media data retrieval and analytics for Brexit. In the following, we show two major features of BreXearch. Brexit Data Retrieval. Modern search engines are limited in their semantic processing capabilities: the retrieved Web content has to be in the same language as the search keywords and cannot be integrated across different media channels. BreXearch aims to break the barriers in between languages and modalities for a seamless semantic access to media data regarding Brexit. Through the semantic integration of BreXLiMe on multiple modalities, languages and media sources, BreXearch supports cross-lingual and cross-media Brexit data retrieval by means of both entities and categories. For example, to find the latest 100 media items in German about the subject Immigration to Europe from all three media channels, the SPARQL query in Fig. 4 (a) can be used to retrieve the results. Brexit Data Analytics. Advanced data analytics in the media domain has become a major necessity, which currently cannot be supported by modern search engines. Using the knowledge extracted by BreXLiMe from different media and languages in combination with additional background knowledge in DBepdia, BreXearch allows us to ask complex questions regarding Brexit, such as Which 11

8 8 politicians from the Conservative Party of UK were most present in social media in the last two weeks before the Brexit referendum in different languages?, which can be answered by the SPARQL query as shown in Fig. 4 (b). More importantly, based on the varieties of BreXLiMe, BreXearch provides us the ability to study differences and commonalities among these media channels and languages. 5 Conclusions In this paper, we present BreXLiMe, a semantically enriched dataset regarding Brexit supporting different media and languages. Besides the dataset itself, the BreXLiMe development methodology provides a general solution to cross-lingual and cross-media knowledge extraction from various multilingual media sources. In addition, the data model used by BreXLiMe for describing and integrating knowledge extracted from different media and languages is based on RDF and Linked Open Data standards and thus can serve as a blueprint for publishing other datasets in the media domain. Furthermore, our semantic search system BreXearch shows that BreXLiMe can serve as a data basis for applications and studies on the case of Brexit. As for the future work, we would like to provide datasets for research based on other use cases, such as the US president election in 2016, by applying the presented development methodology and data modeling. References 1. Howard, P.N., Kollanyi, B.: Bots, #strongerin, and #brexit: Computational propaganda during the UK-EU referendum. CoRR abs/ (2016) 2. Llewellyn, C., Cram, L.: Brexit? analyzing opinion on the UK-EU referendum within twitter. In: Proceedings of the Tenth International Conference on Web and Social Media, Cologne, Germany, May 17-20, (2016) Khatua, A., Khatua, A.: Leave or remain? deciphering brexit deliberations on twitter. In: IEEE International Conference on Data Mining Workshops, ICDM Workshops 2016, December 12-15, 2016, Barcelona, Spain. (2016) Hürlimann, M., Davis, B., Cortis, K., Freitas, A., Handschuh, S., Fernández, S.: A twitter sentiment gold standard for the brexit referendum. In: Proceedings of the 12th International Conference on Semantic Systems, SEMANTICS 2016, Leipzig, Germany, September 12-15, (2016) Vicario, M.D., Zollo, F., Caldarelli, G., Scala, A., Quattrociocchi, W.: The anatomy of brexit debate on facebook. CoRR abs/ (2016) 6. Lansdall-Welfare, T., Dzogang, F., Cristianini, N.: Change-point analysis of the public mood in UK twitter during the brexit referendum. In: IEEE International Conference on Data Mining Workshops, ICDM Workshops 2016, December 12-15, 2016, Barcelona, Spain. (2016) Celli, F., Stepanov, E.A., Poesio, M., Riccardi, G.: Predicting Brexit: Classifying Agreement is Better than Sentiment and Pollsters. In: The Workshop on Computational Modeling of People s Opinions, Personality, and Emotions in Social Media. (2016)

9 9 8. Zhang, L., Thalhammer, A., Rettinger, A., Farber, M., Mogadala, A., Denaux, R.: The xlime system: Cross-lingual and cross-modal semantic annotation, search and recommendation over live-tv, news and social media streams. Web Semantics: Science, Services and Agents on the World Wide Web (2017) 9. Bizer, C., Lehmann, J., Kobilarov, G., Auer, S., Becker, C., Cyganiak, R., Hellmann, S.: DBpedia - A crystallization point for the Web of Data. J. Web Sem. 7(3) (2009) Zhang, L., Dong, Y., Rettinger, A.: Towards entity correctness, completeness and emergence for entity recognition. In: WWW (Companion Volume). (2015) Zhang, L., Rettinger, A.: X-LiSA: Cross-lingual Semantic Annotation. PVLDB 7(13) (2014) Zhang, L., Xu, Y., Rettinger, A.: A joint method for entity linking and text categorization by exploiting knowledge bases. Technical report, Institut AIFB, KIT, el tc.pdf (2017)

DBpedia-An Advancement Towards Content Extraction From Wikipedia

DBpedia-An Advancement Towards Content Extraction From Wikipedia DBpedia-An Advancement Towards Content Extraction From Wikipedia Neha Jain Government Degree College R.S Pura, Jammu, J&K Abstract: DBpedia is the research product of the efforts made towards extracting

More information

Open-Source Natural Language Processing and Computational Archival Science

Open-Source Natural Language Processing and Computational Archival Science Open-Source Natural Language Processing and Computational Archival Science Kalina Bontcheva University of Sheffield @kbontcheva The University of Sheffield, 1995-2018 This work is licensed under the Creative

More information

A service based on Linked Data to classify Web resources using a Knowledge Organisation System

A service based on Linked Data to classify Web resources using a Knowledge Organisation System A service based on Linked Data to classify Web resources using a Knowledge Organisation System A proof of concept in the Open Educational Resources domain Abstract One of the reasons why Web resources

More information

Linking Entities in Chinese Queries to Knowledge Graph

Linking Entities in Chinese Queries to Knowledge Graph Linking Entities in Chinese Queries to Knowledge Graph Jun Li 1, Jinxian Pan 2, Chen Ye 1, Yong Huang 1, Danlu Wen 1, and Zhichun Wang 1(B) 1 Beijing Normal University, Beijing, China zcwang@bnu.edu.cn

More information

Using Linked Data to Reduce Learning Latency for e-book Readers

Using Linked Data to Reduce Learning Latency for e-book Readers Using Linked Data to Reduce Learning Latency for e-book Readers Julien Robinson, Johann Stan, and Myriam Ribière Alcatel-Lucent Bell Labs France, 91620 Nozay, France, Julien.Robinson@alcatel-lucent.com

More information

The Open Government Data Stakeholder Survey

The Open Government Data Stakeholder Survey The Open Government Data Stakeholder Survey Michael Martin+, Martin Kaltenböck*, Helmut Nagy*, and Sören Auer+ + Universität Leipzig, Institut für Informatik, AKSW, Postfach 100920, D-04009 Leipzig, Germany,

More information

LinDA: A Service Infrastructure for Linked Data Analysis and Provision of Data Statistics

LinDA: A Service Infrastructure for Linked Data Analysis and Provision of Data Statistics LinDA: A Service Infrastructure for Linked Data Analysis and Provision of Data Statistics Nicolas Beck, Stefan Scheglmann, and Thomas Gottron WeST Institute for Web Science and Technologies University

More information

3 Publishing Technique

3 Publishing Technique Publishing Tool 32 3 Publishing Technique As discussed in Chapter 2, annotations can be extracted from audio, text, and visual features. The extraction of text features from the audio layer is the approach

More information

Query Expansion using Wikipedia and DBpedia

Query Expansion using Wikipedia and DBpedia Query Expansion using Wikipedia and DBpedia Nitish Aggarwal and Paul Buitelaar Unit for Natural Language Processing, Digital Enterprise Research Institute, National University of Ireland, Galway firstname.lastname@deri.org

More information

3XL News: a Cross-lingual News Aggregator and Reader

3XL News: a Cross-lingual News Aggregator and Reader 3XL News: a Cross-lingual News Aggregator and Reader Evgenia Belyaeva 12, Jan Berčič 1, Katja Berčič 1, Flavio Fuart 1, Aljaž Košmerlj 1, Andrej Muhič 1, Aljoša Rehar 3, Jan Rupnik 1, and Mitja Trampuš

More information

Development of an Ontology-Based Portal for Digital Archive Services

Development of an Ontology-Based Portal for Digital Archive Services Development of an Ontology-Based Portal for Digital Archive Services Ching-Long Yeh Department of Computer Science and Engineering Tatung University 40 Chungshan N. Rd. 3rd Sec. Taipei, 104, Taiwan chingyeh@cse.ttu.edu.tw

More information

FedX: A Federation Layer for Distributed Query Processing on Linked Open Data

FedX: A Federation Layer for Distributed Query Processing on Linked Open Data FedX: A Federation Layer for Distributed Query Processing on Linked Open Data Andreas Schwarte 1, Peter Haase 1,KatjaHose 2, Ralf Schenkel 2, and Michael Schmidt 1 1 fluid Operations AG, Walldorf, Germany

More information

Enhancing applications with Cognitive APIs IBM Corporation

Enhancing applications with Cognitive APIs IBM Corporation Enhancing applications with Cognitive APIs After you complete this section, you should understand: The Watson Developer Cloud offerings and APIs The benefits of commonly used Cognitive services 2 Watson

More information

The role of vocabularies for estimating carbon footprint for food recipies using Linked Open Data

The role of vocabularies for estimating carbon footprint for food recipies using Linked Open Data The role of vocabularies for estimating carbon footprint for food recipies using Linked Open Data Ahsan Morshed Intelligent Sensing and Systems Laboratory, CSIRO, Hobart, Australia {ahsan.morshed, ritaban.dutta}@csiro.au

More information

Social media as a data source for research

Social media as a data source for research Social media as a data source for research Neli Blagus, Slavko Žitnik and Marko Bajec University of Ljubljana Faculty for computer and information science 17 April 2018 Neli Blagus et al. (FRI) Social

More information

August 2012 Daejeon, South Korea

August 2012 Daejeon, South Korea Building a Web of Linked Entities (Part I: Overview) Pablo N. Mendes Free University of Berlin August 2012 Daejeon, South Korea Outline Part I A Web of Linked Entities Challenges Progress towards solutions

More information

Utilizing, creating and publishing Linked Open Data with the Thesaurus Management Tool PoolParty

Utilizing, creating and publishing Linked Open Data with the Thesaurus Management Tool PoolParty Utilizing, creating and publishing Linked Open Data with the Thesaurus Management Tool PoolParty Thomas Schandl, Andreas Blumauer punkt. NetServices GmbH, Lerchenfelder Gürtel 43, 1160 Vienna, Austria

More information

Linking Thesauri to the Linked Open Data Cloud for Improved Media Retrieval

Linking Thesauri to the Linked Open Data Cloud for Improved Media Retrieval biblio.ugent.be The UGent Institutional Repository is the electronic archiving and dissemination platform for all UGent research publications. Ghent University has implemented a mandate stipulating that

More information

Topic Classification in Social Media using Metadata from Hyperlinked Objects

Topic Classification in Social Media using Metadata from Hyperlinked Objects Topic Classification in Social Media using Metadata from Hyperlinked Objects Sheila Kinsella 1, Alexandre Passant 1, and John G. Breslin 1,2 1 Digital Enterprise Research Institute, National University

More information

ANNUAL REPORT Visit us at project.eu Supported by. Mission

ANNUAL REPORT Visit us at   project.eu Supported by. Mission Mission ANNUAL REPORT 2011 The Web has proved to be an unprecedented success for facilitating the publication, use and exchange of information, at planetary scale, on virtually every topic, and representing

More information

Visual Concept Detection and Linked Open Data at the TIB AV- Portal. Felix Saurbier, Matthias Springstein Hamburg, November 6 SWIB 2017

Visual Concept Detection and Linked Open Data at the TIB AV- Portal. Felix Saurbier, Matthias Springstein Hamburg, November 6 SWIB 2017 Visual Concept Detection and Linked Open Data at the TIB AV- Portal Felix Saurbier, Matthias Springstein Hamburg, November 6 SWIB 2017 Agenda 1. TIB and TIB AV-Portal 2. Automated Video Analysis 3. Visual

More information

A service based on Linked Data to classify Web resources using a Knowledge Organisation System

A service based on Linked Data to classify Web resources using a Knowledge Organisation System A service based on Linked Data to classify Web resources using a Knowledge Organisation System A implementation to classify Open Educational Resources Janneth Chicaiza, Nelson Piedra and Jorge López Universidad

More information

Creating Large-scale Training and Test Corpora for Extracting Structured Data from the Web

Creating Large-scale Training and Test Corpora for Extracting Structured Data from the Web Creating Large-scale Training and Test Corpora for Extracting Structured Data from the Web Robert Meusel and Heiko Paulheim University of Mannheim, Germany Data and Web Science Group {robert,heiko}@informatik.uni-mannheim.de

More information

Semantic Web Company. PoolParty - Server. PoolParty - Technical White Paper.

Semantic Web Company. PoolParty - Server. PoolParty - Technical White Paper. Semantic Web Company PoolParty - Server PoolParty - Technical White Paper http://www.poolparty.biz Table of Contents Introduction... 3 PoolParty Technical Overview... 3 PoolParty Components Overview...

More information

Natural Language Processing with PoolParty

Natural Language Processing with PoolParty Natural Language Processing with PoolParty Table of Content Introduction to PoolParty 2 Resolving Language Problems 4 Key Features 5 Entity Extraction and Term Extraction 5 Shadow Concepts 6 Word Sense

More information

Mining Wikipedia s Snippets Graph: First Step to Build A New Knowledge Base

Mining Wikipedia s Snippets Graph: First Step to Build A New Knowledge Base Mining Wikipedia s Snippets Graph: First Step to Build A New Knowledge Base Andias Wira-Alam and Brigitte Mathiak GESIS - Leibniz-Institute for the Social Sciences Unter Sachsenhausen 6-8, 50667 Köln,

More information

TISA Methodology Threat Intelligence Scoring and Analysis

TISA Methodology Threat Intelligence Scoring and Analysis TISA Methodology Threat Intelligence Scoring and Analysis Contents Introduction 2 Defining the Problem 2 The Use of Machine Learning for Intelligence Analysis 3 TISA Text Analysis and Feature Extraction

More information

Man vs. Machine Dierences in SPARQL Queries

Man vs. Machine Dierences in SPARQL Queries Man vs. Machine Dierences in SPARQL Queries Laurens Rietveld 1 and Rinke Hoekstra 1,2 1 Department of Computer Science, VU University Amsterdam, The Netherlands {laurens.rietveld,rinke.hoekstra}@vu.nl

More information

EUDAT B2FIND A Cross-Discipline Metadata Service and Discovery Portal

EUDAT B2FIND A Cross-Discipline Metadata Service and Discovery Portal EUDAT B2FIND A Cross-Discipline Metadata Service and Discovery Portal Heinrich Widmann, DKRZ DI4R 2016, Krakow, 28 September 2016 www.eudat.eu EUDAT receives funding from the European Union's Horizon 2020

More information

Enriching an Academic Knowledge base using Linked Open Data

Enriching an Academic Knowledge base using Linked Open Data Enriching an Academic Knowledge base using Linked Open Data Chetana Gavankar 1,2 Ashish Kulkarni 1 Yuan Fang Li 3 Ganesh Ramakrishnan 1 (1) IIT Bombay, Mumbai, India (2) IITB-Monash Research Academy, Mumbai,

More information

SemSor: Combining Social and Semantic Web to Support the Analysis of Emergency Situations

SemSor: Combining Social and Semantic Web to Support the Analysis of Emergency Situations SemSor: Combining Social and Semantic Web to Support the Analysis of Emergency Situations Philipp Heim philipp.heim@vis.unistuttgart.de Dennis Thom dennis.thom@vis.unistuttgart.de Thomas Ertl thomas.ertl@vis.unistuttgart.de

More information

16th International World Wide Web Conference Developers Track, May 11, DBpedia. Querying Wikipedia like a Database

16th International World Wide Web Conference Developers Track, May 11, DBpedia. Querying Wikipedia like a Database 16th International World Wide Web Conference Developers Track, May 11, 2007 DBpedia Querying Wikipedia like a Database Christian Bizer, Freie Universität Berlin Sören Auer, Universität Leipzig Georgi Kobilarov,

More information

TRENTINOMEDIA: Exploiting NLP and Background Knowledge to Browse a Large Multimedia News Store

TRENTINOMEDIA: Exploiting NLP and Background Knowledge to Browse a Large Multimedia News Store TRENTINOMEDIA: Exploiting NLP and Background Knowledge to Browse a Large Multimedia News Store Roldano Cattoni 1, Francesco Corcoglioniti 1,2, Christian Girardi 1, Bernardo Magnini 1, Luciano Serafini

More information

FAGI-gis: A tool for fusing geospatial RDF data

FAGI-gis: A tool for fusing geospatial RDF data FAGI-gis: A tool for fusing geospatial RDF data Giorgos Giannopoulos 1, Nick Vitsas 1, Nikos Karagiannakis 1, Dimitrios Skoutas 1, and Spiros Athanasiou 1 IMIS Institute, Athena Research Center Abstract.

More information

TIB AV-Portal. Margret Plank 19th of January 2015 TACC Meeting

TIB AV-Portal. Margret Plank 19th of January 2015 TACC Meeting TIB AV-Portal Margret Plank 19th of January 2015 TACC Meeting German National Library of Science and Technology (TIB) German National Library of Science and Technology for all areas of engineering as well

More information

Context Sensitive Search Engine

Context Sensitive Search Engine Context Sensitive Search Engine Remzi Düzağaç and Olcay Taner Yıldız Abstract In this paper, we use context information extracted from the documents in the collection to improve the performance of the

More information

CriES 2010

CriES 2010 CriES Workshop @CLEF 2010 Cross-lingual Expert Search - Bridging CLIR and Social Media Institut AIFB Forschungsgruppe Wissensmanagement (Prof. Rudi Studer) Organizing Committee: Philipp Sorg Antje Schultz

More information

AUTOMATIC VISUAL CONCEPT DETECTION IN VIDEOS

AUTOMATIC VISUAL CONCEPT DETECTION IN VIDEOS AUTOMATIC VISUAL CONCEPT DETECTION IN VIDEOS Nilam B. Lonkar 1, Dinesh B. Hanchate 2 Student of Computer Engineering, Pune University VPKBIET, Baramati, India Computer Engineering, Pune University VPKBIET,

More information

Proposal for Implementing Linked Open Data on Libraries Catalogue

Proposal for Implementing Linked Open Data on Libraries Catalogue Submitted on: 16.07.2018 Proposal for Implementing Linked Open Data on Libraries Catalogue Esraa Elsayed Abdelaziz Computer Science, Arab Academy for Science and Technology, Alexandria, Egypt. E-mail address:

More information

Computer-assisted Ontology Construction System: Focus on Bootstrapping Capabilities

Computer-assisted Ontology Construction System: Focus on Bootstrapping Capabilities Computer-assisted Ontology Construction System: Focus on Bootstrapping Capabilities Omar Qawasmeh 1, Maxime Lefranois 2, Antoine Zimmermann 2, Pierre Maret 1 1 Univ. Lyon, CNRS, Lab. Hubert Curien UMR

More information

Linked Open Data in Aggregation Scenarios: The Case of The European Library Nuno Freire The European Library

Linked Open Data in Aggregation Scenarios: The Case of The European Library Nuno Freire The European Library Linked Open Data in Aggregation Scenarios: The Case of The European Library Nuno Freire The European Library SWIB14 Semantic Web in Libraries Conference Bonn, December 2014 Outline Introduction to The

More information

Specific requirements on the da ra metadata schema

Specific requirements on the da ra metadata schema Specific requirements on the da ra metadata schema Nicole Quitzsch GESIS - Leibniz Institute for the Social Sciences Workshop: Metadata and Persistent Identifiers for Social and Economic Data 07-08 May

More information

Demo: Linked Open Statistical Data for the Scottish Government

Demo: Linked Open Statistical Data for the Scottish Government Demo: Linked Open Statistical Data for the Scottish Government Bill Roberts 1 1 Swirrl IT Limited http://swirrl.com Abstract. This paper describes the approach taken by the Scottish Government, supported

More information

Master Project. Various Aspects of Recommender Systems. Prof. Dr. Georg Lausen Dr. Michael Färber Anas Alzoghbi Victor Anthony Arrascue Ayala

Master Project. Various Aspects of Recommender Systems. Prof. Dr. Georg Lausen Dr. Michael Färber Anas Alzoghbi Victor Anthony Arrascue Ayala Master Project Various Aspects of Recommender Systems May 2nd, 2017 Master project SS17 Albert-Ludwigs-Universität Freiburg Prof. Dr. Georg Lausen Dr. Michael Färber Anas Alzoghbi Victor Anthony Arrascue

More information

COLLABORATIVE EUROPEAN DIGITAL ARCHIVE INFRASTRUCTURE

COLLABORATIVE EUROPEAN DIGITAL ARCHIVE INFRASTRUCTURE COLLABORATIVE EUROPEAN DIGITAL ARCHIVE INFRASTRUCTURE Project Acronym: CENDARI Project Grant No.: 284432 Theme: FP7-INFRASTRUCTURES-2011-1 Project Start Date: 01 February 2012 Project End Date: 31 January

More information

Social Business Intelligence in Action

Social Business Intelligence in Action Social Business Intelligence in ction Matteo Francia, nrico Gallinucci, Matteo Golfarelli, Stefano Rizzi DISI University of Bologna, Italy Introduction Several Social-Media Monitoring tools are available

More information

Wikipedia Infobox Type Prediction Using Embeddings

Wikipedia Infobox Type Prediction Using Embeddings Wikipedia Infobox Type Prediction Using Embeddings Russa Biswas 1,2, Rima Türker 1,2, Farshad Bakhshandegan-Moghaddam 1,2, Maria Koutraki 1,2, and Harald Sack 1,2 1 FIZ Karlsruhe Leibniz Institute for

More information

Available online at ScienceDirect. Procedia Computer Science 52 (2015 )

Available online at  ScienceDirect. Procedia Computer Science 52 (2015 ) Available online at www.sciencedirect.com ScienceDirect Procedia Computer Science 52 (2015 ) 1071 1076 The 5 th International Symposium on Frontiers in Ambient and Mobile Systems (FAMS-2015) Health, Food

More information

MIRACLE at ImageCLEFmed 2008: Evaluating Strategies for Automatic Topic Expansion

MIRACLE at ImageCLEFmed 2008: Evaluating Strategies for Automatic Topic Expansion MIRACLE at ImageCLEFmed 2008: Evaluating Strategies for Automatic Topic Expansion Sara Lana-Serrano 1,3, Julio Villena-Román 2,3, José C. González-Cristóbal 1,3 1 Universidad Politécnica de Madrid 2 Universidad

More information

Open Government Data on the Web: A Semantic Approach

Open Government Data on the Web: A Semantic Approach 1 Open Government Data on the Web: A Semantic Approach Julia Hoxha Institute of Applied Informatics and Formal Description Methods (AIFB) Karlsruhe Institute of Technology, Germany Julia.Hoxha@kit.edu

More information

Jianyong Wang Department of Computer Science and Technology Tsinghua University

Jianyong Wang Department of Computer Science and Technology Tsinghua University Jianyong Wang Department of Computer Science and Technology Tsinghua University jianyong@tsinghua.edu.cn Joint work with Wei Shen (Tsinghua), Ping Luo (HP), and Min Wang (HP) Outline Introduction to entity

More information

NLP Final Project Fall 2015, Due Friday, December 18

NLP Final Project Fall 2015, Due Friday, December 18 NLP Final Project Fall 2015, Due Friday, December 18 For the final project, everyone is required to do some sentiment classification and then choose one of the other three types of projects: annotation,

More information

Linked Data Evolving the Web into a Global Data Space

Linked Data Evolving the Web into a Global Data Space Linked Data Evolving the Web into a Global Data Space Anja Jentzsch, Freie Universität Berlin 05 October 2011 EuropeanaTech 2011, Vienna 1 Architecture of the classic Web Single global document space Web

More information

Cluster-based Instance Consolidation For Subsequent Matching

Cluster-based Instance Consolidation For Subsequent Matching Jennifer Sleeman and Tim Finin, Cluster-based Instance Consolidation For Subsequent Matching, First International Workshop on Knowledge Extraction and Consolidation from Social Media, November 2012, Boston.

More information

DBpedia Extracting structured data from Wikipedia

DBpedia Extracting structured data from Wikipedia DBpedia Extracting structured data from Wikipedia Anja Jentzsch, Freie Universität Berlin Köln. 24. November 2009 DBpedia DBpedia is a community effort to extract structured information from Wikipedia

More information

Theme Identification in RDF Graphs

Theme Identification in RDF Graphs Theme Identification in RDF Graphs Hanane Ouksili PRiSM, Univ. Versailles St Quentin, UMR CNRS 8144, Versailles France hanane.ouksili@prism.uvsq.fr Abstract. An increasing number of RDF datasets is published

More information

Enhanced retrieval using semantic technologies:

Enhanced retrieval using semantic technologies: Enhanced retrieval using semantic technologies: Ontology based retrieval as a new search paradigm? - Considerations based on new projects at the Bavarian State Library Dr. Berthold Gillitzer 28. Mai 2008

More information

CIRGDISCO at RepLab2012 Filtering Task: A Two-Pass Approach for Company Name Disambiguation in Tweets

CIRGDISCO at RepLab2012 Filtering Task: A Two-Pass Approach for Company Name Disambiguation in Tweets CIRGDISCO at RepLab2012 Filtering Task: A Two-Pass Approach for Company Name Disambiguation in Tweets Arjumand Younus 1,2, Colm O Riordan 1, and Gabriella Pasi 2 1 Computational Intelligence Research Group,

More information

SPARQL Protocol And RDF Query Language

SPARQL Protocol And RDF Query Language SPARQL Protocol And RDF Query Language John Julian Carstens March 15, 2012 1 Introduction Beyond doubt, the world wide web has become central to the business reality of companies and to the personal reality

More information

Durchblick - A Conference Assistance System for Augmented Reality Devices

Durchblick - A Conference Assistance System for Augmented Reality Devices Durchblick - A Conference Assistance System for Augmented Reality Devices Anas Alzoghbi 1, Peter M. Fischer 1, Anna Gossen 2, Peter Haase 2, Thomas Hornung 1, Beibei Hu 2, Georg Lausen 1, Christoph Pinkel

More information

1. CONCEPTUAL MODEL 1.1 DOMAIN MODEL 1.2 UML DIAGRAM

1. CONCEPTUAL MODEL 1.1 DOMAIN MODEL 1.2 UML DIAGRAM 1 1. CONCEPTUAL MODEL 1.1 DOMAIN MODEL In the context of federation of repositories of Semantic Interoperability s, a number of entities are relevant. The primary entities to be described by ADMS are the

More information

Ontology based Model and Procedure Creation for Topic Analysis in Chinese Language

Ontology based Model and Procedure Creation for Topic Analysis in Chinese Language Ontology based Model and Procedure Creation for Topic Analysis in Chinese Language Dong Han and Kilian Stoffel Information Management Institute, University of Neuchâtel Pierre-à-Mazel 7, CH-2000 Neuchâtel,

More information

Reimplementing the Mathematics Subject Classification (MSC) as a Linked Open Dataset

Reimplementing the Mathematics Subject Classification (MSC) as a Linked Open Dataset Reimplementing the Mathematics Subject Classification (MSC) as a Linked Open Dataset Christoph Lange 1,2,3, Patrick Ion 4,5, Anastasia Dimou 5, Charalampos Bratsas 5, Joseph Corneli 6, Wolfram Sperber

More information

University of Rome Tor Vergata DBpedia Manuel Fiorelli

University of Rome Tor Vergata DBpedia Manuel Fiorelli University of Rome Tor Vergata DBpedia Manuel Fiorelli fiorelli@info.uniroma2.it 07/12/2017 2 Notes The following slides contain some examples and pictures taken from: Lehmann, J., Isele, R., Jakob, M.,

More information

Topics in Opinion Mining. Dr. Paul Buitelaar Data Science Institute, NUI Galway

Topics in Opinion Mining. Dr. Paul Buitelaar Data Science Institute, NUI Galway Topics in Opinion Mining Dr. Paul Buitelaar Data Science Institute, NUI Galway Opinion: Sentiment, Emotion, Subjectivity OBJECTIVITY SUBJECTIVITY SPECULATION FACTS BELIEFS EMOTION SENTIMENT UNCERTAINTY

More information

Google indexed 3,3 billion of pages. Google s index contains 8,1 billion of websites

Google indexed 3,3 billion of pages. Google s index contains 8,1 billion of websites Access IT Training 2003 Google indexed 3,3 billion of pages http://searchenginewatch.com/3071371 2005 Google s index contains 8,1 billion of websites http://blog.searchenginewatch.com/050517-075657 Estimated

More information

ONTOLOGY MODELLING OF MULTIMODALITY IMAGE RETRIEVAL SYSTEM FOR SPORT NEWS DOMAIN

ONTOLOGY MODELLING OF MULTIMODALITY IMAGE RETRIEVAL SYSTEM FOR SPORT NEWS DOMAIN ONTOLOGY MODELLING OF MULTIMODALITY IMAGE RETRIEVAL SYSTEM FOR SPORT NEWS DOMAIN Y. I. A. Khalid and S. A. Noah Knowledge Technology Research Group, Faculty of Information Science, Universiti Kebangsaan

More information

Exploring and Using the Semantic Web

Exploring and Using the Semantic Web Exploring and Using the Semantic Web Mathieu d Aquin KMi, The Open University m.daquin@open.ac.uk What?? Exploring the Semantic Web Vocabularies Ontologies Linked Data RDF documents Example: Exploring

More information

Available online at ScienceDirect. Procedia Computer Science 60 (2015 )

Available online at   ScienceDirect. Procedia Computer Science 60 (2015 ) Available online at www.sciencedirect.com ScienceDirect Procedia Computer Science 60 (2015 ) 1720 1727 19th International Conference on Knowledge Based and Intelligent Information and Engineering Systems

More information

PROJECT PERIODIC REPORT

PROJECT PERIODIC REPORT PROJECT PERIODIC REPORT Grant Agreement number: 257403 Project acronym: CUBIST Project title: Combining and Uniting Business Intelligence and Semantic Technologies Funding Scheme: STREP Date of latest

More information

Master projects, internships and other opportunities

Master projects, internships and other opportunities Master projects, internships and other opportunities Utrecht, February 2011 2/6/2011 1 PROPOSAL 1. Table of Contents 1. Table of Contents... 2 2. About Teezir... 3 3. Automatically detect emerging topics

More information

Kuphi An Investigation Tool for Searching for and via Semantic Relations

Kuphi An Investigation Tool for Searching for and via Semantic Relations Kuphi An Investigation Tool for Searching for and via Semantic Relations Michael Färber, Lei Zhang, Achim Rettinger Karlsruhe Institute of Technology (KIT), 76131 Karlsruhe, Germany {michael.faerber,l.zhang,rettinger}@kit.edu

More information

BOTS, TROLLS AND SOCIAL BOTS: THE GOOD, THE BAD AND THE UGLY

BOTS, TROLLS AND SOCIAL BOTS: THE GOOD, THE BAD AND THE UGLY BOTS, TROLLS AND SOCIAL BOTS: THE GOOD, THE BAD AND THE UGLY Juan Carlos Medina Serrano Data Scientist/PhD Candidate Political Data Science Technische Universität München Contents Bots The Bad and the

More information

Linking library data: contributions and role of subject data. Nuno Freire The European Library

Linking library data: contributions and role of subject data. Nuno Freire The European Library Linking library data: contributions and role of subject data Nuno Freire The European Library Outline Introduction to The European Library Motivation for Linked Library Data The European Library Open Dataset

More information

NERITS - A Machine Translation Mashup System Using Wikimeta and DBpedia. NEBHI, Kamel, NERIMA, Luka, WEHRLI, Eric. Abstract

NERITS - A Machine Translation Mashup System Using Wikimeta and DBpedia. NEBHI, Kamel, NERIMA, Luka, WEHRLI, Eric. Abstract Proceedings Chapter NERITS - A Machine Translation Mashup System Using Wikimeta and DBpedia NEBHI, Kamel, NERIMA, Luka, WEHRLI, Eric Abstract Recently, Machine Translation (MT) has become a quite popular

More information

Combining Review Text Content and Reviewer-Item Rating Matrix to Predict Review Rating

Combining Review Text Content and Reviewer-Item Rating Matrix to Predict Review Rating Combining Review Text Content and Reviewer-Item Rating Matrix to Predict Review Rating Dipak J Kakade, Nilesh P Sable Department of Computer Engineering, JSPM S Imperial College of Engg. And Research,

More information

Linked European Television Heritage

Linked European Television Heritage Linked European Television Heritage Editor(s): Name Surname, University, Country Solicited review(s): Name Surname, University, Country Open review(s): Name Surname, University, Country Nikolaos Simou

More information

Prof. Ahmet Süerdem Istanbul Bilgi University London School of Economics

Prof. Ahmet Süerdem Istanbul Bilgi University London School of Economics Prof. Ahmet Süerdem Istanbul Bilgi University London School of Economics Media Intelligence Business intelligence (BI) Uses data mining techniques and tools for the transformation of raw data into meaningful

More information

Financial Dataspaces: Challenges, Approaches and Trends

Financial Dataspaces: Challenges, Approaches and Trends Financial Dataspaces: Challenges, Approaches and Trends Finance and Economics on the Semantic Web (FEOSW), ESWC 27 th May, 2012 Seán O Riain ebusiness Copyright 2009. All rights reserved. Motivation Changing

More information

Entity-centric Topic Extraction and Exploration: A Network-based Approach

Entity-centric Topic Extraction and Exploration: A Network-based Approach Entity-centric Topic Extraction and Exploration: A Network-based Approach Andreas Spitz and Michael Gertz March 27, 2018 ECIR 2018, Grenoble Heidelberg University, Germany Database Systems Research Group

More information

A Korean Knowledge Extraction System for Enriching a KBox

A Korean Knowledge Extraction System for Enriching a KBox A Korean Knowledge Extraction System for Enriching a KBox Sangha Nam, Eun-kyung Kim, Jiho Kim, Yoosung Jung, Kijong Han, Key-Sun Choi KAIST / The Republic of Korea {nam.sangha, kekeeo, hogajiho, wjd1004109,

More information

Text Mining. Representation of Text Documents

Text Mining. Representation of Text Documents Data Mining is typically concerned with the detection of patterns in numeric data, but very often important (e.g., critical to business) information is stored in the form of text. Unlike numeric data,

More information

VOICE ENABLING THE AUTOSCOUT24 CAR SEARCH APP. 1 Introduction. 2 Previous work

VOICE ENABLING THE AUTOSCOUT24 CAR SEARCH APP. 1 Introduction. 2 Previous work VOICE ENABLING THE AUTOSCOUT24 CAR SEARCH APP Felix Burkhardt*, Jianshen Zhou*, Stefan Seide*, Thomas Scheerbarth*, Bernd Jäkel+ and Tilman Buchner+ * Deutsche Telekom Laboratories, Berlin, + AutoScout24,

More information

Semantic Multimedia Information Retrieval Based on Contextual Descriptions

Semantic Multimedia Information Retrieval Based on Contextual Descriptions Semantic Multimedia Information Retrieval Based on Contextual Descriptions Nadine Steinmetz and Harald Sack Hasso Plattner Institute for Software Systems Engineering, Potsdam, Germany, nadine.steinmetz@hpi.uni-potsdam.de,

More information

Annotating Spatio-Temporal Information in Documents

Annotating Spatio-Temporal Information in Documents Annotating Spatio-Temporal Information in Documents Jannik Strötgen University of Heidelberg Institute of Computer Science Database Systems Research Group http://dbs.ifi.uni-heidelberg.de stroetgen@uni-hd.de

More information

Semantic Interoperability of Basic Data in the Italian Public Sector Giorgia Lodi

Semantic Interoperability of Basic Data in the Italian Public Sector Giorgia Lodi Semantic Interoperability of Basic Data in the Italian Public Sector Giorgia Lodi SEMIC conference 2013 21 June 2013 Dublin (AgID) Created last year, AgID is a public body that merged three different existing

More information

How Co-Occurrence can Complement Semantics?

How Co-Occurrence can Complement Semantics? How Co-Occurrence can Complement Semantics? Atanas Kiryakov & Borislav Popov ISWC 2006, Athens, GA Semantic Annotations: 2002 #2 Semantic Annotation: How and Why? Information extraction (text-mining) for

More information

Programming the Semantic Web

Programming the Semantic Web Programming the Semantic Web Steffen Staab, Stefan Scheglmann, Martin Leinberger, Thomas Gottron Institute for Web Science and Technologies, University of Koblenz-Landau, Germany Abstract. The Semantic

More information

Automated Visualization Support for Linked Research Data

Automated Visualization Support for Linked Research Data Automated Visualization Support for Linked Research Data Belgin Mutlu 1, Patrick Hoefler 1, Vedran Sabol 1, Gerwald Tschinkel 1, and Michael Granitzer 2 1 Know-Center, Graz, Austria 2 University of Passau,

More information

Open Research Online The Open University s repository of research publications and other research outputs

Open Research Online The Open University s repository of research publications and other research outputs Open Research Online The Open University s repository of research publications and other research outputs Social Web Communities Conference or Workshop Item How to cite: Alani, Harith; Staab, Steffen and

More information

XETA: extensible metadata System

XETA: extensible metadata System XETA: extensible metadata System Abstract: This paper presents an extensible metadata system (XETA System) which makes it possible for the user to organize and extend the structure of metadata. We discuss

More information

Search Engines. Information Retrieval in Practice

Search Engines. Information Retrieval in Practice Search Engines Information Retrieval in Practice All slides Addison Wesley, 2008 Beyond Bag of Words Bag of Words a document is considered to be an unordered collection of words with no relationships Extending

More information

An open source GATE toolkit for social media analysis. Diana Maynard University of Sheffield, UK

An open source GATE toolkit for social media analysis. Diana Maynard University of Sheffield, UK An open source GATE toolkit for social media analysis Diana Maynard University of Sheffield, UK GATE for text engineering Tool for developing and deployment of Text Mining technology http://gate.ac.uk

More information

The Dacura Data Curation System

The Dacura Data Curation System The Dacura Data Curation System Kevin Feeney (B) Knowledge and Data Engineering Group, School of Computer Science and Statistics, Trinity College Dublin, Dublin, Ireland kevin.feeney@cs.tcd.ie Abstract.

More information

Harvesting Open Government Data with DCAT-AP

Harvesting Open Government Data with DCAT-AP Fraunhofer FOKUS Institute for Open Communication Systems AlanDavidRobb / Pixabay (CC0) Harvesting Open Government Data with DCAT-AP Fabian Kirstein, 21. March 2018 The European Data Portal offers more

More information

IJREAT International Journal of Research in Engineering & Advanced Technology, Volume 1, Issue 5, Oct-Nov, ISSN:

IJREAT International Journal of Research in Engineering & Advanced Technology, Volume 1, Issue 5, Oct-Nov, ISSN: IJREAT International Journal of Research in Engineering & Advanced Technology, Volume 1, Issue 5, Oct-Nov, 20131 Improve Search Engine Relevance with Filter session Addlin Shinney R 1, Saravana Kumar T

More information

The Emerging Web of Linked Data

The Emerging Web of Linked Data 4th Berlin Semantic Web Meetup 26. February 2010 The Emerging Web of Linked Data Prof. Dr. Christian Bizer Freie Universität Berlin Outline 1. From a Web of Documents to a Web of Data Web APIs and Linked

More information

Links, languages and semantics: linked data approaches in The European Library and Europeana. Valentine Charles, Nuno Freire & Antoine Isaac

Links, languages and semantics: linked data approaches in The European Library and Europeana. Valentine Charles, Nuno Freire & Antoine Isaac Links, languages and semantics: linked data approaches in The European Library and Europeana. Valentine Charles, Nuno Freire & Antoine Isaac 14 th August 2014, IFLA2014 satellite meeting, Paris The European

More information

Share.TEC System Architecture

Share.TEC System Architecture Share.TEC System Architecture Krassen Stefanov 1, Pavel Boytchev 2, Alexander Grigorov 3, Atanas Georgiev 4, Milen Petrov 5, George Gachev 6, and Mihail Peltekov 7 1,2,3,4,5,6,7 Faculty of Mathematics

More information

Give me the info I need now Setting the text analysis strategy at Belga News Agency

Give me the info I need now Setting the text analysis strategy at Belga News Agency Give me the info I need now Setting the text analysis strategy at Belga News Agency Tom Wuytack ICT Manager, wut@belga.be Powered by Belga News Agency 2015 A few words on Belga News Agency Each day the

More information