Exploring Search Log Data. Theodora Tsikrika University of Applied Sciences Western Switzerland (HES-SO) Switzerland

Size: px
Start display at page:

Download "Exploring Search Log Data. Theodora Tsikrika University of Applied Sciences Western Switzerland (HES-SO) Switzerland"

Transcription

1 Exploring Search Log Data Theodora Tsikrika University of Applied Sciences Western Switzerland (HES-SO) Switzerland University of Copenhagen, February 22, 2012 CLEF 2011, Sept 21,

2 Sierre, Switzerland 2 2

3 HES-SO Sierre 1,500 students Institutes: Business Information Systems, Economy, Tourism Research in focussed domains: Internet of things, RFID Mobile applications Energy, Green ICT SAP centre ehealth Information Retrieval 3 3

4 MedGIFT research group Henning HenningMüller Müller Professor Professor Theodora Tsikrika Antonio Foncubierta Postdoc Ph.D. student Adrien Depeursinge Dimitriοs Markonis Postdoc Ph.D. student Manfredo Atzori Alba Garcia Postdoc Ph.D. student Alexandre Cotting Ivan Eggel Project manager Developer Alejandro Vargas (Geneva) Roger Schaer Medical Doctor Developer 4 4

5 MedGIFT research & projects Medical (multidimensional) image analysis and retrieval Multimedia information retrieval Information retrieval evaluation Test collection creation (including images and signals) User testing and task analysis Infrastructures for computation 5 5

6 Exploring search log data : researcher at CWI, Amsterdam, The Netherlands : Database Architectures and Information Access group : Interactive Information Access group VITALAS: Video & image Indexing and Retrieval in the Large Scale (FP6 IP) use-case driven project that built a prototype system dedicated to intelligent access services to multimedia professional archives advanced solutions for indexing, searching and accessing large scale of non previously (or partly) annotated multimedia content novel contributions in cross-media (audio/speech, video, image, text) indexing, content enrichment, and interactive retrieval methods 6 6

7 7 7

8 8 8

9 9 9

10 10 10

11 11 11

12 12 12

13 Search log data Examine users' information searching behaviour Unobtrusive / Naturalistic settings Broad range of user-system interactions / Significant time periods Large amounts of data / Sizable number of users No qualitative user aspects Context, situation, decision process, user satisfaction remain implicit Benefits Understand system usage Improve user experience and system effectiveness Exploitation Core ranking / Automatic query expansion / Ad matching / User modelling Web caching Query assistance Dynamic query suggestions as you type Query recommendations Media Search cluster meeting 13 13

14 Overview Motivation Search log analysis Semantic search log analysis method Study on professional image search log data Query recommendation Exploitation of clickthrough data Image annotation Image search Conclusions 14 14

15 Search log analysis Logged data timestamp, sessionid, userid, query, clicks Level of analysis Term Query Session Analysis of user behavioural patterns Query submission (formulation) Query modification Syntactic level (term-based) Semantic level Media Search cluster meeting 15 15

16 Term-based query modification analysis Addition = specification University of Copenhagen University of Copenhagen ranking Elimination = generalisation University of Copenhagen ranking University of Copenhagen Substitution = reformulation University of Copenhagen University of Aarhus Lexical variation Media Search cluster meeting 16Copenhagen University of Copenhagen Universities of 16

17 Term-based query modification analysis: limitations Reformulations Posthuma tour france Posthuma tour 2008 Vanessa Williams Serena Williams Undetermined University of Copenhagen Royal School of Library Information Science University of Copenhagen Denmark No semantics! Can we add a semantic dimension? How? Can we exploit the Linked Open Data? 17 17

18 Linked Open Data Entity = URI RDF triple = encode what is predicated about specific entities Subject: Beckham Predicate: Object: 'David Beckham' 18

19 Linked Open Data Entity = URI RDF triple = encode what is predicated about specific entities Subject: Beckham Subject: Predicate: Predicate: Object: David Beckham Object: `soccer player 19

20 Linked Open Data Entity = URI RDF triple = encode what is predicated about specific entities Subject: Beckham Subject: Predicate: Subject: Predicate: Object: David Beckham Predicate: Object: `soccer player Object: 20

21 Linked Open Data Entity = URI RDF triple = encode what is predicated about specific entities Subject: Beckham Subject: Predicate: Subject: Predicate: Object: David Beckham Predicate: Subject: Object: `soccer player Object: Predicate: Object: 21

22 Linked Open Data cloud 22 > > sources sources billion billiontriples triples 22

23 Semantic search log analysis method Input: List of search sessions (queries, query pairs) RDF triples from Linked Open Data sources Output: Query types + relative frequencies Query modification patterns + support & confidence values 1. Map queries from logs to entities in RDF triples (rdfs:label) 2. Determine types of entities and count occurrence frequencies 3. Determine semantic relations between entities of query pairs 4. Abstract semantic relations semantic patterns 5. Count occurrence frequencies 6. Rank semantic patterns based on their support and confidence 23 23

24 Semantic search log analysis method 24 24

25 Semantic search log analysis method 25 25

26 Semantic search log analysis method 26 26

27 Abstract semantic relations semantic patterns David Beckham Joe Cole DBPedia:David_Beckham -DBPedia:Nationalteam DBPedia:England_national_football_team DBPedia:Nationalteam- DBPedia:Joe_Cole Q1 -DBPedia:Nationalteam X DBPedia:Nationalteam- Q2 Nicolas Sarkozy Carla Bruni DBPedia:Nicolas_Sarkozy -DBPedia:spouse DBPedia:Carla_Bruni Q1 -DBPedia:spouse Q

28 Rank semantic patterns Which patterns are the most important? The ones that occur with higher frequency? What if these patterns are not informative and simply occur too often in the linked data? Compute expected frequencies of patterns Compute frequency of patterns between random queries Support = relative frequency Support_session Confidence = Support_session + Support_random Media Search cluster meeting 28 28

29 Overview Motivation Search log analysis Semantic search log analysis method Study on professional image search log data Query recommendation Exploitation of clickthrough data Image annotation Image search Conclusions 29 29

30 Professional image search logs analysis European news agency Commercial picture portal Millions of photographic images Professional users Search log data 10 months ~ 1 million queries / 0.5 million sessions Linked Open Data sources (22 million RDF triples) DBpedia WordNet Cornetto Getty geographical names Media Search cluster Getty Art and Architecture thesaurus meeting 30 30

31 Search log statistics (October 2008 July 2009) 31 31

32 Query frequency distribution 32 32

33 Query types Found matching URI for 79% of all queries Identified type for 68% of matched queries (about half of all queries) 33 33

34 Query types conceptual queries specific queries 34 34

35 Query types DBpedia:Person 35 35

36 Query modification patterns 24% query pairs classified using the semantic analysis 36 36

37 Query modification patterns identity relation 37 37

38 Query modification patterns partner of a person 38 38

39 Query modification patterns common property 39 39

40 Query modification patterns same type e.g., tennis players, townships,

41 Query modification patterns close relation e.g., prince and princess 41 41

42 Query modification classes Sibling relations: 19% Q1 -R X R- Q2 e.g., common property, WordNet hyponyms Direct few-to-few relations: 10% e.g., spouse Other relations: 71% 42 42

43 Term-based query modification analysis 43 43

44 Term-based vs. semantic query modification analysis 25% query pairs classified using the term-based analysis 24% query pairs classified using the semantic analysis complementary approaches 44 44

45 Accuracy of the method Semantic search log analysis method: 1. Match the query to linked data entities 2. Determine query types 3. Identify query modification patterns Accuracy of query modification pattern identification 100 query pairs randomly selected 4 judges identified the most prominent relation for 25 query pairs each (ground truth) 3 raters assessed the patterns identified by the system against the ground truth System choice classified as incorrect, approximately correct, correct Agreement among raters: 0.69 Fleiss kappa Agreement between system and ground truth: 0.61 (lenient mapping) System moderatelymedia successful Search cluster meeting 45 45

46 Overview Motivation Search log analysis Semantic search log analysis method Study on professional image search log data Query recommendation Exploitation of clickthrough data Image annotation Image search Conclusions 46 46

47 Query recommendation: approaches Existing approaches Document-based methods Search log-based methods: co-occurring queries Not previously submitted queries? Infrequent queries? Ontology-based methods Which links to select? Combinations of the above Based on semantic patterns Given: A query mapped to concept(s) Semantic patterns ranked by their support Apply patterns to concepts Suggestions ranked by their support Media Search clustervalue meeting 47 Ties broken by occurrence frequency in logs 47

48 Query recommendation: experiments Applied approach Baseline: search log-based method Top-10 co-occurring queries in the same session If suggestions < 10, then add suggestions based on semantic patterns Datasets 1,105, 766 queries 332,809 sessions 80% of sessions used for training (417,633 query pairs) 20% of sessions used for testing (64,767 query pairs) 44 semantic patterns 48 48

49 Query recommendation: results Log-based statistics Log-based statistics + Semantic patterns All queries Success rate % Coverage % * Queries that occur 5 times or less (36% of queries) Success rate % * Coverage % * Coverage: # times at least one suggestion is found Success rate: # times that suggestions include ground truth 49 Ground truth = the query immediately following the user s query in a session 49

50 Semantic search log analysis V. Hollink, T. Tsikrika, and A. P. de Vries. Semantic Search Log Analysis: a Method and a Study on Professional Image Search. JASIST, 62(4): , V. Hollink, T. Tsikrika, and A. P. de Vries. The semantics of query modification. In Proceedings of the 9th International Conference on Adaptivity, Personalization and Fusion of Heterogeneous Information (RIAO 2010), April 28-30, Paris, France,

51 Overview Motivation Search log analysis Semantic search log analysis method Study on professional image search log data Query recommendation Exploitation of clickthrough data Image annotation Image search Conclusions 51 51

52 Concept-based Image Annotation Aim: unambiguously describe the visual content of images Bridge Canal Red houses... Caption : Pretty Copenhagen

53 Concept-based Image Annotation Challenges when using supervised machine learning techniques: require labelled samples as training data laborious and expensive task when performed manually large number of semantic concepts poor generalisation of concept classifiers in other domains How can we automatically supplement/replace the manually annotated training samples? 53 53

54 Approach Automatically generate annotated training samples user-defined tags (e.g., Flickr) keywords extracted from Web pages where images are embedded clickthrough data collected in search logs traffic advantages: large quantities, no user intervention, available to all content owners, collective annotations (assessments) disadvantages: sparse, noisy, user queries not based on Media Search cluster meeting 54 strict visual criteria 54

55 Research questions 1) How can we build classifiers for annotating images with concepts using clickthrough data? methods for searchlog-based positive sample selection random negative sample selection 2) What is the effectiveness of these concept classifiers? experiments using data provided by BELGA news agency ~100k photographic images (with their text metadata) clickthrough data 55 55

56 Concept definition A concept is a clearly defined, non ambiguous entity represented by a short name keywords free-text short description Name traffic Concept Keywords traffic, traffic jam, cars, road, highway Description Image showing a high density of vehicles when on a road or highway

57 Positive sample selection using search logs Method exact select images clicked for queries exactly matching the concept name Methods textual similarity (based on IR language models) annotate each image with all queries for which it has been clicked apply stemming (yes/no) select images retrieved for query: (i) concept name (ii) concept keywords using retrieval model: (i) language model (LM) (ii) smoothed LM (LMS) Method clickgraph images clicked for the same query are likely to be relevant to each other 57 57

58 Reliability of clickthrough-based annotations methods varies greatly across concepts around 20% of the total number of concepts for each method reach Media 58 agreement of at least 0.8 Search cluster meeting 58

59 Building concept classifiers Positive samples for concept c: Nc,m images selected using one of the methods m exact, textual similarity (6 language modelling variants), clickgraph Negative samples for concept c: Nc',m = max( Nc,m,, Nc,m) images randomly selected Low-level features visual features FW (120-d vector) based on integrated Weibull distribution of edges (texture descriptor) compare region distributions to distributions of a set of reference images J. C. van Gemert et al. Robust Scene Categorization by Learning Image Statistics in Context. In International Workshop on Semantic Learning Applications in Multimedia, text features FT SVM classifier with RBFMedia kernelsearch cluster meeting 59 59

60 Experiments: datasets (provided by Belga news agency) Image collection 97,628 photographic images ~1,000 images manually annotated for each VITALAS concept Search logs 101 days (June October 2007) professional users 9,605 unique ('lightly' normalised) queries conversion to lower case removal of punctuation, quotes, and methods removal of names of major photo agencies 35,894 clicked images (out of the 97,628) Evaluation datasets 25 concepts Training: (manual annotations) (positive samples) (negative samples) 60 Test: 56,605 images 60

61 Experiments: results methods For visual features : combination of manual and searchlog-based training samples performs best consistently over all methods For text features : searchlog-based training samples produced by less noisy methods perform best 61 Text features outperform visual features 61

62 Experiments: results visual features manual+searchlog-based visual features: manual visual features searchlog-based methods For visual features : combination of manual and searchlog-based training samples performs best consistently over all methods For text features : searchlog-based training samples produced by less noisy methods perform best 62 Text features outperform visual features 62

63 Experiments: results text features searchlog-based text features: manual text features manual+searchlog-based visual features manual+searchlog-based visual features: manual visual features searchlog-based methods For visual features : combination of manual and searchlog-based training samples performs best consistently over all methods For text features : searchlog-based training samples produced by less noisy methods perform best 63 Text features outperform visual features 63

64 Concept: soccer manually annotated positive samples search log based annotated positive samples test set results visual features search log based training 64 View all results at: 64

65 Image annotation using clickthrough data: main findings Contribution of search-log training data in image annotation when using supervised machine learning is positive Scales to a large number of concepts Can take into account emerging concepts Available to all content owners avoid the generalisation problem 65 65

66 Image annotation using clickthrough data T. Tsikrika, C. Diou, A. P. de Vries, and A. Delopoulos. Image Annotation Using Clickthrough Data. In Proceedings of CIVR Τ. Tsikrika, C. Diou, A.P. de Vries, and A. Delopoulos. Reliability and Effectiveness of Clickthrough Data for Automatic Image Annotation. Multimedia Tools & Applications, 55(1),

67 Overview Motivation Search log analysis Semantic search log analysis method Study on professional image search log data Query recommendation Exploitation of clickthrough data Image annotation Image search Conclusions 67 67

68 Topic modelling of clickthrough data D. Morrison, T. Tsikrika, V. Hollink, A. P. de Vries, É. Bruno, S. Marchand-Maillet. Topic modelling of clickthrough data in image search. Multimedia Tools & Applications, Springer (to appear)

69 Conclusions Semantic search log analysis Implications for system design, search support, content management Query recommendation Beneficial for infrequent queries or queries entered for first time (long tail) Suggestions not occurring in logs (serendipitous discoveries) Explain relations between query and suggestions Combination of search logs with linked data Image annotation using clickthough data clickthrough data alone can lead to satisfactory effectiveness combination with manual annotations improves the effectiveness scalability in the number of concept detectors possibility to dynamically adapt the detector set Optimal sample size? Noise reduction? 69 69

70 Thank you!

Are clickthrough data reliable as image annotations?

Are clickthrough data reliable as image annotations? Video & Image Indexing and Retrieval in the Large Scale Are clickthrough data reliable as image annotations? Theodora Tsikrika (CWI) Christos Diou (AUTH, CERTH-ITI) Arjen P. de Vries (CWI, TU Delft) (

More information

Medical image analysis and retrieval. Henning Müller

Medical image analysis and retrieval. Henning Müller Medical image analysis and retrieval Henning Müller Overview My background Our laboratory Current projects Khresmoi, MANY, Promise, Chorus+, NinaPro Challenges Demonstration Conclusions 2 Personal background

More information

Semantic Search Log Analysis: A Method and a Study on Professional Image Search

Semantic Search Log Analysis: A Method and a Study on Professional Image Search Semantic Search Log Analysis: A Method and a Study on Professional Image Search Vera Hollink, Theodora Tsikrika, and Arjen P. de Vries Centrum Wiskunde en Informatica, Science Park 123, 1098 XG Amsterdam,

More information

A Text-Based Approach to the ImageCLEF 2010 Photo Annotation Task

A Text-Based Approach to the ImageCLEF 2010 Photo Annotation Task A Text-Based Approach to the ImageCLEF 2010 Photo Annotation Task Wei Li, Jinming Min, Gareth J. F. Jones Center for Next Generation Localisation School of Computing, Dublin City University Dublin 9, Ireland

More information

Columbia University High-Level Feature Detection: Parts-based Concept Detectors

Columbia University High-Level Feature Detection: Parts-based Concept Detectors TRECVID 2005 Workshop Columbia University High-Level Feature Detection: Parts-based Concept Detectors Dong-Qing Zhang, Shih-Fu Chang, Winston Hsu, Lexin Xie, Eric Zavesky Digital Video and Multimedia Lab

More information

Overview of ImageCLEF Mauricio Villegas (on behalf of all organisers)

Overview of ImageCLEF Mauricio Villegas (on behalf of all organisers) Overview of ImageCLEF 2016 Mauricio Villegas (on behalf of all organisers) ImageCLEF history Started in 2003 with a photo retrieval task 4 participants submitting results In 2009 we had 6 tasks and 65

More information

PROJECT PERIODIC REPORT

PROJECT PERIODIC REPORT PROJECT PERIODIC REPORT Grant Agreement number: 257403 Project acronym: CUBIST Project title: Combining and Uniting Business Intelligence and Semantic Technologies Funding Scheme: STREP Date of latest

More information

MedGIFT projects in medical imaging. Henning Müller

MedGIFT projects in medical imaging. Henning Müller MedGIFT projects in medical imaging Henning Müller Where we are 2 Who I am Medical informatics studies in Heidelberg, Germany (1992-1997) Exchange with Daimler Benz research, USA PhD in image processing,

More information

Multimedia Information Retrieval

Multimedia Information Retrieval Multimedia Information Retrieval Prof Stefan Rüger Multimedia and Information Systems Knowledge Media Institute The Open University http://kmi.open.ac.uk/mmis Multimedia Information Retrieval 1. What are

More information

Wikipedia Retrieval Task ImageCLEF 2011

Wikipedia Retrieval Task ImageCLEF 2011 Wikipedia Retrieval Task ImageCLEF 2011 Theodora Tsikrika University of Applied Sciences Western Switzerland, Switzerland Jana Kludas University of Geneva, Switzerland Adrian Popescu CEA LIST, France Outline

More information

CS473: Course Review CS-473. Luo Si Department of Computer Science Purdue University

CS473: Course Review CS-473. Luo Si Department of Computer Science Purdue University CS473: CS-473 Course Review Luo Si Department of Computer Science Purdue University Basic Concepts of IR: Outline Basic Concepts of Information Retrieval: Task definition of Ad-hoc IR Terminologies and

More information

NUS-I2R: Learning a Combined System for Entity Linking

NUS-I2R: Learning a Combined System for Entity Linking NUS-I2R: Learning a Combined System for Entity Linking Wei Zhang Yan Chuan Sim Jian Su Chew Lim Tan School of Computing National University of Singapore {z-wei, tancl} @comp.nus.edu.sg Institute for Infocomm

More information

MIRACLE at ImageCLEFmed 2008: Evaluating Strategies for Automatic Topic Expansion

MIRACLE at ImageCLEFmed 2008: Evaluating Strategies for Automatic Topic Expansion MIRACLE at ImageCLEFmed 2008: Evaluating Strategies for Automatic Topic Expansion Sara Lana-Serrano 1,3, Julio Villena-Román 2,3, José C. González-Cristóbal 1,3 1 Universidad Politécnica de Madrid 2 Universidad

More information

Shrey Patel B.E. Computer Engineering, Gujarat Technological University, Ahmedabad, Gujarat, India

Shrey Patel B.E. Computer Engineering, Gujarat Technological University, Ahmedabad, Gujarat, India International Journal of Scientific Research in Computer Science, Engineering and Information Technology 2018 IJSRCSEIT Volume 3 Issue 3 ISSN : 2456-3307 Some Issues in Application of NLP to Intelligent

More information

Semantic Website Clustering

Semantic Website Clustering Semantic Website Clustering I-Hsuan Yang, Yu-tsun Huang, Yen-Ling Huang 1. Abstract We propose a new approach to cluster the web pages. Utilizing an iterative reinforced algorithm, the model extracts semantic

More information

Theme Identification in RDF Graphs

Theme Identification in RDF Graphs Theme Identification in RDF Graphs Hanane Ouksili PRiSM, Univ. Versailles St Quentin, UMR CNRS 8144, Versailles France hanane.ouksili@prism.uvsq.fr Abstract. An increasing number of RDF datasets is published

More information

SLIPO. Scalable Linking and Integration of Big POI data. Giorgos Giannopoulos IMIS/Athena RC

SLIPO. Scalable Linking and Integration of Big POI data. Giorgos Giannopoulos IMIS/Athena RC SLIPO Scalable Linking and Integration of Big POI data I n f o r m a ti o n a n d N e t w o r ki n g D a y s o n H o ri z o n 2 0 2 0 B i g Da ta Public-Priva te Partnership To p i c : I C T 14 B i g D

More information

Semantic Web Company. PoolParty - Server. PoolParty - Technical White Paper.

Semantic Web Company. PoolParty - Server. PoolParty - Technical White Paper. Semantic Web Company PoolParty - Server PoolParty - Technical White Paper http://www.poolparty.biz Table of Contents Introduction... 3 PoolParty Technical Overview... 3 PoolParty Components Overview...

More information

Using Linked Data to Reduce Learning Latency for e-book Readers

Using Linked Data to Reduce Learning Latency for e-book Readers Using Linked Data to Reduce Learning Latency for e-book Readers Julien Robinson, Johann Stan, and Myriam Ribière Alcatel-Lucent Bell Labs France, 91620 Nozay, France, Julien.Robinson@alcatel-lucent.com

More information

A Preliminary Investigation into the Search Behaviour of Users in a Collection of Digitized Broadcast Audio

A Preliminary Investigation into the Search Behaviour of Users in a Collection of Digitized Broadcast Audio A Preliminary Investigation into the Search Behaviour of Users in a Collection of Digitized Broadcast Audio Haakon Lund 1, Mette Skov 2, Birger Larsen 2 and Marianne Lykke 2 1 Royal School of Library and

More information

Word Indexing Versus Conceptual Indexing in Medical Image Retrieval

Word Indexing Versus Conceptual Indexing in Medical Image Retrieval Word Indexing Versus Conceptual Indexing in Medical Image Retrieval (ReDCAD participation at ImageCLEF Medical Image Retrieval 2012) Karim Gasmi, Mouna Torjmen-Khemakhem, and Maher Ben Jemaa Research unit

More information

Metadata Topic Harmonization and Semantic Search for Linked-Data-Driven Geoportals -- A Case Study Using ArcGIS Online

Metadata Topic Harmonization and Semantic Search for Linked-Data-Driven Geoportals -- A Case Study Using ArcGIS Online Metadata Topic Harmonization and Semantic Search for Linked-Data-Driven Geoportals -- A Case Study Using ArcGIS Online Yingjie Hu 1, Krzysztof Janowicz 1, Sathya Prasad 2, and Song Gao 1 1 STKO Lab, Department

More information

Introduction to Information Retrieval

Introduction to Information Retrieval Introduction to Information Retrieval Mohsen Kamyar چهارمین کارگاه ساالنه آزمایشگاه فناوری و وب بهمن ماه 1391 Outline Outline in classic categorization Information vs. Data Retrieval IR Models Evaluation

More information

A Survey Of Different Text Mining Techniques Varsha C. Pande 1 and Dr. A.S. Khandelwal 2

A Survey Of Different Text Mining Techniques Varsha C. Pande 1 and Dr. A.S. Khandelwal 2 A Survey Of Different Text Mining Techniques Varsha C. Pande 1 and Dr. A.S. Khandelwal 2 1 Department of Electronics & Comp. Sc, RTMNU, Nagpur, India 2 Department of Computer Science, Hislop College, Nagpur,

More information

TERM BASED WEIGHT MEASURE FOR INFORMATION FILTERING IN SEARCH ENGINES

TERM BASED WEIGHT MEASURE FOR INFORMATION FILTERING IN SEARCH ENGINES TERM BASED WEIGHT MEASURE FOR INFORMATION FILTERING IN SEARCH ENGINES Mu. Annalakshmi Research Scholar, Department of Computer Science, Alagappa University, Karaikudi. annalakshmi_mu@yahoo.co.in Dr. A.

More information

Document Clustering for Mediated Information Access The WebCluster Project

Document Clustering for Mediated Information Access The WebCluster Project Document Clustering for Mediated Information Access The WebCluster Project School of Communication, Information and Library Sciences Rutgers University The original WebCluster project was conducted at

More information

Qualitative Data Analysis Software. A workshop for staff & students School of Psychology Makerere University

Qualitative Data Analysis Software. A workshop for staff & students School of Psychology Makerere University Qualitative Data Analysis Software A workshop for staff & students School of Psychology Makerere University (PhD) January 27, 2016 Outline for the workshop CAQDAS NVivo Overview Practice 2 CAQDAS Before

More information

Automatic Generation of Query Sessions using Text Segmentation

Automatic Generation of Query Sessions using Text Segmentation Automatic Generation of Query Sessions using Text Segmentation Debasis Ganguly, Johannes Leveling, and Gareth J.F. Jones CNGL, School of Computing, Dublin City University, Dublin-9, Ireland {dganguly,

More information

EFFICIENT INTEGRATION OF SEMANTIC TECHNOLOGIES FOR PROFESSIONAL IMAGE ANNOTATION AND SEARCH

EFFICIENT INTEGRATION OF SEMANTIC TECHNOLOGIES FOR PROFESSIONAL IMAGE ANNOTATION AND SEARCH EFFICIENT INTEGRATION OF SEMANTIC TECHNOLOGIES FOR PROFESSIONAL IMAGE ANNOTATION AND SEARCH Andreas Walter FZI Forschungszentrum Informatik, Haid-und-Neu-Straße 10-14, 76131 Karlsruhe, Germany, awalter@fzi.de

More information

CIS UDEL Working Notes on ImageCLEF 2015: Compound figure detection task

CIS UDEL Working Notes on ImageCLEF 2015: Compound figure detection task CIS UDEL Working Notes on ImageCLEF 2015: Compound figure detection task Xiaolong Wang, Xiangying Jiang, Abhishek Kolagunda, Hagit Shatkay and Chandra Kambhamettu Department of Computer and Information

More information

Pouya Kousha Fall 2018 CSE 5194 Prof. DK Panda

Pouya Kousha Fall 2018 CSE 5194 Prof. DK Panda Pouya Kousha Fall 2018 CSE 5194 Prof. DK Panda 1 Observe novel applicability of DL techniques in Big Data Analytics. Applications of DL techniques for common Big Data Analytics problems. Semantic indexing

More information

Introduction to Text Mining. Hongning Wang

Introduction to Text Mining. Hongning Wang Introduction to Text Mining Hongning Wang CS@UVa Who Am I? Hongning Wang Assistant professor in CS@UVa since August 2014 Research areas Information retrieval Data mining Machine learning CS@UVa CS6501:

More information

The European Commission s science and knowledge service. Joint Research Centre

The European Commission s science and knowledge service. Joint Research Centre The European Commission s science and knowledge service Joint Research Centre GeoDCAT-AP The story so far Andrea Perego, Antonio Rotundo, Lieven Raes GeoDCAT-AP Webinar 6 June 2018 What is GeoDCAT-AP Geospatial

More information

Chapter 27 Introduction to Information Retrieval and Web Search

Chapter 27 Introduction to Information Retrieval and Web Search Chapter 27 Introduction to Information Retrieval and Web Search Copyright 2011 Pearson Education, Inc. Publishing as Pearson Addison-Wesley Chapter 27 Outline Information Retrieval (IR) Concepts Retrieval

More information

Semantic Annotation and Linking of Medical Educational Resources

Semantic Annotation and Linking of Medical Educational Resources 5 th European IFMBE MBEC, Budapest, September 14-18, 2011 Semantic Annotation and Linking of Medical Educational Resources N. Dovrolis 1, T. Stefanut 2, S. Dietze 3, H.Q. Yu 3, C. Valentine 3 & E. Kaldoudi

More information

IMOTION. Heiko Schuldt, University of Basel, Switzerland

IMOTION. Heiko Schuldt, University of Basel, Switzerland IMOTION Heiko Schuldt, University of Basel, Switzerland heiko.schuldt@unibas.ch IMOTION at a Glance Project Title Intelligent Multimodal Augmented Video Motion Retrieval System (IMOTION) Project Start

More information

Europeana and semantic alignment of vocabularies

Europeana and semantic alignment of vocabularies Europeana and semantic alignment of vocabularies Antoine Isaac Jacco van Ossenbruggen, Victor de Boer, Jan Wielemaker, Guus Schreiber Europeana & Vrije Universiteit Amsterdam NKOS workshop, Berlin, Sept.

More information

Overview MULTIMEDIA INFORMATION RETRIEVAL. Search Engines. Information Retrieval. Explanation. Van Rijsbergen

Overview MULTIMEDIA INFORMATION RETRIEVAL. Search Engines. Information Retrieval. Explanation. Van Rijsbergen MULTIMEDIA INFORMATION RETRIEVAL Arjen P. de Vries arjen@acm.org Overview Information Retrieval Text Retrieval Multimedia Retrieval Recent Developments Research Topics Centrum voor Wiskunde en Informatica

More information

An Evaluation of Geo-Ontology Representation Languages for Supporting Web Retrieval of Geographical Information

An Evaluation of Geo-Ontology Representation Languages for Supporting Web Retrieval of Geographical Information An Evaluation of Geo-Ontology Representation Languages for Supporting Web Retrieval of Geographical Information P. Smart, A.I. Abdelmoty and C.B. Jones School of Computer Science, Cardiff University, Cardiff,

More information

CHAPTER 6 PROPOSED HYBRID MEDICAL IMAGE RETRIEVAL SYSTEM USING SEMANTIC AND VISUAL FEATURES

CHAPTER 6 PROPOSED HYBRID MEDICAL IMAGE RETRIEVAL SYSTEM USING SEMANTIC AND VISUAL FEATURES 188 CHAPTER 6 PROPOSED HYBRID MEDICAL IMAGE RETRIEVAL SYSTEM USING SEMANTIC AND VISUAL FEATURES 6.1 INTRODUCTION Image representation schemes designed for image retrieval systems are categorized into two

More information

A model of information searching behaviour to facilitate end-user support in KOS-enhanced systems

A model of information searching behaviour to facilitate end-user support in KOS-enhanced systems A model of information searching behaviour to facilitate end-user support in KOS-enhanced systems Dorothee Blocks Hypermedia Research Unit School of Computing University of Glamorgan, UK NKOS workshop

More information

Ontology Based Prediction of Difficult Keyword Queries

Ontology Based Prediction of Difficult Keyword Queries Ontology Based Prediction of Difficult Keyword Queries Lubna.C*, Kasim K Pursuing M.Tech (CSE)*, Associate Professor (CSE) MEA Engineering College, Perinthalmanna Kerala, India lubna9990@gmail.com, kasim_mlp@gmail.com

More information

CSI 4107 Image Information Retrieval

CSI 4107 Image Information Retrieval CSI 4107 Image Information Retrieval This slides are inspired by a tutorial on Medical Image Retrieval by Henning Müller and Thomas Deselaers, 2005-2006 1 Outline Introduction Content-based image retrieval

More information

Towards Linked Data and ontology development for the semantic enrichment of volunteered geo-information

Towards Linked Data and ontology development for the semantic enrichment of volunteered geo-information AGILE Link-VGI workshop, Helsinki 14 June 2016 Towards Linked Data and ontology development for the semantic enrichment of volunteered geo-information Rob Lemmens University of Twente, Faculty of Geo-Information

More information

Joint Inference in Image Databases via Dense Correspondence. Michael Rubinstein MIT CSAIL (while interning at Microsoft Research)

Joint Inference in Image Databases via Dense Correspondence. Michael Rubinstein MIT CSAIL (while interning at Microsoft Research) Joint Inference in Image Databases via Dense Correspondence Michael Rubinstein MIT CSAIL (while interning at Microsoft Research) My work Throughout the year (and my PhD thesis): Temporal Video Analysis

More information

Supervised Models for Multimodal Image Retrieval based on Visual, Semantic and Geographic Information

Supervised Models for Multimodal Image Retrieval based on Visual, Semantic and Geographic Information Supervised Models for Multimodal Image Retrieval based on Visual, Semantic and Geographic Information Duc-Tien Dang-Nguyen, Giulia Boato, Alessandro Moschitti, Francesco G.B. De Natale Department of Information

More information

Knowledge-Driven Video Information Retrieval with LOD

Knowledge-Driven Video Information Retrieval with LOD Knowledge-Driven Video Information Retrieval with LOD Leslie F. Sikos, Ph.D., Flinders University ESAIR 15, 23 October 2015 Melbourne, VIC, Australia Knowledge-Driven Video IR Outline Video Retrieval Challenges

More information

Supervised Models for Coreference Resolution [Rahman & Ng, EMNLP09] Running Example. Mention Pair Model. Mention Pair Example

Supervised Models for Coreference Resolution [Rahman & Ng, EMNLP09] Running Example. Mention Pair Model. Mention Pair Example Supervised Models for Coreference Resolution [Rahman & Ng, EMNLP09] Many machine learning models for coreference resolution have been created, using not only different feature sets but also fundamentally

More information

A Novel Categorized Search Strategy using Distributional Clustering Neenu Joseph. M 1, Sudheep Elayidom 2

A Novel Categorized Search Strategy using Distributional Clustering Neenu Joseph. M 1, Sudheep Elayidom 2 A Novel Categorized Search Strategy using Distributional Clustering Neenu Joseph. M 1, Sudheep Elayidom 2 1 Student, M.E., (Computer science and Engineering) in M.G University, India, 2 Associate Professor

More information

AUTOMATIC VISUAL CONCEPT DETECTION IN VIDEOS

AUTOMATIC VISUAL CONCEPT DETECTION IN VIDEOS AUTOMATIC VISUAL CONCEPT DETECTION IN VIDEOS Nilam B. Lonkar 1, Dinesh B. Hanchate 2 Student of Computer Engineering, Pune University VPKBIET, Baramati, India Computer Engineering, Pune University VPKBIET,

More information

Extracting Rankings for Spatial Keyword Queries from GPS Data

Extracting Rankings for Spatial Keyword Queries from GPS Data Extracting Rankings for Spatial Keyword Queries from GPS Data Ilkcan Keles Christian S. Jensen Simonas Saltenis Aalborg University Outline Introduction Motivation Problem Definition Proposed Method Overview

More information

An Enhanced Image Retrieval Using K-Mean Clustering Algorithm in Integrating Text and Visual Features

An Enhanced Image Retrieval Using K-Mean Clustering Algorithm in Integrating Text and Visual Features An Enhanced Image Retrieval Using K-Mean Clustering Algorithm in Integrating Text and Visual Features S.Najimun Nisha 1, Mrs.K.A.Mehar Ban 2, 1 PG Student, SVCET, Puliangudi. najimunnisha@yahoo.com 2 AP/CSE,

More information

Hybrid Approach for Query Expansion using Query Log

Hybrid Approach for Query Expansion using Query Log Volume 7 No.6, July 214 www.ijais.org Hybrid Approach for Query Expansion using Query Log Lynette Lopes M.E Student, TSEC, Mumbai, India Jayant Gadge Associate Professor, TSEC, Mumbai, India ABSTRACT Web

More information

An Efficient Methodology for Image Rich Information Retrieval

An Efficient Methodology for Image Rich Information Retrieval An Efficient Methodology for Image Rich Information Retrieval 56 Ashwini Jaid, 2 Komal Savant, 3 Sonali Varma, 4 Pushpa Jat, 5 Prof. Sushama Shinde,2,3,4 Computer Department, Siddhant College of Engineering,

More information

SYSTEM PROFILES IN CONTENT-BASED INDEXING AND RETRIEVAL

SYSTEM PROFILES IN CONTENT-BASED INDEXING AND RETRIEVAL 1 SYSTEM PROFILES IN CONTENT-BASED INDEXING AND RETRIEVAL Esin Guldogan esin.guldogan@tut.fi 2 Outline Personal Media Management Text-Based Retrieval Metadata Retrieval Content-Based Retrieval System Profiling

More information

Bibster A Semantics-Based Bibliographic Peer-to-Peer System

Bibster A Semantics-Based Bibliographic Peer-to-Peer System Bibster A Semantics-Based Bibliographic Peer-to-Peer System Peter Haase 1, Björn Schnizler 1, Jeen Broekstra 2, Marc Ehrig 1, Frank van Harmelen 2, Maarten Menken 2, Peter Mika 2, Michal Plechawski 3,

More information

Semantic Web. Tahani Aljehani

Semantic Web. Tahani Aljehani Semantic Web Tahani Aljehani Motivation: Example 1 You are interested in SOAP Web architecture Use your favorite search engine to find the articles about SOAP Keywords-based search You'll get lots of information,

More information

Extracting Algorithms by Indexing and Mining Large Data Sets

Extracting Algorithms by Indexing and Mining Large Data Sets Extracting Algorithms by Indexing and Mining Large Data Sets Vinod Jadhav 1, Dr.Rekha Rathore 2 P.G. Student, Department of Computer Engineering, RKDF SOE Indore, University of RGPV, Bhopal, India Associate

More information

MESH. Multimedia Semantic Syndication for Enhanced News Services. Project Overview

MESH. Multimedia Semantic Syndication for Enhanced News Services. Project Overview MESH Multimedia Semantic Syndication for Enhanced News Services Project Overview Presentation Structure 2 Project Summary Project Motivation Problem Description Work Description Expected Result The MESH

More information

Disambiguating Search by Leveraging a Social Context Based on the Stream of User s Activity

Disambiguating Search by Leveraging a Social Context Based on the Stream of User s Activity Disambiguating Search by Leveraging a Social Context Based on the Stream of User s Activity Tomáš Kramár, Michal Barla and Mária Bieliková Faculty of Informatics and Information Technology Slovak University

More information

HealthCyberMap: Mapping the Health Cyberspace Using Hypermedia GIS and Clinical Codes

HealthCyberMap: Mapping the Health Cyberspace Using Hypermedia GIS and Clinical Codes HealthCyberMap: Mapping the Health Cyberspace Using Hypermedia GIS and Clinical Codes PhD Research Project Maged Nabih Kamel Boulos MBBCh, MSc (Derm & Vener), MSc (Medical Informatics) 1 Summary The application

More information

Knowledge Retrieval. Franz J. Kurfess. Computer Science Department California Polytechnic State University San Luis Obispo, CA, U.S.A.

Knowledge Retrieval. Franz J. Kurfess. Computer Science Department California Polytechnic State University San Luis Obispo, CA, U.S.A. Knowledge Retrieval Franz J. Kurfess Computer Science Department California Polytechnic State University San Luis Obispo, CA, U.S.A. 1 Acknowledgements This lecture series has been sponsored by the European

More information

An Entity Name Systems (ENS) for the [Semantic] Web

An Entity Name Systems (ENS) for the [Semantic] Web An Entity Name Systems (ENS) for the [Semantic] Web Paolo Bouquet University of Trento (Italy) Coordinator of the FP7 OKKAM IP LDOW @ WWW2008 Beijing, 22 April 2008 An ordinary day on the [Semantic] Web

More information

over Multi Label Images

over Multi Label Images IBM Research Compact Hashing for Mixed Image Keyword Query over Multi Label Images Xianglong Liu 1, Yadong Mu 2, Bo Lang 1 and Shih Fu Chang 2 1 Beihang University, Beijing, China 2 Columbia University,

More information

A Multilingual Social Media Linguistic Corpus

A Multilingual Social Media Linguistic Corpus A Multilingual Social Media Linguistic Corpus Luis Rei 1,2 Dunja Mladenić 1,2 Simon Krek 1 1 Artificial Intelligence Laboratory Jožef Stefan Institute 2 Jožef Stefan International Postgraduate School 4th

More information

T MULTIMEDIA RETRIEVAL SYSTEM EVALUATION

T MULTIMEDIA RETRIEVAL SYSTEM EVALUATION T-61.6030 MULTIMEDIA RETRIEVAL SYSTEM EVALUATION Pauli Ruonala pruonala@niksula.hut.fi 25.4.2008 Contents 1. Retrieve test material 2. Sources of retrieval errors 3. Traditional evaluation methods 4. Evaluation

More information

Using a Medical Thesaurus to Predict Query Difficulty

Using a Medical Thesaurus to Predict Query Difficulty Using a Medical Thesaurus to Predict Query Difficulty Florian Boudin, Jian-Yun Nie, Martin Dawes To cite this version: Florian Boudin, Jian-Yun Nie, Martin Dawes. Using a Medical Thesaurus to Predict Query

More information

Evaluation and image retrieval

Evaluation and image retrieval Evaluation and image retrieval Henning Müller Thomas Deselaers Overview Information retrieval evaluation TREC Multimedia retrieval evaluation TRECVID, ImageEval, Benchathlon, ImageCLEF Past Future Information

More information

Semantic Annotation of Stock Photography for CBIR using MPEG-7 standards

Semantic Annotation of Stock Photography for CBIR using MPEG-7 standards P a g e 7 Semantic Annotation of Stock Photography for CBIR using MPEG-7 standards Balasubramani R Dr.V.Kannan Assistant Professor IT Dean Sikkim Manipal University DDE Centre for Information I Floor,

More information

An Archiving System for Managing Evolution in the Data Web

An Archiving System for Managing Evolution in the Data Web An Archiving System for Managing Evolution in the Web Marios Meimaris *, George Papastefanatos and Christos Pateritsas * Institute for the Management of Information Systems, Research Center Athena, Greece

More information

An Ontology Based Question Answering System on Software Test Document Domain

An Ontology Based Question Answering System on Software Test Document Domain An Ontology Based Question Answering System on Software Test Document Domain Meltem Serhatli, Ferda N. Alpaslan Abstract Processing the data by computers and performing reasoning tasks is an important

More information

Information mining and information retrieval : methods and applications

Information mining and information retrieval : methods and applications Information mining and information retrieval : methods and applications J. Mothe, C. Chrisment Institut de Recherche en Informatique de Toulouse Université Paul Sabatier, 118 Route de Narbonne, 31062 Toulouse

More information

Information Retrieval

Information Retrieval Multimedia Computing: Algorithms, Systems, and Applications: Information Retrieval and Search Engine By Dr. Yu Cao Department of Computer Science The University of Massachusetts Lowell Lowell, MA 01854,

More information

Overview of the medical task of ImageCLEF Alba G. Seco de Herrera Stefano Bromuri Roger Schaer Henning Müller

Overview of the medical task of ImageCLEF Alba G. Seco de Herrera Stefano Bromuri Roger Schaer Henning Müller Overview of the medical task of ImageCLEF 2016 Alba G. Seco de Herrera Stefano Bromuri Roger Schaer Henning Müller Tasks in ImageCLEF 2016 Automatic image annotation Medical image classification Sub-tasks

More information

Linked Open Europeana: Semantics for the Digital Humanities

Linked Open Europeana: Semantics for the Digital Humanities Linked Open Europeana: Semantics for the Digital Humanities Prof. Dr. Stefan Gradmann Humboldt-Universität zu Berlin / School of Library and Information Science stefan.gradmann@ibi.hu-berlin.de 1 Overview

More information

Overview of Web Mining Techniques and its Application towards Web

Overview of Web Mining Techniques and its Application towards Web Overview of Web Mining Techniques and its Application towards Web *Prof.Pooja Mehta Abstract The World Wide Web (WWW) acts as an interactive and popular way to transfer information. Due to the enormous

More information

Deep Character-Level Click-Through Rate Prediction for Sponsored Search

Deep Character-Level Click-Through Rate Prediction for Sponsored Search Deep Character-Level Click-Through Rate Prediction for Sponsored Search Bora Edizel - Phd Student UPF Amin Mantrach - Criteo Research Xiao Bai - Oath This work was done at Yahoo and will be presented as

More information

MUCKE Multimedia and User Credibility Knowledge Extraction

MUCKE Multimedia and User Credibility Knowledge Extraction MUCKE Multimedia and User Credibility Knowledge Extraction http://ifs.tuwien.ac.at/~mucke/ Mihai Lupu (1), Alexandru Ginsca (2) (1) Vienna University of Technology (2) CEA LIST lupu@ifs.tuwien.ac.at Chist-Era

More information

Outline. Morning program Preliminaries Semantic matching Learning to rank Entities

Outline. Morning program Preliminaries Semantic matching Learning to rank Entities 112 Outline Morning program Preliminaries Semantic matching Learning to rank Afternoon program Modeling user behavior Generating responses Recommender systems Industry insights Q&A 113 are polysemic Finding

More information

Exploring and Using the Semantic Web

Exploring and Using the Semantic Web Exploring and Using the Semantic Web Mathieu d Aquin KMi, The Open University m.daquin@open.ac.uk What?? Exploring the Semantic Web Vocabularies Ontologies Linked Data RDF documents Example: Exploring

More information

<is web> Information Systems & Semantic Web University of Koblenz Landau, Germany

<is web> Information Systems & Semantic Web University of Koblenz Landau, Germany Information Systems & University of Koblenz Landau, Germany Semantic Search examples: Swoogle and Watson Steffen Staad credit: Tim Finin (swoogle), Mathieu d Aquin (watson) and their groups 2009-07-17

More information

Semantic Web. Ontology Alignment. Morteza Amini. Sharif University of Technology Fall 95-96

Semantic Web. Ontology Alignment. Morteza Amini. Sharif University of Technology Fall 95-96 ه عا ی Semantic Web Ontology Alignment Morteza Amini Sharif University of Technology Fall 95-96 Outline The Problem of Ontologies Ontology Heterogeneity Ontology Alignment Overall Process Similarity (Matching)

More information

New Generation Open Content Delivery Networks

New Generation Open Content Delivery Networks Open ContEnt Aware Networks New Generation Open Content Delivery Networks Yannick Le Louédec Orange Labs Workshop Future Media Distribution. November 10 th, 2011 www.ict-ocean.eu The research leading to

More information

Exploiting Semantics Where We Find Them

Exploiting Semantics Where We Find Them Vrije Universiteit Amsterdam 19/06/2018 Exploiting Semantics Where We Find Them A Bottom-up Approach to the Semantic Web Prof. Dr. Christian Bizer Bizer: Exploiting Semantics Where We Find Them. VU Amsterdam,

More information

What you have learned so far. Interoperability. Ontology heterogeneity. Being serious about the semantic web

What you have learned so far. Interoperability. Ontology heterogeneity. Being serious about the semantic web What you have learned so far Interoperability Introduction to the Semantic Web Tutorial at ISWC 2010 Jérôme Euzenat Data can be expressed in RDF Linked through URIs Modelled with OWL ontologies & Retrieved

More information

Lab for Media Search, National University of Singapore 1

Lab for Media Search, National University of Singapore 1 1 2 Word2Image: Towards Visual Interpretation of Words Haojie Li Introduction Motivation A picture is worth 1000 words Traditional dictionary Containing word entries accompanied by photos or drawing to

More information

WEIGHTING QUERY TERMS USING WORDNET ONTOLOGY

WEIGHTING QUERY TERMS USING WORDNET ONTOLOGY IJCSNS International Journal of Computer Science and Network Security, VOL.9 No.4, April 2009 349 WEIGHTING QUERY TERMS USING WORDNET ONTOLOGY Mohammed M. Sakre Mohammed M. Kouta Ali M. N. Allam Al Shorouk

More information

D DAVID PUBLISHING. Big Data; Definition and Challenges. 1. Introduction. Shirin Abbasi

D DAVID PUBLISHING. Big Data; Definition and Challenges. 1. Introduction. Shirin Abbasi Journal of Energy and Power Engineering 10 (2016) 405-410 doi: 10.17265/1934-8975/2016.07.004 D DAVID PUBLISHING Shirin Abbasi Computer Department, Islamic Azad University-Tehran Center Branch, Tehran

More information

Search Evaluation. Tao Yang CS293S Slides partially based on text book [CMS] [MRS]

Search Evaluation. Tao Yang CS293S Slides partially based on text book [CMS] [MRS] Search Evaluation Tao Yang CS293S Slides partially based on text book [CMS] [MRS] Table of Content Search Engine Evaluation Metrics for relevancy Precision/recall F-measure MAP NDCG Difficulties in Evaluating

More information

OWLS-SLR An OWL-S Service Profile Matchmaker

OWLS-SLR An OWL-S Service Profile Matchmaker OWLS-SLR An OWL-S Service Profile Matchmaker Quick Use Guide (v0.1) Intelligent Systems and Knowledge Processing Group Aristotle University of Thessaloniki, Greece Author: Georgios Meditskos, PhD Student

More information

Information Retrieval CS Lecture 01. Razvan C. Bunescu School of Electrical Engineering and Computer Science

Information Retrieval CS Lecture 01. Razvan C. Bunescu School of Electrical Engineering and Computer Science Information Retrieval CS 6900 Razvan C. Bunescu School of Electrical Engineering and Computer Science bunescu@ohio.edu Information Retrieval Information Retrieval (IR) is finding material of an unstructured

More information

Annotation Component in KiWi

Annotation Component in KiWi Annotation Component in KiWi Marek Schmidt and Pavel Smrž Faculty of Information Technology Brno University of Technology Božetěchova 2, 612 66 Brno, Czech Republic E-mail: {ischmidt,smrz}@fit.vutbr.cz

More information

Enhanced retrieval using semantic technologies:

Enhanced retrieval using semantic technologies: Enhanced retrieval using semantic technologies: Ontology based retrieval as a new search paradigm? - Considerations based on new projects at the Bavarian State Library Dr. Berthold Gillitzer 28. Mai 2008

More information

Medical Image Annotation in ImageCLEF 2008

Medical Image Annotation in ImageCLEF 2008 Medical Image Annotation in ImageCLEF 2008 Thomas Deselaers 1 and Thomas M. Deserno 2 1 RWTH Aachen University, Computer Science Department, Aachen, Germany deselaers@cs.rwth-aachen.de 2 RWTH Aachen University,

More information

Accessing information about Linked Data vocabularies with vocab.cc

Accessing information about Linked Data vocabularies with vocab.cc Accessing information about Linked Data vocabularies with vocab.cc Steffen Stadtmüller 1, Andreas Harth 1, and Marko Grobelnik 2 1 Institute AIFB, Karlsruhe Institute of Technology (KIT), Germany {steffen.stadtmueller,andreas.harth}@kit.edu

More information

Class 5: Attributes and Semantic Features

Class 5: Attributes and Semantic Features Class 5: Attributes and Semantic Features Rogerio Feris, Feb 21, 2013 EECS 6890 Topics in Information Processing Spring 2013, Columbia University http://rogerioferis.com/visualrecognitionandsearch Project

More information

Jianyong Wang Department of Computer Science and Technology Tsinghua University

Jianyong Wang Department of Computer Science and Technology Tsinghua University Jianyong Wang Department of Computer Science and Technology Tsinghua University jianyong@tsinghua.edu.cn Joint work with Wei Shen (Tsinghua), Ping Luo (HP), and Min Wang (HP) Outline Introduction to entity

More information

Information Retrieval and Knowledge Organisation

Information Retrieval and Knowledge Organisation Information Retrieval and Knowledge Organisation Knut Hinkelmann Content Information Retrieval Indexing (string search and computer-linguistic aproach) Classical Information Retrieval: Boolean, vector

More information

Annotating Spatio-Temporal Information in Documents

Annotating Spatio-Temporal Information in Documents Annotating Spatio-Temporal Information in Documents Jannik Strötgen University of Heidelberg Institute of Computer Science Database Systems Research Group http://dbs.ifi.uni-heidelberg.de stroetgen@uni-hd.de

More information

Mining the Web 2.0 to improve Search

Mining the Web 2.0 to improve Search Mining the Web 2.0 to improve Search Ricardo Baeza-Yates VP, Yahoo! Research Agenda The Power of Data Examples Improving Image Search (Faceted Clusters) Searching the Wikipedia (Correlator) Understanding

More information