Exploring Search Log Data. Theodora Tsikrika University of Applied Sciences Western Switzerland (HES-SO) Switzerland
|
|
- Silas Bennett
- 5 years ago
- Views:
Transcription
1 Exploring Search Log Data Theodora Tsikrika University of Applied Sciences Western Switzerland (HES-SO) Switzerland University of Copenhagen, February 22, 2012 CLEF 2011, Sept 21,
2 Sierre, Switzerland 2 2
3 HES-SO Sierre 1,500 students Institutes: Business Information Systems, Economy, Tourism Research in focussed domains: Internet of things, RFID Mobile applications Energy, Green ICT SAP centre ehealth Information Retrieval 3 3
4 MedGIFT research group Henning HenningMüller Müller Professor Professor Theodora Tsikrika Antonio Foncubierta Postdoc Ph.D. student Adrien Depeursinge Dimitriοs Markonis Postdoc Ph.D. student Manfredo Atzori Alba Garcia Postdoc Ph.D. student Alexandre Cotting Ivan Eggel Project manager Developer Alejandro Vargas (Geneva) Roger Schaer Medical Doctor Developer 4 4
5 MedGIFT research & projects Medical (multidimensional) image analysis and retrieval Multimedia information retrieval Information retrieval evaluation Test collection creation (including images and signals) User testing and task analysis Infrastructures for computation 5 5
6 Exploring search log data : researcher at CWI, Amsterdam, The Netherlands : Database Architectures and Information Access group : Interactive Information Access group VITALAS: Video & image Indexing and Retrieval in the Large Scale (FP6 IP) use-case driven project that built a prototype system dedicated to intelligent access services to multimedia professional archives advanced solutions for indexing, searching and accessing large scale of non previously (or partly) annotated multimedia content novel contributions in cross-media (audio/speech, video, image, text) indexing, content enrichment, and interactive retrieval methods 6 6
7 7 7
8 8 8
9 9 9
10 10 10
11 11 11
12 12 12
13 Search log data Examine users' information searching behaviour Unobtrusive / Naturalistic settings Broad range of user-system interactions / Significant time periods Large amounts of data / Sizable number of users No qualitative user aspects Context, situation, decision process, user satisfaction remain implicit Benefits Understand system usage Improve user experience and system effectiveness Exploitation Core ranking / Automatic query expansion / Ad matching / User modelling Web caching Query assistance Dynamic query suggestions as you type Query recommendations Media Search cluster meeting 13 13
14 Overview Motivation Search log analysis Semantic search log analysis method Study on professional image search log data Query recommendation Exploitation of clickthrough data Image annotation Image search Conclusions 14 14
15 Search log analysis Logged data timestamp, sessionid, userid, query, clicks Level of analysis Term Query Session Analysis of user behavioural patterns Query submission (formulation) Query modification Syntactic level (term-based) Semantic level Media Search cluster meeting 15 15
16 Term-based query modification analysis Addition = specification University of Copenhagen University of Copenhagen ranking Elimination = generalisation University of Copenhagen ranking University of Copenhagen Substitution = reformulation University of Copenhagen University of Aarhus Lexical variation Media Search cluster meeting 16Copenhagen University of Copenhagen Universities of 16
17 Term-based query modification analysis: limitations Reformulations Posthuma tour france Posthuma tour 2008 Vanessa Williams Serena Williams Undetermined University of Copenhagen Royal School of Library Information Science University of Copenhagen Denmark No semantics! Can we add a semantic dimension? How? Can we exploit the Linked Open Data? 17 17
18 Linked Open Data Entity = URI RDF triple = encode what is predicated about specific entities Subject: Beckham Predicate: Object: 'David Beckham' 18
19 Linked Open Data Entity = URI RDF triple = encode what is predicated about specific entities Subject: Beckham Subject: Predicate: Predicate: Object: David Beckham Object: `soccer player 19
20 Linked Open Data Entity = URI RDF triple = encode what is predicated about specific entities Subject: Beckham Subject: Predicate: Subject: Predicate: Object: David Beckham Predicate: Object: `soccer player Object: 20
21 Linked Open Data Entity = URI RDF triple = encode what is predicated about specific entities Subject: Beckham Subject: Predicate: Subject: Predicate: Object: David Beckham Predicate: Subject: Object: `soccer player Object: Predicate: Object: 21
22 Linked Open Data cloud 22 > > sources sources billion billiontriples triples 22
23 Semantic search log analysis method Input: List of search sessions (queries, query pairs) RDF triples from Linked Open Data sources Output: Query types + relative frequencies Query modification patterns + support & confidence values 1. Map queries from logs to entities in RDF triples (rdfs:label) 2. Determine types of entities and count occurrence frequencies 3. Determine semantic relations between entities of query pairs 4. Abstract semantic relations semantic patterns 5. Count occurrence frequencies 6. Rank semantic patterns based on their support and confidence 23 23
24 Semantic search log analysis method 24 24
25 Semantic search log analysis method 25 25
26 Semantic search log analysis method 26 26
27 Abstract semantic relations semantic patterns David Beckham Joe Cole DBPedia:David_Beckham -DBPedia:Nationalteam DBPedia:England_national_football_team DBPedia:Nationalteam- DBPedia:Joe_Cole Q1 -DBPedia:Nationalteam X DBPedia:Nationalteam- Q2 Nicolas Sarkozy Carla Bruni DBPedia:Nicolas_Sarkozy -DBPedia:spouse DBPedia:Carla_Bruni Q1 -DBPedia:spouse Q
28 Rank semantic patterns Which patterns are the most important? The ones that occur with higher frequency? What if these patterns are not informative and simply occur too often in the linked data? Compute expected frequencies of patterns Compute frequency of patterns between random queries Support = relative frequency Support_session Confidence = Support_session + Support_random Media Search cluster meeting 28 28
29 Overview Motivation Search log analysis Semantic search log analysis method Study on professional image search log data Query recommendation Exploitation of clickthrough data Image annotation Image search Conclusions 29 29
30 Professional image search logs analysis European news agency Commercial picture portal Millions of photographic images Professional users Search log data 10 months ~ 1 million queries / 0.5 million sessions Linked Open Data sources (22 million RDF triples) DBpedia WordNet Cornetto Getty geographical names Media Search cluster Getty Art and Architecture thesaurus meeting 30 30
31 Search log statistics (October 2008 July 2009) 31 31
32 Query frequency distribution 32 32
33 Query types Found matching URI for 79% of all queries Identified type for 68% of matched queries (about half of all queries) 33 33
34 Query types conceptual queries specific queries 34 34
35 Query types DBpedia:Person 35 35
36 Query modification patterns 24% query pairs classified using the semantic analysis 36 36
37 Query modification patterns identity relation 37 37
38 Query modification patterns partner of a person 38 38
39 Query modification patterns common property 39 39
40 Query modification patterns same type e.g., tennis players, townships,
41 Query modification patterns close relation e.g., prince and princess 41 41
42 Query modification classes Sibling relations: 19% Q1 -R X R- Q2 e.g., common property, WordNet hyponyms Direct few-to-few relations: 10% e.g., spouse Other relations: 71% 42 42
43 Term-based query modification analysis 43 43
44 Term-based vs. semantic query modification analysis 25% query pairs classified using the term-based analysis 24% query pairs classified using the semantic analysis complementary approaches 44 44
45 Accuracy of the method Semantic search log analysis method: 1. Match the query to linked data entities 2. Determine query types 3. Identify query modification patterns Accuracy of query modification pattern identification 100 query pairs randomly selected 4 judges identified the most prominent relation for 25 query pairs each (ground truth) 3 raters assessed the patterns identified by the system against the ground truth System choice classified as incorrect, approximately correct, correct Agreement among raters: 0.69 Fleiss kappa Agreement between system and ground truth: 0.61 (lenient mapping) System moderatelymedia successful Search cluster meeting 45 45
46 Overview Motivation Search log analysis Semantic search log analysis method Study on professional image search log data Query recommendation Exploitation of clickthrough data Image annotation Image search Conclusions 46 46
47 Query recommendation: approaches Existing approaches Document-based methods Search log-based methods: co-occurring queries Not previously submitted queries? Infrequent queries? Ontology-based methods Which links to select? Combinations of the above Based on semantic patterns Given: A query mapped to concept(s) Semantic patterns ranked by their support Apply patterns to concepts Suggestions ranked by their support Media Search clustervalue meeting 47 Ties broken by occurrence frequency in logs 47
48 Query recommendation: experiments Applied approach Baseline: search log-based method Top-10 co-occurring queries in the same session If suggestions < 10, then add suggestions based on semantic patterns Datasets 1,105, 766 queries 332,809 sessions 80% of sessions used for training (417,633 query pairs) 20% of sessions used for testing (64,767 query pairs) 44 semantic patterns 48 48
49 Query recommendation: results Log-based statistics Log-based statistics + Semantic patterns All queries Success rate % Coverage % * Queries that occur 5 times or less (36% of queries) Success rate % * Coverage % * Coverage: # times at least one suggestion is found Success rate: # times that suggestions include ground truth 49 Ground truth = the query immediately following the user s query in a session 49
50 Semantic search log analysis V. Hollink, T. Tsikrika, and A. P. de Vries. Semantic Search Log Analysis: a Method and a Study on Professional Image Search. JASIST, 62(4): , V. Hollink, T. Tsikrika, and A. P. de Vries. The semantics of query modification. In Proceedings of the 9th International Conference on Adaptivity, Personalization and Fusion of Heterogeneous Information (RIAO 2010), April 28-30, Paris, France,
51 Overview Motivation Search log analysis Semantic search log analysis method Study on professional image search log data Query recommendation Exploitation of clickthrough data Image annotation Image search Conclusions 51 51
52 Concept-based Image Annotation Aim: unambiguously describe the visual content of images Bridge Canal Red houses... Caption : Pretty Copenhagen
53 Concept-based Image Annotation Challenges when using supervised machine learning techniques: require labelled samples as training data laborious and expensive task when performed manually large number of semantic concepts poor generalisation of concept classifiers in other domains How can we automatically supplement/replace the manually annotated training samples? 53 53
54 Approach Automatically generate annotated training samples user-defined tags (e.g., Flickr) keywords extracted from Web pages where images are embedded clickthrough data collected in search logs traffic advantages: large quantities, no user intervention, available to all content owners, collective annotations (assessments) disadvantages: sparse, noisy, user queries not based on Media Search cluster meeting 54 strict visual criteria 54
55 Research questions 1) How can we build classifiers for annotating images with concepts using clickthrough data? methods for searchlog-based positive sample selection random negative sample selection 2) What is the effectiveness of these concept classifiers? experiments using data provided by BELGA news agency ~100k photographic images (with their text metadata) clickthrough data 55 55
56 Concept definition A concept is a clearly defined, non ambiguous entity represented by a short name keywords free-text short description Name traffic Concept Keywords traffic, traffic jam, cars, road, highway Description Image showing a high density of vehicles when on a road or highway
57 Positive sample selection using search logs Method exact select images clicked for queries exactly matching the concept name Methods textual similarity (based on IR language models) annotate each image with all queries for which it has been clicked apply stemming (yes/no) select images retrieved for query: (i) concept name (ii) concept keywords using retrieval model: (i) language model (LM) (ii) smoothed LM (LMS) Method clickgraph images clicked for the same query are likely to be relevant to each other 57 57
58 Reliability of clickthrough-based annotations methods varies greatly across concepts around 20% of the total number of concepts for each method reach Media 58 agreement of at least 0.8 Search cluster meeting 58
59 Building concept classifiers Positive samples for concept c: Nc,m images selected using one of the methods m exact, textual similarity (6 language modelling variants), clickgraph Negative samples for concept c: Nc',m = max( Nc,m,, Nc,m) images randomly selected Low-level features visual features FW (120-d vector) based on integrated Weibull distribution of edges (texture descriptor) compare region distributions to distributions of a set of reference images J. C. van Gemert et al. Robust Scene Categorization by Learning Image Statistics in Context. In International Workshop on Semantic Learning Applications in Multimedia, text features FT SVM classifier with RBFMedia kernelsearch cluster meeting 59 59
60 Experiments: datasets (provided by Belga news agency) Image collection 97,628 photographic images ~1,000 images manually annotated for each VITALAS concept Search logs 101 days (June October 2007) professional users 9,605 unique ('lightly' normalised) queries conversion to lower case removal of punctuation, quotes, and methods removal of names of major photo agencies 35,894 clicked images (out of the 97,628) Evaluation datasets 25 concepts Training: (manual annotations) (positive samples) (negative samples) 60 Test: 56,605 images 60
61 Experiments: results methods For visual features : combination of manual and searchlog-based training samples performs best consistently over all methods For text features : searchlog-based training samples produced by less noisy methods perform best 61 Text features outperform visual features 61
62 Experiments: results visual features manual+searchlog-based visual features: manual visual features searchlog-based methods For visual features : combination of manual and searchlog-based training samples performs best consistently over all methods For text features : searchlog-based training samples produced by less noisy methods perform best 62 Text features outperform visual features 62
63 Experiments: results text features searchlog-based text features: manual text features manual+searchlog-based visual features manual+searchlog-based visual features: manual visual features searchlog-based methods For visual features : combination of manual and searchlog-based training samples performs best consistently over all methods For text features : searchlog-based training samples produced by less noisy methods perform best 63 Text features outperform visual features 63
64 Concept: soccer manually annotated positive samples search log based annotated positive samples test set results visual features search log based training 64 View all results at: 64
65 Image annotation using clickthrough data: main findings Contribution of search-log training data in image annotation when using supervised machine learning is positive Scales to a large number of concepts Can take into account emerging concepts Available to all content owners avoid the generalisation problem 65 65
66 Image annotation using clickthrough data T. Tsikrika, C. Diou, A. P. de Vries, and A. Delopoulos. Image Annotation Using Clickthrough Data. In Proceedings of CIVR Τ. Tsikrika, C. Diou, A.P. de Vries, and A. Delopoulos. Reliability and Effectiveness of Clickthrough Data for Automatic Image Annotation. Multimedia Tools & Applications, 55(1),
67 Overview Motivation Search log analysis Semantic search log analysis method Study on professional image search log data Query recommendation Exploitation of clickthrough data Image annotation Image search Conclusions 67 67
68 Topic modelling of clickthrough data D. Morrison, T. Tsikrika, V. Hollink, A. P. de Vries, É. Bruno, S. Marchand-Maillet. Topic modelling of clickthrough data in image search. Multimedia Tools & Applications, Springer (to appear)
69 Conclusions Semantic search log analysis Implications for system design, search support, content management Query recommendation Beneficial for infrequent queries or queries entered for first time (long tail) Suggestions not occurring in logs (serendipitous discoveries) Explain relations between query and suggestions Combination of search logs with linked data Image annotation using clickthough data clickthrough data alone can lead to satisfactory effectiveness combination with manual annotations improves the effectiveness scalability in the number of concept detectors possibility to dynamically adapt the detector set Optimal sample size? Noise reduction? 69 69
70 Thank you!
Are clickthrough data reliable as image annotations?
Video & Image Indexing and Retrieval in the Large Scale Are clickthrough data reliable as image annotations? Theodora Tsikrika (CWI) Christos Diou (AUTH, CERTH-ITI) Arjen P. de Vries (CWI, TU Delft) (
More informationMedical image analysis and retrieval. Henning Müller
Medical image analysis and retrieval Henning Müller Overview My background Our laboratory Current projects Khresmoi, MANY, Promise, Chorus+, NinaPro Challenges Demonstration Conclusions 2 Personal background
More informationSemantic Search Log Analysis: A Method and a Study on Professional Image Search
Semantic Search Log Analysis: A Method and a Study on Professional Image Search Vera Hollink, Theodora Tsikrika, and Arjen P. de Vries Centrum Wiskunde en Informatica, Science Park 123, 1098 XG Amsterdam,
More informationA Text-Based Approach to the ImageCLEF 2010 Photo Annotation Task
A Text-Based Approach to the ImageCLEF 2010 Photo Annotation Task Wei Li, Jinming Min, Gareth J. F. Jones Center for Next Generation Localisation School of Computing, Dublin City University Dublin 9, Ireland
More informationColumbia University High-Level Feature Detection: Parts-based Concept Detectors
TRECVID 2005 Workshop Columbia University High-Level Feature Detection: Parts-based Concept Detectors Dong-Qing Zhang, Shih-Fu Chang, Winston Hsu, Lexin Xie, Eric Zavesky Digital Video and Multimedia Lab
More informationOverview of ImageCLEF Mauricio Villegas (on behalf of all organisers)
Overview of ImageCLEF 2016 Mauricio Villegas (on behalf of all organisers) ImageCLEF history Started in 2003 with a photo retrieval task 4 participants submitting results In 2009 we had 6 tasks and 65
More informationPROJECT PERIODIC REPORT
PROJECT PERIODIC REPORT Grant Agreement number: 257403 Project acronym: CUBIST Project title: Combining and Uniting Business Intelligence and Semantic Technologies Funding Scheme: STREP Date of latest
More informationMedGIFT projects in medical imaging. Henning Müller
MedGIFT projects in medical imaging Henning Müller Where we are 2 Who I am Medical informatics studies in Heidelberg, Germany (1992-1997) Exchange with Daimler Benz research, USA PhD in image processing,
More informationMultimedia Information Retrieval
Multimedia Information Retrieval Prof Stefan Rüger Multimedia and Information Systems Knowledge Media Institute The Open University http://kmi.open.ac.uk/mmis Multimedia Information Retrieval 1. What are
More informationWikipedia Retrieval Task ImageCLEF 2011
Wikipedia Retrieval Task ImageCLEF 2011 Theodora Tsikrika University of Applied Sciences Western Switzerland, Switzerland Jana Kludas University of Geneva, Switzerland Adrian Popescu CEA LIST, France Outline
More informationCS473: Course Review CS-473. Luo Si Department of Computer Science Purdue University
CS473: CS-473 Course Review Luo Si Department of Computer Science Purdue University Basic Concepts of IR: Outline Basic Concepts of Information Retrieval: Task definition of Ad-hoc IR Terminologies and
More informationNUS-I2R: Learning a Combined System for Entity Linking
NUS-I2R: Learning a Combined System for Entity Linking Wei Zhang Yan Chuan Sim Jian Su Chew Lim Tan School of Computing National University of Singapore {z-wei, tancl} @comp.nus.edu.sg Institute for Infocomm
More informationMIRACLE at ImageCLEFmed 2008: Evaluating Strategies for Automatic Topic Expansion
MIRACLE at ImageCLEFmed 2008: Evaluating Strategies for Automatic Topic Expansion Sara Lana-Serrano 1,3, Julio Villena-Román 2,3, José C. González-Cristóbal 1,3 1 Universidad Politécnica de Madrid 2 Universidad
More informationShrey Patel B.E. Computer Engineering, Gujarat Technological University, Ahmedabad, Gujarat, India
International Journal of Scientific Research in Computer Science, Engineering and Information Technology 2018 IJSRCSEIT Volume 3 Issue 3 ISSN : 2456-3307 Some Issues in Application of NLP to Intelligent
More informationSemantic Website Clustering
Semantic Website Clustering I-Hsuan Yang, Yu-tsun Huang, Yen-Ling Huang 1. Abstract We propose a new approach to cluster the web pages. Utilizing an iterative reinforced algorithm, the model extracts semantic
More informationTheme Identification in RDF Graphs
Theme Identification in RDF Graphs Hanane Ouksili PRiSM, Univ. Versailles St Quentin, UMR CNRS 8144, Versailles France hanane.ouksili@prism.uvsq.fr Abstract. An increasing number of RDF datasets is published
More informationSLIPO. Scalable Linking and Integration of Big POI data. Giorgos Giannopoulos IMIS/Athena RC
SLIPO Scalable Linking and Integration of Big POI data I n f o r m a ti o n a n d N e t w o r ki n g D a y s o n H o ri z o n 2 0 2 0 B i g Da ta Public-Priva te Partnership To p i c : I C T 14 B i g D
More informationSemantic Web Company. PoolParty - Server. PoolParty - Technical White Paper.
Semantic Web Company PoolParty - Server PoolParty - Technical White Paper http://www.poolparty.biz Table of Contents Introduction... 3 PoolParty Technical Overview... 3 PoolParty Components Overview...
More informationUsing Linked Data to Reduce Learning Latency for e-book Readers
Using Linked Data to Reduce Learning Latency for e-book Readers Julien Robinson, Johann Stan, and Myriam Ribière Alcatel-Lucent Bell Labs France, 91620 Nozay, France, Julien.Robinson@alcatel-lucent.com
More informationA Preliminary Investigation into the Search Behaviour of Users in a Collection of Digitized Broadcast Audio
A Preliminary Investigation into the Search Behaviour of Users in a Collection of Digitized Broadcast Audio Haakon Lund 1, Mette Skov 2, Birger Larsen 2 and Marianne Lykke 2 1 Royal School of Library and
More informationWord Indexing Versus Conceptual Indexing in Medical Image Retrieval
Word Indexing Versus Conceptual Indexing in Medical Image Retrieval (ReDCAD participation at ImageCLEF Medical Image Retrieval 2012) Karim Gasmi, Mouna Torjmen-Khemakhem, and Maher Ben Jemaa Research unit
More informationMetadata Topic Harmonization and Semantic Search for Linked-Data-Driven Geoportals -- A Case Study Using ArcGIS Online
Metadata Topic Harmonization and Semantic Search for Linked-Data-Driven Geoportals -- A Case Study Using ArcGIS Online Yingjie Hu 1, Krzysztof Janowicz 1, Sathya Prasad 2, and Song Gao 1 1 STKO Lab, Department
More informationIntroduction to Information Retrieval
Introduction to Information Retrieval Mohsen Kamyar چهارمین کارگاه ساالنه آزمایشگاه فناوری و وب بهمن ماه 1391 Outline Outline in classic categorization Information vs. Data Retrieval IR Models Evaluation
More informationA Survey Of Different Text Mining Techniques Varsha C. Pande 1 and Dr. A.S. Khandelwal 2
A Survey Of Different Text Mining Techniques Varsha C. Pande 1 and Dr. A.S. Khandelwal 2 1 Department of Electronics & Comp. Sc, RTMNU, Nagpur, India 2 Department of Computer Science, Hislop College, Nagpur,
More informationTERM BASED WEIGHT MEASURE FOR INFORMATION FILTERING IN SEARCH ENGINES
TERM BASED WEIGHT MEASURE FOR INFORMATION FILTERING IN SEARCH ENGINES Mu. Annalakshmi Research Scholar, Department of Computer Science, Alagappa University, Karaikudi. annalakshmi_mu@yahoo.co.in Dr. A.
More informationDocument Clustering for Mediated Information Access The WebCluster Project
Document Clustering for Mediated Information Access The WebCluster Project School of Communication, Information and Library Sciences Rutgers University The original WebCluster project was conducted at
More informationQualitative Data Analysis Software. A workshop for staff & students School of Psychology Makerere University
Qualitative Data Analysis Software A workshop for staff & students School of Psychology Makerere University (PhD) January 27, 2016 Outline for the workshop CAQDAS NVivo Overview Practice 2 CAQDAS Before
More informationAutomatic Generation of Query Sessions using Text Segmentation
Automatic Generation of Query Sessions using Text Segmentation Debasis Ganguly, Johannes Leveling, and Gareth J.F. Jones CNGL, School of Computing, Dublin City University, Dublin-9, Ireland {dganguly,
More informationEFFICIENT INTEGRATION OF SEMANTIC TECHNOLOGIES FOR PROFESSIONAL IMAGE ANNOTATION AND SEARCH
EFFICIENT INTEGRATION OF SEMANTIC TECHNOLOGIES FOR PROFESSIONAL IMAGE ANNOTATION AND SEARCH Andreas Walter FZI Forschungszentrum Informatik, Haid-und-Neu-Straße 10-14, 76131 Karlsruhe, Germany, awalter@fzi.de
More informationCIS UDEL Working Notes on ImageCLEF 2015: Compound figure detection task
CIS UDEL Working Notes on ImageCLEF 2015: Compound figure detection task Xiaolong Wang, Xiangying Jiang, Abhishek Kolagunda, Hagit Shatkay and Chandra Kambhamettu Department of Computer and Information
More informationPouya Kousha Fall 2018 CSE 5194 Prof. DK Panda
Pouya Kousha Fall 2018 CSE 5194 Prof. DK Panda 1 Observe novel applicability of DL techniques in Big Data Analytics. Applications of DL techniques for common Big Data Analytics problems. Semantic indexing
More informationIntroduction to Text Mining. Hongning Wang
Introduction to Text Mining Hongning Wang CS@UVa Who Am I? Hongning Wang Assistant professor in CS@UVa since August 2014 Research areas Information retrieval Data mining Machine learning CS@UVa CS6501:
More informationThe European Commission s science and knowledge service. Joint Research Centre
The European Commission s science and knowledge service Joint Research Centre GeoDCAT-AP The story so far Andrea Perego, Antonio Rotundo, Lieven Raes GeoDCAT-AP Webinar 6 June 2018 What is GeoDCAT-AP Geospatial
More informationChapter 27 Introduction to Information Retrieval and Web Search
Chapter 27 Introduction to Information Retrieval and Web Search Copyright 2011 Pearson Education, Inc. Publishing as Pearson Addison-Wesley Chapter 27 Outline Information Retrieval (IR) Concepts Retrieval
More informationSemantic Annotation and Linking of Medical Educational Resources
5 th European IFMBE MBEC, Budapest, September 14-18, 2011 Semantic Annotation and Linking of Medical Educational Resources N. Dovrolis 1, T. Stefanut 2, S. Dietze 3, H.Q. Yu 3, C. Valentine 3 & E. Kaldoudi
More informationIMOTION. Heiko Schuldt, University of Basel, Switzerland
IMOTION Heiko Schuldt, University of Basel, Switzerland heiko.schuldt@unibas.ch IMOTION at a Glance Project Title Intelligent Multimodal Augmented Video Motion Retrieval System (IMOTION) Project Start
More informationEuropeana and semantic alignment of vocabularies
Europeana and semantic alignment of vocabularies Antoine Isaac Jacco van Ossenbruggen, Victor de Boer, Jan Wielemaker, Guus Schreiber Europeana & Vrije Universiteit Amsterdam NKOS workshop, Berlin, Sept.
More informationOverview MULTIMEDIA INFORMATION RETRIEVAL. Search Engines. Information Retrieval. Explanation. Van Rijsbergen
MULTIMEDIA INFORMATION RETRIEVAL Arjen P. de Vries arjen@acm.org Overview Information Retrieval Text Retrieval Multimedia Retrieval Recent Developments Research Topics Centrum voor Wiskunde en Informatica
More informationAn Evaluation of Geo-Ontology Representation Languages for Supporting Web Retrieval of Geographical Information
An Evaluation of Geo-Ontology Representation Languages for Supporting Web Retrieval of Geographical Information P. Smart, A.I. Abdelmoty and C.B. Jones School of Computer Science, Cardiff University, Cardiff,
More informationCHAPTER 6 PROPOSED HYBRID MEDICAL IMAGE RETRIEVAL SYSTEM USING SEMANTIC AND VISUAL FEATURES
188 CHAPTER 6 PROPOSED HYBRID MEDICAL IMAGE RETRIEVAL SYSTEM USING SEMANTIC AND VISUAL FEATURES 6.1 INTRODUCTION Image representation schemes designed for image retrieval systems are categorized into two
More informationA model of information searching behaviour to facilitate end-user support in KOS-enhanced systems
A model of information searching behaviour to facilitate end-user support in KOS-enhanced systems Dorothee Blocks Hypermedia Research Unit School of Computing University of Glamorgan, UK NKOS workshop
More informationOntology Based Prediction of Difficult Keyword Queries
Ontology Based Prediction of Difficult Keyword Queries Lubna.C*, Kasim K Pursuing M.Tech (CSE)*, Associate Professor (CSE) MEA Engineering College, Perinthalmanna Kerala, India lubna9990@gmail.com, kasim_mlp@gmail.com
More informationCSI 4107 Image Information Retrieval
CSI 4107 Image Information Retrieval This slides are inspired by a tutorial on Medical Image Retrieval by Henning Müller and Thomas Deselaers, 2005-2006 1 Outline Introduction Content-based image retrieval
More informationTowards Linked Data and ontology development for the semantic enrichment of volunteered geo-information
AGILE Link-VGI workshop, Helsinki 14 June 2016 Towards Linked Data and ontology development for the semantic enrichment of volunteered geo-information Rob Lemmens University of Twente, Faculty of Geo-Information
More informationJoint Inference in Image Databases via Dense Correspondence. Michael Rubinstein MIT CSAIL (while interning at Microsoft Research)
Joint Inference in Image Databases via Dense Correspondence Michael Rubinstein MIT CSAIL (while interning at Microsoft Research) My work Throughout the year (and my PhD thesis): Temporal Video Analysis
More informationSupervised Models for Multimodal Image Retrieval based on Visual, Semantic and Geographic Information
Supervised Models for Multimodal Image Retrieval based on Visual, Semantic and Geographic Information Duc-Tien Dang-Nguyen, Giulia Boato, Alessandro Moschitti, Francesco G.B. De Natale Department of Information
More informationKnowledge-Driven Video Information Retrieval with LOD
Knowledge-Driven Video Information Retrieval with LOD Leslie F. Sikos, Ph.D., Flinders University ESAIR 15, 23 October 2015 Melbourne, VIC, Australia Knowledge-Driven Video IR Outline Video Retrieval Challenges
More informationSupervised Models for Coreference Resolution [Rahman & Ng, EMNLP09] Running Example. Mention Pair Model. Mention Pair Example
Supervised Models for Coreference Resolution [Rahman & Ng, EMNLP09] Many machine learning models for coreference resolution have been created, using not only different feature sets but also fundamentally
More informationA Novel Categorized Search Strategy using Distributional Clustering Neenu Joseph. M 1, Sudheep Elayidom 2
A Novel Categorized Search Strategy using Distributional Clustering Neenu Joseph. M 1, Sudheep Elayidom 2 1 Student, M.E., (Computer science and Engineering) in M.G University, India, 2 Associate Professor
More informationAUTOMATIC VISUAL CONCEPT DETECTION IN VIDEOS
AUTOMATIC VISUAL CONCEPT DETECTION IN VIDEOS Nilam B. Lonkar 1, Dinesh B. Hanchate 2 Student of Computer Engineering, Pune University VPKBIET, Baramati, India Computer Engineering, Pune University VPKBIET,
More informationExtracting Rankings for Spatial Keyword Queries from GPS Data
Extracting Rankings for Spatial Keyword Queries from GPS Data Ilkcan Keles Christian S. Jensen Simonas Saltenis Aalborg University Outline Introduction Motivation Problem Definition Proposed Method Overview
More informationAn Enhanced Image Retrieval Using K-Mean Clustering Algorithm in Integrating Text and Visual Features
An Enhanced Image Retrieval Using K-Mean Clustering Algorithm in Integrating Text and Visual Features S.Najimun Nisha 1, Mrs.K.A.Mehar Ban 2, 1 PG Student, SVCET, Puliangudi. najimunnisha@yahoo.com 2 AP/CSE,
More informationHybrid Approach for Query Expansion using Query Log
Volume 7 No.6, July 214 www.ijais.org Hybrid Approach for Query Expansion using Query Log Lynette Lopes M.E Student, TSEC, Mumbai, India Jayant Gadge Associate Professor, TSEC, Mumbai, India ABSTRACT Web
More informationAn Efficient Methodology for Image Rich Information Retrieval
An Efficient Methodology for Image Rich Information Retrieval 56 Ashwini Jaid, 2 Komal Savant, 3 Sonali Varma, 4 Pushpa Jat, 5 Prof. Sushama Shinde,2,3,4 Computer Department, Siddhant College of Engineering,
More informationSYSTEM PROFILES IN CONTENT-BASED INDEXING AND RETRIEVAL
1 SYSTEM PROFILES IN CONTENT-BASED INDEXING AND RETRIEVAL Esin Guldogan esin.guldogan@tut.fi 2 Outline Personal Media Management Text-Based Retrieval Metadata Retrieval Content-Based Retrieval System Profiling
More informationBibster A Semantics-Based Bibliographic Peer-to-Peer System
Bibster A Semantics-Based Bibliographic Peer-to-Peer System Peter Haase 1, Björn Schnizler 1, Jeen Broekstra 2, Marc Ehrig 1, Frank van Harmelen 2, Maarten Menken 2, Peter Mika 2, Michal Plechawski 3,
More informationSemantic Web. Tahani Aljehani
Semantic Web Tahani Aljehani Motivation: Example 1 You are interested in SOAP Web architecture Use your favorite search engine to find the articles about SOAP Keywords-based search You'll get lots of information,
More informationExtracting Algorithms by Indexing and Mining Large Data Sets
Extracting Algorithms by Indexing and Mining Large Data Sets Vinod Jadhav 1, Dr.Rekha Rathore 2 P.G. Student, Department of Computer Engineering, RKDF SOE Indore, University of RGPV, Bhopal, India Associate
More informationMESH. Multimedia Semantic Syndication for Enhanced News Services. Project Overview
MESH Multimedia Semantic Syndication for Enhanced News Services Project Overview Presentation Structure 2 Project Summary Project Motivation Problem Description Work Description Expected Result The MESH
More informationDisambiguating Search by Leveraging a Social Context Based on the Stream of User s Activity
Disambiguating Search by Leveraging a Social Context Based on the Stream of User s Activity Tomáš Kramár, Michal Barla and Mária Bieliková Faculty of Informatics and Information Technology Slovak University
More informationHealthCyberMap: Mapping the Health Cyberspace Using Hypermedia GIS and Clinical Codes
HealthCyberMap: Mapping the Health Cyberspace Using Hypermedia GIS and Clinical Codes PhD Research Project Maged Nabih Kamel Boulos MBBCh, MSc (Derm & Vener), MSc (Medical Informatics) 1 Summary The application
More informationKnowledge Retrieval. Franz J. Kurfess. Computer Science Department California Polytechnic State University San Luis Obispo, CA, U.S.A.
Knowledge Retrieval Franz J. Kurfess Computer Science Department California Polytechnic State University San Luis Obispo, CA, U.S.A. 1 Acknowledgements This lecture series has been sponsored by the European
More informationAn Entity Name Systems (ENS) for the [Semantic] Web
An Entity Name Systems (ENS) for the [Semantic] Web Paolo Bouquet University of Trento (Italy) Coordinator of the FP7 OKKAM IP LDOW @ WWW2008 Beijing, 22 April 2008 An ordinary day on the [Semantic] Web
More informationover Multi Label Images
IBM Research Compact Hashing for Mixed Image Keyword Query over Multi Label Images Xianglong Liu 1, Yadong Mu 2, Bo Lang 1 and Shih Fu Chang 2 1 Beihang University, Beijing, China 2 Columbia University,
More informationA Multilingual Social Media Linguistic Corpus
A Multilingual Social Media Linguistic Corpus Luis Rei 1,2 Dunja Mladenić 1,2 Simon Krek 1 1 Artificial Intelligence Laboratory Jožef Stefan Institute 2 Jožef Stefan International Postgraduate School 4th
More informationT MULTIMEDIA RETRIEVAL SYSTEM EVALUATION
T-61.6030 MULTIMEDIA RETRIEVAL SYSTEM EVALUATION Pauli Ruonala pruonala@niksula.hut.fi 25.4.2008 Contents 1. Retrieve test material 2. Sources of retrieval errors 3. Traditional evaluation methods 4. Evaluation
More informationUsing a Medical Thesaurus to Predict Query Difficulty
Using a Medical Thesaurus to Predict Query Difficulty Florian Boudin, Jian-Yun Nie, Martin Dawes To cite this version: Florian Boudin, Jian-Yun Nie, Martin Dawes. Using a Medical Thesaurus to Predict Query
More informationEvaluation and image retrieval
Evaluation and image retrieval Henning Müller Thomas Deselaers Overview Information retrieval evaluation TREC Multimedia retrieval evaluation TRECVID, ImageEval, Benchathlon, ImageCLEF Past Future Information
More informationSemantic Annotation of Stock Photography for CBIR using MPEG-7 standards
P a g e 7 Semantic Annotation of Stock Photography for CBIR using MPEG-7 standards Balasubramani R Dr.V.Kannan Assistant Professor IT Dean Sikkim Manipal University DDE Centre for Information I Floor,
More informationAn Archiving System for Managing Evolution in the Data Web
An Archiving System for Managing Evolution in the Web Marios Meimaris *, George Papastefanatos and Christos Pateritsas * Institute for the Management of Information Systems, Research Center Athena, Greece
More informationAn Ontology Based Question Answering System on Software Test Document Domain
An Ontology Based Question Answering System on Software Test Document Domain Meltem Serhatli, Ferda N. Alpaslan Abstract Processing the data by computers and performing reasoning tasks is an important
More informationInformation mining and information retrieval : methods and applications
Information mining and information retrieval : methods and applications J. Mothe, C. Chrisment Institut de Recherche en Informatique de Toulouse Université Paul Sabatier, 118 Route de Narbonne, 31062 Toulouse
More informationInformation Retrieval
Multimedia Computing: Algorithms, Systems, and Applications: Information Retrieval and Search Engine By Dr. Yu Cao Department of Computer Science The University of Massachusetts Lowell Lowell, MA 01854,
More informationOverview of the medical task of ImageCLEF Alba G. Seco de Herrera Stefano Bromuri Roger Schaer Henning Müller
Overview of the medical task of ImageCLEF 2016 Alba G. Seco de Herrera Stefano Bromuri Roger Schaer Henning Müller Tasks in ImageCLEF 2016 Automatic image annotation Medical image classification Sub-tasks
More informationLinked Open Europeana: Semantics for the Digital Humanities
Linked Open Europeana: Semantics for the Digital Humanities Prof. Dr. Stefan Gradmann Humboldt-Universität zu Berlin / School of Library and Information Science stefan.gradmann@ibi.hu-berlin.de 1 Overview
More informationOverview of Web Mining Techniques and its Application towards Web
Overview of Web Mining Techniques and its Application towards Web *Prof.Pooja Mehta Abstract The World Wide Web (WWW) acts as an interactive and popular way to transfer information. Due to the enormous
More informationDeep Character-Level Click-Through Rate Prediction for Sponsored Search
Deep Character-Level Click-Through Rate Prediction for Sponsored Search Bora Edizel - Phd Student UPF Amin Mantrach - Criteo Research Xiao Bai - Oath This work was done at Yahoo and will be presented as
More informationMUCKE Multimedia and User Credibility Knowledge Extraction
MUCKE Multimedia and User Credibility Knowledge Extraction http://ifs.tuwien.ac.at/~mucke/ Mihai Lupu (1), Alexandru Ginsca (2) (1) Vienna University of Technology (2) CEA LIST lupu@ifs.tuwien.ac.at Chist-Era
More informationOutline. Morning program Preliminaries Semantic matching Learning to rank Entities
112 Outline Morning program Preliminaries Semantic matching Learning to rank Afternoon program Modeling user behavior Generating responses Recommender systems Industry insights Q&A 113 are polysemic Finding
More informationExploring and Using the Semantic Web
Exploring and Using the Semantic Web Mathieu d Aquin KMi, The Open University m.daquin@open.ac.uk What?? Exploring the Semantic Web Vocabularies Ontologies Linked Data RDF documents Example: Exploring
More information<is web> Information Systems & Semantic Web University of Koblenz Landau, Germany
Information Systems & University of Koblenz Landau, Germany Semantic Search examples: Swoogle and Watson Steffen Staad credit: Tim Finin (swoogle), Mathieu d Aquin (watson) and their groups 2009-07-17
More informationSemantic Web. Ontology Alignment. Morteza Amini. Sharif University of Technology Fall 95-96
ه عا ی Semantic Web Ontology Alignment Morteza Amini Sharif University of Technology Fall 95-96 Outline The Problem of Ontologies Ontology Heterogeneity Ontology Alignment Overall Process Similarity (Matching)
More informationNew Generation Open Content Delivery Networks
Open ContEnt Aware Networks New Generation Open Content Delivery Networks Yannick Le Louédec Orange Labs Workshop Future Media Distribution. November 10 th, 2011 www.ict-ocean.eu The research leading to
More informationExploiting Semantics Where We Find Them
Vrije Universiteit Amsterdam 19/06/2018 Exploiting Semantics Where We Find Them A Bottom-up Approach to the Semantic Web Prof. Dr. Christian Bizer Bizer: Exploiting Semantics Where We Find Them. VU Amsterdam,
More informationWhat you have learned so far. Interoperability. Ontology heterogeneity. Being serious about the semantic web
What you have learned so far Interoperability Introduction to the Semantic Web Tutorial at ISWC 2010 Jérôme Euzenat Data can be expressed in RDF Linked through URIs Modelled with OWL ontologies & Retrieved
More informationLab for Media Search, National University of Singapore 1
1 2 Word2Image: Towards Visual Interpretation of Words Haojie Li Introduction Motivation A picture is worth 1000 words Traditional dictionary Containing word entries accompanied by photos or drawing to
More informationWEIGHTING QUERY TERMS USING WORDNET ONTOLOGY
IJCSNS International Journal of Computer Science and Network Security, VOL.9 No.4, April 2009 349 WEIGHTING QUERY TERMS USING WORDNET ONTOLOGY Mohammed M. Sakre Mohammed M. Kouta Ali M. N. Allam Al Shorouk
More informationD DAVID PUBLISHING. Big Data; Definition and Challenges. 1. Introduction. Shirin Abbasi
Journal of Energy and Power Engineering 10 (2016) 405-410 doi: 10.17265/1934-8975/2016.07.004 D DAVID PUBLISHING Shirin Abbasi Computer Department, Islamic Azad University-Tehran Center Branch, Tehran
More informationSearch Evaluation. Tao Yang CS293S Slides partially based on text book [CMS] [MRS]
Search Evaluation Tao Yang CS293S Slides partially based on text book [CMS] [MRS] Table of Content Search Engine Evaluation Metrics for relevancy Precision/recall F-measure MAP NDCG Difficulties in Evaluating
More informationOWLS-SLR An OWL-S Service Profile Matchmaker
OWLS-SLR An OWL-S Service Profile Matchmaker Quick Use Guide (v0.1) Intelligent Systems and Knowledge Processing Group Aristotle University of Thessaloniki, Greece Author: Georgios Meditskos, PhD Student
More informationInformation Retrieval CS Lecture 01. Razvan C. Bunescu School of Electrical Engineering and Computer Science
Information Retrieval CS 6900 Razvan C. Bunescu School of Electrical Engineering and Computer Science bunescu@ohio.edu Information Retrieval Information Retrieval (IR) is finding material of an unstructured
More informationAnnotation Component in KiWi
Annotation Component in KiWi Marek Schmidt and Pavel Smrž Faculty of Information Technology Brno University of Technology Božetěchova 2, 612 66 Brno, Czech Republic E-mail: {ischmidt,smrz}@fit.vutbr.cz
More informationEnhanced retrieval using semantic technologies:
Enhanced retrieval using semantic technologies: Ontology based retrieval as a new search paradigm? - Considerations based on new projects at the Bavarian State Library Dr. Berthold Gillitzer 28. Mai 2008
More informationMedical Image Annotation in ImageCLEF 2008
Medical Image Annotation in ImageCLEF 2008 Thomas Deselaers 1 and Thomas M. Deserno 2 1 RWTH Aachen University, Computer Science Department, Aachen, Germany deselaers@cs.rwth-aachen.de 2 RWTH Aachen University,
More informationAccessing information about Linked Data vocabularies with vocab.cc
Accessing information about Linked Data vocabularies with vocab.cc Steffen Stadtmüller 1, Andreas Harth 1, and Marko Grobelnik 2 1 Institute AIFB, Karlsruhe Institute of Technology (KIT), Germany {steffen.stadtmueller,andreas.harth}@kit.edu
More informationClass 5: Attributes and Semantic Features
Class 5: Attributes and Semantic Features Rogerio Feris, Feb 21, 2013 EECS 6890 Topics in Information Processing Spring 2013, Columbia University http://rogerioferis.com/visualrecognitionandsearch Project
More informationJianyong Wang Department of Computer Science and Technology Tsinghua University
Jianyong Wang Department of Computer Science and Technology Tsinghua University jianyong@tsinghua.edu.cn Joint work with Wei Shen (Tsinghua), Ping Luo (HP), and Min Wang (HP) Outline Introduction to entity
More informationInformation Retrieval and Knowledge Organisation
Information Retrieval and Knowledge Organisation Knut Hinkelmann Content Information Retrieval Indexing (string search and computer-linguistic aproach) Classical Information Retrieval: Boolean, vector
More informationAnnotating Spatio-Temporal Information in Documents
Annotating Spatio-Temporal Information in Documents Jannik Strötgen University of Heidelberg Institute of Computer Science Database Systems Research Group http://dbs.ifi.uni-heidelberg.de stroetgen@uni-hd.de
More informationMining the Web 2.0 to improve Search
Mining the Web 2.0 to improve Search Ricardo Baeza-Yates VP, Yahoo! Research Agenda The Power of Data Examples Improving Image Search (Faceted Clusters) Searching the Wikipedia (Correlator) Understanding
More information