Machine Learning Approaches to Information Retrieval
|
|
- Amberlynn Goodwin
- 6 years ago
- Views:
Transcription
1 Machine Learning Approaches to Information Retrieval Hang Li Microsoft Research Asia Joint Work with Jun Xu Yunbo Cao Guoping Hu
2 Talk Outline Introuction to Information Retrieval Search by Type Factoi Search New Learning Algorithm for Ranking Information Desk Summary
3 Introuction to Information Retrieval
4 What Is Information Retrieval? search I want to access information X uery information
5 Why Important?
6 Current Approach Information Retrieval = Ranking
7 General Moel for Ranking n ~ ~ ~ n n f f f uery or uestion ocuments information relevance scores for ranking D Q f
8 Relevance: Matching between Query an Document f
9 Probabilistic Moel n ~ ~ ~ n n r P r P r P uery or uestion ocuments relevant scores for ranking } { r r R D Q R P
10 Okapi or BM5 Robertson an Walker 994 ocuments ranking function n uery or uestion w + + tf w b k + tf w l k b avgl
11 Language Moe Ponte an Croft 998; Lafferty an Zhai 00 nl n n n n l l w w w w w w w w w = = = l w w w = ~ ~ ~ n n P P P ocument = bag of wors relevance scores for ranking
12 Classification Moel n uery or uestion ocuments relevance scores for ranking } { r r R D Q R P Logistic Regression or SVM ~ ~ ~ n n r P r P r P
13 Orinal Regression Moel Herbrich et al. 000; Joachims 00 n ~ ~ ~ n n f f f uery or uestion ocuments relevance scores for ranking } { k R D Q f R = Ranking SVM
14 General Moel for Ranking n n n f g f g f g uery or uestion ocuments scores for ranking D g D Q f
15 Page Rank Brin an Page 998 P i = α n + α P j M L j j i
16 Challenges Proximity Synonym an polysemi Quality an freshness of ocument webpage Spamming Information granularity Search nee unerstaning Evaluation Personalization Training ata collection relevance feeback click-through
17 Search by Document Type
18 Search by Document Type I want to fin ocuments relevant to X in type Y uery ocuments
19 Probabilistic Moel n uery with type ocuments ranking function } { } { t t T r r R D Q T R P D T P D Q R P D Q T R P
20 Example Manuals Search Query create an external link Document External Links External links are to Web sites external to your Web site. You can create external links by using buttons highlighting text or creating hot spots on images. For example creating a linkage to the Far Eastern Economic Review [FEER] one woul go to the FEER through the Internet. You might want to right click on the FEER logo or recent issue cover an save the image to your computer [must be save as a graphical interface file - GIF]. To make either the button or the text or the picture or all three interactive you highlight what you want to connect an then you click the hyperlink tool or the Eit-Hyperlink. Press the Worl Wie Web tab in the Create Hyperlink iaolgue box [see box below]. Whatever aress you have in the running Net browser will show up in the box or you can browse. In this case the FEER aress is there one only nee to press OK an the external hyperlink is create. All three examples to the right will link you to the FEER site. Document Creating an External Link To create an anchor that is a link to another ocument: Select to select by click an rag or by keyboar the text for the link you are creating. Click the Link button first case or select the entry "Create or change link" in the Links menu secon case. In the first case the cursor changes from an arrow to a han to let you click the target ocument. If the target ocument is isplaye in another Amaya winow click anywhere within that winow to create the link. If the target ocument is not isplaye in another Amaya winow press the F or Delete key or click a part of the ocument which cannot be a vali target. A ialog prompts you for the location of the target ocument. Type the URI of the target ocument an then Confirm to create the link. In the secon case a ialog prompts you for the location of the target ocument. If the target ocument is isplaye in another Amaya winow an you want to select it by click click the Click button then click anywhere within that winow to create the link. If the target ocument is not isplaye in another Amaya winow type the URI of the target ocument an then Confirm to create the link.
21 Manuals Search -- Experiment MS intranet ata 50 ueries from log of Microsoft Web. answers per uery from 5000 ocuments Evaluation Measure Results RR i = Rank i MRR Q i = = Q RR i Metho Type Only Relevance Only Combine Moel our approach MRR
22 Factoi Search
23 People Search Cao et al. 005 I want to fin people relate to X uery people
24 Probabilistic Moel n uery with type ocuments ranking function D Q E P P e P P e P e m e e entities
25 Example People Search Query Who knows about igital ink Document Jian Wang Ph.D. Senior Researcher & Research Manager Multimoal User Interface Group Microsoft Research Asia Dr. Jian Wang is Research Manager of the Multimoal User Interface Group at Microsoft Research Asia MSR Asia. Dr. Jian Wang's research specializations are ink an pen computing usability multimoal user interface virtual reality an human cognition. The Multimoal UI Group's current research projects inclue: avance igital ink parser igital ink annotation an representation of igital ink for Tablet PC. The group previously invente an inline input an correction user interface for Asian languages calle Moeless Input User Interface which allows Chinese users of Office XP to smoothly enter English an Chinese text without constantly switching between input language moes. Answer Jian Wang
26 People Search -- Experiment MSR Corpus 3 ueries searching in 309 ocuments 80 Caniates Evaluation Measure #{ persons which appear in both Top - 5 ranke Top 5 Precision = 5 caniates an groun truth} Results Metho Profile-base Moel Our Moel Top-5 Precision %
27 Time Search I want to fin time relate to X uery ates
28 Probabilistic Moel passages t t entities e e t n e m ranking function uery with type P E Q T e ~ f t = λ f t i i i
29 Time Search -- Example Query: Nixon visit China Document: Nixon's most significant achievement in foreign affairs may have been the establishment of irect relations with the People's Republic of China after a -year estrangement. Following a series of low-level iplomatic contacts in 970 an the lifting of U.S. trae an travel restrictions the following year the Chinese inicate that they woul welcome high-level iscussions an Nixon sent his national security aviser Henry Kissinger to China for secret talks. The thaw in relations became apparent with the ping-pong iplomacy conucte by American an Chinese table-tennis teams in reciprocal visits in Nixon's visit to China in February March 97 the first by an American presient while in office conclue with the Shanghai Communiué in which the Unite States formally recognize the one-china principle that there is only one China an that Taiwan is a part of China. Answer February March 97
30 Time Search -- Experiment MS intranet ata 00 ueries from log of containing when scheule ay an time 8.73 answers per uery from 0000 ocuments Evaluation Measure Results RR i = Rank i MRR Q i RR = = Q i Metho Best Baseline Learning MRR %
31 New Learning Algorithm for Ranking
32 Ranking Learning Given: S = { x y } X Ywhere Y = { r r } m i i i= Η= { h: X Y} hypothesis space L: Y Y R loss function Question: base on S fin h * = arg min E L h x y ERM: SRM: m * choose herm = argmin L h xi yi i= m * choose hsrm = argmin L h xi yi + λq h i=
33 Viewing Ranking as Classification Formulizing ranking problem as classification of example pairs Measuring the loss of h by inversions y y yˆ ˆ y Ly ˆ ˆ ˆ ˆ y y y = y y y y 0 otherwise
34 Viewing Ranking as Classification cont Given S = { x y } = m m i= j= m i i i fin h ERM that minimizes Ly y hx hx i j i j Euivalently fin a f h that minimizes x x z S' i i i L f x x z 0 h i i i + yi yj where S ' = xi x j z = : xi yi x j y j S yi y j
35 Ranking Moel Moel: instances are ranke by Ux
36 Simulation Experiment Low penalty Error rate of pairs NDCG curve High penalty High penalty Our approach Ranking SVM NDCG Our approach Ranking SVM r-r0 r-r r-r0 All N
37 Information Desk Li et al. 005
38
39 Features -- what is
40 Features who is Bill Gates Go What is Who is Where is homepage of Who knows about What is Who is Where is homepage of Who knows about Bill Gates US-Executive-Chairman + 45 XXXXXXX XXXXXX CHRMN & CHIEF SFTWR ARCHITECT Documents of Bill Gates8 My avice to stuents: Eucation counts Evento NET Reviewers Seattle 7/8 Novembro Anotacoes Dago.oc A Vision for Life Long Learning Year 00 Library/Vision for Eucation_Microsoft_Final.oc Bill Gates answers most freuently aske uestions. >>more Top 0 terms appearing in ocuments of Bill Gates Term Term Term Term Term Term Term Term Term Term
41 Features where is homepage of
42 Features - who knows about
43 Summary
44 Summary Information retrieval = helping people access information Currently search = ranking Matching between uery an ocument Our work Search by Document Type Factoi Search New learning algorithm for ranking Many issues to stuy
45 References Brin S. an Page T.998 The Anatomy of a Large-Scale Hypertextual Web Search Engine. In : Proceeings of the seventh international conference on Worl Wie Web. Hang Li Yunbo Cao Jun Xu Yunhua Hu Shenjie Li an Dmitriy Meyerzon A New Approach to Intranet Search Base on Information Extraction. Proc. of ACM-CIKM'05 inustry track Herbrich R. Graepel T. & Obermayer K Large Margin Rank Bounaries for Orinal Regression.. Avances in Large Margin Classifiers pp Joachims T. 00 Optimizing Search Engines Using Clickthrough Data Proceeings of the ACM Conference on Knowlege Discovery an Data Mining. Lafferty J. an Zhai C. 00. Document Language Moels Query Moels an Risk Minimization for Information Retrieval. SIGIR Ponte J. M. an Croft W. B A language moeling approach to information retrieval. In Proceeings of ACM-SIGIR pp Robertson S. E. an Walker S. 994 Okapi at TREC 3. In Proceeings of TREC Yunbo Cao Jingjing Liu Shenghua Bao an Hang Li 005 Microsoft Research Asia MSRA at Enterprise Track of TREC 005: Expert Search. In Proceeings of TREC.
Learning to Rank: A New Technology for Text Processing
TFANT 07 Tokyo Univ. March 2, 2007 Learning to Rank: A New Technology for Text Processing Hang Li Microsoft Research Asia Talk Outline What is Learning to Rank? Ranking SVM Definition Search Ranking SVM
More informationTREC-10 Web Track Experiments at MSRA
TREC-10 Web Track Experiments at MSRA Jianfeng Gao*, Guihong Cao #, Hongzhao He #, Min Zhang ##, Jian-Yun Nie**, Stephen Walker*, Stephen Robertson* * Microsoft Research, {jfgao,sw,ser}@microsoft.com **
More informationA Few Things to Know about Machine Learning for Web Search
AIRS 2012 Tianjin, China Dec. 19, 2012 A Few Things to Know about Machine Learning for Web Search Hang Li Noah s Ark Lab Huawei Technologies Talk Outline My projects at MSRA Some conclusions from our research
More informationA Machine Learning Approach for Information Retrieval Applications. Luo Si. Department of Computer Science Purdue University
A Machine Learning Approach for Information Retrieval Applications Luo Si Department of Computer Science Purdue University Why Information Retrieval: Information Overload: Since the introduction of digital
More informationInformation Retrieval
Information Retrieval Learning to Rank Ilya Markov i.markov@uva.nl University of Amsterdam Ilya Markov i.markov@uva.nl Information Retrieval 1 Course overview Offline Data Acquisition Data Processing Data
More informationLETOR: Benchmark Dataset for Research on Learning to Rank for Information Retrieval Tie-Yan Liu 1, Jun Xu 1, Tao Qin 2, Wenying Xiong 3, and Hang Li 1
LETOR: Benchmark Dataset for Research on Learning to Rank for Information Retrieval Tie-Yan Liu 1, Jun Xu 1, Tao Qin 2, Wenying Xiong 3, and Hang Li 1 1 Microsoft Research Asia, No.49 Zhichun Road, Haidian
More informationUniversity of Waterloo: Logistic Regression and Reciprocal Rank Fusion at the Microblog Track
University of Waterloo: Logistic Regression and Reciprocal Rank Fusion at the Microblog Track Adam Roegiest and Gordon V. Cormack David R. Cheriton School of Computer Science, University of Waterloo 1
More informationExtracting Search-Focused Key N-Grams for Relevance Ranking in Web Search
Extracting Search-Focused Key N-Grams for Relevance Ranking in Web Search Chen Wang Fudan University Shanghai, P.R.China chen.wang0517@gmail.com Hang Li Microsoft Research Asia Beijing, P.R.China hangli@microsoft.com
More informationCoupling the User Interfaces of a Multiuser Program
Coupling the User Interfaces of a Multiuser Program PRASUN DEWAN University of North Carolina at Chapel Hill RAJIV CHOUDHARY Intel Corporation We have evelope a new moel for coupling the user-interfaces
More informationCWS: : A Comparative Web Search System
CWS: : A Comparative Web Search System Jian-Tao Sun, Xuanhui Wang, Dou Shen Hua-Jun Zeng, Zheng Chen Microsoft Research Asia University of Illinois at Urbana-Champaign Hong Kong University of Science and
More informationMining the Search Trails of Surfing Crowds: Identifying Relevant Websites from User Activity Data
Mining the Search Trails of Surfing Crowds: Identifying Relevant Websites from User Activity Data Misha Bilenko and Ryen White presented by Matt Richardson Microsoft Research Search = Modeling User Behavior
More informationRMIT University at TREC 2006: Terabyte Track
RMIT University at TREC 2006: Terabyte Track Steven Garcia Falk Scholer Nicholas Lester Milad Shokouhi School of Computer Science and IT RMIT University, GPO Box 2476V Melbourne 3001, Australia 1 Introduction
More informationBUPT at TREC 2009: Entity Track
BUPT at TREC 2009: Entity Track Zhanyi Wang, Dongxin Liu, Weiran Xu, Guang Chen, Jun Guo Pattern Recognition and Intelligent System Lab, Beijing University of Posts and Telecommunications, Beijing, China,
More informationAdjacency Matrix Based Full-Text Indexing Models
1000-9825/2002/13(10)1933-10 2002 Journal of Software Vol.13, No.10 Ajacency Matrix Base Full-Text Inexing Moels ZHOU Shui-geng 1, HU Yun-fa 2, GUAN Ji-hong 3 1 (Department of Computer Science an Engineering,
More informationRe-ranking Documents Based on Query-Independent Document Specificity
Re-ranking Documents Based on Query-Independent Document Specificity Lei Zheng and Ingemar J. Cox Department of Computer Science University College London London, WC1E 6BT, United Kingdom lei.zheng@ucl.ac.uk,
More informationChapter 27 Introduction to Information Retrieval and Web Search
Chapter 27 Introduction to Information Retrieval and Web Search Copyright 2011 Pearson Education, Inc. Publishing as Pearson Addison-Wesley Chapter 27 Outline Information Retrieval (IR) Concepts Retrieval
More informationFall Lecture 16: Learning-to-rank
Fall 2016 CS646: Information Retrieval Lecture 16: Learning-to-rank Jiepu Jiang University of Massachusetts Amherst 2016/11/2 Credit: some materials are from Christopher D. Manning, James Allan, and Honglin
More informationLearning to Match. Jun Xu, Zhengdong Lu, Tianqi Chen, Hang Li
Learning to Match Jun Xu, Zhengdong Lu, Tianqi Chen, Hang Li 1. Introduction The main tasks in many applications can be formalized as matching between heterogeneous objects, including search, recommendation,
More informationNortheastern University in TREC 2009 Million Query Track
Northeastern University in TREC 2009 Million Query Track Evangelos Kanoulas, Keshi Dai, Virgil Pavlu, Stefan Savev, Javed Aslam Information Studies Department, University of Sheffield, Sheffield, UK College
More informationLarge Scale Chinese News Categorization. Peng Wang. Joint work with H. Zhang, B. Xu, H.W. Hao
Large Scale Chinese News Categorization --based on Improved Feature Selection Method Peng Wang Joint work with H. Zhang, B. Xu, H.W. Hao Computational-Brain Research Center Institute of Automation, Chinese
More informationNTU Approaches to Subtopic Mining and Document Ranking at NTCIR-9 Intent Task
NTU Approaches to Subtopic Mining and Document Ranking at NTCIR-9 Intent Task Chieh-Jen Wang, Yung-Wei Lin, *Ming-Feng Tsai and Hsin-Hsi Chen Department of Computer Science and Information Engineering,
More informationChapter 6: Information Retrieval and Web Search. An introduction
Chapter 6: Information Retrieval and Web Search An introduction Introduction n Text mining refers to data mining using text documents as data. n Most text mining tasks use Information Retrieval (IR) methods
More informationDetecting Multilingual and Multi-Regional Query Intent in Web Search
Detecting Multilingual and Multi-Regional Query Intent in Web Search Yi Chang, Ruiqiang Zhang, Srihari Reddy Yahoo! Labs 701 First Avenue Sunnyvale, CA 94089 {yichang,ruiqiang,sriharir}@yahoo-inc.com Yan
More informationPreamble. Singly linked lists. Collaboration policy and academic integrity. Getting help
CS2110 Spring 2016 Assignment A. Linke Lists Due on the CMS by: See the CMS 1 Preamble Linke Lists This assignment begins our iscussions of structures. In this assignment, you will implement a structure
More informationEstimating Credibility of User Clicks with Mouse Movement and Eye-tracking Information
Estimating Credibility of User Clicks with Mouse Movement and Eye-tracking Information Jiaxin Mao, Yiqun Liu, Min Zhang, Shaoping Ma Department of Computer Science and Technology, Tsinghua University Background
More informationClassification and Comparative Analysis of Passage Retrieval Methods
Classification and Comparative Analysis of Passage Retrieval Methods Dr. K. Saruladha, C. Siva Sankar, G. Chezhian, L. Lebon Iniyavan, N. Kadiresan Computer Science Department, Pondicherry Engineering
More informationAn Improvement of Search Results Access by Designing a Search Engine Result Page with a Clustering Technique
An Improvement of Search Results Access by Designing a Search Engine Result Page with a Clustering Technique 60 2 Within-Subjects Design Counter Balancing Learning Effect 1 [1 [2www.worldwidewebsize.com
More informationUniversity of Delaware at Diversity Task of Web Track 2010
University of Delaware at Diversity Task of Web Track 2010 Wei Zheng 1, Xuanhui Wang 2, and Hui Fang 1 1 Department of ECE, University of Delaware 2 Yahoo! Abstract We report our systems and experiments
More informationEffect of log-based Query Term Expansion on Retrieval Effectiveness in Patent Searching
Effect of log-based Query Term Expansion on Retrieval Effectiveness in Patent Searching Wolfgang Tannebaum, Parvaz Madabi and Andreas Rauber Institute of Software Technology and Interactive Systems, Vienna
More informationSTUDYING OF CLASSIFYING CHINESE SMS MESSAGES
STUDYING OF CLASSIFYING CHINESE SMS MESSAGES BASED ON BAYESIAN CLASSIFICATION 1 LI FENG, 2 LI JIGANG 1,2 Computer Science Department, DongHua University, Shanghai, China E-mail: 1 Lifeng@dhu.edu.cn, 2
More informationKinematic Analysis of a Family of 3R Manipulators
Kinematic Analysis of a Family of R Manipulators Maher Baili, Philippe Wenger an Damien Chablat Institut e Recherche en Communications et Cybernétique e Nantes, UMR C.N.R.S. 6597 1, rue e la Noë, BP 92101,
More informationActive Evaluation of Ranking Functions based on Graded Relevance (Extended Abstract)
Proceedings of the Twenty-Third International Joint Conference on Artificial Intelligence Active Evaluation of Ranking Functions based on Graded Relevance (Extended Abstract) Christoph Sawade sawade@cs.uni-potsdam.de
More informationLearning to Rank. from heuristics to theoretic approaches. Hongning Wang
Learning to Rank from heuristics to theoretic approaches Hongning Wang Congratulations Job Offer from Bing Core Ranking team Design the ranking module for Bing.com CS 6501: Information Retrieval 2 How
More informationAPPLYING GENETIC ALGORITHM IN QUERY IMPROVEMENT PROBLEM. Abdelmgeid A. Aly
International Journal "Information Technologies an Knowlege" Vol. / 2007 309 [Project MINERVAEUROPE] Project MINERVAEUROPE: Ministerial Network for Valorising Activities in igitalisation -
More informationA Study of Collection-based Features for Adapting the Balance Parameter in Pseudo Relevance Feedback
A Study of Collection-based Features for Adapting the Balance Parameter in Pseudo Relevance Feedback Ye Meng 1, Peng Zhang 1, Dawei Song 1,2, and Yuexian Hou 1 1 Tianjin Key Laboratory of Cognitive Computing
More informationImproving Web Search Ranking by Incorporating User Behavior Information
Improving Web Search Ranking by Incorporating User Behavior Information Eugene Agichtein Microsoft Research eugeneag@microsoft.com Eric Brill Microsoft Research brill@microsoft.com Susan Dumais Microsoft
More informationComment Extraction from Blog Posts and Its Applications to Opinion Mining
Comment Extraction from Blog Posts and Its Applications to Opinion Mining Huan-An Kao, Hsin-Hsi Chen Department of Computer Science and Information Engineering National Taiwan University, Taipei, Taiwan
More informationStudy of Network Optimization Method Based on ACL
Available online at www.scienceirect.com Proceia Engineering 5 (20) 3959 3963 Avance in Control Engineering an Information Science Stuy of Network Optimization Metho Base on ACL Liu Zhian * Department
More informationClassification and clustering methods for documents. by probabilistic latent semantic indexing model
A Short Course at amang University aipei, aiwan, R.O.C., March 7-9, 2006 Classification an clustering methos for ocuments by probabilistic latent semantic inexing moel Shigeichi Hirasawa Department of
More informationCloud Search Service Product Introduction. Issue 01 Date HUAWEI TECHNOLOGIES CO., LTD.
1.3.15 Issue 01 Date 2018-11-21 HUAWEI TECHNOLOGIES CO., LTD. Copyright Huawei Technologies Co., Lt. 2019. All rights reserve. No part of this ocument may be reprouce or transmitte in any form or by any
More informationWebSci and Learning to Rank for IR
WebSci and Learning to Rank for IR Ernesto Diaz-Aviles L3S Research Center. Hannover, Germany diaz@l3s.de Ernesto Diaz-Aviles www.l3s.de 1/16 Motivation: Information Explosion Ernesto Diaz-Aviles
More informationElement-based
Proceeings of te 12t NTCIR Conference on Evaluation of Information Access Tecnologies June 7-10 2016 Tokyo Japan Element-base Retrieval@MobileClick-2 Atsusi eyaki Tokyo Institute of Tecnology keyaki@lsc.cs.titec.ac.jp
More informationText Particles Multi-band Fusion for Robust Text Detection
Text Particles Multi-ban Fusion for Robust Text Detection Pengfei Xu, Rongrong Ji, Hongxun Yao, Xiaoshuai Sun, Tianqiang Liu, an Xianming Liu School of Computer Science an Engineering Harbin Institute
More informationLearning to Rank for Faceted Search Bridging the gap between theory and practice
Learning to Rank for Faceted Search Bridging the gap between theory and practice Agnes van Belle @ Berlin Buzzwords 2017 Job-to-person search system Generated query Match indicator Faceted search Multiple
More informationAdvanced Topics in Information Retrieval. Learning to Rank. ATIR July 14, 2016
Advanced Topics in Information Retrieval Learning to Rank Vinay Setty vsetty@mpi-inf.mpg.de Jannik Strötgen jannik.stroetgen@mpi-inf.mpg.de ATIR July 14, 2016 Before we start oral exams July 28, the full
More informationA Classification of 3R Orthogonal Manipulators by the Topology of their Workspace
A Classification of R Orthogonal Manipulators by the Topology of their Workspace Maher aili, Philippe Wenger an Damien Chablat Institut e Recherche en Communications et Cybernétique e Nantes, UMR C.N.R.S.
More informationA Machine Learning Approach for Improved BM25 Retrieval
A Machine Learning Approach for Improved BM25 Retrieval Krysta M. Svore and Christopher J. C. Burges Microsoft Research One Microsoft Way Redmond, WA 98052 {ksvore,cburges}@microsoft.com Microsoft Research
More informationFocused Retrieval Using Topical Language and Structure
Focused Retrieval Using Topical Language and Structure A.M. Kaptein Archives and Information Studies, University of Amsterdam Turfdraagsterpad 9, 1012 XT Amsterdam, The Netherlands a.m.kaptein@uva.nl Abstract
More informationExperiments on Related Entity Finding Track at TREC 2009 Qing Yang,Peng Jiang, Chunxia Zhang, Zhendong Niu
Experiments on Related Entity Finding Track at TREC 2009 Qing Yang,Peng Jiang, Chunxia Zhang, Zhendong Niu School of Computer, Beijing Institute of Technology { yangqing2005,jp, cxzhang, zniu}@bit.edu.cn
More informationLizhe Sun. November 17, Florida State University. Ranking in Statistics and Machine Learning. Lizhe Sun. Introduction
in in Florida State University November 17, 2017 Framework in 1. our life 2. Early work: Model Examples 3. webpage Web page search modeling Data structure Data analysis with machine learning algorithms
More informationLearning to Rank Only Using Training Data from Related Domain
Learning to Rank Only Using Training Data from Related Domain Wei Gao, Peng Cai 2, Kam-Fai Wong, and Aoying Zhou 2 The Chinese University of Hong Kong, Shatin, N.T., Hong Kong, China {wgao, kfwong}@se.cuhk.edu.hk
More informationA New Approach to Query Segmentation for Relevance Ranking in Web Search
Noname manuscript No. (will be inserted by the editor) A New Approach to Query Segmentation for Relevance Ranking in Web Search Haocheng Wu Yunhua Hu Hang Li Enhong Chen Received: date / Accepted: date
More informationChapter 8. Evaluating Search Engine
Chapter 8 Evaluating Search Engine Evaluation Evaluation is key to building effective and efficient search engines Measurement usually carried out in controlled laboratory experiments Online testing can
More informationPrior Art Retrieval Using Various Patent Document Fields Contents
Prior Art Retrieval Using Various Patent Document Fields Contents Metti Zakaria Wanagiri and Mirna Adriani Fakultas Ilmu Komputer, Universitas Indonesia Depok 16424, Indonesia metti.zakaria@ui.edu, mirna@cs.ui.ac.id
More informationDocument indexing, similarities and retrieval in large scale text collections
Document indexing, similarities and retrieval in large scale text collections Eric Gaussier Univ. Grenoble Alpes - LIG Eric.Gaussier@imag.fr Eric Gaussier Document indexing, similarities & retrieval 1
More informationCAMERAS AND GRAVITY: ESTIMATING PLANAR OBJECT ORIENTATION. Zhaoyin Jia, Andrew Gallagher, Tsuhan Chen
CAMERAS AND GRAVITY: ESTIMATING PLANAR OBJECT ORIENTATION Zhaoyin Jia, Anrew Gallagher, Tsuhan Chen School of Electrical an Computer Engineering, Cornell University ABSTRACT Photography on a mobile camera
More informationCHAPTER 5 EXPERT LOCATOR USING CONCEPT LINKING
94 CHAPTER 5 EXPERT LOCATOR USING CONCEPT LINKING 5.1 INTRODUCTION Expert locator addresses the task of identifying the right person with the appropriate skills and knowledge. In large organizations, it
More informationTopic-Level Random Walk through Probabilistic Model
Topic-Level Random Walk through Probabilistic Model Zi Yang, Jie Tang, Jing Zhang, Juanzi Li, and Bo Gao Department of Computer Science & Technology, Tsinghua University, China Abstract. In this paper,
More informationInformation Retrieval
Information Retrieval CSC 375, Fall 2016 An information retrieval system will tend not to be used whenever it is more painful and troublesome for a customer to have information than for him not to have
More informationPromoting Ranking Diversity for Biomedical Information Retrieval based on LDA
Promoting Ranking Diversity for Biomedical Information Retrieval based on LDA Yan Chen, Xiaoshi Yin, Zhoujun Li, Xiaohua Hu and Jimmy Huang State Key Laboratory of Software Development Environment, Beihang
More informationQuery Likelihood with Negative Query Generation
Query Likelihood with Negative Query Generation Yuanhua Lv Department of Computer Science University of Illinois at Urbana-Champaign Urbana, IL 61801 ylv2@uiuc.edu ChengXiang Zhai Department of Computer
More informationShift-map Image Registration
Shift-map Image Registration Linus Svärm Petter Stranmark Centre for Mathematical Sciences, Lun University {linus,petter}@maths.lth.se Abstract Shift-map image processing is a new framework base on energy
More informationSupervised Reranking for Web Image Search
for Web Image Search Query: Red Wine Current Web Image Search Ranking Ranking Features http://www.telegraph.co.uk/306737/red-wineagainst-radiation.html 2 qd, 2.5.5 0.5 0 Linjun Yang and Alan Hanjalic 2
More informationSNUMedinfo at TREC CDS track 2014: Medical case-based retrieval task
SNUMedinfo at TREC CDS track 2014: Medical case-based retrieval task Sungbin Choi, Jinwook Choi Medical Informatics Laboratory, Seoul National University, Seoul, Republic of Korea wakeup06@empas.com, jinchoi@snu.ac.kr
More informationCluster Center Initialization Method for K-means Algorithm Over Data Sets with Two Clusters
Available online at www.scienceirect.com Proceia Engineering 4 (011 ) 34 38 011 International Conference on Avances in Engineering Cluster Center Initialization Metho for K-means Algorithm Over Data Sets
More informationStructured Ranking Learning using Cumulative Distribution Networks
Structured Ranking Learning using Cumulative Distribution Networks Jim C. Huang Probabilistic and Statistical Inference Group University of Toronto Toronto, ON, Canada M5S 3G4 jim@psi.toronto.edu Brendan
More informationA Metric for Inferring User Search Goals in Search Engines
International Journal of Engineering and Technical Research (IJETR) A Metric for Inferring User Search Goals in Search Engines M. Monika, N. Rajesh, K.Rameshbabu Abstract For a broad topic, different users
More informationMultilevel Linear Dimensionality Reduction using Hypergraphs for Data Analysis
Multilevel Linear Dimensionality Reuction using Hypergraphs for Data Analysis Haw-ren Fang Department of Computer Science an Engineering University of Minnesota; Minneapolis, MN 55455 hrfang@csumneu ABSTRACT
More informationAuthor: Yunqing Xia, Zhongda Xie, Qiuge Zhang, Huiyuan Zhao, Huan Zhao Presenter: Zhongda Xie
Author: Yunqing Xia, Zhongda Xie, Qiuge Zhang, Huiyuan Zhao, Huan Zhao Presenter: Zhongda Xie Outline 1.Introduction 2.Motivation 3.Methodology 4.Experiments 5.Conclusion 6.Future Work 2 1.Introduction(1/3)
More informationMining the Web for Multimedia-based Enriching
Mining the Web for Multimedia-based Enriching Mathilde Sahuguet and Benoit Huet Eurecom, Sophia-Antipolis, France Abstract. As the amount of social media shared on the Internet grows increasingly, it becomes
More informationModern Retrieval Evaluations. Hongning Wang
Modern Retrieval Evaluations Hongning Wang CS@UVa What we have known about IR evaluations Three key elements for IR evaluation A document collection A test suite of information needs A set of relevance
More informationPh.D. in Computer Science & Technology, Tsinghua University, Beijing, China, 2007
Yiqun Liu Associate Professor & Department co-chair Department of Computer Science and Technology Email yiqunliu@tsinghua.edu.cn URL http://www.thuir.org/group/~yqliu Phone +86-10-62796672 Fax +86-10-62796672
More informationLocal Path Planning with Proximity Sensing for Robot Arm Manipulators. 1. Introduction
Local Path Planning with Proximity Sensing for Robot Arm Manipulators Ewar Cheung an Vlaimir Lumelsky Yale University, Center for Systems Science Department of Electrical Engineering New Haven, Connecticut
More informationLecture 3: Improving Ranking with
Modeling User Behavior and Interactions Lecture 3: Improving Ranking with Behavior Data 1 * &!" ) & $ & 6+ $ & 6+ + "& & 8 > " + & 2 Lecture 3 Plan $ ( ' # $ #! & $ #% #"! #! -( #", # + 4 / 0 3 21 0 /
More information0607 CAMBRIDGE INTERNATIONAL MATHEMATICS
PAPA CAMBRIDGE CAMBRIDGE INTERNATIONAL EXAMINATIONS International General Certificate of Seconary Eucation MARK SCHEME for the May/June 0 series CAMBRIDGE INTERNATIONAL MATHEMATICS /4 4 (Extene), maximum
More informationA Deep Relevance Matching Model for Ad-hoc Retrieval
A Deep Relevance Matching Model for Ad-hoc Retrieval Jiafeng Guo 1, Yixing Fan 1, Qingyao Ai 2, W. Bruce Croft 2 1 CAS Key Lab of Web Data Science and Technology, Institute of Computing Technology, Chinese
More information0607 CAMBRIDGE INTERNATIONAL MATHEMATICS
CAMBRIDGE INTERNATIONAL EXAMINATIONS International General Certificate of Seconary Eucation MARK SCHEME for the May/June 03 series 0607 CAMBRIDGE INTERNATIONAL MATHEMATICS 0607/4 Paper 4 (Extene), maximum
More informationEntity and Knowledge Base-oriented Information Retrieval
Entity and Knowledge Base-oriented Information Retrieval Presenter: Liuqing Li liuqing@vt.edu Digital Library Research Laboratory Virginia Polytechnic Institute and State University Blacksburg, VA 24061
More informationResPubliQA 2010
SZTAKI @ ResPubliQA 2010 David Mark Nemeskey Computer and Automation Research Institute, Hungarian Academy of Sciences, Budapest, Hungary (SZTAKI) Abstract. This paper summarizes the results of our first
More informationA Survey on Postive and Unlabelled Learning
A Survey on Postive and Unlabelled Learning Gang Li Computer & Information Sciences University of Delaware ligang@udel.edu Abstract In this paper we survey the main algorithms used in positive and unlabeled
More informationRishiraj Saha Roy and Niloy Ganguly IIT Kharagpur India. Monojit Choudhury and Srivatsan Laxman Microsoft Research India India
Rishiraj Saha Roy and Niloy Ganguly IIT Kharagpur India Monojit Choudhury and Srivatsan Laxman Microsoft Research India India ACM SIGIR 2012, Portland August 15, 2012 Dividing a query into individual semantic
More informationVerbose Query Reduction by Learning to Rank for Social Book Search Track
Verbose Query Reduction by Learning to Rank for Social Book Search Track Messaoud CHAA 1,2, Omar NOUALI 1, Patrice BELLOT 3 1 Research Center on Scientific and Technical Information 05 rue des 03 frères
More informationAlmost Disjunct Codes in Large Scale Multihop Wireless Network Media Access Control
Almost Disjunct Coes in Large Scale Multihop Wireless Network Meia Access Control D. Charles Engelhart Anan Sivasubramaniam Penn. State University University Park PA 682 engelhar,anan @cse.psu.eu Abstract
More informationOptimizing Search Engines using Click-through Data
Optimizing Search Engines using Click-through Data By Sameep - 100050003 Rahee - 100050028 Anil - 100050082 1 Overview Web Search Engines : Creating a good information retrieval system Previous Approaches
More informationLinear feature-based models for information retrieval
Inf Retrieval (2007) 10:257 274 DOI 10.1007/s10791-006-9019-z Linear feature-based models for information retrieval Donald Metzler W. Bruce Croft Received: 6 October 2005 / Accepted: 27 October 2006 /
More informationRanking models in Information Retrieval: A Survey
Ranking models in Information Retrieval: A Survey R.Suganya Devi Research Scholar Department of Computer Science and Engineering College of Engineering, Guindy, Chennai, Tamilnadu, India Dr D Manjula Professor
More informationExternal Query Reformulation for Text-based Image Retrieval
External Query Reformulation for Text-based Image Retrieval Jinming Min and Gareth J. F. Jones Centre for Next Generation Localisation School of Computing, Dublin City University Dublin 9, Ireland {jmin,gjones}@computing.dcu.ie
More informationBoolean Model. Hongning Wang
Boolean Model Hongning Wang CS@UVa Abstraction of search engine architecture Indexed corpus Crawler Ranking procedure Doc Analyzer Doc Representation Query Rep Feedback (Query) Evaluation User Indexer
More informationA Supervised Method for Multi-keyword Web Crawling on Web Forums
Available Online at www.ijcsmc.com International Journal of Computer Science and Mobile Computing A Monthly Journal of Computer Science and Information Technology IJCSMC, Vol. 3, Issue. 2, February 2014,
More informationAutomatic Search Engine Evaluation with Click-through Data Analysis. Yiqun Liu State Key Lab of Intelligent Tech. & Sys Jun.
Automatic Search Engine Evaluation with Click-through Data Analysis Yiqun Liu State Key Lab of Intelligent Tech. & Sys Jun. 3th, 2007 Recent work: Using query log and click-through data analysis to: identify
More informationJames Mayfield! The Johns Hopkins University Applied Physics Laboratory The Human Language Technology Center of Excellence!
James Mayfield! The Johns Hopkins University Applied Physics Laboratory The Human Language Technology Center of Excellence! (301) 219-4649 james.mayfield@jhuapl.edu What is Information Retrieval? Evaluation
More informationInformation Retrieval I
Information Retrieval I David Hawking 30 Sep 2010 Machine Learning Summer School, ANU Session Outline Ranking documents in response to a query Measuring the quality of such rankings Case Study: Tuning
More informationJianyong Wang Department of Computer Science and Technology Tsinghua University
Jianyong Wang Department of Computer Science and Technology Tsinghua University jianyong@tsinghua.edu.cn Joint work with Wei Shen (Tsinghua), Ping Luo (HP), and Min Wang (HP) Outline Introduction to entity
More informationAdapting Ranking Functions to User Preference
Adapting Ranking Functions to User Preference Keke Chen, Ya Zhang, Zhaohui Zheng, Hongyuan Zha, Gordon Sun Yahoo! {kchen,yazhang,zhaohui,zha,gzsun}@yahoo-inc.edu Abstract Learning to rank has become a
More informationUMass at TREC 2006: Enterprise Track
UMass at TREC 2006: Enterprise Track Desislava Petkova and W. Bruce Croft Center for Intelligent Information Retrieval Department of Computer Science University of Massachusetts, Amherst, MA 01003 Abstract
More informationHierarchical Location and Topic Based Query Expansion
Hierarchical Location and Topic Based Query Expansion Shu Huang 1 Qiankun Zhao 2 Prasenjit Mitra 1 C. Lee Giles 1 Information Sciences and Technology 1 AOL Research Lab 2 Pennsylvania State University
More informationModeling Query Term Dependencies in Information Retrieval with Markov Random Fields
Modeling Query Term Dependencies in Information Retrieval with Markov Random Fields Donald Metzler metzler@cs.umass.edu W. Bruce Croft croft@cs.umass.edu Department of Computer Science, University of Massachusetts,
More informationLetter Pair Similarity Classification and URL Ranking Based on Feedback Approach
Letter Pair Similarity Classification and URL Ranking Based on Feedback Approach P.T.Shijili 1 P.G Student, Department of CSE, Dr.Nallini Institute of Engineering & Technology, Dharapuram, Tamilnadu, India
More informationSplog Detection Using Self-Similarity Analysis on Blog Temporal Dynamics. Yu-Ru Lin, Hari Sundaram, Yun Chi, Junichi Tatemura and Belle Tseng
Splog Detection Using Self-Similarity Analysis on Blog Temporal Dynamics Yu-Ru Lin, Hari Sundaram, Yun Chi, Junichi Tatemura and Belle Tseng NEC Laboratories America, Cupertino, CA AIRWeb Workshop 2007
More informationNatural Language Processing SoSe Question Answering. (based on the slides of Dr. Saeedeh Momtazi)
Natural Language Processing SoSe 2015 Question Answering Dr. Mariana Neves July 6th, 2015 (based on the slides of Dr. Saeedeh Momtazi) Outline 2 Introduction History QA Architecture Outline 3 Introduction
More information