Lizhe Sun. November 17, Florida State University. Ranking in Statistics and Machine Learning. Lizhe Sun. Introduction
|
|
- Britney Murphy
- 6 years ago
- Views:
Transcription
1 in in Florida State University November 17, 2017
2 Framework in 1. our life 2. Early work: Model Examples 3. webpage Web page search modeling Data structure Data analysis with machine learning algorithms
3 in Part I:
4 s in our life in Wine If I have K brands of wine of some type, like sauvignon blanc, how to rank them?
5 in sports in Tennis Who is the best tennis player in this year? ATP s WTA s
6 in Tennis players in
7 Web pages search in
8 models: overview in learning and statistical models for model: based on pairwise evaluations. Web search model: the webpages/documents according to their relevance to query.
9 in Part II: model
10 model in Let P ab denote the probability that a is preferred to b. Suppose P ab + P ba = 1 for all pairs; that is, we assume a tie cannot occur. model: Alternatively, P ab log( P ab ) = log( ) = β a β b P ba 1 P ab P ab = exp(β a )/(exp(β a ) + exp(β b )) Thus, P ab = 1 2 when β a = β b and P ab > 1 2 when β a > β b. ( ) I Residual df = (I 1) 2
11 model in Assumption: the samples of evaluation are independent, and the evaluations for different pairs are also independent. We can use logistic methods to fit the model.
12 Example 1: Major League Baseball s in Table: Results of 2011 Season for American League (Eastern Division) Baseball Teams Losing Team Winning Team Boston New York Tampa Bay Toronto Baltimore Boston New York Tampa Bay Toronto Baltimore Data source: Agresti, Alan, and Maria Kateri. Categorical data analysis
13 model in Table: Results of Fitting Bradley-Terry Model to Baseball Data Team Winning Percentage ˆβ a SE Boston New York Tampa Bay Toronto Baltimore
14 R output in
15 Example 2: Tennis player in Table: Head-to-head records of players in the ATP top 20 (update to 10/30/2017) Loser Winner Rafael Nadal Roger Federer Andy Murray Novak Djokovic Stanislas Wawrinka Rafael Nadal Roger Federer Andy Murray Novak Djokovic Stanislas Wawrinka data source:
16 R output for tennis players in
17 Model extension: home team advantage in Let Pab denote the probability that team a beats team b, when a is the home team. Consider the logistic model P ab log( 1 Pab ) = α + β a β b, when α > 0, a home field advantage exists.
18 Part III: webpage search model in Outline Problem description and model introduction Global V.S. subset Data structure Query&webpage pair and feature vector Example: to rank competition Data analysis GBRT igbrt
19 Problem description in In general, we want to rank a set of documents/webpages according to their relevance to a given query. In machine learning communities, learning to rank is a supervised learning.
20 machine learning framework in function h(x). Extract a set of feature vectors x for each query-document pair, and train a function h(x). Rank the documents/web-pages using the value of h(x). For x with a larger value in h(x), we can say that x is ranked higher.
21 Global and subset in In the aforementioned model, given a query, we will rank all the documents in the training dataset. However, in application, we do not need to rank all documents for a given query. Thus, the subset model may be more popular in application.
22 Subset model: web search example in Filtering procedure: when the search engine system takes a query, it will use a simple algorithm for initial filtering, which limits the s to an initial pool {p j } of size m (e.g., m = ). Here, {p j } is returned wed page, j = 1, 2,, m. After this initial, the system use a more complicated algorithm to reorder the s in the pool.
23 Data Structure in Table: A example of training data structure Query Documents Feature Vector Score q1 p1 x1 1 y1 1 p2 x2 1 y2 1 pm1 xm1 1 ym1 1 q2 p1 x1 2 y1 2 p2 x2 2 y2 2 pm2 xm2 2 ym2 2 q3
24 Data description in The training data can be formally represented as: {(x q j, y q j )}, where q goes from 1 to n, the number of queries, j goes from 1 to m q, the number of documents for query q. x q j R p is a p-dimensional feature vector for the pair of query q and the j th document for this query, y q j is the relevance label for x q j.
25 Data description: features in The main categories for feature x: Web graph, Document statistics, Document classifier, Query, Text match, Topical matching, Click, External references, Time. The grade y indicates the degree of relevance of this document to its corresponding query. For example, each grade can be one element in the ordinal set, {perfect, excellent, good, fair, bad} and is labeled by human editors.
26 Example: datasets in learning to rank competition in Table: Datasets released for the challenge dataset1 datasets Train Valid Test Train Valid Test Queries Documents Features Table: Distribution of relevance labels Grade Label dataset1 dataset2 Perfect % 1.89% Excellent % 7.67% Good % 28.55% Fair % 35.80% Bad % 26.09%
27 Example: datasets in learning to rank competition in Figure: The number of documents associated with each query.
28 Data analysis: overview in There are so many algorithms. Generally, current algorithms can be divided into three categories, according to different objective functions optimization: Pointwise: a regression loss or a classification loss. Pairwise: a pairwise loss function. q m q i,j,y q i >y q j l(h(x q i ) h(x q j )) Listwise: The loss function is defined over all the documents associated with query.
29 Model evaluation criteria in The Discounted Cumulative Gain (DCG) has been widely used to assess relevance in the context of search engines. A simple variation of DCG: DCG m = m G j /log(j + 1) j=1 where G j represents the weights assigned to the label of the document at position j. We also have expected reciprocal rank (ERR) and NDCG as model evaluation criteria.
30 Data analysis: learning algorithms in In this presentation, we focus on point-wise methods. Gradient Boosted Regression Trees (GBRT) is a very powerful tool to solve s search. igbrt: the initial residual of GBRT r i = y i F (x i ), F (x i ) is the estimator of RandomForests.
31 Regression vs. Classification in In practice, classification is better than regression. In classification, instead of training a function h(x i ) y i, we generate binary classification s, such as c = 1, 2, 3, 4 for y {0, 1, 2, 3, 4}. The c th classification predicts if the document is less relevant than c, i.e., y i < c. For each of these binary classification s, we train a classifier h c ( ): h c ( ) = P(rel(x i ) < c). In this example, we also define h 0 ( ) = 0 and h 5 ( ) = 1. Thus, we can combine all classifiers h 0, h 1,, h 5 to compute the probability of each class.
32 Regression vs. Classification in In our example, we compute the probability that a document x i has a relevance of r {0, 1, 2, 3, 4} And P(rel(x i ) = r) = P(rel(x i ) < r + 1) P(rel(x i ) < r) = h r+1 (x i ) h r (x i )
33 Results for data analysis in Table: Performance of GBRT, RF,iGBRT. All results are evaluated in ERR and NDCG. ERR method Regr./Class. dataset1 dataset2 GBRT R RF R igbrt R GBRT C RF C igbrt C NDCG method Regr./Class. dataset1 dataset2 GBRT R RF R igbrt R GBRT C RF C igbrt C
34 Future study in There are many important topics do not cover in this presentation. algorithms about pairwise and listwise. Other models, like recommendation system. theory about learning to rank. Online learning, etc.
35 Thank you in Reference Chapelle, Olivier, and Yi Chang. Yahoo! learning to rank challenge overview. Proceedings of the to Rank Challenge Cossock, David, and Tong Zhang. Statistical analysis of Bayes optimal subset. IEEE Transactions on Information Theory (2008): Chapelle, Olivier, Yi Chang, and T-Y. Liu. Future directions in learning to rank. Proceedings of the to Rank Challenge Li, Ping, Qiang Wu, and Christopher J. Burges. Mcrank: to rank using multiple classification and gradient boosting. Advances in neural information processing systems
36 Thank you in Mohan, Ananth, Zheng Chen, and Kilian Weinberger. Web-search with initialized gradient boosted regression trees. Proceedings of the to Rank Challenge Zheng, Zhaohui, et al. A general boosting method and its application to learning functions for web search. Advances in neural information processing systems
WebSci and Learning to Rank for IR
WebSci and Learning to Rank for IR Ernesto Diaz-Aviles L3S Research Center. Hannover, Germany diaz@l3s.de Ernesto Diaz-Aviles www.l3s.de 1/16 Motivation: Information Explosion Ernesto Diaz-Aviles
More informationAn Empirical Analysis on Point-wise Machine Learning Techniques using Regression Trees for Web-search Ranking
Washington University in St. Louis Washington University Open Scholarship All Theses and Dissertations (ETDs) January 2010 An Empirical Analysis on Point-wise Machine Learning Techniques using Regression
More informationParallel Boosted Regression Trees for Web Search Ranking
Parallel Boosted Regression Trees for Web Search Ranking Stephen Tyree, Kilian Q. Weinberger, Kunal Agrawal, Jennifer Paykin Department of Computer Science and Engineering, Washington University in St.
More informationActive Evaluation of Ranking Functions based on Graded Relevance (Extended Abstract)
Proceedings of the Twenty-Third International Joint Conference on Artificial Intelligence Active Evaluation of Ranking Functions based on Graded Relevance (Extended Abstract) Christoph Sawade sawade@cs.uni-potsdam.de
More informationChapter 8. Evaluating Search Engine
Chapter 8 Evaluating Search Engine Evaluation Evaluation is key to building effective and efficient search engines Measurement usually carried out in controlled laboratory experiments Online testing can
More informationCSCI 599: Applications of Natural Language Processing Information Retrieval Evaluation"
CSCI 599: Applications of Natural Language Processing Information Retrieval Evaluation" All slides Addison Wesley, Donald Metzler, and Anton Leuski, 2008, 2012! Evaluation" Evaluation is key to building
More informationRetrieval Evaluation. Hongning Wang
Retrieval Evaluation Hongning Wang CS@UVa What we have learned so far Indexed corpus Crawler Ranking procedure Research attention Doc Analyzer Doc Rep (Index) Query Rep Feedback (Query) Evaluation User
More informationLearning to rank, a supervised approach for ranking of documents Master Thesis in Computer Science - Algorithms, Languages and Logic KRISTOFER TAPPER
Learning to rank, a supervised approach for ranking of documents Master Thesis in Computer Science - Algorithms, Languages and Logic KRISTOFER TAPPER Chalmers University of Technology University of Gothenburg
More informationLearning to Rank: A New Technology for Text Processing
TFANT 07 Tokyo Univ. March 2, 2007 Learning to Rank: A New Technology for Text Processing Hang Li Microsoft Research Asia Talk Outline What is Learning to Rank? Ranking SVM Definition Search Ranking SVM
More informationInformation Retrieval
Information Retrieval Learning to Rank Ilya Markov i.markov@uva.nl University of Amsterdam Ilya Markov i.markov@uva.nl Information Retrieval 1 Course overview Offline Data Acquisition Data Processing Data
More informationRanking and Learning. Table of Content. Weighted scoring for ranking Learning to rank: A simple example Learning to ranking as classification.
Table of Content anking and Learning Weighted scoring for ranking Learning to rank: A simple example Learning to ranking as classification 290 UCSB, Tao Yang, 2013 Partially based on Manning, aghavan,
More informationUsing Machine Learning to Optimize Storage Systems
Using Machine Learning to Optimize Storage Systems Dr. Kiran Gunnam 1 Outline 1. Overview 2. Building Flash Models using Logistic Regression. 3. Storage Object classification 4. Storage Allocation recommendation
More informationAdvanced Topics in Information Retrieval. Learning to Rank. ATIR July 14, 2016
Advanced Topics in Information Retrieval Learning to Rank Vinay Setty vsetty@mpi-inf.mpg.de Jannik Strötgen jannik.stroetgen@mpi-inf.mpg.de ATIR July 14, 2016 Before we start oral exams July 28, the full
More informationSearch Evaluation. Tao Yang CS293S Slides partially based on text book [CMS] [MRS]
Search Evaluation Tao Yang CS293S Slides partially based on text book [CMS] [MRS] Table of Content Search Engine Evaluation Metrics for relevancy Precision/recall F-measure MAP NDCG Difficulties in Evaluating
More informationLearning to Rank. Tie-Yan Liu. Microsoft Research Asia CCIR 2011, Jinan,
Learning to Rank Tie-Yan Liu Microsoft Research Asia CCIR 2011, Jinan, 2011.10 History of Web Search Search engines powered by link analysis Traditional text retrieval engines 2011/10/22 Tie-Yan Liu @
More informationMulti-Task Learning for Boosting with Application to Web Search Ranking
Multi-Task Learning for Boosting with Application to Web Search Ranking Olivier Chapelle Yahoo! Labs Sunnyvale, CA chap@yahoo-inc.com Kilian Weinberger Washington University Saint Louis, MO kilian@wustl.edu
More informationLearning to Rank. from heuristics to theoretic approaches. Hongning Wang
Learning to Rank from heuristics to theoretic approaches Hongning Wang Congratulations Job Offer from Bing Core Ranking team Design the ranking module for Bing.com CS 6501: Information Retrieval 2 How
More informationCombining SVMs with Various Feature Selection Strategies
Combining SVMs with Various Feature Selection Strategies Yi-Wei Chen and Chih-Jen Lin Department of Computer Science, National Taiwan University, Taipei 106, Taiwan Summary. This article investigates the
More informationGraph Algorithms Maximum Flow Applications
Chapter 5 Graph Algorithms Maximum Flow Applications Algorithm Theory WS 202/3 Fabian Kuhn Maximum Flow Applications Maximum flow has many applications Reducing a problem to a max flow problem can even
More informationEvaluation of Retrieval Systems
Performance Criteria Evaluation of Retrieval Systems 1 1. Expressiveness of query language Can query language capture information needs? 2. Quality of search results Relevance to users information needs
More informationArama Motoru Gelistirme Dongusu: Siralamayi Ogrenme ve Bilgiye Erisimin Degerlendirilmesi. Retrieval Effectiveness and Learning to Rank
Arama Motoru Gelistirme Dongusu: Siralamayi Ogrenme ve Bilgiye Erisimin Degerlendirilmesi etrieval Effectiveness and Learning to ank EMIE YILMAZ Professor and Turing Fellow University College London esearch
More informationEvaluating search engines CE-324: Modern Information Retrieval Sharif University of Technology
Evaluating search engines CE-324: Modern Information Retrieval Sharif University of Technology M. Soleymani Fall 2016 Most slides have been adapted from: Profs. Manning, Nayak & Raghavan (CS-276, Stanford)
More informationMulti-label classification using rule-based classifier systems
Multi-label classification using rule-based classifier systems Shabnam Nazmi (PhD candidate) Department of electrical and computer engineering North Carolina A&T state university Advisor: Dr. A. Homaifar
More informationLearning to Match. Jun Xu, Zhengdong Lu, Tianqi Chen, Hang Li
Learning to Match Jun Xu, Zhengdong Lu, Tianqi Chen, Hang Li 1. Introduction The main tasks in many applications can be formalized as matching between heterogeneous objects, including search, recommendation,
More informationASCERTAINING THE RELEVANCE MODEL OF A WEB SEARCH-ENGINE BIPIN SURESH
ASCERTAINING THE RELEVANCE MODEL OF A WEB SEARCH-ENGINE BIPIN SURESH Abstract We analyze the factors contributing to the relevance of a web-page as computed by popular industry web search-engines. We also
More informationLearning to Rank for Information Retrieval. Tie-Yan Liu Lead Researcher Microsoft Research Asia
Learning to Rank for Information Retrieval Tie-Yan Liu Lead Researcher Microsoft Research Asia 4/20/2008 Tie-Yan Liu @ Tutorial at WWW 2008 1 The Speaker Tie-Yan Liu Lead Researcher, Microsoft Research
More informationTHIS LECTURE. How do we know if our results are any good? Results summaries: Evaluating a search engine. Making our good results usable to a user
EVALUATION Sec. 6.2 THIS LECTURE How do we know if our results are any good? Evaluating a search engine Benchmarks Precision and recall Results summaries: Making our good results usable to a user 2 3 EVALUATING
More informationS-MART: Novel Tree-based Structured Learning Algorithms Applied to Tweet Entity Linking
S-MART: Novel Tree-based Structured Learning Algorithms Applied to Tweet Entity Linking Yi Yang * and Ming-Wei Chang # * Georgia Institute of Technology, Atlanta # Microsoft Research, Redmond Traditional
More informationImageNet Classification with Deep Convolutional Neural Networks
ImageNet Classification with Deep Convolutional Neural Networks Alex Krizhevsky Ilya Sutskever Geoffrey Hinton University of Toronto Canada Paper with same name to appear in NIPS 2012 Main idea Architecture
More informationEvaluation of Retrieval Systems
Performance Criteria Evaluation of Retrieval Systems. Expressiveness of query language Can query language capture information needs? 2. Quality of search results Relevance to users information needs 3.
More informationMachine Learning In Search Quality At. Painting by G.Troshkov
Machine Learning In Search Quality At Painting by G.Troshkov Russian Search Market Source: LiveInternet.ru, 2005-2009 A Yandex Overview 1997 Yandex.ru was launched 7 Search engine in the world * (# of
More informationA Survey on Postive and Unlabelled Learning
A Survey on Postive and Unlabelled Learning Gang Li Computer & Information Sciences University of Delaware ligang@udel.edu Abstract In this paper we survey the main algorithms used in positive and unlabeled
More informationInformation Retrieval
Introduction to Information Retrieval Evaluation Rank-Based Measures Binary relevance Precision@K (P@K) Mean Average Precision (MAP) Mean Reciprocal Rank (MRR) Multiple levels of relevance Normalized Discounted
More informationRanking with Query-Dependent Loss for Web Search
Ranking with Query-Dependent Loss for Web Search Jiang Bian 1, Tie-Yan Liu 2, Tao Qin 2, Hongyuan Zha 1 Georgia Institute of Technology 1 Microsoft Research Asia 2 Outline Motivation Incorporating Query
More informationSearch Engines and Learning to Rank
Search Engines and Learning to Rank Joseph (Yossi) Keshet Query processor Ranker Cache Forward index Inverted index Link analyzer Indexer Parser Web graph Crawler Representations TF-IDF To get an effective
More informationCSCI 5417 Information Retrieval Systems. Jim Martin!
CSCI 5417 Information Retrieval Systems Jim Martin! Lecture 7 9/13/2011 Today Review Efficient scoring schemes Approximate scoring Evaluating IR systems 1 Normal Cosine Scoring Speedups... Compute the
More informationScaling Up Decision Tree Ensembles
Scaling Up Decision Tree Ensembles Misha Bilenko (Microsoft) with Ron Bekkerman (LinkedIn) and John Langford (Yahoo!) http://hunch.net/~large_scale_survey Tree Ensembles: Basics Rule-based prediction is
More informationLearning to Rank for Faceted Search Bridging the gap between theory and practice
Learning to Rank for Faceted Search Bridging the gap between theory and practice Agnes van Belle @ Berlin Buzzwords 2017 Job-to-person search system Generated query Match indicator Faceted search Multiple
More informationEvaluation of Retrieval Systems
Performance Criteria Evaluation of Retrieval Systems. Expressiveness of query language Can query language capture information needs? 2. Quality of search results Relevance to users information needs 3.
More informationAdvances on the Development of Evaluation Measures. Ben Carterette Evangelos Kanoulas Emine Yilmaz
Advances on the Development of Evaluation Measures Ben Carterette Evangelos Kanoulas Emine Yilmaz Information Retrieval Systems Match information seekers with the information they seek Why is Evaluation
More informationParallel Boosted Regression Trees for Web Search Ranking
Parallel Boosted Regression Trees for Web Search Ranking Stephen Tyree swtyree@wustl.edu Kilian Q. Weinberger kilian@wustl.edu Department of Computer Science & Engineering Washington University in St.
More informationPersonalized Web Search
Personalized Web Search Dhanraj Mavilodan (dhanrajm@stanford.edu), Kapil Jaisinghani (kjaising@stanford.edu), Radhika Bansal (radhika3@stanford.edu) Abstract: With the increase in the diversity of contents
More informationLearning to Rank for Information Retrieval
Learning to Rank for Information Retrieval Tie-Yan Liu Learning to Rank for Information Retrieval Tie-Yan Liu Microsoft Research Asia Bldg #2, No. 5, Dan Ling Street Haidian District Beijing 100080 People
More informationLecture 20: Bagging, Random Forests, Boosting
Lecture 20: Bagging, Random Forests, Boosting Reading: Chapter 8 STATS 202: Data mining and analysis November 13, 2017 1 / 17 Classification and Regression trees, in a nut shell Grow the tree by recursively
More informationDocument indexing, similarities and retrieval in large scale text collections
Document indexing, similarities and retrieval in large scale text collections Eric Gaussier Univ. Grenoble Alpes - LIG Eric.Gaussier@imag.fr Eric Gaussier Document indexing, similarities & retrieval 1
More informationA Dynamic Bayesian Network Click Model for Web Search Ranking
A Dynamic Bayesian Network Click Model for Web Search Ranking Olivier Chapelle and Anne Ya Zhang Apr 22, 2009 18th International World Wide Web Conference Introduction Motivation Clicks provide valuable
More informationInformation Retrieval
Introduction to Information Retrieval CS276 Information Retrieval and Web Search Chris Manning, Pandu Nayak and Prabhakar Raghavan Evaluation 1 Situation Thanks to your stellar performance in CS276, you
More informationCSC 411 Lecture 4: Ensembles I
CSC 411 Lecture 4: Ensembles I Roger Grosse, Amir-massoud Farahmand, and Juan Carrasquilla University of Toronto UofT CSC 411: 04-Ensembles I 1 / 22 Overview We ve seen two particular classification algorithms:
More informationGradient Descent. Wed Sept 20th, James McInenrey Adapted from slides by Francisco J. R. Ruiz
Gradient Descent Wed Sept 20th, 2017 James McInenrey Adapted from slides by Francisco J. R. Ruiz Housekeeping A few clarifications of and adjustments to the course schedule: No more breaks at the midpoint
More informationDATA MINING AND MACHINE LEARNING. Lecture 6: Data preprocessing and model selection Lecturer: Simone Scardapane
DATA MINING AND MACHINE LEARNING Lecture 6: Data preprocessing and model selection Lecturer: Simone Scardapane Academic Year 2016/2017 Table of contents Data preprocessing Feature normalization Missing
More informationSession Based Click Features for Recency Ranking
Proceedings of the Twenty-Fourth AAAI Conference on Artificial Intelligence (AAAI-10) Session Based Click Features for Recency Ranking Yoshiyuki Inagaki and Narayanan Sadagopan and Georges Dupret and Ciya
More informationarxiv: v1 [cs.ir] 19 Sep 2016
Enhancing LambdaMART Using Oblivious Trees Marek Modrý 1 and Michal Ferov 2 arxiv:1609.05610v1 [cs.ir] 19 Sep 2016 1 Seznam.cz, Radlická 3294/10, 150 00 Praha 5, Czech Republic marek.modry@firma.seznam.cz
More informationPart 7: Evaluation of IR Systems Francesco Ricci
Part 7: Evaluation of IR Systems Francesco Ricci Most of these slides comes from the course: Information Retrieval and Web Search, Christopher Manning and Prabhakar Raghavan 1 This lecture Sec. 6.2 p How
More informationStat 342 Exam 3 Fall 2014
Stat 34 Exam 3 Fall 04 I have neither given nor received unauthorized assistance on this exam. Name Signed Date Name Printed There are questions on the following 6 pages. Do as many of them as you can
More informationEntity Linking. David Soares Batista. November 11, Disciplina de Recuperação de Informação, Instituto Superior Técnico
David Soares Batista Disciplina de Recuperação de Informação, Instituto Superior Técnico November 11, 2011 Motivation Entity-Linking is the process of associating an entity mentioned in a text to an entry,
More informationRyen W. White, Matthew Richardson, Mikhail Bilenko Microsoft Research Allison Heath Rice University
Ryen W. White, Matthew Richardson, Mikhail Bilenko Microsoft Research Allison Heath Rice University Users are generally loyal to one engine Even when engine switching cost is low, and even when they are
More informationStructured Ranking Learning using Cumulative Distribution Networks
Structured Ranking Learning using Cumulative Distribution Networks Jim C. Huang Probabilistic and Statistical Inference Group University of Toronto Toronto, ON, Canada M5S 3G4 jim@psi.toronto.edu Brendan
More informationSearch Engines Chapter 8 Evaluating Search Engines Felix Naumann
Search Engines Chapter 8 Evaluating Search Engines 9.7.2009 Felix Naumann Evaluation 2 Evaluation is key to building effective and efficient search engines. Drives advancement of search engines When intuition
More informationOne-Pass Ranking Models for Low-Latency Product Recommendations
One-Pass Ranking Models for Low-Latency Product Recommendations Martin Saveski @msaveski MIT (Amazon Berlin) One-Pass Ranking Models for Low-Latency Product Recommendations Amazon Machine Learning Team,
More informationGradient Boosted Feature Selection. Zhixiang (Eddie) Xu, Gao Huang, Kilian Q. Weinberger, Alice X. Zheng
Gradient Boosted Feature Selection Zhixiang (Eddie) Xu, Gao Huang, Kilian Q. Weinberger, Alice X. Zheng 1 Goals of feature selection Reliably extract relevant features Identify non-linear feature dependency
More informationChapter 5 Graph Algorithms Algorithm Theory WS 2012/13 Fabian Kuhn
Chapter 5 Graph Algorithms Algorithm Theory WS 2012/13 Fabian Kuhn Graphs Extremely important concept in computer science Graph, : node (or vertex) set : edge set Simple graph: no self loops, no multiple
More information1 Training/Validation/Testing
CPSC 340 Final (Fall 2015) Name: Student Number: Please enter your information above, turn off cellphones, space yourselves out throughout the room, and wait until the official start of the exam to begin.
More informationCAMCOS Report Day. December 9 th, 2015 San Jose State University Project Theme: Classification
CAMCOS Report Day December 9 th, 2015 San Jose State University Project Theme: Classification On Classification: An Empirical Study of Existing Algorithms based on two Kaggle Competitions Team 1 Team 2
More informationThe exam is closed book, closed notes except your one-page (two-sided) cheat sheet.
CS 189 Spring 2015 Introduction to Machine Learning Final You have 2 hours 50 minutes for the exam. The exam is closed book, closed notes except your one-page (two-sided) cheat sheet. No calculators or
More informationCSE 546 Machine Learning, Autumn 2013 Homework 2
CSE 546 Machine Learning, Autumn 2013 Homework 2 Due: Monday, October 28, beginning of class 1 Boosting [30 Points] We learned about boosting in lecture and the topic is covered in Murphy 16.4. On page
More informationEvaluating generalization (validation) Harvard-MIT Division of Health Sciences and Technology HST.951J: Medical Decision Support
Evaluating generalization (validation) Harvard-MIT Division of Health Sciences and Technology HST.951J: Medical Decision Support Topics Validation of biomedical models Data-splitting Resampling Cross-validation
More informationA Comparative Analysis of Cascade Measures for Novelty and Diversity
A Comparative Analysis of Cascade Measures for Novelty and Diversity Charles Clarke, University of Waterloo Nick Craswell, Microsoft Ian Soboroff, NIST Azin Ashkan, University of Waterloo Background Measuring
More informationCPSC 340: Machine Learning and Data Mining. Non-Parametric Models Fall 2016
CPSC 340: Machine Learning and Data Mining Non-Parametric Models Fall 2016 Assignment 0: Admin 1 late day to hand it in tonight, 2 late days for Wednesday. Assignment 1 is out: Due Friday of next week.
More informationPredictive Indexing for Fast Search
Predictive Indexing for Fast Search Sharad Goel, John Langford and Alex Strehl Yahoo! Research, New York Modern Massive Data Sets (MMDS) June 25, 2008 Goel, Langford & Strehl (Yahoo! Research) Predictive
More informationAutomatic Domain Partitioning for Multi-Domain Learning
Automatic Domain Partitioning for Multi-Domain Learning Di Wang diwang@cs.cmu.edu Chenyan Xiong cx@cs.cmu.edu William Yang Wang ww@cmu.edu Abstract Multi-Domain learning (MDL) assumes that the domain labels
More informationDeep Character-Level Click-Through Rate Prediction for Sponsored Search
Deep Character-Level Click-Through Rate Prediction for Sponsored Search Bora Edizel - Phd Student UPF Amin Mantrach - Criteo Research Xiao Bai - Oath This work was done at Yahoo and will be presented as
More informationCS145: INTRODUCTION TO DATA MINING
CS145: INTRODUCTION TO DATA MINING 08: Classification Evaluation and Practical Issues Instructor: Yizhou Sun yzsun@cs.ucla.edu October 24, 2017 Learnt Prediction and Classification Methods Vector Data
More informationBig Data Methods. Chapter 5: Machine learning. Big Data Methods, Chapter 5, Slide 1
Big Data Methods Chapter 5: Machine learning Big Data Methods, Chapter 5, Slide 1 5.1 Introduction to machine learning What is machine learning? Concerned with the study and development of algorithms that
More informationPerformance Measures for Multi-Graded Relevance
Performance Measures for Multi-Graded Relevance Christian Scheel, Andreas Lommatzsch, and Sahin Albayrak Technische Universität Berlin, DAI-Labor, Germany {christian.scheel,andreas.lommatzsch,sahin.albayrak}@dai-labor.de
More informationMachine Learning / Jan 27, 2010
Revisiting Logistic Regression & Naïve Bayes Aarti Singh Machine Learning 10-701/15-781 Jan 27, 2010 Generative and Discriminative Classifiers Training classifiers involves learning a mapping f: X -> Y,
More informationPredictive Indexing for Fast Search
Predictive Indexing for Fast Search Sharad Goel Yahoo! Research New York, NY 10018 goel@yahoo-inc.com John Langford Yahoo! Research New York, NY 10018 jl@yahoo-inc.com Alex Strehl Yahoo! Research New York,
More informationMachine Learning for Professional Tennis Match Prediction and Betting
Machine Learning for Professional Tennis Match Prediction and Betting Andre Cornman, Grant Spellman, Daniel Wright Abstract Our project had two main objectives. First, we wanted to use historical tennis
More informationBayesian model ensembling using meta-trained recurrent neural networks
Bayesian model ensembling using meta-trained recurrent neural networks Luca Ambrogioni l.ambrogioni@donders.ru.nl Umut Güçlü u.guclu@donders.ru.nl Yağmur Güçlütürk y.gucluturk@donders.ru.nl Julia Berezutskaya
More informationA Comparing Pointwise and Listwise Objective Functions for Random Forest based Learning-to-Rank
A Comparing Pointwise and Listwise Objective Functions for Random Forest based Learning-to-Rank MUHAMMAD IBRAHIM, Monash University, Australia MARK CARMAN, Monash University, Australia Current random forest
More informationFacial Expression Classification with Random Filters Feature Extraction
Facial Expression Classification with Random Filters Feature Extraction Mengye Ren Facial Monkey mren@cs.toronto.edu Zhi Hao Luo It s Me lzh@cs.toronto.edu I. ABSTRACT In our work, we attempted to tackle
More informationSemi-supervised Learning
Semi-supervised Learning Piyush Rai CS5350/6350: Machine Learning November 8, 2011 Semi-supervised Learning Supervised Learning models require labeled data Learning a reliable model usually requires plenty
More informationSupport Vector Machines: Brief Overview" November 2011 CPSC 352
Support Vector Machines: Brief Overview" Outline Microarray Example Support Vector Machines (SVMs) Software: libsvm A Baseball Example with libsvm Classifying Cancer Tissue: The ALL/AML Dataset Golub et
More informationCS 237: Probability in Computing
CS 237: Probability in Computing Wayne Snyder Computer Science Department Boston University Lecture 26: Logistic Regression 2 Gradient Descent for Linear Regression Gradient Descent for Logistic Regression
More informationInformation Retrieval
Information Retrieval Lecture 7 - Evaluation in Information Retrieval Seminar für Sprachwissenschaft International Studies in Computational Linguistics Wintersemester 2007 1/ 29 Introduction Framework
More informationPredict the Likelihood of Responding to Direct Mail Campaign in Consumer Lending Industry
Predict the Likelihood of Responding to Direct Mail Campaign in Consumer Lending Industry Jincheng Cao, SCPD Jincheng@stanford.edu 1. INTRODUCTION When running a direct mail campaign, it s common practice
More informationMachine Learning. Nonparametric methods for Classification. Eric Xing , Fall Lecture 2, September 12, 2016
Machine Learning 10-701, Fall 2016 Nonparametric methods for Classification Eric Xing Lecture 2, September 12, 2016 Reading: 1 Classification Representing data: Hypothesis (classifier) 2 Clustering 3 Supervised
More information10-701/15-781, Fall 2006, Final
-7/-78, Fall 6, Final Dec, :pm-8:pm There are 9 questions in this exam ( pages including this cover sheet). If you need more room to work out your answer to a question, use the back of the page and clearly
More informationHomework 2. Due: March 2, 2018 at 7:00PM. p = 1 m. (x i ). i=1
Homework 2 Due: March 2, 2018 at 7:00PM Written Questions Problem 1: Estimator (5 points) Let x 1, x 2,..., x m be an i.i.d. (independent and identically distributed) sample drawn from distribution B(p)
More informationInformation Retrieval. Lecture 7 - Evaluation in Information Retrieval. Introduction. Overview. Standard test collection. Wintersemester 2007
Information Retrieval Lecture 7 - Evaluation in Information Retrieval Seminar für Sprachwissenschaft International Studies in Computational Linguistics Wintersemester 2007 1 / 29 Introduction Framework
More informationHigh Accuracy Retrieval with Multiple Nested Ranker
High Accuracy Retrieval with Multiple Nested Ranker Irina Matveeva University of Chicago 5801 S. Ellis Ave Chicago, IL 60637 matveeva@uchicago.edu Chris Burges Microsoft Research One Microsoft Way Redmond,
More informationImproving the Efficiency of Fast Using Semantic Similarity Algorithm
International Journal of Scientific and Research Publications, Volume 4, Issue 1, January 2014 1 Improving the Efficiency of Fast Using Semantic Similarity Algorithm D.KARTHIKA 1, S. DIVAKAR 2 Final year
More informationA new click model for relevance prediction in web search
A new click model for relevance prediction in web search Alexander Fishkov 1 and Sergey Nikolenko 2,3 1 St. Petersburg State Polytechnical University jetsnguns@gmail.com 2 Steklov Mathematical Institute,
More informationHandling Ties. Analysis of Ties in Input and Output Data of Rankings
Analysis of Ties in Input and Output Data of Rankings 16.7.2014 Knowledge Engineering - Seminar Sports Data Mining 1 Tied results in the input data Frequency depends on data source tie resolution policy
More informationLearning to Rank with Deep Neural Networks
Learning to Rank with Deep Neural Networks Dissertation presented by Goeric HUYBRECHTS for obtaining the Master s degree in Computer Science and Engineering Options: Artificial Intelligence Computing and
More informationLina Guzman, DIRECTV
Paper 3483-2015 Data sampling improvement by developing SMOTE technique in SAS Lina Guzman, DIRECTV ABSTRACT A common problem when developing classification models is the imbalance of classes in the classification
More informationnode2vec: Scalable Feature Learning for Networks
node2vec: Scalable Feature Learning for Networks A paper by Aditya Grover and Jure Leskovec, presented at Knowledge Discovery and Data Mining 16. 11/27/2018 Presented by: Dharvi Verma CS 848: Graph Database
More informationCPSC 340: Machine Learning and Data Mining. Non-Parametric Models Fall 2016
CPSC 340: Machine Learning and Data Mining Non-Parametric Models Fall 2016 Admin Course add/drop deadline tomorrow. Assignment 1 is due Friday. Setup your CS undergrad account ASAP to use Handin: https://www.cs.ubc.ca/getacct
More informationISyE 6416 Basic Statistical Methods - Spring 2016 Bonus Project: Big Data Analytics Final Report. Team Member Names: Xi Yang, Yi Wen, Xue Zhang
ISyE 6416 Basic Statistical Methods - Spring 2016 Bonus Project: Big Data Analytics Final Report Team Member Names: Xi Yang, Yi Wen, Xue Zhang Project Title: Improve Room Utilization Introduction Problem
More informationLearning and Evaluating Classifiers under Sample Selection Bias
Learning and Evaluating Classifiers under Sample Selection Bias Bianca Zadrozny IBM T.J. Watson Research Center, Yorktown Heights, NY 598 zadrozny@us.ibm.com Abstract Classifier learning methods commonly
More informationTutorials Case studies
1. Subject Three curves for the evaluation of supervised learning methods. Evaluation of classifiers is an important step of the supervised learning process. We want to measure the performance of the classifier.
More information