Information Retrieval Rank aggregation. Luca Bondi
|
|
- Jordan Chambers
- 5 years ago
- Views:
Transcription
1 Rank aggregation Luca Bondi
2 Motivations 2 Metasearch For a given query, combine the results from different search engines Combining ranking functions Text, links, anchor text, page title, etc. Comparing search engine qualities Multi-criteria queries (price, availability, quality of service) e.g., travel, restaurant Similarity search
3 Framework 3 Input: n candidates m voters Preferential voting Each voter gives a (partial) list of the candidates in order of preference Rank Voter 1 Voter n 2 Goal: produce a good consensus ordering of all n candidates
4 Borda s proposal (1770) 4 Election by order of merit First place 1 point Second place 2 points Candidate s score: sum of points Borda winner: lowest scoring candidate
5 Condorcet s proposal (1785) 5 Condorcet s winner: A candidate who defeats every other candidate in pair-wise majority rule election
6 Condorcet!= Borda 6 m = 10, n = A A A A A A B B B B B B B B B B C C C C C C C C C C A A A A Borda scores: A: = 18 B: = 16 Borda winner C: = 26 Condorcet s criterion: A beats both B and C in pair-wire majority A is the Condorcet winner A C B
7 Condorcet s paradox A B C B C A C A B A B C Condorcet s winner may not exist! Choose Condorcet s winner; if none, choose Borda winner Choose candidate with highest outdegree indegree in the majority voting graph
8 Kemeny s proposal 8 Axiomatic approach Define a distance between two preference orderings Obtain ordering that is least distant from individual orderings How to measure the distance between two preference orderings?
9 Metrics on permutations 9 Domain {1,2,, n} A ranking function r(i) < r(j) means that the ranking function r ranks i above j Most common rank distance functions Kendall-tau distance Spearman s footrule distance
10 Kendall-tau distance 10 Two ranking functions r, s K(r, s): number of pairs (i, j) such that r ranks (i, j) in one order and s ranks them in the opposite order Bubble-sort distance K is a metric Example AB: disagreement AC: agreement AD: disagreement BC: agreement BD: agreement CD: disagreement Number of disagreements: 3 (AB, AD, CD) K r, s = 3 r A B C D s B D A C
11 Spearman s footrule distance 11 > F r, s = < r i s i F is a metric (L A norm)?@a Example shift(a) = 2 shift(b) = 1 shift(c) = 1 shift(d) = 2 F r, s = 6 r A B C D s B D A C
12 Metrics on permutations 12 Many of the other metrics are computationally expensive (some NPhard, some not known to be polynomial-time computable, etc.) Also these two are perhaps the most natural for many applications Relationship between Kendall-tau distance and Spearman s footrule distance Diaconis-Graham inequality K r, s F r, s 2K r, s
13 Optimal aggregation 13 Given a metric d(r?,r E ) and input permutations r A,, r F Find permutation r such that r = arg min N F < d(r?, r)?@a Kemeny (Kendall) optimal aggregation: d = K NP-hard Spearman footrule optimal aggregation: d = F Can be computed in polynomial time
14 Footrule optimal aggregation 14 Optimal aggregation computed via minimum cost perfect matching Elements in the optimal rank 1 1 Positions in source rankings p F q < r p q?@a n n
15 Footrule optimal aggregation: example 15 Define all the ranking permutations For each candidate permutation Calculate the Spearman s distance between the candidate permutation and each of the input rankings Sum the distances A B C B D D C A B D C A Select as optimal the permutation with the smallest total distance to the input rankings
16 16 Footrule aggregation: Median rank approximation Given input permutations r A,, r F μ R i = median{r A i,, r F i } Order μ R to obtain a new permutation μ Example μ R A = median 1,3,4 = 3 A B C μ R B = median 2,1,3 = 2 B D D μ R C = median 3,4,1 = 3 C A B μ R D = median 4,2,2 = 2 Different possible aggregated rankings D C A μ = B, D, A, C; μ = D, B, A, C; μ = B, D, C, A; μ = D, B, C, A;
17 17 Footrule aggregation: Median rank approximation If the median ranks of the candidates are unique (i.e., form a permutation), then this permutation is a footrule optimal aggregation not the case of the previous example Even when the ranks are not unique, median rank aggregation is an approximation to Spearman footrule optimal aggregation
18 Top-k rank aggregation 18 Data attributes may be non-numeric Even if they are numeric, differences within certain ranges may not matter to the user Rankings may have ties between elements (partial rankings) Traditionally, two ways of accessing data: Random access: given an element, retrieve its score (position in the ranked list or other associated value) Sorted (sequential) access: access, one by one, the next element (together with its score) in a ranked list, starting from top
19 Top-k rank aggregation 19 Main interest in the top k elements of the aggregation Need for algorithms that quickly obtain the top results without having to read each ranking in its entirety Several algorithms developed in the literature to minimize the accesses when determining the top k elements
20 Combining opaque rankings 20 Techniques using only the position of the elements in the ranking (no other associated score) We review MedRank, An algorithm for rank aggregation based on the notion of median Input: m rankings of n elements Output: the top k elements in the aggregated ranking 1. Use sequential accesses in each ranking, one element at a time, until there are k elements that occur in more than m/2 rankings 2. These are the top k elements
21 MedRank: example 21 Hotels by price Ibis Etap Novotel Mercure Hilton Sheraton Crillon Hotels by rating Crillon Novotel Sheraton Hilton Ibis Ritz Lutetia Top 3 hotels Novotel Hilton Ibis Strategy: Make one sequential access at a time in each ranking Look for hotels that appear in both rankings NB: price and rating are opaque, only the position matters
22 Combining ranking with an objective function 22 Several studies consider rankings where the objects, besides the position, also include a score given in the [0, 1] interval Score gives the degree of matching between the object and the given criterion Example query: names of albums by the Beatles whose cover color is red Being an album by the Beatles has a crisp truth value (0 or 1) Redness of the cover has a fuzzy truth value in [0, 1]
23 Fagin s algorithm for monotone queries 23 Input: a monotone query combining rankings r A,, r F Output: the top k <object, score> pairs 1. Extract the same number of objects by sequential accesses in each ranking until there are at least k objects that match the query 2. For each extracted object, compute its overall score by making random accesses wherever needed 3. Among these, output the k objects with the best overall score Complexity is sub-linear in the number n of objects Proportional to the square root of n when combining two rankings
24 24 Fagin s algorithm for monotone queries: example Hotels Cheapness Hotels Rating Ibis.92 Crillon.9 Etap.91 Novotel.9 Novotel.85 Sheraton.8 Mercure.85 Hilton.7 Hilton.825 Ibis.7 Sheraton.8 Etap.7 Crillon.75 Mercure.6 Top 3 Score Query: hotels with best price and rating Strategy: Make one sequential access at a time in each ranking Look for hotels that appear in both rankings NB: price and rating are used to compute the overall score
25 25 Fagin s algorithm for monotone queries: example Hotels Cheapness Hotels Rating Ibis.92 Crillon.9 Etap.91 Novotel.9 Novotel.85 Sheraton.8 Mercure.85 Hilton.7 Hilton.825 Ibis.7 Sheraton.8 Etap.7 Crillon.75 Mercure.6 Top 3 Score 1. Extract the same number of objects by sequential accesses in each ranking until there are at least k objects that match the query
26 26 Fagin s algorithm for monotone queries: example Hotels Cheapness Hotels Rating Ibis.92 Crillon.9 Etap.91 Novotel.9 Novotel.85 Sheraton.8 Mercure.85 Hilton.7 Hilton.825 Ibis.7 Sheraton.8 Etap.7 Crillon.75 Mercure.6 Top 3 Score 1. Extract the same number of objects by sequential accesses in each ranking until there are at least k objects that match the query
27 27 Fagin s algorithm for monotone queries: example Hotels Cheapness Hotels Rating Ibis.92 Crillon.9 Etap.91 Novotel.9 Novotel.85 Sheraton.8 Mercure.85 Hilton.7 Hilton.825 Ibis.7 Sheraton.8 Etap.7 Crillon.75 Mercure.6 Top 3 Score 1. Extract the same number of objects by sequential accesses in each ranking until there are at least k objects that match the query
28 28 Fagin s algorithm for monotone queries: example Hotels Cheapness Hotels Rating Ibis.92 Crillon.9 Etap.91 Novotel.9 Novotel.85 Sheraton.8 Mercure.85 Hilton.7 Hilton.825 Ibis.7 Sheraton.8 Etap.7 Crillon.75 Mercure.6 Top 3 Score 1. Extract the same number of objects by sequential accesses in each ranking until there are at least k objects that match the query
29 29 Fagin s algorithm for monotone queries: example Hotels Cheapness Hotels Rating Ibis.92 Crillon.9 Etap.91 Novotel.9 Novotel.85 Sheraton.8 Mercure.85 Hilton.7 Hilton.825 Ibis.7 Sheraton.8 Etap.7 Crillon.75 Mercure.6 Top 3 Score 1. Extract the same number of objects by sequential accesses in each ranking until there are at least k objects that match the query
30 30 Fagin s algorithm for monotone queries: example Hotels Cheapness Hotels Rating Ibis.92 Crillon.9 Etap.91 Novotel.9 Novotel.85 Sheraton.8 Mercure.85 Hilton.7 Hilton.825 Ibis.7 Sheraton.8 Etap.7 Crillon.75 Mercure.6 Top 3 Score Novotel.875 Crillon.825 Ibis.81 Ibis: \.^_`\.a _ = 0.81; Crillon: 0.825; Etap: 0.805; Novotel:0.875; Sheraton: 0.8; Mercure: 0.725; Hilton: For each extracted object, compute its overall score by making random accesses wherever needed 3. Among these, output the k objects with the best overall score
31 31 Fagin s threshold algorithm for monotone queries Input: a monotone query combining rankings r A,, r F Output: the top k <object, score> pairs 1. Init: R = empty set, th = + 2. For i = 1:m a) Extract the next object from r? by sequential accesses b) For each extracted object, compute its overall score by making random accesses wherever needed c) If R contains less than k objects or the minimum score of the objects in R is smaller then the new object score, add the object to R 3. Let th = threshold value obtained aggregating the last score observed in each ranking 4. If more than k objects have been evaluated, and the minimum of the score of the documents in R is greater than or equal to th, return R 5. Redo step 2
32 32 Fagin s threshold algorithm for monotone queries: example Hotels Cheapness Hotels Rating Ibis.92 Crillon.9 Etap.91 Novotel.9 Novotel.85 Sheraton.8 Mercure.85 Hilton.7 Hilton.825 Ibis.7 Sheraton.8 Etap.7 Crillon.75 Mercure.6 Top 3 Score th = + 1. Init: R = empty set, th = +
33 33 Fagin s threshold algorithm for monotone queries: example Hotels Cheapness Hotels Rating Ibis.92 Crillon.9 Etap.91 Novotel.9 Novotel.85 Sheraton.8 Mercure.85 Hilton.7 Hilton.825 Ibis.7 Sheraton.8 Etap.7 Crillon.75 Mercure.6 Top 3 Score Ibis.81 th = + 2. For i = 1:m a) Extract the next object from r? by sequential accesses b) For each extracted object, compute its overall score by making random accesses wherever needed c) If R contains less than k objects or the minimum score of the objects in R is smaller then the new object score add the object to R
34 34 Fagin s threshold algorithm for monotone queries: example Hotels Cheapness Hotels Rating Ibis.92 Crillon.9 Etap.91 Novotel.9 Novotel.85 Sheraton.8 Mercure.85 Hilton.7 Hilton.825 Ibis.7 Sheraton.8 Etap.7 Crillon.75 Mercure.6 Top 3 Score Crillon.825 Ibis.81 th = + 2. For i = 1:m a) Extract the next object from r? by sequential accesses b) For each extracted object, compute its overall score by making random accesses wherever needed c) If R contains less than k objects or the minimum score of the objects in R is smaller then the new object score add the object to R
35 35 Fagin s threshold algorithm for monotone queries: example Hotels Cheapness Hotels Rating Ibis.92 Crillon.9 Etap.91 Novotel.9 Novotel.85 Sheraton.8 Mercure.85 Hilton.7 Hilton.825 Ibis.7 Sheraton.8 Etap.7 Crillon.75 Mercure.6 Top 3 Score Crillon.825 Ibis.81 th = \.^_`\.^ _ = Let th = threshold value obtained aggregating the last score observed in each ranking 4. If more than k objects have been evaluated, and the minimum of the score of the documents in R is greater than or equal to th, return R 5. Redo step 2
36 36 Fagin s threshold algorithm for monotone queries: example Hotels Cheapness Hotels Rating Ibis.92 Crillon.9 Etap.91 Novotel.9 Novotel.85 Sheraton.8 Mercure.85 Hilton.7 Hilton.825 Ibis.7 Sheraton.8 Etap.7 Crillon.75 Mercure.6 Top 3 Score Crillon.825 Ibis.81 Etap.805 th = For i = 1:m a) Extract the next object from r? by sequential accesses b) For each extracted object, compute its overall score by making random accesses wherever needed c) If R contains less than k objects or the minimum score of the objects in R is smaller then the new object score add the object to R
37 37 Fagin s threshold algorithm for monotone queries: example Hotels Cheapness Hotels Rating Ibis.92 Crillon.9 Etap.91 Novotel.9 Novotel.85 Sheraton.8 Mercure.85 Hilton.7 Hilton.825 Ibis.7 Sheraton.8 Etap.7 Crillon.75 Mercure.6 Top 3 Score Novotel.875 Crillon.825 Ibis.81 th = For i = 1:m a) Extract the next object from r? by sequential accesses b) For each extracted object, compute its overall score by making random accesses wherever needed c) If R contains less than k objects or the minimum score of the objects in R is smaller then the new object score add the object to R
38 38 Fagin s threshold algorithm for monotone queries: example Hotels Cheapness Hotels Rating Ibis.92 Crillon.9 Etap.91 Novotel.9 Novotel.85 Sheraton.8 Mercure.85 Hilton.7 Hilton.825 Ibis.7 Sheraton.8 Etap.7 Crillon.75 Mercure.6 Top 3 Score Novotel.875 Crillon.825 Ibis.81 th = \.^A`\.^ _ = Let th = threshold value obtained aggregating the last score observed in each ranking 4. If more than k objects have been evaluated, and the minimum of the score of the documents in R is greater than or equal to th, return R 5. Redo step 2
39 39 Fagin s threshold algorithm for monotone queries: example Hotels Cheapness Hotels Rating Ibis.92 Crillon.9 Etap.91 Novotel.9 Novotel.85 Sheraton.8 Mercure.85 Hilton.7 Hilton.825 Ibis.7 Sheraton.8 Etap.7 Crillon.75 Mercure.6 Top 3 Score Novotel.875 Crillon.825 Ibis.81 th = For i = 1:m a) Extract the next object from r? by sequential accesses b) For each extracted object, compute its overall score by making random accesses wherever needed c) If R contains less than k objects or the minimum score of the objects in R is smaller then the new object score add the object to R
40 40 Fagin s threshold algorithm for monotone queries: example Hotels Cheapness Hotels Rating Ibis.92 Crillon.9 Etap.91 Novotel.9 Novotel.85 Sheraton.8 Mercure.85 Hilton.7 Hilton.825 Ibis.7 Sheraton.8 Etap.7 Crillon.75 Mercure.6 Top 3 Score Novotel.875 Crillon.825 Ibis.81 th = For i = 1:m a) Extract the next object from r? by sequential accesses b) For each extracted object, compute its overall score by making random accesses wherever needed c) If R contains less than k objects or the minimum score of the objects in R is smaller then the new object score add the object to R
41 41 Fagin s threshold algorithm for monotone queries: example Hotels Cheapness Hotels Rating Ibis.92 Crillon.9 Etap.91 Novotel.9 Novotel.85 Sheraton.8 Mercure.85 Hilton.7 Hilton.825 Ibis.7 Sheraton.8 Etap.7 Crillon.75 Mercure.6 Top 3 Score Novotel.875 Crillon.825 Ibis.81 th = \.fg`\.f _ = Let th = threshold value obtained aggregating the last score observed in each ranking 4. If more than k objects have been evaluated, and the minimum of the score of the documents in R is greater than or equal to th, return R 5. Redo step 2
42 42 Fagin s threshold algorithm for monotone queries: example Hotels Cheapness Hotels Rating Ibis.92 Crillon.9 Etap.91 Novotel.9 Novotel.85 Sheraton.8 Mercure.85 Hilton.7 Hilton.825 Ibis.7 Sheraton.8 Etap.7 Crillon.75 Mercure.6 Top 3 Score Novotel.875 Crillon.825 Ibis.81 th = For i = 1:m a) Extract the next object from r? by sequential accesses b) For each extracted object, compute its overall score by making random accesses wherever needed c) If R contains less than k objects or the minimum score of the objects in R is smaller then the new object score add the object to R
43 43 Fagin s threshold algorithm for monotone queries: example Hotels Cheapness Hotels Rating Ibis.92 Crillon.9 Etap.91 Novotel.9 Novotel.85 Sheraton.8 Mercure.85 Hilton.7 Hilton.825 Ibis.7 Sheraton.8 Etap.7 Crillon.75 Mercure.6 Top 3 Score Novotel.875 Crillon.825 Ibis.81 th = For i = 1:m a) Extract the next object from r? by sequential accesses b) For each extracted object, compute its overall score by making random accesses wherever needed c) If R contains less than k objects or the minimum score of the objects in R is smaller then the new object score add the object to R
44 44 Fagin s threshold algorithm for monotone queries: example Hotels Cheapness Hotels Rating Ibis.92 Crillon.9 Etap.91 Novotel.9 Novotel.85 Sheraton.8 Mercure.85 Hilton.7 Hilton.825 Ibis.7 Sheraton.8 Etap.7 Crillon.75 Mercure.6 Top 3 Score Novotel.875 Crillon.825 Ibis.81 th = \.fg`\.a _ = Let th = threshold value obtained aggregating the last score observed in each ranking 4. If more than k objects have been evaluated, and the minimum of the score of the documents in R is greater than or equal to th, return R 5. Redo step 2
Consensus Answers for Queries over Probabilistic Databases. Jian Li and Amol Deshpande University of Maryland, College Park, USA
Consensus Answers for Queries over Probabilistic Databases Jian Li and Amol Deshpande University of Maryland, College Park, USA Probabilistic Databases Motivation: Increasing amounts of uncertain data
More informationSearch Engines. Provide a ranked list of documents. May provide relevance scores. May have performance information.
Search Engines Provide a ranked list of documents. May provide relevance scores. May have performance information. 3 External Metasearch Metasearch Engine Search Engine A Search Engine B Search Engine
More informationRank Aggregation for Similar Items
Rank Aggregation for Similar Items D. Sculley Department of Computer Science Tufts University, Medford, MA, USA dsculley@cs.tufts.edu Abstract The problem of combining the ranked preferences of many experts
More informationCombining Fuzzy Information - Top-k Query Algorithms. Sanjay Kulhari
Combining Fuzzy Information - Top-k Query Algorithms Sanjay Kulhari Outline Definitions Objects, Attributes and Scores Querying Fuzzy Data Top-k query algorithms Naïve Algorithm Fagin s Algorithm (FA)
More informationPreference Elicitation for Single Crossing Domain
Preference Elicitation for Single Crossing Domain joint work with Neeldhara Misra (IIT Gandhinagar) March 6, 2017 Appeared in IJCAI 2016 Motivation for Preference Elicitation One often wants to learn how
More informationa) Graph 2 and Graph 3 b) Graph 2 and Graph 4 c) Graph 1 and Graph 4 d) Graph 1 and Graph 3 e) Graph 3 and Graph 4 f) None of the above
Mathematics 105: Math as a Liberal Art. Final Exam. Name Instructor: Ramin Naimi Spring 2008 Close book. Closed notes. No Calculators. NO CELL PHONES! Please turn off your cell phones and put them away.
More informationVoting. Xiaoyue Zhang
Voting Xiaoyue Zhang Preference Ballot Ordered list of candidates Assuming no ties Preference schedule = many preference ballots Alice's Preferences 1. Apple 2. Banana 3. Peach 4. Pear 5. Kiwi What is
More informationIntegrating rankings: Problem statement
Integrating rankings: Problem statement Each object has m grades, oneforeachofm criteria. The grade of an object for field i is x i. Normally assume 0 x i 1. Typically evaluations based on different criteria
More informationVoting Methods and Colluding Voters
. Voting Methods and Colluding Voters Christopher Hanusa Binghamton University December 3, 2007 Outline Voting Methods Plurality/Majority and refinements Ranked Pairs Borda Count Let s vote! Mathematics
More informationarxiv: v1 [cs.ma] 8 May 2018
Ordinal Approximation for Social Choice, Matching, and Facility Location Problems given Candidate Positions Elliot Anshelevich and Wennan Zhu arxiv:1805.03103v1 [cs.ma] 8 May 2018 May 9, 2018 Abstract
More informationChapter 4 Answers. Lesson 4.1. Chapter 4 Answers 1
Chapter 4 Answers Lesson 4.1 1. 2. 3. 4. Chapter 4 Answers 1 5. Answers vary. Sample answer: a. A: 2, none; B: 4, A; C: 3, B; D: 7, C; E: 5, A; F: 10, E; G: 8, A; H: 3, A; I: 4, D, F, G, and H b. c. In
More informationRANK-AWARE QUERY PROCESSING AND OPTIMIZATION. A Thesis. Submitted to the Faculty. Purdue University. Ihab F. Ilyas. In Partial Fulfillment of the
RANK-AWARE QUERY PROCESSING AND OPTIMIZATION A Thesis Submitted to the Faculty of Purdue University by Ihab F. Ilyas In Partial Fulfillment of the Requirements for the Degree of Doctor of Philosophy August
More informationData Reduction and Problem Kernels for Voting Problems
Data Reduction and Problem Kernels for Voting Problems Nadja Betzler Friedrich-Schiller-Universität Jena Dagstuhl Seminar Computational Foundations of Social Choice March 2010 Nadja Betzler (Universität
More informationAdvanced Search Techniques for Large Scale Data Analytics Pavel Zezula and Jan Sedmidubsky Masaryk University
Advanced Search Techniques for Large Scale Data Analytics Pavel Zezula and Jan Sedmidubsky Masaryk University http://disa.fi.muni.cz The Cranfield Paradigm Retrieval Performance Evaluation Evaluation Using
More informationCHAPTER IX MULTI STAGE DECISION MAKING APPROACH TO OPTIMIZE THE PRODUCT MIX IN ASSIGNMENT LEVEL UNDER FUZZY GROUP PARAMETERS
CHAPTER IX MULTI STAGE DECISION MAKING APPROACH TO OPTIMIZE THE PRODUCT MIX IN ASSIGNMENT LEVEL UNDER FUZZY GROUP PARAMETERS Introduction: Aryanezhad, M.B [2004] showed that one of the most important decisions
More informationCase-Based Label Ranking
Case-Based Label Ranking Klaus Brinker and Eyke Hüllermeier Data and Knowledge Engineering, Otto-von-Guericke-Universität Magdeburg, Germany {brinker, huellerm}@iti.cs.uni-magdeburg.de Abstract. Label
More informationarxiv: v5 [cs.ds] 4 May 2014
An Analysis of Rank Aggregation Algorithms Gattaca Lv No Institute Given arxiv:1402.5259v5 [cs.ds] 4 May 2014 Abstract. Rank aggregation is an essential approach for aggregating the preferences of multiple
More informationMATH 1340 Mathematics & Politics
MATH 1340 Mathematics & Politics Lecture 5 June 26, 2015 Slides prepared by Iian Smythe for MATH 1340, Summer 2015, at Cornell University 1 An example (Exercise 2.1 in R&U) Consider the following profile
More informationMath 130 Final Exam Study Guide. 1. Voting
1 Math 130 Final Exam Study Guide 1. Voting (a) Be able to interpret a top choice ballot, preference ballot and preference schedule (b) Given a preference schedule, be able to: i. find the winner of an
More informationPackage rankdist. April 8, 2018
Type Package Title Distance Based Ranking Models Version 1.1.3 Date 2018-04-02 Author Zhaozhi Qian Package rankdist April 8, 2018 Maintainer Zhaozhi Qian Implements distance
More informationIterative Voting Rules
Noname manuscript No. (will be inserted by the editor) Iterative Voting Rules Meir Kalech 1, Sarit Kraus 2, Gal A. Kaminka 2, Claudia V. Goldman 3 1 Information Systems Engineering, Ben-Gurion University,
More informationExact and Approximate Generic Multi-criteria Top-k Query Processing
Exact and Approximate Generic Multi-criteria Top-k Query Processing Mehdi Badr, Dan Vodislav To cite this version: Mehdi Badr, Dan Vodislav. Exact and Approximate Generic Multi-criteria Top-k Query Processing.
More informationIBM Research Report. Computing Slater Rankings Using Similarities Among Candidates
RC23748 (W0510-105) October 13, 2005 Computer Science IBM Research Report Computing Slater Rankings Using Similarities Among Candidates Vincent Conitzer* IBM Research Division Thomas J. Watson Research
More informationMining Social Network Graphs
Mining Social Network Graphs Analysis of Large Graphs: Community Detection Rafael Ferreira da Silva rafsilva@isi.edu http://rafaelsilva.com Note to other teachers and users of these slides: We would be
More informationComparison of of parallel and random approach to
Comparison of of parallel and random approach to acandidate candidatelist listininthe themultifeature multifeaturequerying Peter Gurský Peter Gurský Institute of Computer Science, Faculty of Science Institute
More informationSarah Cohen-Boulakia. Laboratoire de Recherche en Informatique CNRS UMR 8623 Université Paris-Sud, Université Paris-Saclay
Sarah Cohen-Boulakia Laboratoire de Recherche en Informatique CNRS UMR 8623 Université Paris-Sud, Université Paris-Saclay Understanding Life Sciences Progress in multiple domains: biology, chemistry, maths,
More informationmodern database systems lecture 5 : top-k retrieval
modern database systems lecture 5 : top-k retrieval Aristides Gionis Michael Mathioudakis spring 2016 announcements problem session on Monday, March 7, 2-4pm, at T2 solutions of the problems in homework
More informationQOS-BASED RANKING MODEL FOR WEB SERVICE SELECTION CONSIDERING USER REQUIREMENTS
QOS-BASED RANKING MODEL FOR WEB SERVICE SELECTION CONSIDERING USER REQUIREMENTS 1 G. VADIVELOU, 2 E. ILAVARASAN 1 Research Scholar, Dept. of CSE, Bharathiar University, Coimbatore, Tamilnadu, India 2 Professor,
More informationOrdering by weighted number of wins gives a good ranking for weighted tournaments
Ordering by weighted number of wins gives a good ranking for weighted tournaments Don Coppersmith Lisa Fleischer Atri Rudra Abstract We consider the following simple algorithm for feedback arc set problem
More informationPossibilities of Voting
Possibilities of Voting MATH 100, Survey of Mathematical Ideas J. Robert Buchanan Department of Mathematics Summer 2018 Introduction When choosing between just two alternatives, the results of voting are
More informationPractical voting rules with partial information
DOI 10.1007/s10458-010-9133-6 Practical voting rules with partial information Meir Kalech Sarit Kraus Gal A. Kaminka Claudia V. Goldman The Author(s) 2010 Abstract Voting is an essential mechanism that
More informationUnsupervised Rank Aggregation with Distance-Based Models
Unsupervised Rank Aggregation with Distance-Based Models Alexandre Klementiev, Dan Roth, and Kevin Small University of Illinois at Urbana-Champaign Motivation Consider a panel of judges Each (independently)
More informationMIDTERM EXAMINATION Networked Life (NETS 112) November 21, 2013 Prof. Michael Kearns
MIDTERM EXAMINATION Networked Life (NETS 112) November 21, 2013 Prof. Michael Kearns This is a closed-book exam. You should have no material on your desk other than the exam itself and a pencil or pen.
More informationCharacterization of Scoring Rules with Distances: Application to the Clustering of Rankings
Proceedings of the Twenty-Fourth International Joint Conference on Artificial Intelligence (IJCAI 2015) Characterization of Scoring Rules with Distances: Application to the Clustering of Rankings Paolo
More information5. DIVIDE AND CONQUER I
5. DIVIDE AND CONQUER I mergesort counting inversions closest pair of points randomized quicksort median and selection Lecture slides by Kevin Wayne Copyright 2005 Pearson-Addison Wesley Copyright 2013
More informationEfficient Aggregation of Ranked Inputs
Efficient Aggregation of Ranked Inputs Nikos Mamoulis, Kit Hung Cheng, Man Lung Yiu, and David W. Cheung Department of Computer Science University of Hong Kong Pokfulam Road Hong Kong {nikos,khcheng,mlyiu2,dcheung}@cs.hku.hk
More informationComputing Slater Rankings Using Similarities Among Candidates
Computing Slater Rankings Using Similarities Among Candidates Vincent Conitzer Carnegie Mellon University Computer Science Department 5 Forbes Avenue Pittsburgh, PA 15213, USA conitzer@cs.cmu.edu ABSTRACT
More informationData can be in the form of numbers, words, measurements, observations or even just descriptions of things.
+ What is Data? Data is a collection of facts. Data can be in the form of numbers, words, measurements, observations or even just descriptions of things. In most cases, data needs to be interpreted and
More informationCase-based Label Ranking
Case-based Label Ranking Klaus Brinker 1 and Eyke Hüllermeier 1 Data and Knowledge Engineering, Otto-von-Guericke-Universität Magdeburg, Germany {brinker,huellerm}@iti.cs.uni-magdeburg.de Abstract. Label
More informationFixed-Parameter Algorithms for Kemeny Scores
Fixed-Parameter Algorithms for Kemeny Scores Nadja Betzler 1,, Michael R. Fellows 2,, Jiong Guo 1,, Rolf Niedermeier 1, and Frances A. Rosamond 2, 1 Institut für Informatik, Friedrich-Schiller-Universität
More informationSystem of Systems Architecture Generation and Evaluation using Evolutionary Algorithms
SysCon 2008 IEEE International Systems Conference Montreal, Canada, April 7 10, 2008 System of Systems Architecture Generation and Evaluation using Evolutionary Algorithms Joseph J. Simpson 1, Dr. Cihan
More informationDinner for Two, Reprise
Fuzzy Logic Toolbox Dinner for Two, Reprise In this section we provide the same two-input, one-output, three-rule tipping problem that you saw in the introduction, only in more detail. The basic structure
More informationApproximation Algorithms
Chapter 8 Approximation Algorithms Algorithm Theory WS 2016/17 Fabian Kuhn Approximation Algorithms Optimization appears everywhere in computer science We have seen many examples, e.g.: scheduling jobs
More informationarxiv: v2 [cs.db] 21 Oct 2016
Monitoring the Top-m Aggregation in a Sliding Window of Spatial Queries Farhana M. Choudhury Zhifeng Bao J. Shane Culpepper School of CSIT, RMIT University, Melbourne, Australia {farhana.choudhury,zhifeng.bao,shane.culpepper}@rmit.edu.au
More informationA COMPARATIVE STUDY OF QUALITY AND CONTENT-BASED SPATIAL POOLING STRATEGIES IN IMAGE QUALITY ASSESSMENT. Dogancan Temel and Ghassan AlRegib
A COMPARATIVE STUDY OF QUALITY AND CONTENT-BASED SPATIAL POOLING STRATEGIES IN IMAGE QUALITY ASSESSMENT Dogancan Temel and Ghassan AlRegib Center for Signal and Information Processing (CSIP) School of
More informationOn the Axiomatic Characterization of Runoff Voting Rules
On the Axiomatic Characterization of Runoff Voting Rules Rupert Freeman Markus Brill Vincent Conitzer Department of Computer Science Duke University Durham, NC, USA {rupert,brill,conitzer}@cs.duke.edu
More informationIndex. ,a-possibility efficient solution, programming, 275
Index,a-possibility efficient solution, 196 0-1 programming, 275 aggregation, 31 of fuzzy number, 55 operator, 37, 45 agreement-discordance principle, 61 agricultural economics, 205 Arrow's impossibility
More informationComputing Slater Rankings Using Similarities Among Candidates
Computing Slater Rankings Using Similarities Among Candidates Vincent Conitzer Computer Science Department Carnegie Mellon University Pittsburgh, PA 213, USA conitzer@cs.cmu.edu Abstract Voting (or rank
More informationAlternative Consensus Algorithms. Murat Osmanoglu
Alternative Consensus Algorithms Murat Osmanoglu Bitcoin Block i-1 Block i Block i+1 Hash i-2 Nonce i-1 Hash i-1 Nonce i Hash i Nonce i+1 tx tx tx tx tx tx tx tx tx tx tx tx Do you really need a Blockchain?
More informationRanking Clustered Data with Pairwise Comparisons
Ranking Clustered Data with Pairwise Comparisons Alisa Maas ajmaas@cs.wisc.edu 1. INTRODUCTION 1.1 Background Machine learning often relies heavily on being able to rank the relative fitness of instances
More informationLecture 6. Register Allocation. I. Introduction. II. Abstraction and the Problem III. Algorithm
I. Introduction Lecture 6 Register Allocation II. Abstraction and the Problem III. Algorithm Reading: Chapter 8.8.4 Before next class: Chapter 10.1-10.2 CS243: Register Allocation 1 I. Motivation Problem
More informationComparing Top-k XML Lists
Comparing Top-k XML Lists Ramakrishna Varadarajan a, Fernando Farfán b, Vagelis Hristidis c a Hewlett-Packard, Billerica, MA 01821 b Department of Computer Science and Engineering, University of Michigan,
More informationEfficient Top-k Algorithms for Fuzzy Search in String Collections
Efficient Top-k Algorithms for Fuzzy Search in String Collections Rares Vernica Chen Li Department of Computer Science University of California, Irvine First International Workshop on Keyword Search on
More informationData Mining. Part 2. Data Understanding and Preparation. 2.4 Data Transformation. Spring Instructor: Dr. Masoud Yaghini. Data Transformation
Data Mining Part 2. Data Understanding and Preparation 2.4 Spring 2010 Instructor: Dr. Masoud Yaghini Outline Introduction Normalization Attribute Construction Aggregation Attribute Subset Selection Discretization
More informationPeter Gurský. Institute of Computer Science, Faculty of Science.
Towards TowardsBetter better Semantics semantics in in the the multifeature Multifeature Querying querying Peter Gurský Peter Gurský Institute of Computer Science, Faculty of Science Institute of P.J.Šafárik
More informationUNIT 8: SOLVING AND GRAPHING QUADRATICS. 8-1 Factoring to Solve Quadratic Equations. Solve each equation:
UNIT 8: SOLVING AND GRAPHING QUADRATICS 8-1 Factoring to Solve Quadratic Equations Zero Product Property For all numbers a & b Solve each equation: If: ab 0, 1. (x + 3)(x 5) = 0 Then one of these is true:
More informationRank Aggregation Methods for the Web
S 886 dvanced Topics in rtificial Intelligence: Multiagent Systems Rank ggregation Methods for the Web ynthia work Ravi Kumar Moni Naor. Sivakumar Presented by: Wanying Luo 1 Outline What is rank aggregation
More informationThe interaction of theory and practice in database research
The interaction of theory and practice in database research Ron Fagin IBM Research Almaden 1 Purpose of This Talk Encourage collaboration between theoreticians and system builders via two case studies
More informationUnderstanding Clustering Supervising the unsupervised
Understanding Clustering Supervising the unsupervised Janu Verma IBM T.J. Watson Research Center, New York http://jverma.github.io/ jverma@us.ibm.com @januverma Clustering Grouping together similar data
More informationNear Neighbor Search in High Dimensional Data (1) Dr. Anwar Alhenshiri
Near Neighbor Search in High Dimensional Data (1) Dr. Anwar Alhenshiri Scene Completion Problem The Bare Data Approach High Dimensional Data Many real-world problems Web Search and Text Mining Billions
More informationClustering. Informal goal. General types of clustering. Applications: Clustering in information search and analysis. Example applications in search
Informal goal Clustering Given set of objects and measure of similarity between them, group similar objects together What mean by similar? What is good grouping? Computation time / quality tradeoff 1 2
More informationA Complexity-of-Strategic-Behavior Comparison between Schulze s Rule and Ranked Pairs
Proceedings of the Twenty-Sixth AAAI Conference on Artificial Intelligence A Complexity-of-Strategic-Behavior Comparison between Schulze s ule and anked Pairs David C. Parkes School of Engineering and
More informationExtracting Rankings for Spatial Keyword Queries from GPS Data
Extracting Rankings for Spatial Keyword Queries from GPS Data Ilkcan Keles Christian S. Jensen Simonas Saltenis Aalborg University Outline Introduction Motivation Problem Definition Proposed Method Overview
More informationCondorcet Consistent Bundling with Social Choice
Condorcet Consistent Bundling with Social Choice Shreyas Sekar Rensselaer Polytechnic Institute Computer Science Department sekars@rpi.edu Sujoy Sikdar Rensselaer Polytechnic Institute Computer Science
More information(Refer Slide Time: 00:02:00)
Computer Graphics Prof. Sukhendu Das Dept. of Computer Science and Engineering Indian Institute of Technology, Madras Lecture - 18 Polyfill - Scan Conversion of a Polygon Today we will discuss the concepts
More informationGeometric Modeling. Mesh Decimation. Mesh Decimation. Applications. Copyright 2010 Gotsman, Pauly Page 1. Oversampled 3D scan data
Applications Oversampled 3D scan data ~150k triangles ~80k triangles 2 Copyright 2010 Gotsman, Pauly Page 1 Applications Overtessellation: E.g. iso-surface extraction 3 Applications Multi-resolution hierarchies
More informationPractical Algorithms for Computing STV and Other Multi-Round Voting Rules
Practical Algorithms for Computing STV and Other Multi-Round Voting Rules Chunheng Jiang jiangc4@rpi.edu Lirong Xia xial@cs.rpi.edu Sujoy Sikdar sikdas@rpi.edu Zhibing Zhao zhaoz6@rpi.edu Hejun Wang wangj38@rpi.edu
More informationRANKINGS AND MULTI-CRITERIA EVALUATION OF ALTERNATIVES: USAGE OF OBJECTIVE DATA, SUBJECTIVE MEASUREMENTS AND EXPERT JUDGMENT
RANKINGS AND MULTI-CRITERIA EVALUATION OF ALTERNATIVES: USAGE OF OBJECTIVE DATA, SUBJECTIVE MEASUREMENTS AND EXPERT JUDGMENT Pavel Brusilovskiy and Robert Hernandez Risk Management, Conrail Corporation
More informationPackage RankAggreg. May 15, 2018
Type Package Title Weighted Rank Aggregation Version 0.6.5 Date 2018-05-14 Package RankAggreg May 15, 2018 Author Vasyl Pihur , Somnath Datta , Susmita Datta
More informationDATA MINING II - 1DL460
DATA MINING II - 1DL460 Spring 2013 " An second class in data mining http://www.it.uu.se/edu/course/homepage/infoutv2/vt13 Kjell Orsborn Uppsala Database Laboratory Department of Information Technology,
More informationApriori Algorithm. 1 Bread, Milk 2 Bread, Diaper, Beer, Eggs 3 Milk, Diaper, Beer, Coke 4 Bread, Milk, Diaper, Beer 5 Bread, Milk, Diaper, Coke
Apriori Algorithm For a given set of transactions, the main aim of Association Rule Mining is to find rules that will predict the occurrence of an item based on the occurrences of the other items in the
More information4.3 Quadratic functions and their properties
4.3 Quadratic functions and their properties A quadratic function is a function defined as f(x) = ax + x + c, a 0 Domain: the set of all real numers x-intercepts: Solutions of ax + x + c = 0 y-intercept:
More informationExam : C : IBM InfoSphere Quality Stage v8 Examination. Title. Version : DEMO
Exam : C2090-419 Title : IBM InfoSphere Quality Stage v8 Examination Version : DEMO 1. When running Word Investigation, producing a pattern report will help you do what? A. Refine a standardization rule
More informationExam. Question: Total Points: Score:
FS 2016 Data Modelling and Databases Date: June 9, 2016 ETH Zurich Systems Group Prof. Gustavo Alonso Exam Name: Question: 1 2 3 4 5 6 7 8 9 10 11 Total Points: 15 20 15 10 10 15 10 15 10 10 20 150 Score:
More informationCMSC 451: Dynamic Programming
CMSC 41: Dynamic Programming Slides By: Carl Kingsford Department of Computer Science University of Maryland, College Park Based on Sections 6.1&6.2 of Algorithm Design by Kleinberg & Tardos. Dynamic Programming
More informationRuled Based Approach for Scheduling Flow-shop and Job-shop Problems
Ruled Based Approach for Scheduling Flow-shop and Job-shop Problems Mohammad Komaki, Shaya Sheikh, Behnam Malakooti Case Western Reserve University Systems Engineering Email: komakighorban@gmail.com Abstract
More informationA probabilistic model to resolve diversity-accuracy challenge of recommendation systems
A probabilistic model to resolve diversity-accuracy challenge of recommendation systems AMIN JAVARI MAHDI JALILI 1 Received: 17 Mar 2013 / Revised: 19 May 2014 / Accepted: 30 Jun 2014 Recommendation systems
More informationII (Sorting and) Order Statistics
II (Sorting and) Order Statistics Heapsort Quicksort Sorting in Linear Time Medians and Order Statistics 8 Sorting in Linear Time The sorting algorithms introduced thus far are comparison sorts Any comparison
More informationHKTA TANG HIN MEMORIAL SECONDARY SCHOOL SECONDARY 3 COMPUTER LITERACY. Name: ( ) Class: Date: Databases and Microsoft Access
Databases and Microsoft Access Introduction to Databases A well-designed database enables huge data storage and efficient data retrieval. Term Database Table Record Field Primary key Index Meaning A organized
More informationGeometric Approaches for Top-k Queries
Geometric Approaches for Top-k Queries [Tutorial] Kyriakos Mouratidis Singapore Management University VLDB 2017 Introduction Top-k query: shortlists top options from a set of alternatives E.g. tripadvisor.com
More informationMATH 1340 Mathematics & Politics
MTH 1340 Mathematics & Politics Lecture 4 June 25, 2015 Slides prepared by Iian Smythe for MTH 1340, Summer 2015, at Cornell University 1 Profiles and social choice functions Recall from last time: in
More informationRelational Model, Relational Algebra, and SQL
Relational Model, Relational Algebra, and SQL August 29, 2007 1 Relational Model Data model. constraints. Set of conceptual tools for describing of data, data semantics, data relationships, and data integrity
More informationSearch Engines Chapter 8 Evaluating Search Engines Felix Naumann
Search Engines Chapter 8 Evaluating Search Engines 9.7.2009 Felix Naumann Evaluation 2 Evaluation is key to building effective and efficient search engines. Drives advancement of search engines When intuition
More information3. Cluster analysis Overview
Université Laval Multivariate analysis - February 2006 1 3.1. Overview 3. Cluster analysis Clustering requires the recognition of discontinuous subsets in an environment that is sometimes discrete (as
More informationa) 1/3 area of triangle ABC b) 3.6 c) 3 d) e) Euclid s fifth postulate is equivalent to: Given a line and a point not on that line
1. Given is a right triangle with AD a perpendicular from the right angle to the hypotenuse, find the length of AD given AB = 6, BC = 10 and AC = 8. B D A C a) 7.5 b) 6.5 c) 4.8 d) e) 2. Using the figure
More informationExtending Prolog with Incomplete Fuzzy Information
Extending Prolog with Incomplete Fuzzy Information Susana Muñoz-Hernández Claudio Vaucheret Facultad de Informática Universidad Politécnica de Madrid 28660 Madrid, Spain susana@fi.upm.es vaucheret@ibap.com.ar
More informationSemantics of Ranking Queries for Probabilistic Data and Expected Ranks
Semantics of Ranking Queries for Probabilistic Data and Expected Ranks Graham Cormode AT&T Labs Research Florham Park, NJ, USA Feifei Li Computer Science Department FSU, Tallahassee, FL, USA Ke Yi Computer
More informationClustering Algorithms for general similarity measures
Types of general clustering methods Clustering Algorithms for general similarity measures general similarity measure: specified by object X object similarity matrix 1 constructive algorithms agglomerative
More informationOnline Algorithms. Lecture Notes for CSCI 570 by David Kempe December 1, 2006
Online Algorithms Lecture Notes for CSCI 570 by David Kempe December 1, 2006 So far in class, we have only considered the traditional framework for algorithm design, as follows: we are given an input I,
More informationAchieving k-anonmity* Privacy Protection Using Generalization and Suppression
UT DALLAS Erik Jonsson School of Engineering & Computer Science Achieving k-anonmity* Privacy Protection Using Generalization and Suppression Murat Kantarcioglu Based on Sweeney 2002 paper Releasing Private
More informationGeneralization of Hierarchical Crisp Clustering Algorithms to Fuzzy Logic
Generalization of Hierarchical Crisp Clustering Algorithms to Fuzzy Logic Mathias Bank mathias.bank@uni-ulm.de Faculty for Mathematics and Economics University of Ulm Dr. Friedhelm Schwenker friedhelm.schwenker@uni-ulm.de
More information1.1 evaluating expressions 2017 ink.notebook. August 18, page 7 page 8 Unit 1 Basic Equations and Inequalities. 1.1 Order of Operations.
1.1 evaluating expressions 2017 ink.notebook page 7 page 8 Unit 1 Basic Equations and Inequalities 1.1 Order of Operations page 9 page 10 Lesson Objectives Standards 1.1 Order of Operations Press the tabs
More informationClustering. Robert M. Haralick. Computer Science, Graduate Center City University of New York
Clustering Robert M. Haralick Computer Science, Graduate Center City University of New York Outline K-means 1 K-means 2 3 4 5 Clustering K-means The purpose of clustering is to determine the similarity
More informationA - Total Beans Input File: TotalBeansIn.txt
A - Total Beans Input File: TotalBeansIn.txt Farmer Ryan sells beanbags of various sizes at his farm stand. His friend Nora, a bean counter, has filled six bags with beans after carefully counting the
More informationSUMMER PACKET Answer Key
BELLEVUE SCHOOL DISTRICT SUMMER PACKET Answer Key FOR STUDENTS GOlNG lnto: GEOMETRY Section A1.1.A 1. s = 27 s = 2r + 7 2. a) A. f(n) = b) The number of dollars Julie will get on day 12 is $2048. If you
More informationOrdering by weighted number of wins gives a good ranking for weighted tournaments
Ordering by weighted number of wins gives a good ranking for weighted tournaments DON COPPERSMITH IDA Center for Communications Research and LISA K. FLEISCHER Dartmouth College and ATRI RURDA University
More information( ) ( ) Completing the Square. Alg 3 1 Rational Roots Solving Polynomial Equations. A Perfect Square Trinomials
Alg Completing the Square A Perfect Square Trinomials (± ) ± (± ) ± 4 4 (± ) ± 6 9 (± 4) ± 8 6 (± 5) ± 5 What is the relationship between the red term and the blue term? B. Creating perfect squares.. 6
More informationMesh Decimation. Mark Pauly
Mesh Decimation Mark Pauly Applications Oversampled 3D scan data ~150k triangles ~80k triangles Mark Pauly - ETH Zurich 280 Applications Overtessellation: E.g. iso-surface extraction Mark Pauly - ETH Zurich
More informationMotion estimation for video compression
Motion estimation for video compression Blockmatching Search strategies for block matching Block comparison speedups Hierarchical blockmatching Sub-pixel accuracy Motion estimation no. 1 Block-matching
More informationACM-ICPC Indonesia National Contest Problem A. The Best Team. Time Limit: 2s
Problem A The Best Team Time Limit: 2s ACM-ICPC 2010 is drawing near and your university want to select three out of N students to form the best team. The university however, has a limited budget, so they
More information