Data Reduction and Problem Kernels for Voting Problems

Similar documents
Parameterized Computational Complexity of Dodgson and Young Elections

Fixed-Parameter Algorithms for Kemeny Scores

On Bounded-Degree Vertex Deletion Parameterized by Treewidth

Parameterized Algorithm for Eternal Vertex Cover

Fixed-Parameter Algorithms

Discrete Applied Mathematics. On Bounded-Degree Vertex Deletion parameterized by treewidth

arxiv: v5 [cs.ds] 4 May 2014

Linear Problem Kernels for NP-Hard Problems on Planar Graphs

KERNELIZATION. Polynomial time preprocessing - data reduction

A linear kernel for planar red-blue dominating set

Kemeny Elections with Bounded Single-peaked or Single-crossing Width

A 2k-Kernelization Algorithm for Vertex Cover Based on Crown Decomposition

Fixed-Parameter Tractability Results for Full-Degree Spanning Tree and Its Dual

Practical Fixed-Parameter Algorithms for Graph-Modeled Data Clustering

nonblocker: Parameterized Algorithmics for minimum dominating set

Graph Editing to a Given Degree Sequence,

Improved Algorithms and Complexity Results for Power Domination in Graphs

Hardness of r-dominating set on graphs of diameter (r + 1)

Even Faster Parameterized Cluster Deletion and Cluster Editing

Graph-Based Data Clustering with Overlaps

Parameterized Complexity of Finding Regular Induced Subgraphs

Parameterized graph separation problems

Ranking and Drawing in Subexponential Time

Fixed-Parameter Algorithms for Cluster Vertex Deletion

W[1]-hardness. Dániel Marx 1. Hungarian Academy of Sciences (MTA SZTAKI) Budapest, Hungary

REFLECTIONS ON MULTIVARIATE ALGORITHMICS AND PROBLEM PARAMETERIZATION ROLF NIEDERMEIER

Ranking and Drawing in Subexponential Time

Effective and Efficient Data Reduction for the Subset Interconnection Design Problem

Preference Elicitation for Single Crossing Domain

Solving Dominating Set in Larger Classes of Graphs: FPT Algorithms and Polynomial Kernels

Kernelization Through Tidying A Case-Study Based on s-plex Cluster Vertex Deletion

W[1]-hardness. Dániel Marx. Recent Advances in Parameterized Complexity Tel Aviv, Israel, December 3, 2017

On the Complexity of the Highly Connected Deletion Problem

FPT Algorithms for Connected Feedback Vertex Set

Common Induced Subgraph Isomorphism Structural Parameterizations and Exact Algorithms

Safe approximation and its relation to kernelization

Isolation Concepts for Enumerating Dense Subgraphs

Hardness of Subgraph and Supergraph Problems in c-tournaments

Faster parameterized algorithms for Minimum Fill-In

Directed Feedback Vertex Set Problem is FPT

Rotation Distance is Fixed-Parameter Tractable

Faster parameterized algorithms for Minimum Fill-In

Introduction to Parameterized Complexity

Subexponential Algorithms for Partial Cover Problems

Improved Upper Bounds for Partial Vertex Cover

A Complexity Dichotomy for Finding Disjoint Solutions of Vertex Deletion Problems

Solving Hard Problems Incrementally

Linear Kernel for Planar Connected Dominating Set

Kernels for Below-Upper-Bound Parameterizations of the Hitting Set and Directed Dominating Set Problems

Partial vs. Complete Domination:

The Method of Extremal Structure on the k-maximum CUT Problem

Introduction to Graph Theory

Parameterized Complexity - an Overview

Multicut is FPT. Lyon Cedex 07, France

An Improved Exact Algorithm for Undirected Feedback Vertex Set

arxiv: v1 [cs.ds] 8 Jan 2019

Preprocessing for treewidth

Bidimensionality and Kernels

On the Min-Max 2-Cluster Editing Problem

Fixed Parameter Algorithms

Graph-Based Data Clustering with Overlaps

Feedback Arc Set in Bipartite Tournaments is NP-Complete

Exact Algorithms and Applications for Tree-Like Weighted Set Cover 1

Finding Dense Subgraphs of Sparse Graphs

Lecture 4: September 11, 2003

Call Control with k Rejections

Subexponential Algorithms for Partial Cover Problems

arxiv: v1 [cs.ma] 8 May 2018

Fast, Effective Vertex Cover Kernelization: A Tale of Two Algorithms

Improved Algorithms and Complexity Results for Power Domination in Graphs

Parameterizing the Winner Determination Problem for Combinatorial Auctions

The Parameterized Complexity of Some Minimum Label Problems

Tree Decompositions of Graphs: Saving Memory in Dynamic Programming 1

Parameterized Reductions

Parametric Duality and Kernelization: Lower Bounds and Upper Bounds on Kernel Size

Multicut in trees viewed through the eyes of vertex cover

PROBLEM SOLVING AND SEARCH IN ARTIFICIAL INTELLIGENCE

On the Parameterized Max-Leaf Problems: Digraphs and Undirected Graphs

A Refined Complexity Analysis of Finding the Most Vital Edges for Undirected Shortest Paths

Treewidth: Preprocessing and Kernelization

Exact Algorithms Lecture 7: FPT Hardness and the ETH

XLVI Pesquisa Operacional na Gestão da Segurança Pública

Convex Recoloring Revisited: Complexity and Exact Algorithms

Experiments on Data Reduction for Optimal Domination in Networks

On the Complexity of Various Parameterizations of Common Induced Subgraph Isomorphism

9. Parameter Treewidth

Fixed- Parameter Evolu2onary Algorithms

Lossy Kernelization. Daniel Lokshtanov 1 Fahad Panolan 2 M. S. Ramanujan 3 Saket Saurabh 1,2. Simon Institute, November 5th, 2015.

A Linear Kernel for the Differential of a Graph

On Structural Parameterizations for the 2-Club Problem

Iterative Voting Rules

On Exploring Temporal Graphs of Small Pathwidth

2386 IEEE TRANSACTIONS ON INFORMATION THEORY, VOL. 52, NO. 6, JUNE 2006

What Makes Equitable Connected Partition Easy

Finding k Disjoint Triangles in an Arbitrary Graph

Branching Algorithms

Paths, Flowers and Vertex Cover

The Dominating Set Problem in Intersection Graphs

Increasing the Minimum Degree of a Graph by Contractions

Fixed Parameter Complexity of the Weighted Max Leaf Problem

Transcription:

Data Reduction and Problem Kernels for Voting Problems Nadja Betzler Friedrich-Schiller-Universität Jena Dagstuhl Seminar Computational Foundations of Social Choice March 2010 Nadja Betzler (Universität Jena) Data Reduction and Problem Kernels for Voting Problems 1/19

Data reduction rule Preprocessing step to solve a problem: Basic idea A data reduction rule shrinks an instance of a problem to an equivalent instance by cutting away easy parts of the original instance. We focus on polynomial-time data reduction rules for NP-hard problems. First example: Kemeny Score (or Rank Aggregation) Nadja Betzler (Universität Jena) Data Reduction and Problem Kernels for Voting Problems 2/19

Kemeny Score Election Set of votes V, set of candidates C. A vote is a ranking (total order) over all candidates. Example: C = {a, b, c} vote 1: a > b > c vote 2: a > c > b vote 3: b > c > a How to aggregate the votes into a consensus ranking? Nadja Betzler (Universität Jena) Data Reduction and Problem Kernels for Voting Problems 3/19

Kemeny score: KT-distance KT-distance (between two votes v and w) KT-dist(v, w) = d v,w (c, d), {c,d} C where d v,w (c, d) is 0 if v and w rank c and d in the same order, 1 otherwise. Example: v : a > b > c w : c > a > b KT-dist(v, w) = d v,w (a, b) + d v,w (a, c) + d v,w (b, c) = 0 + 1 + 1 = 2 Nadja Betzler (Universität Jena) Data Reduction and Problem Kernels for Voting Problems 4/19

Kemeny Consensus Kemeny score of a ranking r: sum of KT-distances between r and all votes Kemeny consensus r con : a ranking that minimizes the Kemeny score v 1 : a > b > c.. KT-dist(r con, v 1 ) = 0 v 2 : a > c > b KT-dist(r con, v 2 ) = 1 because of {b, c} v 3 : b > c > a KT-dist(r con, v 3 ) = 2 because of {a, b} and {a, c} r con : a > b > c Kemeny score: 0 + 1 + 2 = 3 Nadja Betzler (Universität Jena) Data Reduction and Problem Kernels for Voting Problems 5/19

Decision problem Kemeny Score Input: An election (V, C) and a positive integer k. Question: Is there a Kemeny consensus of (V, C) with score at most k? Kemeny Score is NP-complete (even for 4 votes) [Bartholdi et al., SCW 1989], [Dwork et al., WWW 2001] many algorithmic results and applications... Nadja Betzler (Universität Jena) Data Reduction and Problem Kernels for Voting Problems 6/19

Simple reduction rules Condorcet property: A candidate c beating every other candidate in more than half of the votes, that is, c > 1/2 c for every candidate c c, takes the first position in every Kemeny consensus. Reduction Rule If there is a Condorcet winner in an election provided by a Kemeny Score instance, delete this candidate. Nadja Betzler (Universität Jena) Data Reduction and Problem Kernels for Voting Problems 7/19

Simple reduction rules Condorcet property: A candidate c beating every other candidate in more than half of the votes, that is, c > 1/2 c for every candidate c c, takes the first position in every Kemeny consensus. Reduction Rule If there is a Condorcet winner in an election provided by a Kemeny Score instance, delete this candidate. Reduction Rule If there is a subset C C of candidates with c > 1/2 c for every c C and every c C \ C, then replace the original instance by the two subinstances induced by C and C \ C. Note: A subset C must be found in polynomial time. Nadja Betzler (Universität Jena) Data Reduction and Problem Kernels for Voting Problems 7/19

Reduction rules using dirty candidates A candidate c is non-dirty if for every other candidate c either c 3/4 c or c 3/4 c. Lemma For a non-dirty candidate c and candidate c C \ {c}: If c 3/4 c, then c > > c in every Kemeny consensus. If c 3/4 c, then c > > c in every Kemeny consensus. Reduction Rule If there is a non-dirty candidate, then delete it and partition the instance into two subinstances accordingly. a 1 > a 2 > a 3 > c > b 1 > b 2 a i 3/4 c and c 3/4 b i a 3 > a 2 > c > a 1 > b 2 > b 1 a 1 > c > a 2 > b 2 > b 1 > a 3 in every Kemeny consensus: a 2 > a 3 > a 1 > b 1 > b 2 > c {a 1, a 2, a 3 } > c > {b 1, b 2 } Nadja Betzler (Universität Jena) Data Reduction and Problem Kernels for Voting Problems 8/19

Reduction rules using dirty candidates A candidate c is non-dirty if for every other candidate c either c 3/4 c or c 3/4 c. Lemma For a non-dirty candidate c and candidate c C \ {c}: If c 3/4 c, then c > > c in every Kemeny consensus. If c 3/4 c, then c > > c in every Kemeny consensus. Reduction Rule If there is a non-dirty candidate, then delete it and partition the instance into two subinstances accordingly. Further rule: an extended reduction rule based on sets of non-dirty candidates..... Nadja Betzler (Universität Jena) Data Reduction and Problem Kernels for Voting Problems 9/19

Experimental results: Meta search engines Four votes: Google, Lycos, MSN Live Search, and Yahoo! top 1000 hits each, candidates that appear in all four rankings search term #cand. time [s] structure of reduced instance solved/unsolved affirmative action 127 0.41 [27] > 41 > [59] alcoholism 115 0.21 [115] architecture 122 0.47 [36] > 12 > [30] > 17 > [27] blues 112 0.16 [74] > 9 > [29] cheese 142 0.39 [94] > 6 > [42] classical guitar 115 1.12 [6] > 7 > [50] > 35 > [17] Death+Valley 110 0.25 [15] > 7 > [30] > 8 > [50] field hockey 102 0.21 [37] > 26 > [20] > 4 > [15] gardening 106 0.19 [54] > 20 > [2] > 9 > [8] > 4 > [9] HIV 115 0.26 [62] > 5 > [7] > 20 > [21] lyme disease 153 2.61 [25] > 97 > [31] mutual funds 128 3.33 [9] > 45 > [9] > 5 > [1] > 49 > [10] rock climbing 102 0.12 [102] Shakespeare 163 0.68 [100] > 10 > [25] > 6 > [22] telecommuting 131 2.28 [9] > 109 > [13] Nadja Betzler (Universität Jena) Data Reduction and Problem Kernels for Voting Problems 10/19

Theoretical Analysis Central question How to theoretically measure the performance of data reduction rules? For which kind of instances do the reduction rules perform well? Recall: NP-hard problems and polynomial-time data reduction rules Classical complexity theory: Reduction rules cannot always be applied, else P=NP. No provable performance guarantees for data reduction. Nadja Betzler (Universität Jena) Data Reduction and Problem Kernels for Voting Problems 11/19

Parameterized Complexity Given an NP-hard problem with input size n and a parameter k Basic idea: Confine the combinatorial explosion to k n k instead of n k Definition A problem of size n is called fixed-parameter tractable with respect to a parameter k if it can be solved exactly in f (k) n O(1) time. Parameters: # votes, # candidates, average KT-distance,... Nadja Betzler (Universität Jena) Data Reduction and Problem Kernels for Voting Problems 12/19

Problem kernel Let L Σ N be a parameterized problem. An instance of L is denoted by (I, k). Kernelization I k data reduction rules poly( I, k) I k (I, k) L (I, k ) L k k I g(k) If g is a polynomial, we say L admits a polynomial problem kernel. Known fact: A problem is fixed-parameter tractable if and only if it admits problem kernel. Nadja Betzler (Universität Jena) Data Reduction and Problem Kernels for Voting Problems 13/19

Partial problem kernel for Kemeny Score Parameter: average KT-distance between the input votes d a := 1 n(n 1) u,v V,u v KT-dist(u, v). Known FPT-results: dynamic programming with running time O (16 da ) [Betzler, Fellows, Guo, Niedermeier, and Rosamond, AAMAS 2009] branching algorithm with running time O (5.83 da ) [Simjour, IWPEC 2009] Nadja Betzler (Universität Jena) Data Reduction and Problem Kernels for Voting Problems 14/19

Partial problem kernel for Kemeny Score Parameter: average KT-distance between the input votes Theorem A Kemeny Score instance with average KT-distance d a can be reduced in polynomial time to an equivalent instance with less than 11 d a candidates. Nadja Betzler (Universität Jena) Data Reduction and Problem Kernels for Voting Problems 15/19

Partial problem kernel for Kemeny Score Parameter: average KT-distance between the input votes Theorem A Kemeny Score instance with average KT-distance d a can be reduced in polynomial time to an equivalent instance with less than 11 d a candidates. Idea of proof: Recall: Reduction rule which deletes all non-dirty candidates Every dirty candidate must be involved in at least one candidate pair that is not ordered according to the 3/4 -majority For an instance with n votes, every such pair contributes with at least n/4 3n/4 to the average KT-distance. Nadja Betzler (Universität Jena) Data Reduction and Problem Kernels for Voting Problems 15/19

Partial problem kernel for Kemeny Score Parameter: average KT-distance between the input votes Theorem A Kemeny Score instance with average KT-distance d a can be reduced in polynomial time to an equivalent instance with less than 11 d a candidates. Idea of proof: Recall: Reduction rule which deletes all non-dirty candidates Every dirty candidate must be involved in at least one candidate pair that is not ordered according to the 3/4 -majority For an instance with n votes, every such pair contributes with at least n/4 3n/4 to the average KT-distance. Theoretical challenge: Reduce the number of votes Nadja Betzler (Universität Jena) Data Reduction and Problem Kernels for Voting Problems 15/19

Example for a problem kernel Possible Winner Problem Input: Given a set of partial orders (votes) on a set of candidates and a distinguished candidate c. Question: Can the partial orders be extended to linear ones such that c becomes a winner? k-approval voting: In every vote, the first k candidates get one point and the remaining candidates zero points. Nadja Betzler (Universität Jena) Data Reduction and Problem Kernels for Voting Problems 16/19

Example for a problem kernel Possible Winner Problem Input: Given a set of partial orders (votes) on a set of candidates and a distinguished candidate c. Question: Can the partial orders be extended to linear ones such that c becomes a winner? k-approval voting: In every vote, the first k candidates get one point and the remaining candidates zero points. Relevant known results: NP-hard for every fixed k {2,..., C 2} [Betzler and Dorn, MFCS 2009] NP-hard for two (or more) partial votes [Betzler et al., IJCAI 2009] Consider the combined parameter number of partial votes and number of candidates receiving zero/one point per vote. Nadja Betzler (Universität Jena) Data Reduction and Problem Kernels for Voting Problems 16/19

Possible Winner for k-approval t partial votes k 11..1 000... 00 1111... 111 0...0... t... k 0 := #candidates k Nadja Betzler (Universität Jena) Data Reduction and Problem Kernels for Voting Problems 17/19

Possible Winner for k-approval t partial votes k k 0 := #candidates k 11..1 000... 00 1111... 111 0...0 t...... Theorem For k-approval, Possible Winner admits a polynomial-size problem kernel with respect to the parameter (t, k 0 ). More precisely, an instance can be reduced to an instance with O(t k 2 0 ) candidates and O(t2 k 2 0 ) votes. Theorem For k-approval, Possible Winner parameterized by (t, k) is fixed-parameter tractable but does not admit a polynomial-size problem kernel unless conp NP/poly. Nadja Betzler (Universität Jena) Data Reduction and Problem Kernels for Voting Problems 17/19

Conclusion In practice: Data reduction should be applied whenever possible! In theory: Parameterized algorithmics offer a framework to analyze the effectiveness of data reduction rules. Hardly any results on problem kernels for problems in the voting context, many open questions.. Nadja Betzler (Universität Jena) Data Reduction and Problem Kernels for Voting Problems 18/19

Literature Talk is based on Exact Rank Aggregation Based on Effective Data Reduction [N. Betzler, R. Bredereck, and R. Niedermeier, manuscript] On Problem Kernels for Possible Winner Determination Under the k-approval Protocol [ N. Betzler, manuscript] General literature on problem kernels and parameterized algorithms Invitation to data reduction and problem kernelization [J. Guo and R. Niedermeier, ACM SIGACT News 2007] Kernelization: New upper and lower bound techniques [H. Bodlaender, IWPEC 2009] R. G. Downey and M. R. Fellows, Parameterized Complexity, Springer, 1999 R. Niedermeier, Invitation to Fixed-Parameter Algorithms, Oxford University Press, 2006 Nadja Betzler (Universität Jena) Data Reduction and Problem Kernels for Voting Problems 19/19