Lecture 14: Lower Bounds for Tree Resolution

Similar documents
Lecture 6: Arithmetic and Threshold Circuits

Finite Model Generation for Isabelle/HOL Using a SAT Solver

The Resolution Algorithm

8 NP-complete problem Hard problems: demo

2SAT Andreas Klappenecker

Linear Time Unit Propagation, Horn-SAT and 2-SAT

On Proof Systems Behind Efficient SAT Solvers. DoRon B. Motter and Igor L. Markov University of Michigan, Ann Arbor

8.1 Polynomial-Time Reductions

Lecture 1. 2 Motivation: Fast. Reliable. Cheap. Choose two.

6.856 Randomized Algorithms

To prove something about all Boolean expressions, we will need the following induction principle: Axiom 7.1 (Induction over Boolean expressions):

To prove something about all Boolean expressions, we will need the following induction principle: Axiom 7.1 (Induction over Boolean expressions):

NP-Complete Reductions 2

! Greed. O(n log n) interval scheduling. ! Divide-and-conquer. O(n log n) FFT. ! Dynamic programming. O(n 2 ) edit distance.

Exercises Computational Complexity

PROPOSITIONAL LOGIC (2)

ALGORITHMS EXAMINATION Department of Computer Science New York University December 17, 2007

Boolean Representations and Combinatorial Equivalence

Where Can We Draw The Line?

Chapter 10 Part 1: Reduction

Propositional Calculus: Boolean Algebra and Simplification. CS 270: Mathematical Foundations of Computer Science Jeremy Johnson

CSE 20 DISCRETE MATH. Fall

! Greed. O(n log n) interval scheduling. ! Divide-and-conquer. O(n log n) FFT. ! Dynamic programming. O(n 2 ) edit distance.

Boolean Functions (Formulas) and Propositional Logic

Polynomial SAT-Solver Algorithm Explanation

Randomness and Computation March 25, Lecture 5

Chapter 8. NP and Computational Intractability. Slides by Kevin Wayne. Copyright 2005 Pearson-Addison Wesley. All rights reserved.

Binary Decision Diagrams

NP and computational intractability. Kleinberg and Tardos, chapter 8

Homework 1. Due Date: Wednesday 11/26/07 - at the beginning of the lecture

CMPSCI 250: Introduction to Computation. Lecture #22: Graphs, Paths, and Trees David Mix Barrington 12 March 2014

Example of a Demonstration that a Problem is NP-Complete by reduction from CNF-SAT

CSE 20 DISCRETE MATH. Winter

Reductions and Satisfiability

SAT-CNF Is N P-complete

Unit 8: Coping with NP-Completeness. Complexity classes Reducibility and NP-completeness proofs Coping with NP-complete problems. Y.-W.

1 Definition of Reduction

Question 7.11 Show how heapsort processes the input:

P Is Not Equal to NP. ScholarlyCommons. University of Pennsylvania. Jon Freeman University of Pennsylvania. October 1989

Answer Key #1 Phil 414 JL Shaheen Fall 2010

Greedy Algorithms. CLRS Chapters Introduction to greedy algorithms. Design of data-compression (Huffman) codes

The Satisfiability Problem [HMU06,Chp.10b] Satisfiability (SAT) Problem Cook s Theorem: An NP-Complete Problem Restricted SAT: CSAT, k-sat, 3SAT

9.1 Cook-Levin Theorem

CS 395T Computational Learning Theory. Scribe: Wei Tang

Horn Formulae. CS124 Course Notes 8 Spring 2018

Exact Algorithms Lecture 7: FPT Hardness and the ETH

Combinatorial Optimization

Rank Bounds and Integrality Gaps for Cutting Planes Procedures

8.1 Polynomial-Time Reductions

Notes for Chapter 12 Logic Programming. The AI War Basic Concepts of Logic Programming Prolog Review questions

Inference rule for Induction

Mathematical Logic Prof. Arindama Singh Department of Mathematics Indian Institute of Technology, Madras. Lecture - 37 Resolution Rules

Algorithms Dr. Haim Levkowitz

arxiv: v2 [cs.cc] 29 Mar 2010

NP-Completeness of 3SAT, 1-IN-3SAT and MAX 2SAT

CMPSCI 250: Introduction to Computation. Lecture #14: Induction and Recursion (Still More Induction) David Mix Barrington 14 March 2013

Non-context-Free Languages. CS215, Lecture 5 c

Lecture 2: NP-Completeness

Lecture Notes on Binary Decision Diagrams

W4231: Analysis of Algorithms

SAT Solver. CS 680 Formal Methods Jeremy Johnson

Mixed Integer Linear Programming

Indicate the option which most accurately completes the sentence.

Symbolic Methods. The finite-state case. Martin Fränzle. Carl von Ossietzky Universität FK II, Dpt. Informatik Abt.

Computer-Aided Program Design

Decision Procedures. An Algorithmic Point of View. Decision Procedures for Propositional Logic. D. Kroening O. Strichman.

It is important that you show your work. There are 134 points available on this test.

Parallel and Distributed Computing

Propositional Calculus. Math Foundations of Computer Science

NP-complete Reductions

Deductive Methods, Bounded Model Checking

SAT Solvers. Ranjit Jhala, UC San Diego. April 9, 2013

Finding a winning strategy in variations of Kayles

CS 580: Algorithm Design and Analysis

3/7/2018. CS 580: Algorithm Design and Analysis. 8.1 Polynomial-Time Reductions. Chapter 8. NP and Computational Intractability

CS 151 Complexity Theory Spring Final Solutions. L i NL i NC 2i P.

Chapter 8. NP-complete problems

would be included in is small: to be exact. Thus with probability1, the same partition n+1 n+1 would be produced regardless of whether p is in the inp

4.1 Review - the DPLL procedure

(a) (4 pts) Prove that if a and b are rational, then ab is rational. Since a and b are rational they can be written as the ratio of integers a 1

1 Minimum Cut Problem

Coq projects for type theory 2018

Integrity Constraints (Chapter 7.3) Overview. Bottom-Up. Top-Down. Integrity Constraint. Disjunctive & Negative Knowledge. Proof by Refutation

1. Suppose you are given a magic black box that somehow answers the following decision problem in polynomial time:

Homework 4 Solutions

NP-Completeness. Algorithms

Algorithms. AVL Tree

FINAL EXAM SOLUTIONS

Chapter 2 PRELIMINARIES

CS521 \ Notes for the Final Exam

Logic and Computation

NP-Hardness. We start by defining types of problem, and then move on to defining the polynomial-time reductions.

Sets. Sets. Subset, universe. Specifying sets, membership. Examples: Specifying a set using a predicate. Examples

Satisfiability (SAT) Applications. Extensions/Related Problems. An Aside: Example Proof by Machine. Annual Competitions 12/3/2008

Recursion defining an object (or function, algorithm, etc.) in terms of itself. Recursion can be used to define sequences

A Satisfiability Procedure for Quantified Boolean Formulae

1.4 Normal Forms. We define conjunctions of formulas as follows: and analogously disjunctions: Literals and Clauses

Copyright 2000, Kevin Wayne 1

Recursive Definitions Structural Induction Recursive Algorithms

Solutions. (a) Claim: A d-ary tree of height h has at most 1 + d +...

Transcription:

IAS/PCMI Summer Session 2000 Clay Mathematics Undergraduate Program Advanced Course on Computational Complexity Lecture 14: Lower Bounds for Tree Resolution David Mix Barrington and Alexis Maciel August 3, 2000 1. Resolution and Tree Resolution The Resolution proof system can be defined as a restriction of the Sequent Calculus that allows only variables to appear as formulas in the sequents and that uses only the cut rule. (This is not the usual definition of Resolution but it is equivalent to it.) What can we do with such limited proofs? One thing we cannot do is start with the axioms and build larger formulas because we cannot introduce connectives. In fact, with direct resolution proofs, we can essentially prove nothing else but the axioms. The way to make Resolution useful is to use it as a refutation system. We start with the axioms and a set S of additional sequents and try to derive the empty sequent. If we succeed, then we have proved that S is inconsistent or unsatisfiable: no truth assignment can make all of the sequents in S true. Note that a sequent of the form x 1,..., x m y 1,..., y n is equivalent to the disjunction (or clause) x 1 x m y 1 y n. Therefore, refuting a set of sequents S is equivalent to refuting the CNF formula that consists of the conjunction of all the clauses in S. This does not that mean that with Resolution we can only refute CNF formulas. We can view Resolution as a refutation system for general formulas if we first convert an arbitrary formula F to a CNF formula F C by using the idea we presented in Basic Lecture 7 for reducing CIRCUIT-SAT to 3SAT. View the formula as a circuit, introduce new variables for the output of each of the gates, and define CNF formulas that express the fact that the output of each of the gates is correct. The conjunction of all these formulas together with the variable corresponding to the output gate is satisfiable if and only if the original formula is satisfiable. Therefore, refuting F C also refutes F. In addition, the size of F C is polynomial in the size of F. So if the size of a refutation of F C is polynomial in the size of F C, then it is also polynomial in the size of F. 1

In the previous lecture, we showed that the pigeonhole principle has polynomialsize Sequent Calculus proofs. In this lecture, we will prove an exponential lower bound on the size of a certain type of Resolution proofs of the pigeonhole principle. Of course, with Resolution, what we will consider is refuting the negation of the pigeonhole principle. More precisely, PHP n m corresponds to the following set of clauses: p i1,..., p im (for i [n]) p ij, p kj (for i, k [n], i < k, j [m]) is As a simple example, consider refuting PHP 2 1. The corresponding set of clauses p 11, p 21 p 11 p 21 These can be refuted by two applications of the cut rule. In fact, PHP n 1 can always be refuted by two cuts. More generally, if n > m, then PHP n m has a refutation of size 2 O(m). This implies that if m = O(log n), then PHP n m has a refutation of size n O(1), which is polynomial in the size of PHP n m. In contrast, it is known that any Resolution refutation of PHP n m has size at least 2 m. When m = n Ω(1), the lower bound is exponential. In the next section, we will prove a similar lower bound of 2 m on the size of tree-like Resolution refutations of PHP n m. To explain what a tree-like refutation (or proof) is consider a directed graph whose nodes are the lines of the proof. For each inference that occurs in the proof, connect the conclusion to the hypotheses. Viewing the last line of the proof at the top, that graph may or may not be a tree. If it is, then we say the proof is tree-like. For example, the simple refutation of PHP 2 1 we mentioned earlier is tree-like. The restriction of resolution to tree-like refutations is called Tree Resolution. In proving the lower bound, we will make use of decision trees. These are binary trees in which each node is labeled by some variable and the two edges coming out of a node are labeled 0 and 1. The leaves of the tree are each labeled by some output value. A decision tree computes a function as follows: given an input, start at the root and follow the edge corresponding to the value of the indicated input variable. Repeat this until, eventually, a leaf is reached. The output of the function on that input is the label of that leaf. For example, a tree deciding whether the first bit of the input is equal to the last one would first query the first bit and then the last one. This tree is of depth two and 2

size four. To compute the AND function, simply query all the input bits in order. This gives a tree of depth n and size O(n). To decide the PARITY language, simply query all the bits in order and explore all the possibilities. This produces a tree of depth n and size 2 O(n). It is easy to see that the AND function cannot have smaller decisions trees since every bit must be queried. Can the PARITY language have trees of size 2 o(n)? The answer is no. If a leaf is reached before n bits have been queried, than flipping one of the bits that was not queried would make the output of the tree incorrect. Therefore, every leaf must be reached after all n bits have been queried. But then, only one input string will reach each leaf so the tree must contain 2 n leaves, one for each input string. 2. A Lower Bound for Tree Resolution In this section, we show that every tree-like Resolution refutation of PHP n m has size at least 2 m. This will be done in two steps. First, if a set of clauses is unsatisfiable, then, for every truth assignment to the variables, at least one of the clauses will evaluate to false. The search problem associated with an unsatisfiable set of clauses is to find, given a truth assignment, one of these false clauses. We will show how to transform a tree-like refutation of PHP n m into a decision tree for the associated search problem. This tree will be of the same size as the proof. Then, our second step will be to show that this tree must be of exponential size. So consider a tree-like refutation of PHP n m. The decision tree will have exactly the same structure as the proof tree. Consider any line Γ in the proof. We now show how to label the nodes and edges in the subtree rooted at that node so that this subtree solves the decision problem for all assignments that make the sequent false. To label the entire tree, apply this procedure to the node corresponding to the last line of the proof, which must be the empty sequent. Note that the empty sequent is false for every assignment, so the tree will work for every assignment. If Γ is not an axiom or a clause of PHP n m, it must have been inferred by applying the cut rule to sequents of the form Γ, x and Γ x,, where x is variable. The node corresponding to this sequent will be labeled by x. All the assignments that make Γ false make all the variables in Γ true and all those in false. If x is false for one of these assignments, then this assignment will make Γ x, false so the edge going to that node will be labeled 0. If x is true for one of these assignments, then this assignment will make Γ, x true so the edge going to 3

that node will be labeled 1. The trees rooted at these two nodes can now be labeled recursively so that they correctly solve their respective subproblems. The base case occurs when Γ is either an axiom or one of the clauses of PHP n m. In this case, we are dealing with a leaf of the tree. Note that we only care about assignments that make Γ false. If this sequent is an axiom, then there are no such assignments so we can label the corresponding leaf whatever we want. If the sequent is a clause of PHP n m, then we label the leaf with that clause. This completes the construction of the decision tree. Now we show that this tree must be of size 2 m. Each assignment to the variables describes a multivalued mapping of the n pigeons to the m holes. Among all these assignments, we will focus on those that correspond to a one-to-one mapping of a subset of m of the pigeons to the holes. We call these critical truth assignments (CTA s). We will also consider subsets of these assignments that correspond to CTA s in which particular pigeons are mapped to particular holes. For example, the set of all CTA s where pigeons 1, 2, 5 go to holes 3, 9, 1, respectively. Suppose that a decision tree T correctly solves the search problem for all CTA s that have pigeons i 1,..., i r going to holes j 1,..., j r, respectively. By induction on m r, the number of holes still free, we show that the size of the tree must be at least 2 m r. A lower bound on the size of the entire tree then follows from the case m r = m, because the entire tree solves the problem for all assignments, including all CTA s. The base case is when m r = 1, which means that there is only one hole free. Since there are still two unassigned pigeons, this tree cannot be a single leaf because we still have no way of knowing which of the remaining pigeons will go to the remaining hole and which pigeon(s) will be left out. So the tree must be of size at least 3, which is no less than 2 m r = 2. For the inductive step, suppose that m r > 1 and that a decision tree correctly solves the search problem for all CTA s that have pigeons i 1,..., i r going to holes j 1,..., j r, respectively. By the same argument used in the base case, this tree cannot simply be a leaf. The root of the tree then queries some variable x. If the CTA s under consideration either all have x true or all have x false, then follow the corresponding edge to the next node. Keep on going until you hit either a node that splits the CTA s or a leaf. Once again, that node cannot be a leaf. Now suppose that this node is labeled p ij. Since this variable splits the current set of CTA s in a nontrivial way, it must be that pigeon i is still unassigned and that hole j is still free. Consider the node that the edge labeled 1 leads to. Assign to that node the subset of the current CTA s that send pigeon i to hole j. The tree rooted 4

at that node solves the search problem for those assignments, so it must be of size at least 2 m r 1. Next, consider the node that the edge labeled 0 leads to. All the CTA s going there do not send pigeon i to hole j. Since there are at least two free holes, choose some other hole k and consider the subset of the current CTA s that send pigeon i to hole k. Assign that set of CTA s to that node. Once again, the tree rooted at that node solves the search problem for those assignments, so it must be of size at least 2 m r 1. This implies that the size of the the original tree T must be at least twice that, which is 2 m r, as desired. And this completes the proof of the lower bound. 3. Generalization to Depth-0.5 Sequent Calculus Proofs All the formulas occurring in a Resolution proof are simple variables. Let us now consider relaxing this restriction a little by allowing some less limited formulas. For example, we could allow formulas that contain a single connective. These are formulas of depth 1 so we call the corresponding proof system depth-1 Sequent Calculus. Note that depth here refers to the depth of the formulas allowed in the proof, not to the depth of the graph or tree representing the proof. Resolution could be called depth-0 Sequent Calculus. We can generalize this further and allow formulas of any constant depth d. This gives us constant-depth Sequent Calculus proofs, which are also called AC 0 proofs because polynomial-size, constant-depth formulas, when viewed as circuits, define precisely the class AC 0. We know from the previous lecture that the general Sequent Calculus can prove PHP n m for any m < n in polynomial size. We have now shown that Tree Resolution cannot refute PHP n m in subexponential size if m = n Ω(1). We also mentioned earlier that general Resolution cannot either. What about constant-depth proofs? It turns out that they cannot prove PHP n m in subexponential size if m = n n o(1). But they can prove PHP n m in quasipolynomial size if m n/2. (Quasipolynomial size means size n (log n)o(1).) In fact, depth-1 proofs in which all the connectives have fan-in at most (log n) O(1) are enough for this. We call this depth 0.5. So when m = n/2, Resolution (and Tree Resolution) cannot refute PHP n m in subexponential size but depth-0.5 proofs can do it in quasipolynomial size. What about tree-like, depth-0.5 proofs? The techniques in the proof of our lower bound 5

for Tree Resolution can be extended to show a subexponential lower bound for these proofs too. The construction of the decision tree must be modified to take into account the logical rules of the Sequent Calculus. The nodes in the decision tree will end up with labels corresponding to formulas involving (log n) O(1) variables. So the lower bound argument will have to be modified accordingly. We leave the details as an exercise. 4. Exercises 1. Use the fact that PHP n n 1 has Resolution refutations of size 2 O(n), to show that PHP n m has Resolution refutations of size polynomial in n if m = O(log n). 2. Extend the Tree Resolution lower bound argument to show that tree-like Sequent Calculus refutations of PHP n m have size at least 2 m/t if all the formulas in the refutation involve at most t variables. (t is not necessarily a constant, it can be a function of n.) Use this to show that tree-like, depth-0.5 proofs cannot refute PHP n m in subexponential size. 6