Lecture 12 March 4th

Similar documents
Bounds on the number of inference functions of a graphical model

9 Bounds for the Knapsack Problem (March 6)

FACES OF CONVEX SETS

Math 414 Lecture 2 Everyone have a laptop?

Convex hulls of spheres and convex hulls of convex polytopes lying on parallel hyperplanes

CS 473: Algorithms. Ruta Mehta. Spring University of Illinois, Urbana-Champaign. Ruta (UIUC) CS473 1 Spring / 29

Lecture 3: Art Gallery Problems and Polygon Triangulation

Chapter 4 Concepts from Geometry

Lecture 2 - Introduction to Polytopes

Simplicial Cells in Arrangements of Hyperplanes

Convex Geometry arising in Optimization

CS522: Advanced Algorithms

Lecture 6: Faces, Facets

On the Hardness of Computing Intersection, Union and Minkowski Sum of Polytopes

Lecture 5: Properties of convex sets

3 No-Wait Job Shops with Variable Processing Times

Lecture 3. Corner Polyhedron, Intersection Cuts, Maximal Lattice-Free Convex Sets. Tepper School of Business Carnegie Mellon University, Pittsburgh

Lecture 15: The subspace topology, Closed sets

On the number of distinct directions of planes determined by n points in R 3

11.1 Facility Location

Treewidth and graph minors

A PROOF OF THE LOWER BOUND CONJECTURE FOR CONVEX POLYTOPES

arxiv: v1 [cs.cc] 30 Jun 2017

MATH 890 HOMEWORK 2 DAVID MEREDITH

The Simplex Algorithm

Preferred directions for resolving the non-uniqueness of Delaunay triangulations

Advanced Operations Research Techniques IE316. Quiz 1 Review. Dr. Ted Ralphs

Classification of Ehrhart quasi-polynomials of half-integral polygons

Line Arrangement. Chapter 6

Lecture 4: Rational IPs, Polyhedron, Decomposition Theorem

Planar Graphs. 1 Graphs and maps. 1.1 Planarity and duality

FINAL EXAM SOLUTIONS

Endomorphisms and synchronization, 2: Graphs and transformation monoids

Automorphism Groups of Cyclic Polytopes

arxiv: v2 [math.co] 24 Aug 2016

12.1 Formulation of General Perfect Matching

6 Randomized rounding of semidefinite programs

ORIE 6300 Mathematical Programming I September 2, Lecture 3

Partitions and Packings of Complete Geometric Graphs with Plane Spanning Double Stars and Paths

Geometry. Every Simplicial Polytope with at Most d + 4 Vertices Is a Quotient of a Neighborly Polytope. U. H. Kortenkamp. 1.

Polar Duality and Farkas Lemma

8 Matroid Intersection

Combinatorial Geometry & Topology arising in Game Theory and Optimization

THE DIMENSION OF POSETS WITH PLANAR COVER GRAPHS

Let and be a differentiable function. Let Then be the level surface given by

Mathematical Programming and Research Methods (Part II)

Endomorphisms and synchronization, 2: Graphs and transformation monoids. Peter J. Cameron

Ma/CS 6b Class 26: Art Galleries and Politicians

Two Upper Bounds for the Erdős Szekeres Number with Conditions

Toric Ideals of Homogeneous Phylogenetic Models

arxiv: v1 [math.co] 15 Dec 2009

Testing Isomorphism of Strongly Regular Graphs

arxiv: v1 [math.co] 27 Feb 2015

Massachusetts Institute of Technology Department of Electrical Engineering and Computer Science Algorithms For Inference Fall 2014

DM545 Linear and Integer Programming. Lecture 2. The Simplex Method. Marco Chiarandini

CPSC 536N: Randomized Algorithms Term 2. Lecture 10

Lecture 2 September 3

COMP331/557. Chapter 2: The Geometry of Linear Programming. (Bertsimas & Tsitsiklis, Chapter 2)

Pebble Sets in Convex Polygons

5. THE ISOPERIMETRIC PROBLEM

arxiv: v1 [math.co] 25 Sep 2015

MA4254: Discrete Optimization. Defeng Sun. Department of Mathematics National University of Singapore Office: S Telephone:

Cyclotomic Polytopes and Growth Series of Cyclotomic Lattices. Matthias Beck & Serkan Hoşten San Francisco State University

LATIN SQUARES AND THEIR APPLICATION TO THE FEASIBLE SET FOR ASSIGNMENT PROBLEMS

Lecture 2 Convex Sets

Linear Programming in Small Dimensions

Inapproximability of the Perimeter Defense Problem

Motion Planning. O Rourke, Chapter 8

On Evasiveness, Kneser Graphs, and Restricted Intersections: Lecture Notes

However, this is not always true! For example, this fails if both A and B are closed and unbounded (find an example).

arxiv: v2 [math.co] 23 Jan 2018

Face Width and Graph Embeddings of face-width 2 and 3

Probabilistic Graphical Models

CS 372: Computational Geometry Lecture 10 Linear Programming in Fixed Dimension

Modeling and Analysis of Hybrid Systems

Modeling and Analysis of Hybrid Systems

Mathematical and Algorithmic Foundations Linear Programming and Matchings

A GRAPH FROM THE VIEWPOINT OF ALGEBRAIC TOPOLOGY

From acute sets to centrally symmetric 2-neighborly polytopes

Crossing Families. Abstract

ACTUALLY DOING IT : an Introduction to Polyhedral Computation

A combinatorial proof of a formula for Betti numbers of a stacked polytope

Discrete Optimization 2010 Lecture 5 Min-Cost Flows & Total Unimodularity

ON CELLULAR RESOLUTION OF MONOMIAL IDEALS

Submodularity Reading Group. Matroid Polytopes, Polymatroid. M. Pawan Kumar

Polytopes Course Notes

Polyhedral Computation Today s Topic: The Double Description Algorithm. Komei Fukuda Swiss Federal Institute of Technology Zurich October 29, 2010

ON THE EMPTY CONVEX PARTITION OF A FINITE SET IN THE PLANE**

arxiv: v1 [math.co] 24 Aug 2009

INDIAN STATISTICAL INSTITUTE

Formally Self-Dual Codes Related to Type II Codes

Acute Triangulations of Polygons

Convex Sets. Pontus Giselsson

All 0-1 Polytopes are. Abstract. We study the facial structure of two important permutation polytopes

C&O 355 Lecture 16. N. Harvey

1 Definition of Reduction

What is the sum of the measures of the angles of a quadrilateral?

Voronoi diagram and Delaunay triangulation

Restricted-Orientation Convexity in Higher-Dimensional Spaces

Design and Analysis of Algorithms (VII)

Transcription:

Math 239: Discrete Mathematics for the Life Sciences Spring 2008 Lecture 12 March 4th Lecturer: Lior Pachter Scribe/ Editor: Wenjing Zheng/ Shaowei Lin 12.1 Alignment Polytopes Recall that the alignment polytope is the convex hull of all alignment summaries. In last lecture, we saw that the alignment polytope for two sequences is the Newton polytope of the statistical model for the two sequences. Here are some results that give bounds on the number of vertices. Theorem 12.1. The number of vertices of the alignment polytope in the case of two free parameters is O((n + m) 2/3 ), where n and m are the lengths of the sequences. Remark. It is conjectured that the number of vertices is Θ((n + m) 2/3 ) for any fixed alphabet. In fact, it is not even known if this is true for the case of a binary alphabet {0, 1}. Theorem 12.2. (Upper-bound for number of vertices, G. Andrews, 1967) For any fixed number d, there is a constant C d such that the number of vertices of any lattice polytope P in R d is bounded from above by ( ) C d vol(p ) d 1 d+1. Note that Theorem 12.2 implies Theorem 12.1, because in the case of two free parameters, vol(p ) = O((n + m) d(d 1)/(d+1) ) so for d = 2, the number of vertices of P is O((n + m) 2/3 ). Moreover, in the case d = 2, this theorem gives a good running time bound for the polytope propagation algorithm. Indeed, the run times for the Minkowski sum and the convex hull of unions are both linear in the number of vertices and the polytope propagation algorithm executes these two operations over a grid of size nm, so the algorithm has a time complexity of O(nm(n + m) 2/3 ). 12.2 Normal Fan We would like to decompose the parameter space of the alignment model into regions which correspond with faces of the alignment polytope. The construction is as follows: 1. Fix a polytope P R d. Given a vector w R d, let face w (P ) = {x P x w y w, y P }. Note that face w (P ) is a polytope because it is the intersection of P with the hyperplane x w = min{x w : x P }. We call such a polytope a face of P. 12-1

2. For F a face of polytope P, define the normal cone of P at F to be N P (F ) = {w R d face w (P ) = F }. 3. Finally, the normal fan of P is the collection of all the normal cones of P. N(P ) = {N P (F ) F a face of P }. Note that N(P ) partitions R d into cones which are in bijection with the faces of P. As an example, suppose we have the polytope For each face, we have a normal cone, displayed here: The collection of these cones gives the normal fan of the polytope. 12-2

12.3 Graham Scan The Graham scan is an algorithm that computes the convex hull of a set of points in R 2. In particular, we can use it to compute the convex hull of the union of two polytopes in R 2 by taking the vertices of the two polytopes and the points where their boundaries intersect as input for the algorithm. The following implementation of the Graham scan takes a set S of n distinct points and returns a sequence T of points on the boundary of the convex hull. 1. Pick a point p 0 S lying on the boundary of the convex hull of S. This can be done by finding a point which minimizes a linear functional, e.g. the y-coordinate. 2. Sort the remaining points, arranging them clockwise according to their radial angle from p 0. Label them p 1,..., p n 1, p n = p 0. This sorting has complexity O(n log n). 3. If S has only one or two points, then return T = (p 0 ) or T = (p 0, p 1 ) respectively. Otherwise, initialize T = (p 0, p 1 ) and i = 2. 4. Let x and y be the last and second-to-last points in T respectively. If we make a right turn going from y to x to p i, append p i to the end of T and increase i by 1. If we make a left turn, then delete x from T. Repeat this step if i < n. 5. Output T. 12.4 Inference Functions In earlier lectures, we studied some graphical models with hidden variables, such as the hidden Markov model. In those situations, given an observation, it would be useful to find the most likely hidden data that caused this observation. Given fixed parameters, an inference function is a map that assigns a most likely hidden data point to each observation. We call this most likely hidden data an explanation for the observation. For example, in the case of sequence alignments, fixing the parameters M, X and G gives an inference function Φ M,X,G : {A, C, G, T} n+m A n,m which maps each pair of sequences of length n and m to an alignment maximizing the score mm + xx + gg. Here, A n,m is the set of global alignments for the two sequences. We order the elements of A n,m to break ties when more than one alignment maximizes the above score. In general, consider a graphical model f which has the k observable variables with the alphabet Σ and l hidden variables with the alphabet Σ. Then, the inference function corresponding to given parameters θ is of the form Φ θ : (Σ ) k Σ l. 12-3

An interesting question is the following: how many different maps Φ θ are there as the parameters θ are varied? For sequence alignments, a trivial upper bound would be ( ) 4 n+m n + m. n This bound is doubly exponential in n+m. However, as the theorem below shows, most of the functions counted in the bound are not inference functions for any values of the parameters. Theorem 12.3. The number of inference functions is O((n + m) 2 ) for the 2-parameter alignment model. In fact, it is Θ((n + m) 2 ). More generally, for a graphical model of complexity N, the bound is O(N d(d 1) ). Remark. The complexity of a graphical model will not be defined here, but it is usually bounded linearly in the number k of observable variables. Example. Consider the hidden Markov model for binary states of length 5 represented by the graph below, where the dark nodes are observed and the white nodes are hidden. There are 32 possible observations and 32 possible explanations. The trivial upper bound is thus 32 32 but the actual number of inference functions is only 5266. To see why this theorem is true, we first need to introduce some new concepts and results. Let N(P ) denote the normal fan of a polytope P. For polytopes P 1,..., P k, the common refinement of their normal fans is the collection of cones obtained as the intersection of a cone from each of the individual fans, and is denoted by N(P 1 ) N(P k ). Lemma 12.4. N(P 1 ) N(P k ) = N( k i=1p i ) Proof: Assigned as a homework. Theorem 12.5. (Gritzmann-Sturmfels) Let P 1,..., P k be polytopes in R d, and let m denote the number of nonparallel edges. Then the number of vertices of P 1 P k is bounded above by d 1 ( ) m 1 2 j j=0 12-4

Remark. Note that this bound does not depend on k. Recall that for every sequence pair σ, the alignment model with parameters θ is given by a polynomial f σ (θ) where each monomial of f σ represents the probability of some alignment of the two sequences. Then, the alignment polytope P σ is the Newton polytope of f σ and its normal fan is N(P σ ). Each vertex of P σ represents some alignment summary, so more than one alignment may be mapped to the same vertex. An optimal alignment for given parameters θ R d is an alignment mapped to a vertex on P σ that is maximal in the direction of θ = log θ. Ties between optimal alignments are broken by a predetermined order on the set of alignments. In the language of normal cones, each cone in the normal fan N(P σ ) represents a set of parameter vectors θ with the same optimal alignment. As we vary over all sequence pairs σ, for two parameter vectors θ and θ, we see that Φ θ(σ) = Φ θ (σ) if and only if θ and θ lie in the same of cone of the common refinement N(P 1 ) N(P k ) = N( k i=1p i ). Hence, we have the following inequality: number of inference functions number of vertices of σ P σ. Now, since each P σ lies in [0, n + m] d, every edge is a vector with coordinates in [ (n + m), n + m]. Therefore, the number of non-parallel edges is at most (2(n + m) + 1) d, and 12.5 Homework d 1 ( ) (2(n + m) + 1) d 1 number of vertices of σ P σ 2 j j=0 2d2 d+1 (d 1)! (n + m)d(d 1). Show that N(P 1 ) N(P k ) = N( k i=1p i ). 12-5