Convex Optimization M2

Similar documents
2. Convex sets. x 1. x 2. affine set: contains the line through any two distinct points in the set

Convex Optimization. Convex Sets. ENSAE: Optimisation 1/24

2. Convex sets. affine and convex sets. some important examples. operations that preserve convexity. generalized inequalities

Convex Sets. CSCI5254: Convex Optimization & Its Applications. subspaces, affine sets, and convex sets. operations that preserve convexity

Lecture: Convex Sets

Tutorial on Convex Optimization for Engineers

EE/ACM Applications of Convex Optimization in Signal Processing and Communications Lecture 6

Convex Optimization - Chapter 1-2. Xiangru Lian August 28, 2015

Convex Optimization. 2. Convex Sets. Prof. Ying Cui. Department of Electrical Engineering Shanghai Jiao Tong University. SJTU Ying Cui 1 / 33

Shiqian Ma, MAT-258A: Numerical Optimization 1. Chapter 2. Convex Optimization

Introduction to Convex Optimization. Prof. Daniel P. Palomar

60 2 Convex sets. {x a T x b} {x ã T x b}

CS599: Convex and Combinatorial Optimization Fall 2013 Lecture 4: Convex Sets. Instructor: Shaddin Dughmi

CS675: Convex and Combinatorial Optimization Spring 2018 Convex Sets. Instructor: Shaddin Dughmi

Mathematical Programming and Research Methods (Part II)

Affine function. suppose f : R n R m is affine (f(x) =Ax + b with A R m n, b R m ) the image of a convex set under f is convex

Convex sets and convex functions

Convex sets and convex functions

Introduction to Modern Control Systems

2. Convex sets. affine and convex sets. some important examples. operations that preserve convexity. generalized inequalities

Convexity I: Sets and Functions

DM545 Linear and Integer Programming. Lecture 2. The Simplex Method. Marco Chiarandini

Lecture 2 Convex Sets

EE/AA 578: Convex Optimization

Lecture 2: August 31

Convexity: an introduction

Lecture 2: August 29, 2018

Simplex Algorithm in 1 Slide

Convex Sets (cont.) Convex Functions

MA4254: Discrete Optimization. Defeng Sun. Department of Mathematics National University of Singapore Office: S Telephone:

Convex Optimization CMU-10725

IE521 Convex Optimization Introduction

COM Optimization for Communications Summary: Convex Sets and Convex Functions

Lecture 2 - Introduction to Polytopes

Advanced Operations Research Techniques IE316. Quiz 1 Review. Dr. Ted Ralphs

Lecture 4: Convexity

CS599: Convex and Combinatorial Optimization Fall 2013 Lecture 1: Introduction to Optimization. Instructor: Shaddin Dughmi

This lecture: Convex optimization Convex sets Convex functions Convex optimization problems Why convex optimization? Why so early in the course?

Convex Optimization MLSS 2015

Lecture 2. Topology of Sets in R n. August 27, 2008

Convex Optimization. Chapter 1 - chapter 2.2

Lecture 2: August 29, 2018

Division of the Humanities and Social Sciences. Convex Analysis and Economic Theory Winter Separation theorems

Alternating Projections

CMU-Q Lecture 9: Optimization II: Constrained,Unconstrained Optimization Convex optimization. Teacher: Gianni A. Di Caro

Chapter 4 Convex Optimization Problems

Lec13p1, ORF363/COS323

Convex Optimization Lecture 2

Lecture 5: Properties of convex sets

Algorithms for convex optimization

CS 435, 2018 Lecture 2, Date: 1 March 2018 Instructor: Nisheeth Vishnoi. Convex Programming and Efficiency

Mathematical and Algorithmic Foundations Linear Programming and Matchings

THEORY OF LINEAR AND INTEGER PROGRAMMING

15. Cutting plane and ellipsoid methods

Conic Duality. yyye

Math 5593 Linear Programming Lecture Notes

Submodularity Reading Group. Matroid Polytopes, Polymatroid. M. Pawan Kumar

Conic Optimization via Operator Splitting and Homogeneous Self-Dual Embedding

Outline. CS38 Introduction to Algorithms. Linear programming 5/21/2014. Linear programming. Lecture 15 May 20, 2014

Convex Optimization: from Real-Time Embedded to Large-Scale Distributed

Lecture 5: Duality Theory

FACES OF CONVEX SETS

Optimization under uncertainty: modeling and solution methods

Chapter 15 Introduction to Linear Programming

1. Introduction. performance of numerical methods. complexity bounds. structural convex optimization. course goals and topics

Applied Lagrange Duality for Constrained Optimization

Numerical Optimization

Aspects of Convex, Nonconvex, and Geometric Optimization (Lecture 1) Suvrit Sra Massachusetts Institute of Technology

Convex Optimization. Lijun Zhang Modification of

Lecture 3: Convex sets

Convex Sets. Pontus Giselsson

A linear program is an optimization problem of the form: minimize subject to

Linear programming and duality theory

13. Cones and semidefinite constraints

EARLY INTERIOR-POINT METHODS

On the Hardness of Computing Intersection, Union and Minkowski Sum of Polytopes

A mini-introduction to convexity

Linear Optimization. Andongwisye John. November 17, Linkoping University. Andongwisye John (Linkoping University) November 17, / 25

Section Notes 5. Review of Linear Programming. Applied Math / Engineering Sciences 121. Week of October 15, 2017

Optimization for Machine Learning

ORIE 6300 Mathematical Programming I September 2, Lecture 3

Linear and Integer Programming :Algorithms in the Real World. Related Optimization Problems. How important is optimization?

However, this is not always true! For example, this fails if both A and B are closed and unbounded (find an example).

Math 273a: Optimization Linear programming

A Course in Convexity

Week 5. Convex Optimization

Lecture 2 September 3

Topological properties of convex sets

POLYHEDRAL GEOMETRY. Convex functions and sets. Mathematical Programming Niels Lauritzen Recall that a subset C R n is convex if

Convex Geometry arising in Optimization

of Convex Analysis Fundamentals Jean-Baptiste Hiriart-Urruty Claude Lemarechal Springer With 66 Figures

CS675: Convex and Combinatorial Optimization Fall 2014 Convex Functions. Instructor: Shaddin Dughmi

NATCOR Convex Optimization Linear Programming 1

The Simplex Algorithm

Programming, numerics and optimization

Convex Optimization. August 26, 2008

Research Interests Optimization:

Some Advanced Topics in Linear Programming

Chapter 4 Concepts from Geometry

Linear Optimization and Extensions: Theory and Algorithms

Transcription:

Convex Optimization M2 Lecture 1 A. d Aspremont. Convex Optimization M2. 1/49

Today Convex optimization: introduction Course organization and other gory details... Convex sets, basic definitions. A. d Aspremont. Convex Optimization M2. 2/49

Convex Optimization A. d Aspremont. Convex Optimization M2. 3/49

Convex Optimization How do we identify easy and hard problems? Convexity: why is it so important? Modeling: how do we recognize easy problems in real applications? Algorithms: how do we solve these problems in practice? A. d Aspremont. Convex Optimization M2. 4/49

Least squares (LS) minimize Ax b 2 2 A R m n, b R m are parameters; x R n is variable Complete theory (existence & uniqueness, sensitivity analysis... ) Several algorithms compute (global) solution reliably We can solve dense problems with n = 1000 vbles, m = 10000 terms By exploiting structure (e.g., sparsity) can solve far larger problems... LS is a (widely used) technology A. d Aspremont. Convex Optimization M2. 5/49

Linear program (LP) minimize c T x subject to a T i x b i, i = 1,..., m c, a i R n are parameters; x R n is variable Nearly complete theory (existence & uniqueness, sensitivity analysis... ) Several algorithms compute (global) solution reliably Can solve dense problems with n = 1000 vbles, m = 10000 constraints By exploiting structure (e.g., sparsity) can solve far larger problems... LP is a (widely used) technology A. d Aspremont. Convex Optimization M2. 6/49

Quadratic program (QP) minimize F x g 2 2 subject to a T i x b i, i = 1,..., m Combination of LS & LP Same story... QP is a technology Reliability: Programmed on chips to solve real-time problems Classic application: portfolio optimization A. d Aspremont. Convex Optimization M2. 7/49

The bad news LS, LP, and QP are exceptions Most optimization problems, even some very simple looking ones, are intractable The objective of this class is to show you how to recognize the nice ones... Many, many applications across all fields... A. d Aspremont. Convex Optimization M2. 8/49

Polynomial minimization minimize p is polynomial of degree d; x R n is variable p(x) Except for special cases (e.g., d = 2) this is a very difficult problem Even sparse problems with size n = 20, d = 10 are essentially intractable All algorithms known to solve this problem require effort exponential in n A. d Aspremont. Convex Optimization M2. 9/49

What makes a problem easy or hard? Classical view: linear is easy nonlinear is hard(er) A. d Aspremont. Convex Optimization M2. 10/49

What makes a problem easy or hard? Emerging (and correct) view:... the great watershed in optimization isn t between linearity and nonlinearity, but convexity and nonconvexity. R. Rockafellar, SIAM Review 1993 A. d Aspremont. Convex Optimization M2. 11/49

Convex optimization minimize f 0 (x) subject to f 1 (x) 0,..., f m (x) 0 x R n is optimization variable; f i : R n R are convex: f i (λx + (1 λ)y) λf i (x) + (1 λ)f i (y) for all x, y, 0 λ 1 includes LS, LP, QP, and many others like LS, LP, and QP, convex problems are fundamentally tractable A. d Aspremont. Convex Optimization M2. 12/49

Example: Stochastic LP Consider the following stochastic LP: minimize c T x subject to Prob(a T i x b i) η, i = 1,..., m coefficient vectors a i IID, N (a i, Σ i ); η is required reliability for fixed x, a T i x is N (at i x, x T Σ i x) so for η = 50%, stochastic LP reduces to LP minimize c T x subject to a T i x b i, i = 1,..., m and so is easily solved what about other values of η, e.g., η = 10%? η = 90%? A. d Aspremont. Convex Optimization M2. 13/49

Hint {x Prob(a T i x b i ) η, i = 1,..., m} η = 10% η = 50% η = 90% A. d Aspremont. Convex Optimization M2. 14/49

Convexity again stochastic LP with reliability η = 90% is convex, and very easily solved stochastic LP with reliability η = 10% is not convex, and extremely difficult moral: very difficult and very easy problems can look quite similar (to the untrained eye) A. d Aspremont. Convex Optimization M2. 15/49

Convex Optimization A brief history... The field is about 50 years old. Starts with the work of Von Neumann, Kuhn and Tucker, etc Explodes in the 60 s with the advent of relatively cheap and efficient computers... Key to all this: fast linear algebra Some of the theory developed before computers even existed... A. d Aspremont. Convex Optimization M2. 16/49

Convex optimization: history Convexity = low complexity:... In fact the great watershed in optimization isn t between linearity and nonlinearity, but convexity and nonconvexity. T. Rockafellar. True: Nemirovskii and Yudin [1979]. Very true: Karmarkar [1984]. Seriously true: convex programming, Nesterov and Nemirovskii [1994]. A. d Aspremont. Convex Optimization M2. 17/49

Standard convex complexity analysis All convex minimization problems with: a first order oracle (returning f(x) and a subgradient) can be solved in polynomial time in size and number of precision digits. Proved using the ellipsoid method by Nemirovskii and Yudin [1979]. Very slow convergence in practice. A. d Aspremont. Convex Optimization M2. 18/49

Linear Programming Simplex algorithm by Dantzig (1949): exponential worst-case complexity, very efficient in most cases. Khachiyan [1979] then used the ellipsoid method to show the polynomial complexity of LP. Karmarkar [1984] describes the first efficient polynomial time algorithm for LP, using interior point methods. A. d Aspremont. Convex Optimization M2. 19/49

From LP to structured convex programs Nesterov and Nemirovskii [1994] show that the interior point methods used for LPs can be applied to a larger class of structured convex problems. The self-concordance analysis that they introduce extends the polynomial time complexity proof for LPs. Most operations that preserve convexity also preserve self-concordance. The complexity of a certain number of elementary problems can be directly extended to a much wider class. A. d Aspremont. Convex Optimization M2. 20/49

Symmetric cone programs An important particular case: linear programming on symmetric cones minimize subject to c T x Ax b K These include the LP, second-order (Lorentz) and semidefinite cone: LP: {x R n : x 0} Second order: {(x, y) R n R : x y} Semidefinite: {X S n : X 0} Again, the class of problems that can be represented using these cones is extremely vast. A. d Aspremont. Convex Optimization M2. 21/49

Course Organization A. d Aspremont. Convex Optimization M2. 22/49

Course Plan Convex analysis & modeling Duality Algorithms: interior point methods, first order methods. Applications A. d Aspremont. Convex Optimization M2. 23/49

Grading Course website with lecture notes, homework, etc. http://di.ens.fr/~aspremon/optconvexem2.html A few homeworks, will be posted online. Email your homeworks to dm.daspremont@gmail.com you will get an automatic reply to your message if it has been received. A final exam. A. d Aspremont. Convex Optimization M2. 24/49

Short blurb Contact info on http://di.ens.fr/~aspremon Email: aspremon@ens.fr Dual PhDs: Ecole Polytechnique & Stanford University Interests: Optimization, machine learning, statistics & finance. A. d Aspremont. Convex Optimization M2. 25/49

References All lecture notes will be posted online Textbook: Convex Optimization by Lieven Vandenberghe and Stephen Boyd, available online at: http://www.stanford.edu/~boyd/cvxbook/ See also Ben-Tal and Nemirovski [2001], Lectures On Modern Convex Optimization: Analysis, Algorithms, And Engineering Applications, SIAM. http://www2.isye.gatech.edu/~nemirovs/ Nesterov [2003], Introductory Lectures on Convex Optimization, Springer. Nesterov and Nemirovskii [1994], Interior Point Polynomial Algorithms in Convex Programming, SIAM. A. d Aspremont. Convex Optimization M2. 26/49

Convex Sets A. d Aspremont. Convex Optimization M2. 27/49

Convex Sets affine and convex sets some important examples operations that preserve convexity generalized inequalities separating and supporting hyperplanes dual cones and generalized inequalities A. d Aspremont. Convex Optimization M2. 28/49

Affine set line through x 1, x 2 : all points x = θx 1 + (1 θ)x 2 (θ R) x 1 θ = 1.2 θ = 1 θ = 0.6 x 2 θ = 0 θ = 0.2 affine set: contains the line through any two distinct points in the set example: solution set of linear equations {x Ax = b} A. d Aspremont. Convex Optimization M2. 29/49

Convex set line segment between x 1 and x 2 : all points with 0 θ 1 x = θx 1 + (1 θ)x 2 convex set: contains line segment between any two points in the set x 1, x 2 C, 0 θ 1 = θx 1 + (1 θ)x 2 C examples (one convex, two nonconvex sets) A. d Aspremont. Convex Optimization M2. 30/49

Convex combination and convex hull convex combination of x 1,..., x k : any point x of the form with θ 1 + + θ k = 1, θ i 0 x = θ 1 x 1 + θ 2 x 2 + + θ k x k convex hull CoS: set of all convex combinations of points in S A. d Aspremont. Convex Optimization M2. 31/49

Convex cone conic (nonnegative) combination of x 1 and x 2 : any point of the form with θ 1 0, θ 2 0 x = θ 1 x 1 + θ 2 x 2 x 1 0 x 2 convex cone: set that contains all conic combinations of points in the set A. d Aspremont. Convex Optimization M2. 32/49

Hyperplanes and halfspaces hyperplane: set of the form {x a T x = b} (a 0) a x 0 x a T x = b halfspace: set of the form {x a T x b} (a 0) a x 0 a T x b a T x b a is the normal vector hyperplanes are affine and convex; halfspaces are convex A. d Aspremont. Convex Optimization M2. 33/49

Euclidean balls and ellipsoids (Euclidean) ball with center x c and radius r: B(x c, r) = {x x x c 2 r} = {x c + ru u 2 1} ellipsoid: set of the form {x (x x c ) T P 1 (x x c ) 1} with P S n ++ (i.e., P symmetric positive definite) x c other representation: {x c + Au u 2 1} with A square and nonsingular A. d Aspremont. Convex Optimization M2. 34/49

Norm balls and norm cones norm: a function that satisfies x 0; x = 0 if and only if x = 0 tx = t x for t R x + y x + y notation: is general (unspecified) norm; symb is particular norm norm ball with center x c and radius r: {x x x c r} 1 norm cone: {(x, t) x t} t 0.5 Euclidean norm cone is called secondorder cone 1 0 0 x 2 1 1 x 1 0 1 norm balls and cones are convex A. d Aspremont. Convex Optimization M2. 35/49

Polyhedra solution set of finitely many linear inequalities and equalities Ax b, Cx = d (A R m n, C R p n, is componentwise inequality) a 1 a2 a 5 P a 3 a 4 polyhedron is intersection of finite number of halfspaces and hyperplanes A. d Aspremont. Convex Optimization M2. 36/49

Positive semidefinite cone notation: S n is set of symmetric n n matrices S n + = {X S n X 0}: positive semidefinite n n matrices X S n + z T Xz 0 for all z S n + is a convex cone S n ++ = {X S n X 0}: positive definite n n matrices 1 example: [ x y y z ] S 2 + z 0.5 0 1 0 y 1 0 x 0.5 1 A. d Aspremont. Convex Optimization M2. 37/49

Operations that preserve convexity practical methods for establishing convexity of a set C 1. apply definition x 1, x 2 C, 0 θ 1 = θx 1 + (1 θ)x 2 C 2. show that C is obtained from simple convex sets (hyperplanes, halfspaces, norm balls,... ) by operations that preserve convexity intersection affine functions perspective function linear-fractional functions A. d Aspremont. Convex Optimization M2. 38/49

Intersection the intersection of (any number of) convex sets is convex example: S = {x R m p(t) 1 for t π/3} where p(t) = x 1 cos t + x 2 cos 2t + + x m cos mt for m = 2: 2 p(t) 1 0 1 x2 1 0 S 1 0 π/3 2π/3 π t 2 2 1 0 1 2 x 1 A. d Aspremont. Convex Optimization M2. 39/49

Affine function suppose f : R n R m is affine (f(x) = Ax + b with A R m n, b R m ) the image of a convex set under f is convex S R n convex = f(s) = {f(x) x S} convex the inverse image f 1 (C) of a convex set under f is convex C R m convex = f 1 (C) = {x R n f(x) C} convex examples scaling, translation, projection solution set of linear matrix inequality {x x 1 A 1 + + x m A m B} (with A i, B S p ) hyperbolic cone {x x T P x (c T x) 2, c T x 0} (with P S n +) A. d Aspremont. Convex Optimization M2. 40/49

Perspective and linear-fractional function perspective function P : R n+1 R n : P (x, t) = x/t, dom P = {(x, t) t > 0} images and inverse images of convex sets under perspective are convex linear-fractional function f : R n R m : f(x) = Ax + b c T x + d, dom f = {x ct x + d > 0} images and inverse images of convex sets under linear-fractional functions are convex A. d Aspremont. Convex Optimization M2. 41/49

example of a linear-fractional function f(x) = 1 x 1 + x 2 + 1 x 1 1 x2 0 C x2 0 f(c) 1 1 0 1 x 1 1 1 0 1 x 1 A. d Aspremont. Convex Optimization M2. 42/49

Generalized inequalities a convex cone K R n is a proper cone if K is closed (contains its boundary) K is solid (has nonempty interior) K is pointed (contains no line) examples nonnegative orthant K = R n + = {x R n x i 0, i = 1,..., n} positive semidefinite cone K = S n + nonnegative polynomials on [0, 1]: K = {x R n x 1 + x 2 t + x 3 t 2 + + x n t n 1 0 for t [0, 1]} A. d Aspremont. Convex Optimization M2. 43/49

generalized inequality defined by a proper cone K: x K y y x K, x K y y x int K examples componentwise inequality (K = R n +) x R n + y x i y i, i = 1,..., n matrix inequality (K = S n +) X S n + Y Y X positive semidefinite these two types are so common that we drop the subscript in K properties: many properties of K are similar to on R, e.g., x K y, u K v = x + u K y + v A. d Aspremont. Convex Optimization M2. 44/49

Minimum and minimal elements K is not in general a linear ordering: we can have x K y and y K x x S is the minimum element of S with respect to K if y S = x K y x S is a minimal element of S with respect to K if y S, y K x = y = x example (K = R 2 +) x 1 is the minimum element of S 1 x 2 is a minimal element of S 2 x 1 S 2 S 1 x 2 A. d Aspremont. Convex Optimization M2. 45/49

Separating hyperplane theorem if C and D are disjoint convex sets, then there exists a 0, b such that a T x b for x C, a T x b for x D a T x b a T x b D C a the hyperplane {x a T x = b} separates C and D strict separation requires additional assumptions (e.g., C is closed, D is a singleton) A. d Aspremont. Convex Optimization M2. 46/49

Supporting hyperplane theorem supporting hyperplane to set C at boundary point x 0 : {x a T x = a T x 0 } where a 0 and a T x a T x 0 for all x C a C x 0 supporting hyperplane theorem: if C is convex, then there exists a supporting hyperplane at every boundary point of C A. d Aspremont. Convex Optimization M2. 47/49

Dual cones and generalized inequalities dual cone of a cone K: examples K = {y y T x 0 for all x K} K = R n +: K = R n + K = S n +: K = S n + K = {(x, t) x 2 t}: K = {(x, t) x 2 t} K = {(x, t) x 1 t}: K = {(x, t) x t} first three examples are self-dual cones dual cones of proper cones are proper, hence define generalized inequalities: y K 0 y T x 0 for all x K 0 A. d Aspremont. Convex Optimization M2. 48/49

Minimum and minimal elements via dual inequalities minimum element w.r.t. K x is minimum element of S iff for all λ K 0, x is the unique minimizer of λ T z over S x S minimal element w.r.t. K if x minimizes λ T z over S for some λ K 0, then x is minimal λ 1 x 1 S x 2 λ 2 if x is a minimal element of a convex set S, then there exists a nonzero λ K 0 such that x minimizes λ T z over S A. d Aspremont. Convex Optimization M2. 49/49

* References A. Ben-Tal and A. Nemirovski. Lectures on modern convex optimization : analysis, algorithms, and engineering applications. MPS-SIAM series on optimization. Society for Industrial and Applied Mathematics : Mathematical Programming Society, Philadelphia, PA, 2001. N. K. Karmarkar. A new polynomial-time algorithm for linear programming. Combinatorica, 4:373 395, 1984. L. G. Khachiyan. A polynomial algorithm in linear programming (in Russian). Doklady Akademiia Nauk SSSR, 224:1093 1096, 1979. A. Nemirovskii and D. Yudin. Problem complexity and method efficiency in optimization. Nauka (published in English by John Wiley, Chichester, 1983), 1979. Y. Nesterov. Introductory Lectures on Convex Optimization. Springer, 2003. Y. Nesterov and A. Nemirovskii. Interior-point polynomial algorithms in convex programming. Society for Industrial and Applied Mathematics, Philadelphia, 1994. A. d Aspremont. Convex Optimization M2. 50/49