Lecture 2: August 29, 2018

Similar documents
Lecture 2: August 29, 2018

Lecture 2: August 31

Convexity I: Sets and Functions

Lecture 4: Convexity

COM Optimization for Communications Summary: Convex Sets and Convex Functions

Convex Optimization. 2. Convex Sets. Prof. Ying Cui. Department of Electrical Engineering Shanghai Jiao Tong University. SJTU Ying Cui 1 / 33

CS675: Convex and Combinatorial Optimization Fall 2014 Convex Functions. Instructor: Shaddin Dughmi

Convex sets and convex functions

Convex Sets (cont.) Convex Functions

Simplex Algorithm in 1 Slide

Convex sets and convex functions

2. Convex sets. x 1. x 2. affine set: contains the line through any two distinct points in the set

Convex Optimization. Convex Sets. ENSAE: Optimisation 1/24

2. Convex sets. affine and convex sets. some important examples. operations that preserve convexity. generalized inequalities

EE/ACM Applications of Convex Optimization in Signal Processing and Communications Lecture 6

Shiqian Ma, MAT-258A: Numerical Optimization 1. Chapter 2. Convex Optimization

Introduction to Modern Control Systems

Convex Sets. CSCI5254: Convex Optimization & Its Applications. subspaces, affine sets, and convex sets. operations that preserve convexity

Tutorial on Convex Optimization for Engineers

CS675: Convex and Combinatorial Optimization Spring 2018 Convex Sets. Instructor: Shaddin Dughmi

Lecture 5: Properties of convex sets

Convex Optimization - Chapter 1-2. Xiangru Lian August 28, 2015

CS599: Convex and Combinatorial Optimization Fall 2013 Lecture 4: Convex Sets. Instructor: Shaddin Dughmi

Convexity Theory and Gradient Methods

Affine function. suppose f : R n R m is affine (f(x) =Ax + b with A R m n, b R m ) the image of a convex set under f is convex

Lecture 2 Convex Sets

60 2 Convex sets. {x a T x b} {x ã T x b}

Lecture: Convex Sets

Lecture 2 - Introduction to Polytopes

Mathematical Programming and Research Methods (Part II)

Lecture 1: Introduction

Lecture 2. Topology of Sets in R n. August 27, 2008

2. Convex sets. affine and convex sets. some important examples. operations that preserve convexity. generalized inequalities

Convex Geometry arising in Optimization

Convexity: an introduction

Numerical Optimization

Lecture 2 September 3

This lecture: Convex optimization Convex sets Convex functions Convex optimization problems Why convex optimization? Why so early in the course?

Convex Optimization Lecture 2

Advanced Operations Research Techniques IE316. Quiz 1 Review. Dr. Ted Ralphs

Convex Optimization M2

Lecture 15: Log Barrier Method

Combinatorial Geometry & Topology arising in Game Theory and Optimization

Lecture 18: March 23

IE 521 Convex Optimization

Introduction to Convex Optimization. Prof. Daniel P. Palomar

Math 5593 Linear Programming Lecture Notes

DM545 Linear and Integer Programming. Lecture 2. The Simplex Method. Marco Chiarandini

LECTURE 10 LECTURE OUTLINE

CMU-Q Lecture 9: Optimization II: Constrained,Unconstrained Optimization Convex optimization. Teacher: Gianni A. Di Caro

Chapter 4 Convex Optimization Problems

Lecture 5: Duality Theory

EC 521 MATHEMATICAL METHODS FOR ECONOMICS. Lecture 2: Convex Sets

Week 5. Convex Optimization

LECTURE 7 LECTURE OUTLINE. Review of hyperplane separation Nonvertical hyperplanes Convex conjugate functions Conjugacy theorem Examples

Lecture 3. Corner Polyhedron, Intersection Cuts, Maximal Lattice-Free Convex Sets. Tepper School of Business Carnegie Mellon University, Pittsburgh

Lecture 3: Convex sets

CS675: Convex and Combinatorial Optimization Spring 2018 Consequences of the Ellipsoid Algorithm. Instructor: Shaddin Dughmi

Homework 1 (a and b) Convex Sets and Convex Functions

Convex Optimization. Chapter 1 - chapter 2.2

Locally convex topological vector spaces

FAQs on Convex Optimization

Applied Lagrange Duality for Constrained Optimization

Conic Duality. yyye

Aspects of Convex, Nonconvex, and Geometric Optimization (Lecture 1) Suvrit Sra Massachusetts Institute of Technology

Convexization in Markov Chain Monte Carlo

Introduction to optimization

MTAEA Convexity and Quasiconvexity

However, this is not always true! For example, this fails if both A and B are closed and unbounded (find an example).

Lecture 19 Subgradient Methods. November 5, 2008

FACES OF CONVEX SETS

Math 414 Lecture 2 Everyone have a laptop?

Lecture 6: Faces, Facets

Revisiting Frank-Wolfe: Projection-Free Sparse Convex Optimization. Author: Martin Jaggi Presenter: Zhongxing Peng

THREE LECTURES ON BASIC TOPOLOGY. 1. Basic notions.

Convex Sets. Pontus Giselsson

M3P1/M4P1 (2005) Dr M Ruzhansky Metric and Topological Spaces Summary of the course: definitions, examples, statements.

Convexity. 1 X i is convex. = b is a hyperplane in R n, and is denoted H(p, b) i.e.,

Convex Optimization MLSS 2015

Section 5 Convex Optimisation 1. W. Dai (IC) EE4.66 Data Proc. Convex Optimisation page 5-1

Convexity and Optimization

Ellipsoid II: Grötschel-Lovász-Schrijver theorems

maximize c, x subject to Ax b,

Perspective Mappings. Contents

Chapter 4 Concepts from Geometry

Convexity and Optimization

Open and Closed Sets

Characterizing Improving Directions Unconstrained Optimization

Lecture 5: Linear Classification

Lecture 2 Optimization with equality constraints

MATH3016: OPTIMIZATION

Differential Geometry: Circle Patterns (Part 1) [Discrete Conformal Mappinngs via Circle Patterns. Kharevych, Springborn and Schröder]

Lec13p1, ORF363/COS323

REVIEW OF FUZZY SETS

MATH 890 HOMEWORK 2 DAVID MEREDITH

Division of the Humanities and Social Sciences. Convex Analysis and Economic Theory Winter Separation theorems

Lecture 19: November 5

IE 5531: Engineering Optimization I

EC422 Mathematical Economics 2

EXTREME POINTS AND AFFINE EQUIVALENCE

Transcription:

10-725/36-725: Convex Optimization Fall 2018 Lecturer: Ryan Tibshirani Lecture 2: August 29, 2018 Scribes: Yingjing Lu, Adam Harley, Ruosong Wang Note: LaTeX template courtesy of UC Berkeley EECS dept. Disclaimer: These notes have not been subjected to the usual scrutiny reserved for formal publications. They may be distributed outside this class only with the permission of the Instructor. 2.1 Recap: Convexity and Why Definition 2.1. A convex optimization problem is of the form: Subject to: min x D f(x) (2.1) g i (x) 0, i = 1,..., m h j (x) = 0, j = 1,..., r Where f and {g i } m i=1 are all convex and {h j} r j=1 are affine. Convex optimization has special property that any local minimizer is a global minimizer. (2.2) 2.2 Convex Sets Definition 2.2. A set C R n is a convex set if for any x, y C, we have tx+(1 t)y C for all t [0, 1]. Another way to put in slide is that line segment joining any two elements lies entirely in set Definition 2.3. Convex combination of x 1,..., x k R n is any linear combination of the form θ 1 x 1 +... + θ k x k with θ i 0 and k θ 1 = 1. i=1 Definition 2.4. For a set of C R n, the convex hull conv(c) is the smallest convex set that contains those vectors. Notice that a convex hull is always convex. 2.2.1 Convex Set Examples 2.2.1.1 Some simple ones Empty set, point, line. 2-1

2-2 Lecture 2: August 29, 2018 Norm ball: {x : x r}, for given norm, radius r. Hyperplane: {x : a T x = b} for given a, b. Half space: {x : a T x b}. Affine space:{x : Ax = b} for given matrix A and vector b. Polyhedron: {x : Ax b} ( is interpreted component-wise). The set {x : Ax b, Cx = d} is also polyhedron. Simplex conv(c) for a set C R n of affinely independent points. 2.2.2 Cones and convexity Definition 2.5. A set C R n is a cone if for any x C we have tx C for all t 0. Definition 2.6. A set C R n is a convex cone if for any x 1, x 2 C we have t 1 x 1 + t 2 x 2 C for all t 1, t 2 0. Definition 2.7. Conic combination of x 1,..., x k R n is any linear combination of the form θ 1 x 1 +... + θ k x k with θ i 0. Definition 2.8. For a set of vectors x 1,..., x k R n, the conic hull is the smallest set that contains all conic combinations of x 1,..., x k. 2.2.3 Examples of convex cones Norm cone: {(x, t) : x t} for any norm. The cone is called second-order cone when the norm is the l 2 norm. Normal cone: for any set C R n and any x C define N c (x) = {g : g T x g T y, for all y C}. This is always a convex cone. Positive Semi-definite cone: S n + = {X S n : X 0}. Here we use X to denote X is a positive semidefinite matrix. 2.2.4 Key properties of convex sets Separating hyperplane theorem: two disjoint convex sets have a separating hyperplane between them. Formally, if C, D are nonempty convex sets with C D =, then there exists a, b such that C {x : A T x b} and D {x : A T x b}. If two sets do not intersect, is there always a hyperplane that strictly separates them? Consider the set {(x, y) : y 0}, and also the epigraph {(x, y) : y bx, x 0}. The second set will get infinitely close to the first, so you cannot put a hyperplane bewteen them. This means you cannot expect that disjoint sets will be strictly separated. Supporting hyperplane theorem: a boundary point of a convex set has a supporting hyperplane passing through it. Formally, if C is a nonempty convex set, and x 0 bd(c), there exists a such that C {x : a T x a T x 0 }.

Lecture 2: August 29, 2018 2-3 2.2.5 Operations preserving convexity Intersection Scaling and translation Affine images and preimages. For any given affine function of the form f(x) = Ax + b and C is a convex set, then f(c) = {f(x) : x C} is convex. Also, if D is convex, then the inverse (or pre-image) f 1 (D) = {x : f(x) D} is convex. 2.2.6 Example: Linear matrix inequality solution set Given A 1,..., A k, B S n, a linear matrix inequality for a variable x R k looks like x 1 A 1 +... + x k A k B. (2.3) Let s prove that the set C of points x that satisfy the above inequality is convex. First approach: check that if two points lie in( the set, then all in-between points lie in the set. We can check this by seeing that this is true for any v: v T B ) k i=1 (tx i + (1 t) i )A i v 0. Second approach: define S n + = {Y, Y 0} which is convex, and define f(x) = B x i A i, then which is exactly our set. f 1 (S N + ) = {x : f(x) S N + } = {x : B x i A i 0}, (2.4) 2.2.7 More operations preserving convexity Perspective images and preimages. (There is a relationship here to pinhole cameras!) Linear-fractional images and preimages: The perspective map composed with an affine function, like f(x) = (Ax + b)/(c T x + d), called a linear-fractional function, preserves convexity! So if C dom(f) is contex, then so is f(c), and if D is convex then so is f 1 (D). 2.3 Convex functions Definition 2.9. A function f : R n R is convex if dom(f) R n is convex, and f(tx + (1 t)y) tf(x) + (1 t)f(y) for t 1 and x, y dom(f). In other words, the function always lies below the line segment joining f(x) and f(y). Concave functions have the opposite inequality, and f being concave implies f is convex. Important modifiers: Strictly convex: same definition but with strict inequalities, and the function is strictly below the line segments. The way I think about this is: f has more curvature than a linear function. Linear functions are convex but not strictly convex. Strongly convex with parameter m > 0: this says that if you subtract a quadratic, it s still convex: f m/2 x 2 is convex. In other words, f is more convex/curved than a quadratic function. Note that strong convexity implies strict convexity, which implies convexity. All of this is analogous for concave functions!

2-4 Lecture 2: August 29, 2018 2.3.1 Examples of convex functions Univariate functions (e ax for any a R, x a for a 1, x a for a 0 when x R + ; x a is concave for 0 a 1 for x R +, log x is concave over R ++. affine functions a T x + b are both convex and concave quadratic functions 1 2 xt Qx + b T x + c are convex provided that Q 0 (i.e., Q is positive semidefinite). least squares, like y Ax 2, because if you expand it, it looks like a quadratic function with Q = A T A, and A T A is always positive semidefinite (since b T A T Ab 0 since z T z = zi 2 where z = Ab). Norms all of them are convex! (Three things define a norm; briefly, these are: x 0, αx = α x, and x + y x + y. These also provide convexity.) The most common l p norms are 1, 2,. The l 1 norm is good for inducing sparsity, l 2 is ubiquitous because we use it to measure distances, and l is useful for reasons we ll see later. (Note that the l 0 norm does not satisfy the triangle inequality, so it is not a norm, not convex, and not our friend.) Operator (also called spectral) and trace (also called nuclear) norms are also convex. Indicator functions. If C is convex, then the indicator function I C (x), defined as 0 inside the set and outside, is also convex. Let s check this. We have f(x) = I C (x). We need to first check the domain: dom(f) = C is convex by assumption; good. Now, for any convex combination of x, y dom(f), we need f(tx + (1 t)y) tf(x)+(1 t)f(y). On the right hand side, f(x) and f(y) are zero (since x, y are in the domain), so we have tf(x) + (1 t)f(y)) t0 + (1 t)0 = 0. On the left hand side, we know the term (tx + (1 t)y) lies inside the set (since the domain C is convex), and the indicator function gives 0 for all values in the set, so we have 0 = 0 and we are done. 2.3.2 Key properties of convex functions Epigraph characterization: a function f is convex its epigraph (which is the set of all points above the function) is a convex set. In other words, f is convex if its epigraph epi(f) = {(x, t) dom(f) R : f(x) t} is also convex. Convex sublevel sets: if f is convex, then its sublevel sets, defined by {x dom(f) : f(x) t}, are convex, for all t R. The converse is not true! First-order characterization: if f is differentiable, then f is convex dom(f) is convex and f(y) f(x) + f(x) T (y x) for all x, y dom(f). This means that in a differentiable convex function, f(x) = 0 = x minimizes f. Second-order characterization: if f is twice differentiable, then f is convex dom(f) is convex, and 2 f(x) 0 for all x dom(f). You might wonder about that inequality. Consider f(x) = x 4, which is quadratic, with second derivative zero at zero. So, it s strictly convex, but the second derivative is not strictly positive. Jensen s inequality: if f is convex, and X is a random variable on dom(f), then f(e[x]) E[f(x)]. ( k Log-sum-exp function: g(x) = log i=1 eat i x+bi ) since it smoothly approximates max i=1,...,k (a T i x + b i). for fixed a i, b i. This is often called the soft max,

Lecture 2: August 29, 2018 2-5 2.3.3 Operations preserving convexity Nonnegative linear combination: f 1,..., f m being convex implies a 1 f 1 +... + a m f m is also convex, for any a i 0. Pointwise maximization: if f s for s S are all convex, then the pointwise max of them is also convex. Note that the functions can be discrete or continuous, and the set S can even be infinite! Partial minimization: if g(x, y) is convex in x, y, and C is convex, then f(x) = min y C g(x, y) is convex. 2.3.4 Example: distances to a set Consider the max (or min) distance to a set, under an arbitrary norm, written f(x) = max y C x y, is convex (whether C is convex or not). This is because the norm is convex, and the pointwise max is also convex. As for the min distance to the set, it is convex as long as the set C is convex. 2.4 Contributions Yingjing Lu: Section 1.1 to 1.2.3 Adam Harley: Section 1.2.4 to the end. Ruosong Wang: Revisions.