Generic descent algorithm Generalization to multiple dimensions Problems of descent methods, possible improvements Fixes Local minima

Size: px

Start display at page:

Download "Generic descent algorithm Generalization to multiple dimensions Problems of descent methods, possible improvements Fixes Local minima"

Allyson Charles
6 years ago
Views:

1 1 Lecture 10: descent methods Generic descent algorithm Generalization to multiple dimensions Problems of descent methods, possible improvements Fixes Local minima Gradient descent (reminder) Minimum of a function is found by following the slope of the function f f(x) guess f(m) m x

2 2 Gradient descent (illustration) f f(x) next step guess f(m) m x Gradient descent (illustration) f f(x) guess new gradient next step f(m) m x

3 3 Gradient descent (illustration) f f(x) guess next step f(m) m x Gradient descent (illustration) f f(x) guess f(m) stop m x

4 4 Gradient descent: algorithm guess Gradient descent: algorithm Direction: downhill

5 5 Gradient descent: algorithm step Gradient descent: algorithm Now you are here

6 6 Gradient descent: algorithm Stop when close from minimum Gradient descent: algorithm guess = x direction = -f (x) step = h > 0 x:=x hf (x) f (x)~0

7 7 Example of 2D gradient: pic of the MATLAB demo Illustration of the gradient in 2D Example of 2D gradient: pic of the MATLAB demo Illustration of the gradient in 2D

8 8 Example of 2D gradient: pic of the MATLAB demo Illustration of the gradient in 2D Example of 2D gradient: pic of the MATLAB demo Definition of the gradient in 2D This is just a genaralization of the derivative in two dimensions. This can be generalized to any dimension.

9 9 Example of 2D gradient: pic of the MATLAB demo Illustration of the gradient in 2D Example of 2D gradient: pic of the MATLAB demo Gradient descent works in 2D

10 10 Generalization to multiple dimensions guess Generalization to multiple dimensions Direction: downhill

11 11 Generalization to multiple dimensions step Generalization to multiple dimensions Now you are here

12 12 Generalization to multiple dimensions Stop when close from minimum Generalization to multiple dimensions guess = x direction = -f(x) step = h > 0 x:=x h Vf (x) Vf (x)~0

13 13 Multiple dimensions Everything that you have seen with derivatives can be generalized with the gradient. For the descent method, f (x) can be replaced by In two dimensions, and by in N dimensions. Example of 2D gradient: MATLAB demo The cost to buy a portfolio is: Stock N Stock i Stock 2 Stock 1 If you want to minimize the price to buy your portfolio, you need to compute the gradient of its price:

14 14 Problem 1: choice of the step f When updating the current computation: - small steps: inefficient - large steps: potentially bad results f(x) guess f(m) stop m Too many steps: takes too long to converge x Problem 1: choice of the step f When updating the current computation: - small steps: inefficient - large steps: potentially bad results f(x) Next point (went too far) f(m) m Current point x

15 15 Problem 2: «ping pong effect» [S. Boyd, L. Vandenberghe, Convex Convex Optimization lect. Notes, Stanford Univ ] Problem 2: «ping pong effect» [S. Boyd, L. Vandenberghe, Convex Convex Optimization lect. Notes, Stanford Univ ]

16 16 Problem 2: (other norm dependent issues) [S. Boyd, L. Vandenberghe, Convex Convex Optimization lect. Notes, Stanford Univ ] Problem 3: stopping criterion Intuitive criterion: In multiple dimensions: Or equivalently Rarely used in practice. More about this in EE227A (convex optimization, Prof. L. El Ghaoui).

17 17 Fixes Several methods exist to address this problem - Line search methods, in particular - Backtracking line search - Exact line search - Normalized steepest descent - Newton steps Fundamental problem of the method: local minima Local minima: pic of the MATLAB demo The iterations of the algorithm converge to a local minimum

18 18 Local minima: pic of the MATLAB demo View of the algorithm is «myopic»

Gradient descent (illustration) Gradient descent (illustration) Gradient descent (illustration) Gradient descent (illustration)

Gradient descent (illustration) Gradient descent (illustration) Gradient descent (illustration) Gradient descent (illustration) 1 Lecture 10: descent ethods Gradient descent (reinder) Miniu o a unction is ound by ollowing the slope o the unction Generic descent algorith Generalization to ultiple diensions Probles o descent ethods,