ECE 490: Lecture Schedule
The schedule is tentative (except for the dates for the two in-class midterms) and will be updated and revised as the course progresses. Required readings from Wright and Recht will be indicated on the left.
- Tue Jan 21
- No class (❄️❄️❄️)
- Thu Jan 23
Ch. 1
- Introduction and administrivia
Data analysis and optimization: examples
- Tue Jan 28
Thu Jan 30
Ch. 2
- Foundations of smooth optimization
Taylor's theorem and some consequences
Notions of smoothness: Lipschitz continuity, Lipschitz gradients
Necessary and sufficient conditions for local minima
Convexity and strong convexity, interplay with Lipschitz gradients
- Tue Feb 4
Thu Feb 6
Sections 3.1, 3.2, 3.4, 3.5
- Descent methods
Directions of descent: definition, sufficient conditions
Steepest descent method (a.k.a. gradient method)
Convergence rates for smooth or smooth + strongly convex functions
Line-search methods: choosing the direction and the steplength
- Tue Feb 11
Sections 3.5, 3.6
Thu Feb 13
Ch. 4
- Descent methods (cont.)
Approximate line-search: weak Wolfe conditions
Mirror descent
Gradient methods using momentum
Motivation using differential equations
Heavy ball method and Nesterov's accelerated method
- Tue Feb 18
- Review and Q&A
- Thu Feb 20
- In-class midterm 1
- Tue Feb 25
Thu Feb 27
Sections 5.1-5.3
- Stochastic gradient methods
Motivation and examples
Randomized incremental gradient
Robbins-Monro scheme
Randomized Kaczmarz method
Key assumptions on stochastic gradients, with examples
- Tue Mar 4
Sections 5.4-5.5
Thu Mar 6
Sections 6.1-6.2
- Stochastic gradient methods
Convergence analysis
Implementation aspects: epochs, mini-batching, momentum
Coordinate descent
Motivation and examples from machine learning
Local Lipschitz constants
Convergence analysis of randomized coordinate descent: sampling with replacement
- Tue Mar 11
Thu Mar 13
Ch. 7
- First-order methods for constrained optimization
Directions of descent, feasible directions, normal cone
First-order necessary condition for local optimality
Euclidean projection onto a closed convex set: definition, examples, the minimum principle
Projected gradient descent: convergence analysis
Conditional gradient (Frank-Wolfe) method
- Tue Mar 18
Thu Mar 20
- No class: Spring Break
- Tue Mar 25
Thu Mar 27
Ch. 8
- Nonsmooth functions and subgradients
Subgradients and subdifferentials: motivation, definition, examples
First-order optimality condition in terms of subgradients
A supporting hyperplane interpretation
- Tue Apr 1
- Review and Q&A
- Thu Apr 4
- In-class midterm 2
- Tue Apr 8
- No class
- Thu Apr 10
Ch. 8
- Nonsmooth functions and subgradients (cont.)
Calculus of subdifferentials (additivity, composition with affine transformations, Danskin's theorem)
Convex sets and convex constrained optimization
- Tue Apr 15
Thu Apr 17
Ch. 9
- Nonsmooth optimization methods
Motivating example for nonsmooth optimization: LASSO
Direction of steepest descent: minimum-norm subgradient
Subgradient descent: rates of convergence with constant and dmininishing step lengths
Proximal gradient methods for regularized optimization
- Tue Apr 22
Thu Apr 24
Ch. 10
- Duality and algorithms
Optimization subject to linear equality constraints
Quadratic penalty, Lagrangians, weak duality
Necessary and sufficient conditions for optimality
Strong duality
Dual optimization algorithms
Augmented Lagrangian method, ADMM
Examples
- Tue Apr 29
Thu May 1
Ch. 11
- Differentiation and adjoints
Multivariable chain rule, Jacobian-vector products
Computation of gradients of nested compositions of vector-valued functions
The method of adjoints and its Lagrangian interpretation
Automatic differentiation and backpropagation
- Tue May 6
- Review and Q&A