Adaptive Algorithms for Online Optimization

Elad Hazan
IBM Almaden


Wednesday, February 27, 2008
4:30 - 5:30 PM
Terman Engineering Center, Room 453


Abstract:

In this talk we study the notion of learning in an oblivious changing environment. Existing online learning algorithms which minimize regret are shown to converge to the average of all locally optimal solutions. We propose a new performance metric, strengthening the standard metric of regret, to capture convergence to locally optimal solutions. We then describe a series of reductions which transform algorithms for minimizing (standard) regret into adaptive algorithms albeit incurring only poly-logarithmic computational overhead.

We describe applications of this technique to various well studied online problems, such as online routing, portfolio management and the tree update problem. In all cases we explain how previous algorithms perform suboptimally and how the reduction technique gives adaptive algorithms.

Our reductions combine techniques from data streaming algorithms, composition of learning algorithms and a novel use of unbiased gradient estimates.

This talk is self-contained - no prior knowledge in machine learning or optimization is required.

Joint work with C. Seshadhri, Princeton University







Operations Research Colloquia: http://or.stanford.edu/oras_seminars.html