Paper | Presenter |
Online Mirror Descent and Dual Averaging: Keeping Pace in the Dynamic Case | Quan Nguyen |
Adaptive Subgradient Methods for Online Learning and Stochastic Optimization | Ezra MacDonald |
Logarithmic Regret Algorithms for Online Convex Optimization | Yifeng Liu |
Efficient Algorithms for Online Decision Problems | Ali Mortazavi |
Online Sparse Reinforcement Learning | Homayoun Honari |
Anytime Online-to-Batch, Optimism and Acceleration | Mica Grant-Hagen |
Online Bandit Learning Against an Adaptive Adversary: From Regret to Policy Regret | Andrea Nguyen |
As part of this course, every student will complete a solo project with a theoretical focus. The idea of the project is to select a paper or two and present the main ideas of the paper(s) to the class. There also will be a brief (3-5 page report) due on Friday, April 14th at 11:59pm. Please submit via Brightspace, where you can also find a brief description of what to submit.
To help get you started thinking about papers, here is a list of papers in online learning:
For each project, there will be a 20 to 25 minute presentation (depending on the time available) to discuss your findings, plus a few minutes for questions. These presentations will happen in the last week of lectures.
Each of you also will submit a 3-5 page report, not including references. The maximum length is flexible depending on your project. For your report, please use the LaTeX style file and template below (this is just to get the geometry right, e.g. margins, font size, line spacing):