JSM 2017 Online Program

Activity Number:	478 - Online Machine Learning for Prediction and Sequential Decision Making
Type:	Invited
Date/Time:	Wednesday, August 2, 2017 : 10:30 AM to 12:20 PM
Sponsor:	IMS
Abstract #322221	View Presentation
Title:	Regret Bounds for Adaptive Control of Linear Quadratic Systems
Author(s):	Mohamad Kazem Shirani Faradonbeh and Ambuj Tewari* and George Michailidis
Companies:	University of Michigan and University of Michigan and University of Florida
Keywords:	adaptive control ; reinforcement learning ; LQ systems ; vector autoregression ; optimism in face of uncertainty ; regret bounds
Abstract:	The study of dynamical systems with linear dynamics and quadratic costs is a classical topic in control theory. However, finite time regret bounds for these systems have only been derived recently. Unfortunately, existing regret bounds suffer from a number of deficiencies including stringent assumptions on the underlying system and undesirable (e.g., exponential) dependence on the dimension of the state space. In this talk, I will describe our recent efforts to mitigate these deficiencies. On the way to regret bounds, I will also touch upon finite time estimation error bounds for general (i.e., not necessarily stable) vector autoregressive (VAR) time series models.

Authors who are presenting talks have a * after their name.