Online Program Home
  My Program

Abstract Details

Activity Number: 478 - Online Machine Learning for Prediction and Sequential Decision Making
Type: Invited
Date/Time: Wednesday, August 2, 2017 : 10:30 AM to 12:20 PM
Sponsor: IMS
Abstract #322221 View Presentation
Title: Regret Bounds for Adaptive Control of Linear Quadratic Systems
Author(s): Mohamad Kazem Shirani Faradonbeh and Ambuj Tewari* and George Michailidis
Companies: University of Michigan and University of Michigan and University of Florida
Keywords: adaptive control ; reinforcement learning ; LQ systems ; vector autoregression ; optimism in face of uncertainty ; regret bounds
Abstract:

The study of dynamical systems with linear dynamics and quadratic costs is a classical topic in control theory. However, finite time regret bounds for these systems have only been derived recently. Unfortunately, existing regret bounds suffer from a number of deficiencies including stringent assumptions on the underlying system and undesirable (e.g., exponential) dependence on the dimension of the state space. In this talk, I will describe our recent efforts to mitigate these deficiencies. On the way to regret bounds, I will also touch upon finite time estimation error bounds for general (i.e., not necessarily stable) vector autoregressive (VAR) time series models.


Authors who are presenting talks have a * after their name.

Back to the full JSM 2017 program

 
 
Copyright © American Statistical Association