Online Program Home
  My Program

All Times EDT

Abstract Details

Activity Number: 146 - Statistical Reinforcement Learning
Type: Invited
Date/Time: Tuesday, August 10, 2021 : 10:00 AM to 11:50 AM
Sponsor: Section on Statistical Computing
Abstract #316705
Title: On Efficiency in Hierarchical Reinforcement Learning
Author(s): Zheng Wen*
Companies: DeepMind
Keywords: Hierarchical Reinforcement Learning; Reinforcement Learning; Efficient Exploration; Near-Optimal Planning
Abstract:

Hierarchical Reinforcement Learning (HRL) approaches promise to provide more efficient solutions to sequential decision making problems, both in terms of statistical as well as computational efficiency. While this has been demonstrated empirically over time in a variety of tasks, theoretical results quantifying the benefits of such methods are still few and far between. In this talk, we discuss the kind of structure in a Markov decision process which gives rise to efficient HRL methods. Specifically, we formalize the intuition that HRL can exploit well repeating "subMDPs", with similar reward and transition structure. We show that, under reasonable assumptions, a model-based Thompson sampling-style HRL algorithm that exploits this structure is statistically efficient, as established through a finite-time regret bound. We also establish conditions under which planning with structure-induced options is near-optimal and computationally efficient.


Authors who are presenting talks have a * after their name.

Back to the full JSM 2021 program