Name: 2021 Joint Statistical Meetings
Start: 2021-08-08T07:00:00+00:00
End: 2021-08-12

Online Program Home
My Program

All Times EDT

Activity Number:	276 - Statistical Foundations of Reinforcement Learning
Type:	Topic-Contributed
Date/Time:	Wednesday, August 11, 2021 : 1:30 PM to 3:20 PM
Sponsor:	IMS
Abstract #317420
Title:	Breaking the sample size barrier in model-based reinforcement learning
Author(s):	Yuting Wei*
Companies:	Carnegie Mellon University
Keywords:
Abstract:	We investigate the sample efficiency of reinforcement learning in a gamma-discounted infinite-horizon Markov decision process (MDP) with state space S and action space A, assuming access to a generative model. Despite a number of prior work tackling this problem, a complete picture of the trade-offs between sample complexity and statistical accuracy is yet to be determined. In particular, prior results suffer from a sample size barrier, in the sense that their claimed statistical guarantees hold only when the sample size exceeds at least a large barrier (up to some log factor). In this talk, we break this barrier by certifying the minimax optimality of model-based reinforcement learning as soon as the sample size exceeds the order of SA/(1-gamma). To the best of our knowledge, this work provides the first minimax-optimal guarantee in a generative model that accommodates the entire range of sample sizes (beyond which finding a meaningful policy is information-theoretically impossible).

Authors who are presenting talks have a * after their name.

JSM 2021 Online Program