2015 Joint Statistical Meetings - Statistics: Making Better Decisions.

JSM 2015 Home

JSM 2015 Preliminary Program

Abstract Details

Activity Number:	504
Type:	Contributed
Date/Time:	Wednesday, August 12, 2015 : 8:30 AM to 10:20 AM
Sponsor:	Section on Statistical Learning and Data Mining
Abstract #316959
Title:	Reinforcement Learning for Categorical Data and Marginalized Transition Models
Author(s):	Stephen Carden*
Companies:	Georgia Southern University
Keywords:	Reinforcement Learning ; Marginalized Transition Models ; Machine Learning ; Markov Decision Process
Abstract:	Reinforcement Learning concerns algorithms tasked with learning optimal control policies from interacting with or observing a system. Fitted Q-iteration is a framework in which a regression method is iteratively used to approximate the value of states and actions. Because the state-action value function rarely has a predictable shape, non-parametric supervised learning methods are typical. This greater modeling flexibility comes at a cost of large data requirements. If only a small amount of data is available, the supervised learning method is likely to over-generalize and approximate the value function poorly. In this paper, we propose using Marginalized Transition Models to estimate the process which produces observations. From this estimated process, additional observations are generated. Our contention is that using these additional observations reduces the bias produced by the regression method's over-smoothing, and can produce better policies than only using the original data. This approach is applied to a scenario mimicking medical prescription policies for a disease with sporadically appearing symptoms as a proof-of-concept example.

Authors who are presenting talks have a * after their name.

Back to the full JSM 2015 program

For program information, contact the JSM Registration Department or phone (888) 231-3473.

For Professional Development information, contact the Education Department.

The views expressed here are those of the individual authors and not necessarily those of the JSM sponsors, their officers, or their staff.

2015 JSM Online Program Home