| Activity Number: | 83 
                            	- Your Invited Poster Evening Entertainment: No Longer Board | 
                    
                        | Type: | Invited | 
                    
                        | Date/Time: | Sunday, July 30, 2017 : 8:30 PM to 10:30 PM | 
                    
                        | Sponsor: | Section on Statistical Learning and Data Science | 
                
                    
                        | Abstract #323040 |  | 
                    
                        | Title: | Sufficient Markov Decision Processes with Alternating Deep Neural Networks | 
                
                
                    | Author(s): | Longshaokan Wang* and Eric Laber and Katie Witkiewitz | 
                
                    | Companies: | North Carolina State University and North Carolina State University and University of New Mexico | 
                
                    | Keywords: | Markov Decision Processes ; 
                            Dimension Reduction ; 
                            Deep Learning ; 
                            Reinforcement Learning ; 
                            Mobile Health ; 
                            Sufficient Statistics | 
                
                    | Abstract: | 
                            Advances in mobile computing technologies have made it possible to monitor and apply data-driven interventions across complex systems in real time. Markov decision processes (MDPs) are the primary model for sequential decision problems with a large or indefinite time horizon. Choosing a representation of the underlying decision process that is both Markov and low-dimensional is non-trivial. We propose a method for constructing a low-dimensional representation of the original decision process for which: 1. the MDP model holds; 2. a decision strategy that maximizes mean utility when applied to the low-dimensional representation also maximizes mean utility when applied to the original process. We use a deep neural network to define a class of potential process representations and estimate the process of lowest dimension within this class. The method is illustrated using data from a mobile study on heavy drinking and smoking among college students.   
                         | 
                
                
                    Authors who are presenting talks have a * after their name.