Name: 2018 Joint Statistical Meetings
Start: 2018-07-28T07:00:00+00:00
End: 2018-08-02
Location: Vancouver Convention Centre

Abstract Details

Activity Number:	86 - SPEED: Statistics and Econometrics
Type:	Contributed
Date/Time:	Sunday, July 29, 2018 : 5:05 PM to 5:50 PM
Sponsor:	Business and Economic Statistics Section
Abstract #332703
Title:	Monte Carlo Tree Search and AlphaZero: Past, Present, and Future
Author(s):	Michael Fu*
Companies:	Smith School of Business
Keywords:	Monte Carlo; games; tree search; simulation; Markov decision processes; ranking & selection
Abstract:	In March of 2016, Google DeepMind's AlphaGo, a computer Go-playing program, defeated the reigning human world champion Go player, 4-1, a feat far more impressive than previous victories by computer programs in chess (IBM's Deep Blue) and Jeopardy (IBM's Watson). In December 2017, AlphaZero, a successor of AlphaGo designed for more generic game playing, dominated the well-known computer chess program Stockfish (which has a far higher rating than any human) in a 100-game match by winning 28 games and losing none (72 draws). The main engine behind these programs combines machine learning approaches with a technique called Monte Carlo tree search. After reviewing the history and background of AlphaGo/AlphaZero, we trace the origins of Monte Carlo tree search (particularly its relationship to simulation-based algorithms for Markov decision processes), discuss its present role in AlphaZero, and propose a substantial modification of the algorithm based on results from the well-established field of statistical ranking & selection. We present preliminary computational results based on simulations of an inventory control problem.

Authors who are presenting talks have a * after their name.

Back to the full JSM 2018 program

JSM 2018 Online Program

Abstract Details

American Statistical Association