Abstract:
|
Deep Gaussian processes (DGPs) are increasingly popular as predictive models in machine learning (ML) for their non-stationary flexibility and ability to cope with abrupt regime changes in the training data. Here we explore DGPs as surrogates for computer simulation experiments. In particular, we exploit the DGPs automatic warping of the input space and full uncertainty quantification (UQ) through Bayesian posterior inference to develop active learning (AL) strategies that distribute runs non-uniformly in the input space. Building up the design sequentially allows smaller training sets, limiting both expensive evaluation of the simulator code and mitigating cubic costs of DGP inference. In order to ensure that AL criteria synthesize all relevant posterior mean and variance elements, we depart from the ML preference for thrifty variational inference of DGPs and instead promote full MCMC. When training data sizes are kept small through careful acquisition, and with parsimonious layout of latent layers, the framework can be both effective and computationally tractable. We provide an open source implementation in the “deepgp” package on CRAN.
|