Online Program Home
  My Program

All Times EDT

Abstract Details

Activity Number: 199 - Innovations in Modeling Computer Experiments
Type: Contributed
Date/Time: Tuesday, August 4, 2020 : 10:00 AM to 2:00 PM
Sponsor: Section on Physical and Engineering Sciences
Abstract #312543
Title: Prediction for Distributional Outcomes in High-Performance Computing I/O Variability
Author(s): Li Xu* and Yili Hong and Layne Watson and Kirk Cameron
Companies: Virginia Tech and and Department of Computer Science, Virginia Tech and Department of Computer Science, Virginia Tech
Keywords: Computer Experiments; Gaussian Process; Reliability; Approximation Methods
Abstract:

One important research area in high-performance computing (HPC) is the management of performance variability, which is affected by complicated interactions of numerous factors, such as CPU frequency, the number of I/O threads, file size and record size. In this paper, we are interested in the I/O variability, which is measured by the I/O throughputs that varies from run to run. Given a system configuration, one way to describe the variability is to use the cumulative distribution function (cdf) for the throughputs. Prediction the cdf of throughputs for a new system configuration is often of interest, which, however, is a challenging task. To overcome this challenge, computer scientists conducted large-scale experiments and collected a mass amount of data for the distribution of variability under various system configurations. We develop a Gaussian process model to predict the cdf under a new configuration using the experimental data. We also impose a monotone constraint so that the predicted function is monotonically increasing, which is one desired property of the cdf. We evaluate the performance of the proposed method by using the experimental data. We also discuss the methodolog


Authors who are presenting talks have a * after their name.

Back to the full JSM 2020 program