Abstract:
|
Quantiles for latency data are well-established and broadly used metrics for monitoring website performance. As the Google Cloud Platform (GCP) is the gateway for all Cloud users to access Google Cloud service, the page performance is critical for successful user journeys. However, the extremely large traffic volume poses a couple unique challenges when we try to conduct real-time latency quantile comparison in online A/B experiments, such as storage limitation and computation time. In this talk, we will discuss how we have developed and implemented statistical methods which significantly reduce the computational cost and accommodate different storage strategies into the GCP performance monitoring system.
|