57
0

Multi-Metric Adaptive Experimental Design under Fixed Budget with Validation

Main:24 Pages
6 Figures
Bibliography:1 Pages
1 Tables
Appendix:25 Pages
Abstract

Standard A/B tests in online experiments face statistical power challenges when testing multiple candidates simultaneously, while adaptive experimental designs (AED) alone fall short in inferring experiment statistics such as the average treatment effect, especially with many metrics (e.g., revenue, safety) and heterogeneous variances. This paper proposes a fixed-budget multi-metric AED framework with a two-phase structure: an adaptive exploration phase to identify the best treatment, and a validation phase with an A/B test to verify the treatment's quality and infer statistics. We propose SHRVar, which generalizes sequential halving (SH) (Karnin et al., 2013) with a novel relative-variance-based sampling and an elimination strategy built on reward z-values. It achieves a provable error probability that decreases exponentially, where the exponent generalizes the complexity measure for SH (Karnin et al., 2013) and SHVar (Lalitha et al., 2023) with homogeneous and heterogeneous variances, respectively. Numerical experiments verify our analysis and demonstrate the superior performance of this new framework.

View on arXiv
@article{zhang2025_2506.03062,
  title={ Multi-Metric Adaptive Experimental Design under Fixed Budget with Validation },
  author={ Qining Zhang and Tanner Fiez and Yi Liu and Wenyang Liu },
  journal={arXiv preprint arXiv:2506.03062},
  year={ 2025 }
}
Comments on this paper