23
35

Distributionally Robust Submodular Maximization

Abstract

Submodular functions have applications throughout machine learning, but in many settings, we do not have direct access to the underlying function ff. We focus on stochastic functions that are given as an expectation of functions over a distribution PP. In practice, we often have only a limited set of samples fif_i from PP. The standard approach indirectly optimizes ff by maximizing the sum of fif_i. However, this ignores generalization to the true (unknown) distribution. In this paper, we achieve better performance on the actual underlying function ff by directly optimizing a combination of bias and variance. Algorithmically, we accomplish this by showing how to carry out distributionally robust optimization (DRO) for submodular functions, providing efficient algorithms backed by theoretical guarantees which leverage several novel contributions to the general theory of DRO. We also show compelling empirical evidence that DRO improves generalization to the unknown stochastic submodular function.

View on arXiv
Comments on this paper