19
9

Adaptive Regularized Submodular Maximization

Shaojie Tang
Jing Yuan
Abstract

In this paper, we study the problem of maximizing the difference between an adaptive submodular (revenue) function and an non-negative modular (cost) function under the adaptive setting. The input of our problem is a set of nn items, where each item has a particular state drawn from some known prior distribution pp. The revenue function gg is defined over items and states, and the cost function cc is defined over items, i.e., each item has a fixed cost. The state of each item is unknown initially, one must select an item in order to observe its realized state. A policy π\pi specifies which item to pick next based on the observations made so far. Denote by gavg(π)g_{avg}(\pi) the expected revenue of π\pi and let cavg(π)c_{avg}(\pi) denote the expected cost of π\pi. Our objective is to identify the best policy πoargmaxπgavg(π)cavg(π)\pi^o\in \arg\max_{\pi}g_{avg}(\pi)-c_{avg}(\pi) under a kk-cardinality constraint. Since our objective function can take on both negative and positive values, the existing results of submodular maximization may not be applicable. To overcome this challenge, we develop a series of effective solutions with performance grantees. Let πo\pi^o denote the optimal policy. For the case when gg is adaptive monotone and adaptive submodular, we develop an effective policy πl\pi^l such that gavg(πl)cavg(πl)(11eϵ)gavg(πo)cavg(πo)g_{avg}(\pi^l) - c_{avg}(\pi^l) \geq (1-\frac{1}{e}-\epsilon)g_{avg}(\pi^o) - c_{avg}(\pi^o), using only O(nϵ2logϵ1)O(n\epsilon^{-2}\log \epsilon^{-1}) value oracle queries. For the case when gg is adaptive submodular, we present a randomized policy πr\pi^r such that gavg(πr)cavg(πr)1egavg(πo)cavg(πo)g_{avg}(\pi^r) - c_{avg}(\pi^r) \geq \frac{1}{e}g_{avg}(\pi^o) - c_{avg}(\pi^o).

View on arXiv
Comments on this paper