NeurIPS Poster Gradient Methods for Online DR-Submodular Maximization with Stochastic Long-Term Constraints

Poster

Gradient Methods for Online DR-Submodular Maximization with Stochastic Long-Term Constraints

Guanyu Nie · Vaneet Aggarwal · Christopher Quinn

[ Abstract ]

[ Paper] [ Poster] [ OpenReview]

2024 Poster

Abstract: In this paper, we consider the problem of online monotone DR-submodular maximization subject to long-term stochastic constraints. Specifically, at each round

t \in [T]

, after committing an action

x_{t}

, a random reward

f_{t} (x_{t})

and an unbiased gradient estimate of the point

\tilde{\nabla} f_{t} (x_{t})

(semi-bandit feedback) are revealed. Meanwhile, a budget of

g_{t} (x_{t})

, which is linear and stochastic, is consumed of its total allotted budget

B_{T}

. We propose a gradient ascent based algorithm that achieves

\frac{1}{2}

-regret of

O (\sqrt{T})

with

O (T^{3 / 4})

constraint violation with high probability. Moreover, when first-order full-information feedback is available, we propose an algorithm that achieves

(1 - 1 / e)

-regret of

O (\sqrt{T})

with

O (T^{3 / 4})

constraint violation. These algorithms significantly improve over the state-of-the-art in terms of query complexity.

Chat is not available.