Sleeping Combinatorial Bandits

Abhishek, Kumar; Ghalme, Ganesh; Gujar, Sujit; Narahari, Yadati

Computer Science > Machine Learning

arXiv:2106.01624 (cs)

[Submitted on 3 Jun 2021]

Title:Sleeping Combinatorial Bandits

Authors:Kumar Abhishek, Ganesh Ghalme, Sujit Gujar, Yadati Narahari

View PDF

Abstract:In this paper, we study an interesting combination of sleeping and combinatorial stochastic bandits. In the mixed model studied here, at each discrete time instant, an arbitrary \emph{availability set} is generated from a fixed set of \emph{base} arms. An algorithm can select a subset of arms from the \emph{availability set} (sleeping bandits) and receive the corresponding reward along with semi-bandit feedback (combinatorial bandits).
We adapt the well-known CUCB algorithm in the sleeping combinatorial bandits setting and refer to it as \CSUCB. We prove -- under mild smoothness conditions -- that the \CSUCB\ algorithm achieves an $O(\log (T))$ instance-dependent regret guarantee. We further prove that (i) when the range of the rewards is bounded, the regret guarantee of \CSUCB\ algorithm is $O(\sqrt{T \log (T)})$ and (ii) the instance-independent regret is $O(\sqrt[3]{T^2 \log(T)})$ in a general setting. Our results are quite general and hold under general environments -- such as non-additive reward functions, volatile arm availability, a variable number of base-arms to be pulled -- arising in practical applications. We validate the proven theoretical guarantees through experiments.

Subjects:	Machine Learning (cs.LG)
Cite as:	arXiv:2106.01624 [cs.LG]
	(or arXiv:2106.01624v1 [cs.LG] for this version)
	https://doi.org/10.48550/arXiv.2106.01624

Submission history

From: Kumar Abhishek [view email]
[v1] Thu, 3 Jun 2021 06:49:44 UTC (3,076 KB)

Full-text links:

Access Paper:

view license

Current browse context:

cs.LG

< prev | next >

new | recent | 2021-06

Change to browse by:

References & Citations

DBLP - CS Bibliography

listing | bibtex

Kumar Abhishek
Ganesh Ghalme
Sujit Gujar
Yadati Narahari

export BibTeX citation

Computer Science > Machine Learning

Title:Sleeping Combinatorial Bandits

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Machine Learning

Title:Sleeping Combinatorial Bandits

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators