Thompson Sampling with a Mixture Prior

Hong, Joey; Kveton, Branislav; Zaheer, Manzil; Ghavamzadeh, Mohammad; Boutilier, Craig

Computer Science > Machine Learning

arXiv:2106.05608v1 (cs)

[Submitted on 10 Jun 2021 (this version), latest version 5 Mar 2022 (v2)]

Title:Thompson Sampling with a Mixture Prior

Authors:Joey Hong, Branislav Kveton, Manzil Zaheer, Mohammad Ghavamzadeh, Craig Boutilier

View PDF

Abstract:We study Thompson sampling (TS) in online decision-making problems where the uncertain environment is sampled from a mixture distribution. This is relevant to multi-task settings, where a learning agent is faced with different classes of problems. We incorporate this structure in a natural way by initializing TS with a mixture prior -- dubbed MixTS -- and develop a novel, general technique for analyzing the regret of TS with such priors. We apply this technique to derive Bayes regret bounds for MixTS in both linear bandits and tabular Markov decision processes (MDPs). Our regret bounds reflect the structure of the problem and depend on the number of components and confidence width of each component of the prior. Finally, we demonstrate the empirical effectiveness of MixTS in both synthetic and real-world experiments.

Comments:	22 pages, 3 figures
Subjects:	Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Machine Learning (stat.ML)
Cite as:	arXiv:2106.05608 [cs.LG]
	(or arXiv:2106.05608v1 [cs.LG] for this version)
	https://doi.org/10.48550/arXiv.2106.05608

Submission history

From: Joey Hong [view email]
[v1] Thu, 10 Jun 2021 09:21:07 UTC (1,544 KB)
[v2] Sat, 5 Mar 2022 06:17:27 UTC (1,548 KB)

Computer Science > Machine Learning

Title:Thompson Sampling with a Mixture Prior

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Machine Learning

Title:Thompson Sampling with a Mixture Prior

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators