Stochastic Multi-Armed Bandit Task

(Redirected from stochastic multi-armed bandit problem)

A Stochastic Multi-Armed Bandit Task is a Multi-Armed Bandit Task that is a ...

AKA: Stochastic MAB.
See: Stochastic Task.

References

2009

(Audibert et al., 2009) ⇒ Jean-Yves Audibert, Rémi Munos, and Csaba Szepesvári. (2009). “Exploration-exploitation Tradeoff Using Variance Estimates in Multi-armed Bandits.” In: Theoretical Computer Science Journal, 410(19). doi:10.1016/j.tcs.2009.01.016
- QUOTE: Algorithms based on upper confidence bounds for balancing exploration and exploitation are gaining popularity since they are easy to implement, efficient and effective. This paper considers a variant of the basic algorithm for the stochastic, multi-armed bandit problem that takes into account the empirical variance of the different arms.

Retrieved from "http://www.gabormelli.com/RKB/index.php?title=Stochastic_Multi-Armed_Bandit_Task&oldid=725806"