k-Armed Bandit Algorithm: Difference between revisions

From GM-RKB
Jump to navigation Jump to search
m (Text replacement - ". ----" to ". ----")
No edit summary
 
Line 1: Line 1:
A [[k-Armed Bandit Algorithm]] is an [[reinforcement learning algorithm]] that can by applied by a [[k-armed bandit system]] (to solve a [[k-armed bandit task]]).
A [[k-Armed Bandit Algorithm]] is an [[reinforcement learning algorithm]] that can be implemented by a [[k-armed bandit system]] (to solve a [[k-armed bandit task]]).
* <B>Context:</B>
* <B>Context:</B>
** It can range from being an [[Exact k-Armed Bandit Algorithm]] to being an [[Approximate k-Armed Bandit Algorithm]].
** It can range from being an [[Exact k-Armed Bandit Algorithm]] to being an [[Approximate k-Armed Bandit Algorithm]].
** ...
* <B>Example(s):</B>
* <B>Example(s):</B>
** a [[Poker k-Armed Bandit Algorithm]] ([[2005_MultiArmedBanditAlgorithmsandEm|Vermorel et al., 2005]]).
** a [[Poker k-Armed Bandit Algorithm]] ([[2005_MultiArmedBanditAlgorithmsandEm|Vermorel et al., 2005]]).
** a [[Weighted Majority Algorithm]] ([[1989_TheWeightedMajorityAlgorithm|Littlestone et al., 1989]]).
** a [[Weighted Majority Algorithm]] ([[1989_TheWeightedMajorityAlgorithm|Littlestone et al., 1989]]).
** an [[Upper Confidence Bound (UCB) Algorithm]], ...
** an [[Epsilon-Greedy Algorithm]], ...
** a [[Thompson Sampling Algorithm]], ...
** ...
* <B>Conter-Example(s):</B>
* <B>Conter-Example(s):</B>
** a [[Deterministic Game]].
** a [[Deterministic Game]].

Latest revision as of 06:12, 18 October 2023

A k-Armed Bandit Algorithm is an reinforcement learning algorithm that can be implemented by a k-armed bandit system (to solve a k-armed bandit task).



References

2015

2008

2005