Weighted Policy Learner (WPL) Algorithm

Revision as of 21:13, 9 May 2024 by Gmelli (talk | contribs) (Text replacement - "tems]]" to "tem]]s")
(diff) ← Older revision | Latest revision (diff) | Newer revision → (diff)

A Weighted Policy Learner (WPL) Algorithm is a Multi-Agent Reinforcement Learning (MARL) Algorithm that enables agents to converge to a Nash Equilibrium assuming each agent is oblivious to other agents and receives only one type of feedback.



References

2008

2007