Weighted Policy Learner (WPL) Algorithm

From GM-RKB
Jump to navigation Jump to search

A Weighted Policy Learner (WPL) Algorithm is a Multi-Agent Reinforcement Learning (MARL) Algorithm that enables agents to converge to a Nash Equilibrium assuming each agent is oblivious to other agents and receives only one type of feedback.



References

2008

2007