2008 OnlineMultiagentLearningAgainst

From GM-RKB

Jump to navigation Jump to search

(Chakraborty & Stone, 2008) ⇒ Doran Chakraborty, and Peter Stone. (2008). “Online Multiagent Learning Against Memory Bounded Adversaries.” In: Proceedings of the 2008th European Conference on Machine Learning and Knowledge Discovery in Databases - Volume Part I. ISBN:3-540-87478-X, 978-3-540-87478-2 doi:10.1007/978-3-540-87479-9_32

Subject Headings: Multi-Agent Learning Algorithm, LoE-AIM Algorithm.

Notes

Cited By

Quotes

Keywords

Nash Equilibrium, Multiagent System, Markov Decision Process, Targetable Pair, International Joint Conference.

Abstract

The traditional agenda in Multiagent Learning (MAL) has been to develop learners that guarantee convergence to an equilibrium in self-play or that converge to playing the best response against an opponent using one of a fixed set of known targeted strategies. This paper introduces an algorithm called Learn or Exploit for Adversary Induced Markov Decision Process (LoE-AIM) that targets optimality against any learning opponent that can be treated as a memory bounded adversary. LoE-AIM makes no prior assumptions about the opponent and is tailored to optimally exploit any adversary which induces a Markov decision process in the state space of joint histories. LoE-AIM either explores and gathers new information about the opponent or converges to the best response to the partially learned opponent strategy in repeated play. We further extend LoE-AIM to account for online repeated interactions against the same adversary with plays against other adversaries interleaved in between. LoE-AIM-repeated stores learned knowledge about an adversary, identifies the adversary in case of repeated interaction, and reuses the stored knowledge about the behavior of the adversary to enhance learning in the current epoch of play. LoE-AIM and LoE-AIM-repeated are fully implemented, with results demonstrating their superiority over other existing MAL algorithms.

References

;

	Author	volume	Date Value	title	type	journal	titleUrl	doi	note	year
2008 OnlineMultiagentLearningAgainst	Peter Stone Doran Chakraborty			Online Multiagent Learning Against Memory Bounded Adversaries				10.1007/978-3-540-87479-9_32		2008

Retrieved from "http://www.gabormelli.com/RKB/index.php?title=2008_OnlineMultiagentLearningAgainst&oldid=845177"

Facts

... more about "2008 OnlineMultiagentLearningAgainst"

Doran Chakraborty + and Peter Stone +

10.1007/978-3-540-87479-9_32 +

Online Multiagent Learning Against Memory Bounded Adversaries +

2008 +