Sparse cooperative Q-learning: extended abstract
Jelle R. Kok and Nikos Vlassis. Sparse cooperative Q-learning: extended abstract. In Proceedings of the 16th Belgian-Dutch Conference on Artifical Intelligence, pp. 361–362, Groningen, The Netherlands, October 2004.
Download
Abstract
Learning in multiagent systems suffers from the fact that both the state and the action space scale exponentially with the number of agents. In this paper we are interested in using Q-learning to learn the coordinated actions of a group of cooperative agents, using a sparse representation of the joint state-action space of the agents. We first examine a compact representation in which the agents need to explicitly coordinate their actions only in a predefined set of states. Next, we use a coordination-graph approach in which we represent the Q-values by value rules that specify the coordination dependencies of the agents at particular states. We show how Q-learning can be efficiently applied to learn a coordinated policy for the agents in the above framework. We demonstrate the proposed method on the predator-prey domain, and we compare it with other related multiagent Q-learning methods.
BibTeX Entry
@InProceedings{Kok04bnaic, author = {Jelle R. Kok and Nikos Vlassis}, title = {Sparse cooperative {Q}-learning: extended abstract}, address = {Groningen, The Netherlands}, booktitle = {Proceedings of the 16th Belgian-Dutch Conference on Artifical Intelligence}, year = 2004, month = oct, pages = {361--362}, editor = {Rineke Verbrugge and Niels Taatgen and Lambert Schomaker}, postscript = {2004/Kok04bnaic.ps.gz}, pdf = {2004/Kok04bnaic.pdf}, abstract = { Learning in multiagent systems suffers from the fact that both the state and the action space scale exponentially with the number of agents. In this paper we are interested in using Q-learning to learn the coordinated actions of a group of cooperative agents, using a sparse representation of the joint state-action space of the agents. We first examine a compact representation in which the agents need to explicitly coordinate their actions only in a predefined set of states. Next, we use a coordination-graph approach in which we represent the Q-values by value rules that specify the coordination dependencies of the agents at particular states. We show how Q-learning can be efficiently applied to learn a coordinated policy for the agents in the above framework. We demonstrate the proposed method on the predator-prey domain, and we compare it with other related multiagent Q-learning methods. } }
Generated by bib2html.pl (written by Patrick Riley) on Tue Oct 31, 2006 19:33:42 UTC