Utile coordination: learning interdependencies among cooperative agents
Jelle R. Kok, Pieter Jan 't Hoen, Bram Bakker, and Nikos Vlassis. Utile coordination: learning interdependencies among cooperative agents. In Proceedings of the IEEE Symposium on Computational Intelligence and Games (CIG), pp. 29–36, Colchester, United Kingdom, April 2005.
Download
Abstract
We describe Utile Coordination, an algorithm that allows a multiagent system to learn where and how to coordinate. The method starts with uncoordinated learners and maintains statistics on expected returns. Coordination dependencies are dynamically added if the statistics indicate a statistically significant benefit. This results in a compact state representation because only necessary coordination is modeled. We apply our method within the framework of coordination graphs in which value rules represent the coordination dependencies between the agents for a specific context. The algorithm is first applied on a small illustrative problem, and next on a large predator-prey problem in which two predators have to capture a single prey.
BibTeX Entry
@InProceedings{Kok05cig, author = {Jelle R. Kok and Pieter Jan 't Hoen and Bram Bakker and Nikos Vlassis}, title = {Utile coordination: learning interdependencies among cooperative agents}, address = {Colchester, United Kingdom}, booktitle = {Proceedings of the IEEE Symposium on Computational Intelligence and Games (CIG)}, year = 2005, month = apr, pages = {29--36}, abstract = {We describe Utile Coordination, an algorithm that allows a multiagent system to learn where and how to coordinate. The method starts with uncoordinated learners and maintains statistics on expected returns. Coordination dependencies are dynamically added if the statistics indicate a statistically significant benefit. This results in a compact state representation because only necessary coordination is modeled. We apply our method within the framework of coordination graphs in which value rules represent the coordination dependencies between the agents for a specific context. The algorithm is first applied on a small illustrative problem, and next on a large predator-prey problem in which two predators have to capture a single prey.} }
Generated by bib2html.pl (written by Patrick Riley) on Tue Oct 31, 2006 19:33:42 UTC