jellekok.nl

Utile coordination: learning interdependencies among cooperative agents

Jelle R. Kok, Pieter Jan 't Hoen, Bram Bakker, and Nikos Vlassis. Utile coordination: learning interdependencies among cooperative agents. In Proceedings of the IEEE Symposium on Computational Intelligence and Games (CIG), pp. 29–36, Colchester, United Kingdom, April 2005.

Download

pdf [167.4kB]  ps.gz [142.4kB]  

Abstract

We describe Utile Coordination, an algorithm that allows a multiagent system to learn where and how to coordinate. The method starts with uncoordinated learners and maintains statistics on expected returns. Coordination dependencies are dynamically added if the statistics indicate a statistically significant benefit. This results in a compact state representation because only necessary coordination is modeled. We apply our method within the framework of coordination graphs in which value rules represent the coordination dependencies between the agents for a specific context. The algorithm is first applied on a small illustrative problem, and next on a large predator-prey problem in which two predators have to capture a single prey.

BibTeX Entry

@InProceedings{Kok05cig,
  author =       {Jelle R. Kok and Pieter Jan 't Hoen and Bram Bakker
                  and Nikos Vlassis},
  title =        {Utile coordination: learning interdependencies among
                  cooperative agents},
  address =      {Colchester, United Kingdom},
  booktitle =    {Proceedings of the IEEE Symposium on Computational
                  Intelligence and Games (CIG)},
  year =         2005,
  month =        apr,
  pages =        {29--36},
  abstract =     {We describe Utile Coordination, an algorithm that
                  allows a multiagent system to learn where and how to
                  coordinate. The method starts with uncoordinated
                  learners and maintains statistics on expected
                  returns. Coordination dependencies are dynamically
                  added if the statistics indicate a statistically
                  significant benefit. This results in a compact state
                  representation because only necessary coordination
                  is modeled. We apply our method within the framework
                  of coordination graphs in which value rules
                  represent the coordination dependencies between the
                  agents for a specific context. The algorithm is
                  first applied on a small illustrative problem, and
                  next on a large predator-prey problem in which two
                  predators have to capture a single prey.}
}

Generated by bib2html.pl (written by Patrick Riley) on Tue Oct 31, 2006 19:33:42 UTC