Authors
Matthijs TJ Spaan, Frans A Oliehoek, Christopher Amato
Publication date
2011/5/3
Conference
Multi-agent Sequential Decision Making in Uncertain Domains
Description
Planning under uncertainty for multiagent systems can be formalized as a decentralized partially observable Markov decision process. We advance the state of the art for optimal solution of this model, building on the Multiagent A* heuristic search method. A key insight is that we can avoid the full expansion of a search node that generates a number of children doubly exponential in the node’s depth. Instead we incrementally expand the children of a node only when a next child might have the highest heuristic value. We target a subsequent bottleneck by introducing a more memoryefficient representation for our heuristic functions. Proof is given that the resulting algorithm is correct and experiments demonstrate a significant speedup over the state of the art, allowing for optimal solutions over longer horizons for many benchmark problems.
Total citations
201120122013201420152016201720182019202020212022202341316335355122
Scholar articles
MTJ Spaan, FA Oliehoek, C Amato - Sixth Annual Workshop on Multiagent Sequential …, 2011