Faculty Bibliography 2010s

Optimizing coalition formation for tasks with dynamically evolving rewards and nondeterministic action effects

Authors

M. A. Khan; D. Turgut;L. Boloni

Comments

Authors: contact us about adding a copy of your work at STARS@ucf.edu

Abbreviated Journal Title

Auton. Agents Multi-Agent Syst.

Keywords

Optimization; Coalition formation; Agents; FACTORED MDPS; Automation & Control Systems; Computer Science, Artificial Intelligence

Abstract

We consider a problem domain where coalitions of agents are formed in order to execute tasks. Each task is assigned at most one coalition of agents, and the coalition can be reorganized during execution. Executing a task means bringing it to one of the desired terminal states, which might take several time steps. The state of the task evolves even if no coalition is assigned to its execution and depends nondeterministically on the cumulative actions of the agents in the coalition. Furthermore, we assume that the reward obtained for executing a task evolves in time: the more the execution of the task is delayed, the lesser the reward. A representative example of this class of problems is the allocation of firefighters to fires in a disaster rescue environment. We describe a practical methodology through which a problem of this class can be encoded as a Markov Decision Process. Due to the three levels of factoring in the resulting MDP (the states, actions and rewards are composites of the original features of the problem) the resulting MDP can be directly solved only for small problem instances. We describe two methods for parallel decomposition of the MDP: the MDP RSUA approach for random sampling and uniform allocation and the MDP REUSE method which reuses the lower level MDP to allocate resources to the parallel subproblems. Through an experimental study which models the problem domain using the fire simulation components of the Robocup Rescue simulator, we show that both methods significantly outperform heuristic approaches and MDP REUSE provides an overall higher performance than MDP RSUA.

Journal Title

Autonomous Agents and Multi-Agent Systems

Volume

Issue/Number

Publication Date

1-1-2011

Document Type

Article

DOI Link

http://dx.doi.org/10.1007/s10458-010-9134-5

Language

English

First Page

415

Last Page

438

WOS Identifier

WOS:000289680900003

ISSN

1387-2532

Recommended Citation

"Optimizing coalition formation for tasks with dynamically evolving rewards and nondeterministic action effects" (2011). Faculty Bibliography 2010s. 7070.
https://stars.library.ucf.edu/facultybib2010/7070

Find in your library

COinS

Faculty Bibliography 2010s

Optimizing coalition formation for tasks with dynamically evolving rewards and nondeterministic action effects

Authors

Comments

Abbreviated Journal Title

Keywords

Abstract

Journal Title

Volume

Issue/Number

Publication Date

Document Type

DOI Link

Language

First Page

Last Page

WOS Identifier

ISSN

Recommended Citation

Explore

Connect

Faculty Bibliography 2010s

Optimizing coalition formation for tasks with dynamically evolving rewards and nondeterministic action effects

Authors

Authors

Comments

Abbreviated Journal Title

Keywords

Abstract

Journal Title

Volume

Issue/Number

Publication Date

Document Type

DOI Link

Language

First Page

Last Page

WOS Identifier

ISSN

Recommended Citation

Share

Explore

Connect