Multi-agent Multi-target Path Planning in Markov Decision Processes

Nawaz, Farhad; Ornik, Melkior

doi:10.1109/TAC.2023.3286807

Full-text links:

Download:

Current browse context:

math.OC

< prev | next >

new | recent | 2205

Mathematics > Optimization and Control

Title: Multi-agent Multi-target Path Planning in Markov Decision Processes

Authors: Farhad Nawaz, Melkior Ornik

(Submitted on 31 May 2022 (v1), last revised 17 Jun 2023 (this version, v2))

Abstract: Missions for autonomous systems often require agents to visit multiple targets in complex operating conditions. This work considers the problem of visiting a set of targets in minimum time by a team of non-communicating agents in a Markov decision process (MDP). The single-agent problem is at least NP-complete by reducing it to a Hamiltonian path problem. We first discuss an optimal algorithm based on Bellman's optimality equation that is exponential in the number of target states. Then, we trade-off optimality for time complexity by presenting a suboptimal algorithm that is polynomial at each time step. We prove that the proposed algorithm generates optimal policies for certain classes of MDPs. Extending our procedure to the multi-agent case, we propose a target partitioning algorithm that approximately minimizes the expected time to visit the targets. We prove that our algorithm generates optimal partitions for clustered target scenarios. We present the performance of our algorithms on random MDPs and gridworld environments inspired by ocean dynamics. We show that our algorithms are much faster than the optimal procedure and more optimal than the currently available heuristic.

Comments:	IEEE Xplore link: this https URL
Subjects:	Optimization and Control (math.OC); Systems and Control (eess.SY)
Journal reference:	IEEE Transactions on Automatic Control, VOL. 69, NO. 04, 2024 (tentative)
DOI:	10.1109/TAC.2023.3286807
Cite as:	arXiv:2205.15841 [math.OC]
	(or arXiv:2205.15841v2 [math.OC] for this version)

Submission history

From: Farhad Nawaz Savvas Sadiq Ali [view email]
[v1] Tue, 31 May 2022 14:44:21 GMT (3169kb,D)
[v2] Sat, 17 Jun 2023 16:08:29 GMT (3633kb,D)

Which authors of this paper are endorsers? | Disable MathJax (What is MathJax?)

Link back to: arXiv, form interface, contact.

> math > arXiv:2205.15841

Download:

Current browse context:

Change to browse by:

References & Citations

Bookmark

Mathematics > Optimization and Control

Title: Multi-agent Multi-target Path Planning in Markov Decision Processes

Submission history