Federated Control in Markov Decision Processes

Jin, Hao; Peng, Yang; Zhang, Liangyu; Zhang, Zhihua

Full-text links:

Download:

Current browse context:

stat.ML

< prev | next >

new | recent | 2405

Statistics > Machine Learning

Title: Federated Control in Markov Decision Processes

Authors: Hao Jin, Yang Peng, Liangyu Zhang, Zhihua Zhang

(Submitted on 7 May 2024)

Abstract: We study problems of federated control in Markov Decision Processes. To solve an MDP with large state space, multiple learning agents are introduced to collaboratively learn its optimal policy without communication of locally collected experience. In our settings, these agents have limited capabilities, which means they are restricted within different regions of the overall state space during the training process. In face of the difference among restricted regions, we firstly introduce concepts of leakage probabilities to understand how such heterogeneity affects the learning process, and then propose a novel communication protocol that we call Federated-Q protocol (FedQ), which periodically aggregates agents' knowledge of their restricted regions and accordingly modifies their learning problems for further training. In terms of theoretical analysis, we justify the correctness of FedQ as a communication protocol, then give a general result on sample complexity of derived algorithms FedQ-X with the RL oracle , and finally conduct a thorough study on the sample complexity of FedQ-SynQ. Specifically, FedQ-X has been shown to enjoy linear speedup in terms of sample complexity when workload is uniformly distributed among agents. Moreover, we carry out experiments in various environments to justify the efficiency of our methods.

Subjects:	Machine Learning (stat.ML); Machine Learning (cs.LG)
Cite as:	arXiv:2405.04026 [stat.ML]
	(or arXiv:2405.04026v1 [stat.ML] for this version)

Submission history

From: Hao Jin [view email]
[v1] Tue, 7 May 2024 05:59:10 GMT (3046kb,D)

Which authors of this paper are endorsers? | Disable MathJax (What is MathJax?)

Link back to: arXiv, form interface, contact.

> stat > arXiv:2405.04026

Download:

Current browse context:

Change to browse by:

References & Citations

Bookmark

Statistics > Machine Learning

Title: Federated Control in Markov Decision Processes

Submission history