Better than the Best: Gradient-based Improper Reinforcement Learning for Network Scheduling

Zaki, Mohammani; Mohan, Avi; Gopalan, Aditya; Mannor, Shie

Full-text links:

Download:

Current browse context:

cs.LG

< prev | next >

new | recent | 2105

Change to browse by:

Computer Science > Machine Learning

Title: Better than the Best: Gradient-based Improper Reinforcement Learning for Network Scheduling

Authors: Mohammani Zaki, Avi Mohan, Aditya Gopalan, Shie Mannor

(Submitted on 1 May 2021)

Abstract: We consider the problem of scheduling in constrained queueing networks with a view to minimizing packet delay. Modern communication systems are becoming increasingly complex, and are required to handle multiple types of traffic with widely varying characteristics such as arrival rates and service times. This, coupled with the need for rapid network deployment, render a bottom up approach of first characterizing the traffic and then devising an appropriate scheduling protocol infeasible.
In contrast, we formulate a top down approach to scheduling where, given an unknown network and a set of scheduling policies, we use a policy gradient based reinforcement learning algorithm that produces a scheduler that performs better than the available atomic policies. We derive convergence results and analyze finite time performance of the algorithm. Simulation results show that the algorithm performs well even when the arrival rates are nonstationary and can stabilize the system even when the constituent policies are unstable.

Comments:	4 pages, 5 figures, RLNQ workshop at the SIGMETRICS 2021
Subjects:	Machine Learning (cs.LG)
Cite as:	arXiv:2105.00210 [cs.LG]
	(or arXiv:2105.00210v1 [cs.LG] for this version)

Submission history

From: Avi Mohan [view email]
[v1] Sat, 1 May 2021 10:18:34 GMT (705kb,D)

Which authors of this paper are endorsers? | Disable MathJax (What is MathJax?)

Link back to: arXiv, form interface, contact.

> cs > arXiv:2105.00210

Download:

Current browse context:

Change to browse by:

References & Citations

DBLP - CS Bibliography

Bookmark

Computer Science > Machine Learning

Title: Better than the Best: Gradient-based Improper Reinforcement Learning for Network Scheduling

Submission history