We gratefully acknowledge support from
the Simons Foundation and member institutions.
Full-text links:

Download:

Current browse context:

cs.LG

Change to browse by:

References & Citations

DBLP - CS Bibliography

Bookmark

(what is this?)
CiteULike logo BibSonomy logo Mendeley logo del.icio.us logo Digg logo Reddit logo

Computer Science > Machine Learning

Title: MARLIN: Soft Actor-Critic based Reinforcement Learning for Congestion Control in Real Networks

Abstract: Fast and efficient transport protocols are the foundation of an increasingly distributed world. The burden of continuously delivering improved communication performance to support next-generation applications and services, combined with the increasing heterogeneity of systems and network technologies, has promoted the design of Congestion Control (CC) algorithms that perform well under specific environments. The challenge of designing a generic CC algorithm that can adapt to a broad range of scenarios is still an open research question. To tackle this challenge, we propose to apply a novel Reinforcement Learning (RL) approach. Our solution, MARLIN, uses the Soft Actor-Critic algorithm to maximize both entropy and return and models the learning process as an infinite-horizon task. We trained MARLIN on a real network with varying background traffic patterns to overcome the sim-to-real mismatch that researchers have encountered when applying RL to CC. We evaluated our solution on the task of file transfer and compared it to TCP Cubic. While further research is required, results have shown that MARLIN can achieve comparable results to TCP with little hyperparameter tuning, in a task significantly different from its training setting. Therefore, we believe that our work represents a promising first step toward building CC algorithms based on the maximum entropy RL framework.
Comments: 10 pages, 5 figures, AAAI 2023 workshop "Reinforcement Learning Ready for Production", accepted at NOMS 2023 - IEEE/IFIP Network Operations and Management Symposium
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Networking and Internet Architecture (cs.NI)
DOI: 10.1109/NOMS56928.2023.10154210
Cite as: arXiv:2302.01301 [cs.LG]
  (or arXiv:2302.01301v1 [cs.LG] for this version)

Submission history

From: Raffaele Galliera [view email]
[v1] Thu, 2 Feb 2023 18:27:20 GMT (1533kb)

Link back to: arXiv, form interface, contact.