Dynamic Spectrum Access using Stochastic Multi-User Bandits

Bande, Meghana; Magesh, Akshayaa; Veeravalli, Venugopal V.

Full-text links:

Download:

Current browse context:

cs.IT

< prev | next >

new | recent | 2101

Computer Science > Information Theory

Title: Dynamic Spectrum Access using Stochastic Multi-User Bandits

Authors: Meghana Bande, Akshayaa Magesh, Venugopal V. Veeravalli

(Submitted on 12 Jan 2021)

Abstract: A stochastic multi-user multi-armed bandit framework is used to develop algorithms for uncoordinated spectrum access. In contrast to prior work, it is assumed that rewards can be non-zero even under collisions, thus allowing for the number of users to be greater than the number of channels. The proposed algorithm consists of an estimation phase and an allocation phase. It is shown that if every user adopts the algorithm, the system wide regret is order-optimal of order $O(\log T)$ over a time-horizon of duration $T$. The regret guarantees hold for both the cases where the number of users is greater than or less than the number of channels. The algorithm is extended to the dynamic case where the number of users in the system evolves over time, and is shown to lead to sub-linear regret.

Subjects:	Information Theory (cs.IT); Machine Learning (stat.ML)
Cite as:	arXiv:2101.04388 [cs.IT]
	(or arXiv:2101.04388v1 [cs.IT] for this version)

Submission history

From: Akshayaa Magesh [view email]
[v1] Tue, 12 Jan 2021 10:29:57 GMT (349kb)

Which authors of this paper are endorsers? | Disable MathJax (What is MathJax?)

Link back to: arXiv, form interface, contact.

> cs > arXiv:2101.04388

Download:

Current browse context:

Change to browse by:

References & Citations

DBLP - CS Bibliography

Bookmark

Computer Science > Information Theory

Title: Dynamic Spectrum Access using Stochastic Multi-User Bandits

Submission history