Dueling Bandits with Team Comparisons

Cohen, Lee; Schmidt-Kraepelin, Ulrike; Mansour, Yishay

Full-text links:

Download:

Current browse context:

cs.LG

< prev | next >

new | recent | 2107

Computer Science > Machine Learning

Title: Dueling Bandits with Team Comparisons

Authors: Lee Cohen, Ulrike Schmidt-Kraepelin, Yishay Mansour

(Submitted on 6 Jul 2021)

Abstract: We introduce the dueling teams problem, a new online-learning setting in which the learner observes noisy comparisons of disjoint pairs of $k$-sized teams from a universe of $n$ players. The goal of the learner is to minimize the number of duels required to identify, with high probability, a Condorcet winning team, i.e., a team which wins against any other disjoint team (with probability at least $1/2$). Noisy comparisons are linked to a total order on the teams. We formalize our model by building upon the dueling bandits setting (Yue et al.2012) and provide several algorithms, both for stochastic and deterministic settings. For the stochastic setting, we provide a reduction to the classical dueling bandits setting, yielding an algorithm that identifies a Condorcet winning team within $\mathcal{O}((n + k \log (k)) \frac{\max(\log\log n, \log k)}{\Delta^2})$ duels, where $\Delta$ is a gap parameter. For deterministic feedback, we additionally present a gap-independent algorithm that identifies a Condorcet winning team within $\mathcal{O}(nk\log(k)+k^5)$ duels.

Subjects:	Machine Learning (cs.LG); Data Structures and Algorithms (cs.DS); Machine Learning (stat.ML)
Cite as:	arXiv:2107.02738 [cs.LG]
	(or arXiv:2107.02738v1 [cs.LG] for this version)

Submission history

From: Lee Cohen [view email]
[v1] Tue, 6 Jul 2021 17:12:17 GMT (912kb)

Which authors of this paper are endorsers? | Disable MathJax (What is MathJax?)

Link back to: arXiv, form interface, contact.

> cs > arXiv:2107.02738

Download:

Current browse context:

Change to browse by:

References & Citations

DBLP - CS Bibliography

Bookmark

Computer Science > Machine Learning

Title: Dueling Bandits with Team Comparisons

Submission history