Learning Multiclass Classifier Under Noisy Bandit Feedback

Agarwal, Mudit; Manwani, Naresh

doi:10.1007/978-3-030-75765-6_36

Full-text links:

Download:

Current browse context:

cs.LG

< prev | next >

new | recent | 2006

Computer Science > Machine Learning

Title: Learning Multiclass Classifier Under Noisy Bandit Feedback

Authors: Mudit Agarwal, Naresh Manwani

(Submitted on 5 Jun 2020 (v1), last revised 3 Mar 2021 (this version, v2))

Abstract: This paper addresses the problem of multiclass classification with corrupted or noisy bandit feedback. In this setting, the learner may not receive true feedback. Instead, it receives feedback that has been flipped with some non-zero probability. We propose a novel approach to deal with noisy bandit feedback based on the unbiased estimator technique. We further offer a method that can efficiently estimate the noise rates, thus providing an end-to-end framework. The proposed algorithm enjoys a mistake bound of the order of $O(\sqrt{T})$ in the high noise case and of the order of $O(T^{\nicefrac{2}{3}})$ in the worst case. We show our approach's effectiveness using extensive experiments on several benchmark datasets.

Comments:	17 pages, 6 figures 1 Table
Subjects:	Machine Learning (cs.LG); Machine Learning (stat.ML)
Journal reference:	Pacific-Asia Conference on Knowledge Discovery and Data Mining,2021
DOI:	10.1007/978-3-030-75765-6_36
Cite as:	arXiv:2006.03545 [cs.LG]
	(or arXiv:2006.03545v2 [cs.LG] for this version)

Submission history

From: Mudit Agarwal [view email]
[v1] Fri, 5 Jun 2020 16:31:05 GMT (1245kb,D)
[v2] Wed, 3 Mar 2021 16:56:12 GMT (1903kb,D)

Which authors of this paper are endorsers? | Disable MathJax (What is MathJax?)

Link back to: arXiv, form interface, contact.

> cs > arXiv:2006.03545

Download:

Current browse context:

Change to browse by:

References & Citations

DBLP - CS Bibliography

Bookmark

Computer Science > Machine Learning

Title: Learning Multiclass Classifier Under Noisy Bandit Feedback

Submission history