We gratefully acknowledge support from
the Simons Foundation and member institutions.
Full-text links:

Download:

Current browse context:

stat

Change to browse by:

References & Citations

Bookmark

(what is this?)
CiteULike logo BibSonomy logo Mendeley logo del.icio.us logo Digg logo Reddit logo

Computer Science > Machine Learning

Title: Toward Optimal Probabilistic Active Learning Using a Bayesian Approach

Abstract: Gathering labeled data to train well-performing machine learning models is one of the critical challenges in many applications. Active learning aims at reducing the labeling costs by an efficient and effective allocation of costly labeling resources. In this article, we propose a decision-theoretic selection strategy that (1) directly optimizes the gain in misclassification error, and (2) uses a Bayesian approach by introducing a conjugate prior distribution to determine the class posterior to deal with uncertainties. By reformulating existing selection strategies within our proposed model, we can explain which aspects are not covered in current state-of-the-art and why this leads to the superior performance of our approach. Extensive experiments on a large variety of datasets and different kernels validate our claims.
Comments: 11 pages, 8 figures, appendix
Subjects: Machine Learning (cs.LG); Machine Learning (stat.ML)
MSC classes: 68T05
ACM classes: I.2.6
Cite as: arXiv:2006.01732 [cs.LG]
  (or arXiv:2006.01732v1 [cs.LG] for this version)

Submission history

From: Daniel Kottke [view email]
[v1] Tue, 2 Jun 2020 15:59:42 GMT (1362kb,D)

Link back to: arXiv, form interface, contact.