We gratefully acknowledge support from
the Simons Foundation and member institutions.
Full-text links:

Download:

Current browse context:

stat.ML

Change to browse by:

References & Citations

Bookmark

(what is this?)
CiteULike logo BibSonomy logo Mendeley logo del.icio.us logo Digg logo Reddit logo ScienceWISE logo

Statistics > Machine Learning

Title: A New Perspective on Pool-Based Active Classification and False-Discovery Control

Abstract: In many scientific settings there is a need for adaptive experimental design to guide the process of identifying regions of the search space that contain as many true positives as possible subject to a low rate of false discoveries (i.e. false alarms). Such regions of the search space could differ drastically from a predicted set that minimizes 0/1 error and accurate identification could require very different sampling strategies. Like active learning for binary classification, this experimental design cannot be optimally chosen a priori, but rather the data must be taken sequentially and adaptively. However, unlike classification with 0/1 error, collecting data adaptively to find a set with high true positive rate and low false discovery rate (FDR) is not as well understood. In this paper we provide the first provably sample efficient adaptive algorithm for this problem. Along the way we highlight connections between classification, combinatorial bandits, and FDR control making contributions to each.
Subjects: Machine Learning (stat.ML); Machine Learning (cs.LG)
Journal reference: Published at Neurips 2019
Cite as: arXiv:2008.06555 [stat.ML]
  (or arXiv:2008.06555v1 [stat.ML] for this version)

Submission history

From: Lalit Jain [view email]
[v1] Fri, 14 Aug 2020 19:49:19 GMT (1203kb,D)

Link back to: arXiv, form interface, contact.