We gratefully acknowledge support from
the Simons Foundation and member institutions.
Full-text links:

Download:

Current browse context:

cs.LG

Change to browse by:

References & Citations

DBLP - CS Bibliography

Bookmark

(what is this?)
CiteULike logo BibSonomy logo Mendeley logo del.icio.us logo Digg logo Reddit logo ScienceWISE logo

Computer Science > Machine Learning

Title: A User-Guided Bayesian Framework for Ensemble Feature Selection in Life Science Applications (UBayFS)

Abstract: Feature selection represents a measure to reduce the complexity of high-dimensional datasets and gain insights into the systematic variation in the data. This aspect is of specific importance in domains that rely on model interpretability, such as life sciences. We propose UBayFS, an ensemble feature selection technique embedded in a Bayesian statistical framework. Our approach considers two sources of information: data and domain knowledge. We build a meta-model from an ensemble of elementary feature selectors and aggregate this information in a multinomial likelihood. The user guides UBayFS by weighting features and penalizing specific feature blocks or combinations, implemented via a Dirichlet-type prior distribution and a regularization term. In a quantitative evaluation, we demonstrate that our framework (a) allows for a balanced trade-off between user knowledge and data observations, and (b) achieves competitive performance with state-of-the-art methods.
Subjects: Machine Learning (cs.LG); Methodology (stat.ME)
Journal reference: Machine Learning (2022)
DOI: 10.1007/s10994-022-06221-9
Cite as: arXiv:2104.14787 [cs.LG]
  (or arXiv:2104.14787v3 [cs.LG] for this version)

Submission history

From: Stefan Schrunner [view email]
[v1] Fri, 30 Apr 2021 06:51:33 GMT (367kb,D)
[v2] Fri, 28 May 2021 16:21:17 GMT (519kb,D)
[v3] Sat, 11 Dec 2021 09:59:13 GMT (596kb,D)

Link back to: arXiv, form interface, contact.