We gratefully acknowledge support from
the Simons Foundation and member institutions.
Full-text links:

Download:

Current browse context:

stat.ML

Change to browse by:

References & Citations

Bookmark

(what is this?)
CiteULike logo BibSonomy logo Mendeley logo del.icio.us logo Digg logo Reddit logo

Statistics > Machine Learning

Title: Pruning variable selection ensembles

Abstract: In the context of variable selection, ensemble learning has gained increasing interest due to its great potential to improve selection accuracy and to reduce false discovery rate. A novel ordering-based selective ensemble learning strategy is designed in this paper to obtain smaller but more accurate ensembles. In particular, a greedy sorting strategy is proposed to rearrange the order by which the members are included into the integration process. Through stopping the fusion process early, a smaller subensemble with higher selection accuracy can be obtained. More importantly, the sequential inclusion criterion reveals the fundamental strength-diversity trade-off among ensemble members. By taking stability selection (abbreviated as StabSel) as an example, some experiments are conducted with both simulated and real-world data to examine the performance of the novel algorithm. Experimental results demonstrate that pruned StabSel generally achieves higher selection accuracy and lower false discovery rates than StabSel and several other benchmark methods.
Comments: 29 pages, 2 figures
Subjects: Machine Learning (stat.ML); Machine Learning (cs.LG)
MSC classes: 62J07, 62F07
Cite as: arXiv:1704.08265 [stat.ML]
  (or arXiv:1704.08265v1 [stat.ML] for this version)

Submission history

From: Chunxia Zhang [view email]
[v1] Wed, 26 Apr 2017 18:01:10 GMT (45kb)

Link back to: arXiv, form interface, contact.