We gratefully acknowledge support from
the Simons Foundation and member institutions.
Full-text links:

Download:

Current browse context:

stat.ML

Change to browse by:

References & Citations

Bookmark

(what is this?)
CiteULike logo BibSonomy logo Mendeley logo del.icio.us logo Digg logo Reddit logo

Statistics > Machine Learning

Title: Learning Hierarchical Interactions at Scale: A Convex Optimization Approach

Abstract: In many learning settings, it is beneficial to augment the main features with pairwise interactions. Such interaction models can be often enhanced by performing variable selection under the so-called strong hierarchy constraint: an interaction is non-zero only if its associated main features are non-zero. Existing convex optimization based algorithms face difficulties in handling problems where the number of main features $p \sim 10^3$ (with total number of features $\sim p^2$). In this paper, we study a convex relaxation which enforces strong hierarchy and develop a highly scalable algorithm based on proximal gradient descent. We introduce novel screening rules that allow for solving the complicated proximal problem in parallel. In addition, we introduce a specialized active-set strategy with gradient screening for avoiding costly gradient computations. The framework can handle problems having dense design matrices, with $p = 50,000$ ($\sim 10^9$ interactions)---instances that are much larger than current state of the art. Experiments on real and synthetic data suggest that our toolkit hierScale outperforms the state of the art in terms of prediction and variable selection and can achieve over a 4900x speed-up.
Comments: AISTATS 2020
Subjects: Machine Learning (stat.ML); Machine Learning (cs.LG); Optimization and Control (math.OC); Computation (stat.CO)
Cite as: arXiv:1902.01542 [stat.ML]
  (or arXiv:1902.01542v5 [stat.ML] for this version)

Submission history

From: Hussein Hazimeh [view email]
[v1] Tue, 5 Feb 2019 04:47:54 GMT (115kb,D)
[v2] Wed, 6 Feb 2019 23:48:45 GMT (105kb,D)
[v3] Sun, 26 May 2019 03:29:03 GMT (86kb,D)
[v4] Sat, 11 Jul 2020 00:54:39 GMT (247kb,D)
[v5] Tue, 14 Jul 2020 01:13:02 GMT (247kb,D)

Link back to: arXiv, form interface, contact.