Learning Feature Nonlinearities with Non-Convex Regularized Binned Regression

Oymak, Samet; Mahdavi, Mehrdad; Chen, Jiasi

Full-text links:

Download:

Current browse context:

cs.LG

< prev | next >

new | recent | 1705

Computer Science > Machine Learning

Title: Learning Feature Nonlinearities with Non-Convex Regularized Binned Regression

Authors: Samet Oymak, Mehrdad Mahdavi, Jiasi Chen

(Submitted on 20 May 2017)

Abstract: For various applications, the relations between the dependent and independent variables are highly nonlinear. Consequently, for large scale complex problems, neural networks and regression trees are commonly preferred over linear models such as Lasso. This work proposes learning the feature nonlinearities by binning feature values and finding the best fit in each quantile using non-convex regularized linear regression. The algorithm first captures the dependence between neighboring quantiles by enforcing smoothness via piecewise-constant/linear approximation and then selects a sparse subset of good features. We prove that the proposed algorithm is statistically and computationally efficient. In particular, it achieves linear rate of convergence while requiring near-minimal number of samples. Evaluations on synthetic and real datasets demonstrate that algorithm is competitive with current state-of-the-art and accurately learns feature nonlinearities. Finally, we explore an interesting connection between the binning stage of our algorithm and sparse Johnson-Lindenstrauss matrices.

Comments:	22 pages, 7 figures
Subjects:	Machine Learning (cs.LG); Information Theory (cs.IT); Optimization and Control (math.OC); Machine Learning (stat.ML)
Cite as:	arXiv:1705.07256 [cs.LG]
	(or arXiv:1705.07256v1 [cs.LG] for this version)

Submission history

From: Samet Oymak [view email]
[v1] Sat, 20 May 2017 03:46:32 GMT (411kb,D)

Which authors of this paper are endorsers? | Disable MathJax (What is MathJax?)

Link back to: arXiv, form interface, contact.

> cs > arXiv:1705.07256

Download:

Current browse context:

Change to browse by:

References & Citations

DBLP - CS Bibliography

Bookmark

Computer Science > Machine Learning

Title: Learning Feature Nonlinearities with Non-Convex Regularized Binned Regression

Submission history