We gratefully acknowledge support from
the Simons Foundation and member institutions.
Full-text links:

Download:

Current browse context:

cs.LG

Change to browse by:

cs

References & Citations

DBLP - CS Bibliography

Bookmark

(what is this?)
CiteULike logo BibSonomy logo Mendeley logo del.icio.us logo Digg logo Reddit logo

Computer Science > Machine Learning

Title: Gradient Boosting With Piece-Wise Linear Regression Trees

Abstract: Gradient Boosted Decision Trees (GBDT) is a very successful ensemble learning algorithm widely used across a variety of applications. Recently, several variants of GBDT training algorithms and implementations have been designed and heavily optimized in some very popular open sourced toolkits including XGBoost, LightGBM and CatBoost. In this paper, we show that both the accuracy and efficiency of GBDT can be further enhanced by using more complex base learners. Specifically, we extend gradient boosting to use piecewise linear regression trees (PL Trees), instead of piecewise constant regression trees, as base learners. We show that PL Trees can accelerate convergence of GBDT and improve the accuracy. We also propose some optimization tricks to substantially reduce the training time of PL Trees, with little sacrifice of accuracy. Moreover, we propose several implementation techniques to speedup our algorithm on modern computer architectures with powerful Single Instruction Multiple Data (SIMD) parallelism. The experimental results show that GBDT with PL Trees can provide very competitive testing accuracy with comparable or less training time.
Subjects: Machine Learning (cs.LG)
Cite as: arXiv:1802.05640 [cs.LG]
  (or arXiv:1802.05640v3 [cs.LG] for this version)

Submission history

From: Yu Shi [view email]
[v1] Thu, 15 Feb 2018 16:26:35 GMT (234kb,D)
[v2] Wed, 4 Apr 2018 12:34:04 GMT (233kb,D)
[v3] Tue, 25 Jun 2019 18:17:03 GMT (3462kb,D)

Link back to: arXiv, form interface, contact.