References & Citations
Statistics > Machine Learning
Title: Additive Models with Trend Filtering
(Submitted on 16 Feb 2017 (v1), last revised 22 Nov 2018 (this version, v4))
Abstract: We study additive models built with trend filtering, i.e., additive models whose components are each regularized by the (discrete) total variation of their $k$th (discrete) derivative, for a chosen integer $k \geq 0$. This results in $k$th degree piecewise polynomial components, (e.g., $k=0$ gives piecewise constant components, $k=1$ gives piecewise linear, $k=2$ gives piecewise quadratic, etc.). Analogous to its advantages in the univariate case, additive trend filtering has favorable theoretical and computational properties, thanks in large part to the localized nature of the (discrete) total variation regularizer that it uses. On the theory side, we derive fast error rates for additive trend filtering estimates, and show these rates are minimax optimal when the underlying function is additive and has component functions whose derivatives are of bounded variation. We also show that these rates are unattainable by additive smoothing splines (and by additive models built from linear smoothers, in general). On the computational side, as per the standard in additive models, backfitting is an appealing method for optimization, but it is particularly appealing for additive trend filtering because we can leverage a few highly efficient univariate trend filtering solvers. Going one step further, we describe a new backfitting algorithm whose iterations can be run in parallel, which (as far as we know) is the first of its kind. Lastly, we present experiments to examine the empirical performance of additive trend filtering.
Submission history
From: Veeranjaneyulu Sadhanala [view email][v1] Thu, 16 Feb 2017 16:19:56 GMT (538kb,D)
[v2] Fri, 24 Mar 2017 05:06:18 GMT (545kb,D)
[v3] Sat, 28 Apr 2018 09:10:41 GMT (566kb,D)
[v4] Thu, 22 Nov 2018 03:16:44 GMT (550kb,D)
Link back to: arXiv, form interface, contact.