Current browse context:
stat.ML
Change to browse by:
References & Citations
Statistics > Machine Learning
Title: Data-driven confidence bands for distributed nonparametric regression
(Submitted on 13 Dec 2019 (v1), last revised 8 Jun 2020 (this version, v2))
Abstract: Gaussian Process Regression and Kernel Ridge Regression are popular nonparametric regression approaches. Unfortunately, they suffer from high computational complexity rendering them inapplicable to the modern massive datasets. To that end a number of approximations have been suggested, some of them allowing for a distributed implementation. One of them is the divide and conquer approach, splitting the data into a number of partitions, obtaining the local estimates and finally averaging them. In this paper we suggest a novel computationally efficient fully data-driven algorithm, quantifying uncertainty of this method, yielding frequentist $L_2$-confidence bands. We rigorously demonstrate validity of the algorithm. Another contribution of the paper is a minimax-optimal high-probability bound for the averaged estimator, complementing and generalizing the known risk bounds.
Submission history
From: Valeriy Avanesov [view email][v1] Fri, 13 Dec 2019 20:13:55 GMT (82kb)
[v2] Mon, 8 Jun 2020 18:17:00 GMT (2932kb,D)
Link back to: arXiv, form interface, contact.