We gratefully acknowledge support from
the Simons Foundation and member institutions.
Full-text links:

Download:

Current browse context:

math.NA

Change to browse by:

References & Citations

Bookmark

(what is this?)
CiteULike logo BibSonomy logo Mendeley logo del.icio.us logo Digg logo Reddit logo

Mathematics > Numerical Analysis

Title: Giga-scale Kernel Matrix Vector Multiplication on GPU

Abstract: Kernel matrix-vector multiplication (KMVM) is a foundational operation in machine learning and scientific computing. However, as KMVM tends to scale quadratically in both memory and time, applications are often limited by these computational constraints. In this paper, we propose a novel approximation procedure coined \textit{Faster-Fast and Free Memory Method} ($\fthreem$) to address these scaling issues of KMVM for tall~($10^8\sim 10^9$) and skinny~($D\leq7$) data. Extensive experiments demonstrate that $\fthreem$ has empirical \emph{linear time and memory} complexity with a relative error of order $10^{-3}$ and can compute a full KMVM for a billion points \emph{in under a minute} on a high-end GPU, leading to a significant speed-up in comparison to existing CPU methods. We demonstrate the utility of our procedure by applying it as a drop-in for the state-of-the-art GPU-based linear solver FALKON, \emph{improving speed 1.5-5.5 times} at the cost of $<1\%$ drop in accuracy. We further demonstrate competitive results on \emph{Gaussian Process regression} coupled with significant speedups on a variety of real-world datasets.
Subjects: Numerical Analysis (math.NA); Machine Learning (cs.LG); Mathematical Software (cs.MS); Computation (stat.CO)
Cite as: arXiv:2202.01085 [math.NA]
  (or arXiv:2202.01085v3 [math.NA] for this version)

Submission history

From: Robert Hu [view email]
[v1] Wed, 2 Feb 2022 15:28:15 GMT (13525kb,D)
[v2] Wed, 25 May 2022 14:19:48 GMT (12513kb,D)
[v3] Wed, 12 Oct 2022 22:01:20 GMT (12539kb,D)

Link back to: arXiv, form interface, contact.