Linearized GMM Kernels and Normalized Random Fourier Features

Li, Ping

Full-text links:

Download:

Current browse context:

cs.LG

< prev | next >

new | recent | 1605

Computer Science > Machine Learning

Title: Linearized GMM Kernels and Normalized Random Fourier Features

Authors: Ping Li

(Submitted on 18 May 2016 (v1), last revised 21 Feb 2017 (this version, v4))

Abstract: The method of "random Fourier features (RFF)" has become a popular tool for approximating the "radial basis function (RBF)" kernel. The variance of RFF is actually large. Interestingly, the variance can be substantially reduced by a simple normalization step as we theoretically demonstrate. We name the improved scheme as the "normalized RFF (NRFF)".
We also propose the "generalized min-max (GMM)" kernel as a measure of data similarity. GMM is positive definite as there is an associated hashing method named "generalized consistent weighted sampling (GCWS)" which linearizes this nonlinear kernel. We provide an extensive empirical evaluation of the RBF kernel and the GMM kernel on more than 50 publicly available datasets. For a majority of the datasets, the (tuning-free) GMM kernel outperforms the best-tuned RBF kernel.
We conduct extensive experiments for comparing the linearized RBF kernel using NRFF with the linearized GMM kernel using GCWS. We observe that, to reach a comparable classification accuracy, GCWS typically requires substantially fewer samples than NRFF, even on datasets where the original RBF kernel outperforms the original GMM kernel. The empirical success of GCWS (compared to NRFF) can also be explained from a theoretical perspective. Firstly, the relative variance (normalized by the squared expectation) of GCWS is substantially smaller than that of NRFF, except for the very high similarity region (where the variances of both methods are close to zero). Secondly, if we make a model assumption on the data, we can show analytically that GCWS exhibits much smaller variance than NRFF for estimating the same object (e.g., the RBF kernel), except for the very high similarity region.

Subjects:	Machine Learning (cs.LG); Information Retrieval (cs.IR); Machine Learning (stat.ML)
Cite as:	arXiv:1605.05721 [cs.LG]
	(or arXiv:1605.05721v4 [cs.LG] for this version)

Submission history

From: Ping Li [view email]
[v1] Wed, 18 May 2016 19:54:22 GMT (109kb)
[v2] Mon, 23 May 2016 19:51:39 GMT (148kb)
[v3] Thu, 3 Nov 2016 18:42:09 GMT (172kb)
[v4] Tue, 21 Feb 2017 17:11:48 GMT (303kb)

Which authors of this paper are endorsers? | Disable MathJax (What is MathJax?)

Link back to: arXiv, form interface, contact.

> cs > arXiv:1605.05721

Download:

Current browse context:

Change to browse by:

References & Citations

DBLP - CS Bibliography

Bookmark

Computer Science > Machine Learning

Title: Linearized GMM Kernels and Normalized Random Fourier Features

Submission history