References & Citations
Statistics > Methodology
Title: Exact mean integrated squared error and bandwidth selection for kernel distribution function estimators
(Submitted on 22 Jun 2016 (v1), last revised 24 Jul 2018 (this version, v3))
Abstract: An exact, closed form, and easy to compute expression for the mean integrated squared error (MISE) of a kernel estimator of a normal mixture cumulative distribution function is derived for the class of arbitrary order Gaussian-based kernels. Comparisons are made with MISE of the empirical distribution function, the infeasible minimum MISE of kernel estimators, and the asymptotically optimal second order uniform kernel. The results afford straightforward extensions to other classes of kernel functions and distributions. The analysis also offers a guide on when to use higher order kernels in distribution function estimation.
A simple plug-in method of simultaneously selecting the optimal bandwidth and kernel order is proposed based on a non-asymptotic approximation of the unknown distribution by a normal mixture. A simulation study shows that the method works well in finite samples, thus providing a viable alternative to existing bandwidth selection procedures.
Submission history
From: Vitaliy Oryshchenko [view email][v1] Wed, 22 Jun 2016 15:59:36 GMT (633kb,D)
[v2] Mon, 8 May 2017 15:54:45 GMT (687kb,D)
[v3] Tue, 24 Jul 2018 20:55:45 GMT (1046kb,D)
Link back to: arXiv, form interface, contact.