We gratefully acknowledge support from
the Simons Foundation and member institutions.
Full-text links:

Download:

Current browse context:

math

References & Citations

Bookmark

(what is this?)
CiteULike logo BibSonomy logo Mendeley logo del.icio.us logo Digg logo Reddit logo

Mathematics > Statistics Theory

Title: Bandwidth choice for nonparametric classification

Abstract: It is shown that, for kernel-based classification with univariate distributions and two populations, optimal bandwidth choice has a dichotomous character. If the two densities cross at just one point, where their curvatures have the same signs, then minimum Bayes risk is achieved using bandwidths which are an order of magnitude larger than those which minimize pointwise estimation error. On the other hand, if the curvature signs are different, or if there are multiple crossing points, then bandwidths of conventional size are generally appropriate. The range of different modes of behavior is narrower in multivariate settings. There, the optimal size of bandwidth is generally the same as that which is appropriate for pointwise density estimation. These properties motivate empirical rules for bandwidth choice.
Comments: Published at this http URL in the Annals of Statistics (this http URL) by the Institute of Mathematical Statistics (this http URL)
Subjects: Statistics Theory (math.ST)
MSC classes: 62H30, 62C12 (Primary) 62G07. (Secondary)
Journal reference: Annals of Statistics 2005, Vol. 33, No. 1, 284-306
DOI: 10.1214/009053604000000959
Report number: IMS-AOS-AOS303
Cite as: arXiv:math/0504511 [math.ST]
  (or arXiv:math/0504511v1 [math.ST] for this version)

Submission history

From: Peter Hall [view email]
[v1] Mon, 25 Apr 2005 12:10:55 GMT (224kb,S)

Link back to: arXiv, form interface, contact.