We gratefully acknowledge support from
the Simons Foundation and member institutions.
Full-text links:

Download:

Current browse context:

math.ST

Change to browse by:

References & Citations

Bookmark

(what is this?)
CiteULike logo BibSonomy logo Mendeley logo del.icio.us logo Digg logo Reddit logo

Mathematics > Statistics Theory

Title: Convergence of Chao Unseen Species Estimator

Abstract: Support size estimation and the related problem of unseen species estimation have wide applications in ecology and database analysis. Perhaps the most used support size estimator is the Chao estimator. Despite its wide spread use, little is known about its theoretical properties. We analyze the Chao estimator and show that its worst case mean squared error (MSE) is smaller than the MSE of the plug-in estimator by a factor of $\mathcal{O} ((k/n)^4)$, where $k$ is the maximum support size and $n$ is the number of samples. Our main technical contribution is a new method to analyze rational estimators for discrete distribution properties, which may be of independent interest.
Comments: 20 pages, 1 figure, short version presented at International Symposium on Information Theory (ISIT) 2019
Subjects: Statistics Theory (math.ST)
Cite as: arXiv:2001.04130 [math.ST]
  (or arXiv:2001.04130v1 [math.ST] for this version)

Submission history

From: Nived Rajaraman [view email]
[v1] Mon, 13 Jan 2020 10:07:13 GMT (353kb)

Link back to: arXiv, form interface, contact.