We gratefully acknowledge support from
the Simons Foundation and member institutions.
Full-text links:

Download:

Current browse context:

stat.ME

Change to browse by:

References & Citations

Bookmark

(what is this?)
CiteULike logo BibSonomy logo Mendeley logo del.icio.us logo Digg logo Reddit logo

Statistics > Methodology

Title: Statistical computation of Boltzmann entropy and estimation of the optimal probability density function from statistical sample

Abstract: In this work, we investigate the statistical computation of the Boltzmann entropy of statistical samples. For this purpose, we use both histogram and kernel function to estimate the probability density function of statistical samples. We find that, due to coarse-graining, the entropy is a monotonic increasing function of the bin width for histogram or bandwidth for kernel estimation, which seems to be difficult to select an optimal bin width/bandwidth for computing the entropy. Fortunately, we notice that there exists a minimum of the first derivative of entropy for both histogram and kernel estimation, and this minimum point of the first derivative asymptotically points to the optimal bin width or bandwidth. We have verified these findings by large amounts of numerical experiments. Hence, we suggest that the minimum of the first derivative of entropy be used as a selector for the optimal bin width or bandwidth of density estimation. Moreover, the optimal bandwidth selected by the minimum of the first derivative of entropy is purely data-based, independent of the unknown underlying probability density distribution, which is obviously superior to the existing estimators. Our results are not restricted to one-dimensional, but can also be extended to multivariate cases. It should be emphasized, however, that we do not provide a robust mathematical proof of these findings, and we leave these issues with those who are interested in them.
Comments: 8 pages, 6 figures, MNRAS, in the press
Subjects: Methodology (stat.ME); Computational Physics (physics.comp-ph); Data Analysis, Statistics and Probability (physics.data-an)
Journal reference: MNRAS (2014), 445, 4211 - 4217
DOI: 10.1093/mnras/stu2040
Cite as: arXiv:1410.5356 [stat.ME]
  (or arXiv:1410.5356v1 [stat.ME] for this version)

Submission history

From: Ping He [view email]
[v1] Fri, 17 Oct 2014 16:08:02 GMT (142kb)

Link back to: arXiv, form interface, contact.