Current browse context:
stat.ME
Change to browse by:
References & Citations
Statistics > Methodology
Title: Gini estimation under infinite variance
(Submitted on 5 Jul 2017 (this version), latest version 17 Dec 2017 (v4))
Abstract: Under infinite variance, the Gini coefficient cannot be reliably estimated using conventional nonparametric methods.
We study different approaches to the estimation of the Gini index in presence of a heavy tailed data generating process, that is, one with Paretan tails and/or in the stable distribution class with finite mean but non-finite variance (with tail index $\alpha\in(1,2)$).
While the Gini index is a measurement of fat tailedness, little attention has been brought to a significant downward bias in conventional applications, one that increases with lower values of $\alpha$.
First, we show how the "non-parametric" estimator of the Gini index undergoes a phase transition in the symmetry structure of its asymptotic distribution as the data distribution shifts from the domain of attraction of a light tail distribution to the domain of attraction of a fat tailed, infinite variance one.
Second, we show how the maximum likelihood estimator outperforms the "non-parametric" requiring a much smaller sample size to reach efficiency.
Finally we provide a simple correction mechanism to the small sample bias of the "non-paramteric" estimator based on the distance between the mode and the mean of its asymptotic distribution for the case of heavy tailed data generating process.
Submission history
From: Nassim Nicholas Taleb [view email][v1] Wed, 5 Jul 2017 12:46:35 GMT (4289kb,D)
[v2] Thu, 6 Jul 2017 11:40:38 GMT (4021kb,D)
[v3] Mon, 17 Jul 2017 21:38:37 GMT (3873kb,D)
[v4] Sun, 17 Dec 2017 23:17:47 GMT (153kb,D)
Link back to: arXiv, form interface, contact.