We gratefully acknowledge support from
the Simons Foundation and member institutions.
Full-text links:


Current browse context:


Change to browse by:

References & Citations


(what is this?)
CiteULike logo BibSonomy logo Mendeley logo del.icio.us logo Digg logo Reddit logo ScienceWISE logo

Statistics > Applications

Title: VAT tax gap prediction: a 2-steps Gradient Boosting approach

Abstract: Tax evasion is the illegal non-payment of taxes by individuals, corporations, and trusts. It results in a loss of state revenue that can undermine the effectiveness of government policies. One measure of tax evasion is the so-called tax gap: the difference between the income that should be reported to the tax authorities and the amount actually reported. However, economists lack a robust method for estimating the tax gap through a bottom-up approach based on fiscal audits. This is difficult because the declared tax base is available on the whole population but the income reported to the tax authorities is generally available only on a small, non-random sample of audited units. This induces a selection bias which invalidates standard statistical methods. Here, we use machine learning based on a 2-steps Gradient Boosting model, to correct for the selection bias without requiring any strong assumption on the distribution. We use our method to estimate the Italian VAT Gap related to individual firms based on information gathered from administrative sources. Our algorithm estimates the potential VAT turnover of Italian individual firms for the fiscal year 2011 and suggests that the tax gap is about 30% of the total potential tax base. Comparisons with other methods show our technique offers a significant improvement in predictive performance.
Comments: 13 pages, 2 figures, 6 tables Presented at NTTS 2019 conference Under review at other peer-reviewed journal
Subjects: Applications (stat.AP); General Economics (econ.GN); Methodology (stat.ME); Machine Learning (stat.ML)
Cite as: arXiv:1912.03781 [stat.AP]
  (or arXiv:1912.03781v1 [stat.AP] for this version)

Submission history

From: Pierfrancesco Alaimo Di Loro [view email]
[v1] Sun, 8 Dec 2019 23:16:29 GMT (48kb,D)

Link back to: arXiv, form interface, contact.