We gratefully acknowledge support from
the Simons Foundation and member institutions.
Full-text links:


Current browse context:


Change to browse by:

References & Citations


(what is this?)
CiteULike logo BibSonomy logo Mendeley logo del.icio.us logo Digg logo Reddit logo ScienceWISE logo

Statistics > Applications

Title: Visualizing Count Data Regressions Using Rootograms

Abstract: The rootogram is a graphical tool associated with the work of J. W. Tukey that was originally used for assessing goodness of fit of univariate distributions. Here we extend the rootogram to regression models and show that this is particularly useful for diagnosing and treating issues such as overdispersion and/or excess zeros in count data models. We also introduce a weighted version of the rootogram that can be applied out of sample or to (weighted) subsets of the data, e.g., in finite mixture models. An empirical illustration revisiting a well-known data set from ethology is included, for which a negative binomial hurdle model is employed. Supplementary materials providing two further illustrations are available online: the first, using data from public health, employs a two-component finite mixture of negative binomial models, the second, using data from finance, involves underdispersion. An R implementation of our tools is available in the R package countreg. It also contains the data and replication code.
Comments: 19 pages, 7 figures
Subjects: Applications (stat.AP)
Journal reference: The American Statistician, 2016, Vol. 70, No. 3, 296-303
DOI: 10.1080/00031305.2016.1173590
Cite as: arXiv:1605.01311 [stat.AP]
  (or arXiv:1605.01311v1 [stat.AP] for this version)

Submission history

From: Christian Kleiber [view email]
[v1] Wed, 4 May 2016 15:16:38 GMT (140kb,D)

Link back to: arXiv, form interface, contact.