We gratefully acknowledge support from
the Simons Foundation and member institutions.
Full-text links:


Current browse context:


Change to browse by:

References & Citations


(what is this?)
CiteULike logo BibSonomy logo Mendeley logo del.icio.us logo Digg logo Reddit logo ScienceWISE logo

Statistics > Methodology

Title: Revisiting Simpson's Paradox: a statistical misspecification perspective

Authors: Aris Spanos
Abstract: The primary objective of this paper is to revisit Simpson's paradox using a statistical misspecification perspective. It is argued that the reversal of statistical associations is sometimes spurious, stemming from invalid probabilistic assumptions imposed on the data. The concept of statistical misspecification is used to formalize the vague term `spurious results' as `statistically untrustworthy' inference results. This perspective sheds new light on the paradox by distingusing between statistically trustworthy vs. untrustworthy association reversals. It turns out that in both cases there is nothing counterintuitive to explain or account for. This perspective is also used to revisit the causal `resolution' of the paradox in an attempt to delineate the modeling and inference issues raised by the statistical misspecification perspective. The main arguments are illustrated using both actual and hypothetical data from the literature, including Yule's "nonsense-correlations" and the Berkeley admissions study.
Comments: 24 pages, 12 figures
Subjects: Methodology (stat.ME)
Cite as: arXiv:1605.02209 [stat.ME]
  (or arXiv:1605.02209v2 [stat.ME] for this version)

Submission history

From: Aris Spanos [view email]
[v1] Sat, 7 May 2016 16:26:59 GMT (26kb)
[v2] Fri, 13 May 2016 14:53:54 GMT (80kb)

Link back to: arXiv, form interface, contact.