Current browse context:
stat.ME
Change to browse by:
References & Citations
Statistics > Methodology
Title: Statistical Methods for Replicability Assessment
(Submitted on 20 Mar 2019 (v1), last revised 27 Feb 2020 (this version, v2))
Abstract: Large-scale replication studies like the Reproducibility Project: Psychology (RP:P) provide invaluable systematic data on scientific replicability, but most analyses and interpretations of the data fail to agree on the definition of "replicability" and disentangle the inexorable consequences of known selection bias from competing explanations. We discuss three concrete definitions of replicability based on (1) whether published findings about the signs of effects are mostly correct, (2) how effective replication studies are in reproducing whatever true effect size was present in the original experiment, and (3) whether true effect sizes tend to diminish in replication. We apply techniques from multiple testing and post-selection inference to develop new methods that answer these questions while explicitly accounting for selection bias. Our analyses suggest that the RP:P dataset is largely consistent with publication bias due to selection of significant effects. The methods in this paper make no distributional assumptions about the true effect sizes.
Submission history
From: Kenneth Hung [view email][v1] Wed, 20 Mar 2019 21:15:12 GMT (223kb,D)
[v2] Thu, 27 Feb 2020 00:04:45 GMT (221kb,D)
Link back to: arXiv, form interface, contact.