References & Citations
Statistics > Methodology
Title: Experimental Design Issues in Big Data. The Question of Bias
(Submitted on 19 Dec 2017 (v1), last revised 20 Nov 2018 (this version, v4))
Abstract: Data can be collected in scientific studies via a controlled experiment or passive observation. Big data is often collected in a passive way, e.g. from social media. In studies of causation great efforts are made to guard against bias and hidden confounders or feedback which can destroy the identification of causation by corrupting or omitting counterfactuals (controls). Various solutions of these problems are discussed, including randomization.
Submission history
From: Elena Pesce [view email][v1] Tue, 19 Dec 2017 13:31:59 GMT (29kb)
[v2] Mon, 8 Jan 2018 11:56:37 GMT (29kb)
[v3] Thu, 11 Jan 2018 14:59:42 GMT (29kb)
[v4] Tue, 20 Nov 2018 10:05:54 GMT (29kb)
Link back to: arXiv, form interface, contact.