We gratefully acknowledge support from
the Simons Foundation and member institutions.
Full-text links:

Download:

Current browse context:

stat.ME

Change to browse by:

References & Citations

Bookmark

(what is this?)
CiteULike logo BibSonomy logo Mendeley logo del.icio.us logo Digg logo Reddit logo ScienceWISE logo

Statistics > Methodology

Title: The Limits to Learning a Diffusion Model

Abstract: This paper provides the first sample complexity lower bounds for the estimation of simple diffusion models, including the Bass model (used in modeling consumer adoption) and the SIR model (used in modeling epidemics). We show that one cannot hope to learn such models until quite late in the diffusion. Specifically, we show that the time required to collect a number of observations that exceeds our sample complexity lower bounds is large. For Bass models with low innovation rates, our results imply that one cannot hope to predict the eventual number of adopting customers until one is at least two-thirds of the way to the time at which the rate of new adopters is at its peak. In a similar vein, our results imply that in the case of an SIR model, one cannot hope to predict the eventual number of infections until one is approximately two-thirds of the way to the time at which the infection rate has peaked. These limits are borne out in both product adoption data (Amazon), as well as epidemic data (COVID-19).
Subjects: Methodology (stat.ME); Machine Learning (cs.LG)
Cite as: arXiv:2006.06373 [stat.ME]
  (or arXiv:2006.06373v2 [stat.ME] for this version)

Submission history

From: Jackie Baek [view email]
[v1] Thu, 11 Jun 2020 12:47:16 GMT (658kb,D)
[v2] Tue, 16 Mar 2021 03:26:28 GMT (1108kb,D)

Link back to: arXiv, form interface, contact.