We gratefully acknowledge support from
the Simons Foundation and member institutions.
Full-text links:


Current browse context:


Change to browse by:

References & Citations


(what is this?)
CiteULike logo BibSonomy logo Mendeley logo del.icio.us logo Digg logo Reddit logo ScienceWISE logo

Quantitative Biology > Populations and Evolution

Title: Stochastic Modeling of an Infectious Disease, Part I: Understand the Negative Binomial Distribution and Predict an Epidemic More Reliably

Abstract: Why are the epidemic patterns of COVID-19 so different among different cities or countries which are similar in their populations, medical infrastructures, and people's behavior? Why are forecasts or predictions made by so-called experts often grossly wrong, concerning the numbers of people who get infected or die? The purpose of this study is to better understand the stochastic nature of an epidemic disease, and answer the above questions. Much of the work on infectious diseases has been based on "SIR deterministic models," (Kermack and McKendrick:1927.) We will explore stochastic models that can capture the essence of the seemingly erratic behavior of an infectious disease. A stochastic model, in its formulation, takes into account the random nature of an infectious disease.
The stochastic model we study here is based on the "birth-and-death process with immigration" (BDI for short), which was proposed in the study of population growth or extinction of some biological species. The BDI process model ,however, has not been investigated by the epidemiology community. The BDI process is one of a few birth-and-death processes, which we can solve analytically. Its time-dependent probability distribution function is a "negative binomial distribution" with its parameter $r$ less than $1$. The "coefficient of variation" of the process is larger than $\sqrt{1/r} > 1$. Furthermore, it has a long tail like the zeta distribution. These properties explain why infection patterns exhibit enormously large variations. The number of infected predicted by a deterministic model is much greater than the median of the distribution. This explains why any forecast based on a deterministic model will fail more often than not.
Comments: 28 pages, 14 figures
Subjects: Populations and Evolution (q-bio.PE); Methodology (stat.ME)
MSC classes: 00
ACM classes: G.3; I.6; J.3
Cite as: arXiv:2006.01586 [q-bio.PE]
  (or arXiv:2006.01586v1 [q-bio.PE] for this version)

Submission history

From: Hisashi Kobayashi [view email]
[v1] Tue, 2 Jun 2020 13:25:36 GMT (349kb,D)

Link back to: arXiv, form interface, contact.