We gratefully acknowledge support from
the Simons Foundation and member institutions.
Full-text links:

Download:

Current browse context:

stat

Change to browse by:

References & Citations

Bookmark

(what is this?)
CiteULike logo BibSonomy logo Mendeley logo del.icio.us logo Digg logo Reddit logo

Mathematics > Statistics Theory

Title: Group testing with nested pools

Abstract: In order to identify the infected individuals of a population, their samples are divided in equally sized groups called pools and a single laboratory test is applied to each pool. Individuals whose samples belong to pools that test negative are declared healthy, while each pool that tests positive is divided into smaller, equally sized pools which are tested in the next stage. In the $(k+1)$-th stage all remaining samples are tested. If $p<1-3^{-1/3}$, we minimize the expected number of tests per individual as a function of the number $k+1$ of stages, and of the pool sizes in the first $k$ stages. We show that for each $p\in (0, 1-3^{-1/3})$ the optimal choice is one of four possible schemes, which are explicitly described. We conjecture that for each $p$, the optimal choice is one of the two sequences of pool sizes $(3^k\text{ or }3^{k-1}4,3^{k-1},\dots,3^2,3 )$, with a precise description of the range of $p$'s where each is optimal. The conjecture is supported by overwhelming numerical evidence for $p>2^{-51}$. We also show that the cost of the best among the schemes $(3^k,\dots,3)$ is of order $O\big(p\log(1/p)\big)$, comparable to the information theoretical lower bound $p\log_2(1/p)+(1-p)\log_2(1/(1-p))$, the entropy of a Bernoulli$(p)$ random variable.
Comments: 31 pages, 2 figures
Subjects: Statistics Theory (math.ST); Data Analysis, Statistics and Probability (physics.data-an); Quantitative Methods (q-bio.QM); Applications (stat.AP)
MSC classes: 62.P.10
Cite as: arXiv:2005.13650 [math.ST]
  (or arXiv:2005.13650v4 [math.ST] for this version)

Submission history

From: Pablo A. Ferrari [view email]
[v1] Wed, 27 May 2020 21:00:31 GMT (574kb,D)
[v2] Sat, 6 Jun 2020 23:43:00 GMT (578kb,D)
[v3] Thu, 19 Nov 2020 22:17:46 GMT (40kb,D)
[v4] Tue, 5 Oct 2021 00:01:04 GMT (44kb,D)

Link back to: arXiv, form interface, contact.