We gratefully acknowledge support from
the Simons Foundation and member institutions.
Full-text links:

Download:

Current browse context:

math

Change to browse by:

References & Citations

Bookmark

(what is this?)
CiteULike logo BibSonomy logo Mendeley logo del.icio.us logo Digg logo Reddit logo

Computer Science > Information Theory

Title: Second-Order Asymptotically Optimal Outlier Hypothesis Testing

Abstract: We revisit the outlier hypothesis testing framework of Li \emph{et al.} (TIT 2014) and derive fundamental limits for the optimal test under the generalized Neyman-Pearson criterion. In outlier hypothesis testing, one is given multiple observed sequences, where most sequences are generated i.i.d. from a nominal distribution. The task is to discern the set of outlying sequences that are generated from anomalous distributions. The nominal and anomalous distributions are \emph{unknown}. We study the tradeoff among the probabilities of misclassification error, false alarm and false reject for tests that satisfy weak conditions on the rate of decrease of these error probabilities as a function of sequence length. Specifically, we propose a threshold-based test that ensures exponential decay of misclassification error and false alarm probabilities. We study two constraints on the false reject probability, with one constraint being that it is a non-vanishing constant and the other being that it has an exponential decay rate. For both cases, we characterize bounds on the false reject probability, as a function of the threshold, for each pair of nominal and anomalous distributions and demonstrate the optimality of our test under the generalized Neyman-Pearson criterion. We first consider the case of at most one outlying sequence and then generalize our results to the case of multiple outlying sequences where the number of outlying sequences is unknown and each outlying sequence can follow a different anomalous distribution.
Comments: To appear in IEEE Transactions on Information Theory. Copyright (c) 2017 IEEE. Personal use of this material is permitted. However, permission to use this material for any other purposes must be obtained from the IEEE by sending a request to pubs-permissions@ieee.org
Subjects: Information Theory (cs.IT); Statistics Theory (math.ST)
Cite as: arXiv:2009.03505 [cs.IT]
  (or arXiv:2009.03505v4 [cs.IT] for this version)

Submission history

From: Lin Zhou [view email]
[v1] Tue, 8 Sep 2020 03:47:18 GMT (1378kb)
[v2] Mon, 6 Sep 2021 02:28:46 GMT (1410kb)
[v3] Sun, 23 Jan 2022 14:59:34 GMT (873kb,D)
[v4] Mon, 14 Feb 2022 06:25:16 GMT (408kb,D)

Link back to: arXiv, form interface, contact.