We gratefully acknowledge support from
the Simons Foundation and member institutions.
Full-text links:

Download:

Current browse context:

math.ST

Change to browse by:

References & Citations

Bookmark

(what is this?)
CiteULike logo BibSonomy logo Mendeley logo del.icio.us logo Digg logo Reddit logo

Mathematics > Statistics Theory

Title: Concentration Inequalities for Two-Sample Rank Processes with Application to Bipartite Ranking

Abstract: The ROC curve is the gold standard for measuring the performance of a test/scoring statistic regarding its capacity to discriminate between two statistical populations in a wide variety of applications, ranging from anomaly detection in signal processing to information retrieval, through medical diagnosis. Most practical performance measures used in scoring/ranking applications such as the AUC, the local AUC, the p-norm push, the DCG and others, can be viewed as summaries of the ROC curve. In this paper, the fact that most of these empirical criteria can be expressed as two-sample linear rank statistics is highlighted and concentration inequalities for collections of such random variables, referred to as two-sample rank processes here, are proved, when indexed by VC classes of scoring functions. Based on these nonasymptotic bounds, the generalization capacity of empirical maximizers of a wide class of ranking performance criteria is next investigated from a theoretical perspective. It is also supported by empirical evidence through convincing numerical experiments.
Subjects: Statistics Theory (math.ST); Machine Learning (stat.ML)
Journal reference: Electronic Journal of Statistics , Shaker Heights, OH : Institute of Mathematical Statistics, 2021, 15 (2), pp.4659 -- 4717
DOI: 10.1214/21-EJS1907
Cite as: arXiv:2104.02943 [math.ST]
  (or arXiv:2104.02943v3 [math.ST] for this version)

Submission history

From: Myrto Limnios [view email]
[v1] Wed, 7 Apr 2021 06:31:06 GMT (4847kb,D)
[v2] Wed, 1 Jun 2022 12:13:17 GMT (4869kb,D)
[v3] Tue, 24 Jan 2023 08:34:11 GMT (4870kb,D)

Link back to: arXiv, form interface, contact.