We gratefully acknowledge support from
the Simons Foundation and member institutions.
Full-text links:

Download:

Current browse context:

stat.ML

Change to browse by:

References & Citations

Bookmark

(what is this?)
CiteULike logo BibSonomy logo Mendeley logo del.icio.us logo Digg logo Reddit logo

Statistics > Machine Learning

Title: On Focal Loss for Class-Posterior Probability Estimation: A Theoretical Perspective

Abstract: The focal loss has demonstrated its effectiveness in many real-world applications such as object detection and image classification, but its theoretical understanding has been limited so far. In this paper, we first prove that the focal loss is classification-calibrated, i.e., its minimizer surely yields the Bayes-optimal classifier and thus the use of the focal loss in classification can be theoretically justified. However, we also prove a negative fact that the focal loss is not strictly proper, i.e., the confidence score of the classifier obtained by focal loss minimization does not match the true class-posterior probability and thus it is not reliable as a class-posterior probability estimator. To mitigate this problem, we next prove that a particular closed-form transformation of the confidence score allows us to recover the true class-posterior probability. Through experiments on benchmark datasets, we demonstrate that our proposed transformation significantly improves the accuracy of class-posterior probability estimation.
Comments: 57 pages
Subjects: Machine Learning (stat.ML); Machine Learning (cs.LG)
Cite as: arXiv:2011.09172 [stat.ML]
  (or arXiv:2011.09172v2 [stat.ML] for this version)

Submission history

From: Nontawat Charoenphakdee [view email]
[v1] Wed, 18 Nov 2020 09:36:52 GMT (8954kb,D)
[v2] Mon, 14 Dec 2020 04:15:40 GMT (8945kb,D)

Link back to: arXiv, form interface, contact.