Leveraging Labeled and Unlabeled Data for Consistent Fair Binary Classification

Chzhen, Evgenii; Denis, Christophe; Hebiri, Mohamed; Oneto, Luca; Pontil, Massimiliano

Full-text links:

Download:

Current browse context:

math.ST

< prev | next >

new | recent | 1906

Mathematics > Statistics Theory

Title: Leveraging Labeled and Unlabeled Data for Consistent Fair Binary Classification

Authors: Evgenii Chzhen (LAMA, LMO, CELESTE), Christophe Denis (LAMA), Mohamed Hebiri (LAMA), Luca Oneto, Massimiliano Pontil (IIT, UCL)

(Submitted on 12 Jun 2019 (v1), last revised 4 Feb 2020 (this version, v2))

Abstract: We study the problem of fair binary classification using the notion of Equal Opportunity. It requires the true positive rate to distribute equally across the sensitive groups. Within this setting we show that the fair optimal classifier is obtained by recalibrating the Bayes classifier by a group-dependent threshold. We provide a constructive expression for the threshold. This result motivates us to devise a plug-in classification procedure based on both unlabeled and labeled datasets. While the latter is used to learn the output conditional probability, the former is used for calibration. The overall procedure can be computed in polynomial time and it is shown to be statistically consistent both in terms of the classification error and fairness measure. Finally, we present numerical experiments which indicate that our method is often superior or competitive with the state-of-the-art methods on benchmark datasets.

Subjects:	Statistics Theory (math.ST); Machine Learning (stat.ML)
Journal reference:	NeurIPS 2019 - 33th Annual Conference on Neural Information Processing Systems, Dec 2019, Vancouver, Canada
Cite as:	arXiv:1906.05082 [math.ST]
	(or arXiv:1906.05082v2 [math.ST] for this version)

Submission history

From: Mohamed Hebiri [view email]
[v1] Wed, 12 Jun 2019 12:25:25 GMT (626kb,D)
[v2] Tue, 4 Feb 2020 07:30:31 GMT (632kb,D)

Which authors of this paper are endorsers? | Disable MathJax (What is MathJax?)

Link back to: arXiv, form interface, contact.

> math > arXiv:1906.05082

Download:

Current browse context:

Change to browse by:

References & Citations

Bookmark

Mathematics > Statistics Theory

Title: Leveraging Labeled and Unlabeled Data for Consistent Fair Binary Classification

Submission history