Current browse context:
stat.ML
Change to browse by:
References & Citations
Statistics > Machine Learning
Title: Fairness Beyond Disparate Treatment & Disparate Impact: Learning Classification without Disparate Mistreatment
(Submitted on 26 Oct 2016 (v1), last revised 8 Mar 2017 (this version, v2))
Abstract: Automated data-driven decision making systems are increasingly being used to assist, or even replace humans in many settings. These systems function by learning from historical decisions, often taken by humans. In order to maximize the utility of these systems (or, classifiers), their training involves minimizing the errors (or, misclassifications) over the given historical data. However, it is quite possible that the optimally trained classifier makes decisions for people belonging to different social groups with different misclassification rates (e.g., misclassification rates for females are higher than for males), thereby placing these groups at an unfair disadvantage. To account for and avoid such unfairness, in this paper, we introduce a new notion of unfairness, disparate mistreatment, which is defined in terms of misclassification rates. We then propose intuitive measures of disparate mistreatment for decision boundary-based classifiers, which can be easily incorporated into their formulation as convex-concave constraints. Experiments on synthetic as well as real world datasets show that our methodology is effective at avoiding disparate mistreatment, often at a small cost in terms of accuracy.
Submission history
From: Muhammad Bilal Zafar [view email][v1] Wed, 26 Oct 2016 18:34:48 GMT (122kb,D)
[v2] Wed, 8 Mar 2017 19:04:28 GMT (155kb,D)
Link back to: arXiv, form interface, contact.