Counterfactual fairness: removing direct effects through regularization

Di Stefano, Pietro G.; Hickey, James M.; Vasileiou, Vlasios

Full-text links:

Download:

Current browse context:

cs.AI

< prev | next >

new | recent | 2002

Computer Science > Artificial Intelligence

Title: Counterfactual fairness: removing direct effects through regularization

Authors: Pietro G. Di Stefano, James M. Hickey, Vlasios Vasileiou

(Submitted on 25 Feb 2020 (v1), last revised 26 Feb 2020 (this version, v2))

Abstract: Building machine learning models that are fair with respect to an unprivileged group is a topical problem. Modern fairness-aware algorithms often ignore causal effects and enforce fairness through modifications applicable to only a subset of machine learning models. In this work, we propose a new definition of fairness that incorporates causality through the Controlled Direct Effect (CDE). We develop regularizations to tackle classical fairness measures and present a causal regularization that satisfies our new fairness definition by removing the impact of unprivileged group variables on the model outcomes as measured by the CDE. These regularizations are applicable to any model trained using by iteratively minimizing a loss through differentiation. We demonstrate our approaches using both gradient boosting and logistic regression on: a synthetic dataset, the UCI Adult (Census) Dataset, and a real-world credit-risk dataset. Our results were found to mitigate unfairness from the predictions with small reductions in model performance.

Comments:	10 pages, 4 figures
Subjects:	Artificial Intelligence (cs.AI); Machine Learning (stat.ML)
Cite as:	arXiv:2002.10774 [cs.AI]
	(or arXiv:2002.10774v2 [cs.AI] for this version)

Submission history

From: Pietro Di Stefano [view email]
[v1] Tue, 25 Feb 2020 10:13:55 GMT (116kb,D)
[v2] Wed, 26 Feb 2020 11:28:34 GMT (116kb,D)

Which authors of this paper are endorsers? | Disable MathJax (What is MathJax?)

Link back to: arXiv, form interface, contact.

> cs > arXiv:2002.10774

Download:

Current browse context:

Change to browse by:

References & Citations

DBLP - CS Bibliography

Bookmark

Computer Science > Artificial Intelligence

Title: Counterfactual fairness: removing direct effects through regularization

Submission history