We gratefully acknowledge support from
the Simons Foundation and member institutions.
Full-text links:

Download:

Current browse context:

cs.AI

Change to browse by:

References & Citations

DBLP - CS Bibliography

Bookmark

(what is this?)
CiteULike logo BibSonomy logo Mendeley logo del.icio.us logo Digg logo Reddit logo

Computer Science > Artificial Intelligence

Title: Counterfactual fairness: removing direct effects through regularization

Abstract: Building machine learning models that are fair with respect to an unprivileged group is a topical problem. Modern fairness-aware algorithms often ignore causal effects and enforce fairness through modifications applicable to only a subset of machine learning models. In this work, we propose a new definition of fairness that incorporates causality through the Controlled Direct Effect (CDE). We develop regularizations to tackle classical fairness measures and present a causal regularization that satisfies our new fairness definition by removing the impact of unprivileged group variables on the model outcomes as measured by the CDE. These regularizations are applicable to any model trained using by iteratively minimizing a loss through differentiation. We demonstrate our approaches using both gradient boosting and logistic regression on: a synthetic dataset, the UCI Adult (Census) Dataset, and a real-world credit-risk dataset. Our results were found to mitigate unfairness from the predictions with small reductions in model performance.
Comments: 10 pages, 4 figures
Subjects: Artificial Intelligence (cs.AI); Machine Learning (stat.ML)
Cite as: arXiv:2002.10774 [cs.AI]
  (or arXiv:2002.10774v2 [cs.AI] for this version)

Submission history

From: Pietro Di Stefano [view email]
[v1] Tue, 25 Feb 2020 10:13:55 GMT (116kb,D)
[v2] Wed, 26 Feb 2020 11:28:34 GMT (116kb,D)

Link back to: arXiv, form interface, contact.