We gratefully acknowledge support from
the Simons Foundation and member institutions.
Full-text links:

Download:

Current browse context:

cs.LG

Change to browse by:

References & Citations

DBLP - CS Bibliography

Bookmark

(what is this?)
CiteULike logo BibSonomy logo Mendeley logo del.icio.us logo Digg logo Reddit logo

Computer Science > Machine Learning

Title: Multiaccurate Proxies for Downstream Fairness

Abstract: We study the problem of training a model that must obey demographic fairness conditions when the sensitive features are not available at training time -- in other words, how can we train a model to be fair by race when we don't have data about race? We adopt a fairness pipeline perspective, in which an "upstream" learner that does have access to the sensitive features will learn a proxy model for these features from the other attributes. The goal of the proxy is to allow a general "downstream" learner -- with minimal assumptions on their prediction task -- to be able to use the proxy to train a model that is fair with respect to the true sensitive features. We show that obeying multiaccuracy constraints with respect to the downstream model class suffices for this purpose, provide sample- and oracle efficient-algorithms and generalization bounds for learning such proxies, and conduct an experimental evaluation. In general, multiaccuracy is much easier to satisfy than classification accuracy, and can be satisfied even when the sensitive features are hard to predict.
Subjects: Machine Learning (cs.LG); Data Structures and Algorithms (cs.DS)
Cite as: arXiv:2107.04423 [cs.LG]
  (or arXiv:2107.04423v2 [cs.LG] for this version)

Submission history

From: Emily Diana [view email]
[v1] Fri, 9 Jul 2021 13:16:44 GMT (57kb)
[v2] Tue, 25 Jan 2022 20:11:10 GMT (2597kb,D)

Link back to: arXiv, form interface, contact.