We gratefully acknowledge support from
the Simons Foundation and member institutions.
Full-text links:

Download:

Current browse context:

cs.LG

Change to browse by:

References & Citations

DBLP - CS Bibliography

Bookmark

(what is this?)
CiteULike logo BibSonomy logo Mendeley logo del.icio.us logo Digg logo Reddit logo

Computer Science > Machine Learning

Title: Efficient Algorithms For Fair Clustering with a New Fairness Notion

Abstract: We revisit the problem of fair clustering, first introduced by Chierichetti et al., that requires each protected attribute to have approximately equal representation in every cluster; i.e., a balance property. Existing solutions to fair clustering are either not scalable or do not achieve an optimal trade-off between clustering objective and fairness. In this paper, we propose a new notion of fairness, which we call $tau$-fair fairness, that strictly generalizes the balance property and enables a fine-grained efficiency vs. fairness trade-off. Furthermore, we show that simple greedy round-robin based algorithms achieve this trade-off efficiently. Under a more general setting of multi-valued protected attributes, we rigorously analyze the theoretical properties of the our algorithms. Our experimental results suggest that the proposed solution outperforms all the state-of-the-art algorithms and works exceptionally well even for a large number of clusters.
Comments: 41 Pages, 12 Figures, 2 Tables
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Computers and Society (cs.CY)
Journal reference: Data Mining and Knowledge Discovery (S.I: Bias and Fairness) 2023
DOI: 10.1007/s10618-023-00928-6
Cite as: arXiv:2109.00708 [cs.LG]
  (or arXiv:2109.00708v3 [cs.LG] for this version)

Submission history

From: Shivam Gupta Mr [view email]
[v1] Thu, 2 Sep 2021 04:52:49 GMT (12321kb,D)
[v2] Fri, 3 Sep 2021 08:44:39 GMT (7213kb,D)
[v3] Tue, 28 Jun 2022 06:37:17 GMT (15030kb,D)

Link back to: arXiv, form interface, contact.