Identifying Biased Subgroups in Ranking and Classification

Pastor, Eliana; de Alfaro, Luca; Baralis, Elena

Full-text links:

Download:

Current browse context:

cs.LG

< prev | next >

new | recent | 2108

Computer Science > Machine Learning

Title: Identifying Biased Subgroups in Ranking and Classification

Authors: Eliana Pastor, Luca de Alfaro, Elena Baralis

(Submitted on 17 Aug 2021)

Abstract: When analyzing the behavior of machine learning algorithms, it is important to identify specific data subgroups for which the considered algorithm shows different performance with respect to the entire dataset. The intervention of domain experts is normally required to identify relevant attributes that define these subgroups.
We introduce the notion of divergence to measure this performance difference and we exploit it in the context of (i) classification models and (ii) ranking applications to automatically detect data subgroups showing a significant deviation in their behavior. Furthermore, we quantify the contribution of all attributes in the data subgroup to the divergent behavior by means of Shapley values, thus allowing the identification of the most impacting attributes.

Comments:	5 pages
Subjects:	Machine Learning (cs.LG); Computers and Society (cs.CY); Information Retrieval (cs.IR)
Journal reference:	In Responsible AI @ KDD 2021 Workshop, 2021
Cite as:	arXiv:2108.07450 [cs.LG]
	(or arXiv:2108.07450v1 [cs.LG] for this version)

Submission history

From: Luca De Alfaro [view email]
[v1] Tue, 17 Aug 2021 05:26:11 GMT (1769kb,D)

Which authors of this paper are endorsers? | Disable MathJax (What is MathJax?)

Link back to: arXiv, form interface, contact.

> cs > arXiv:2108.07450

Download:

Current browse context:

Change to browse by:

References & Citations

DBLP - CS Bibliography

Bookmark

Computer Science > Machine Learning

Title: Identifying Biased Subgroups in Ranking and Classification

Submission history