We gratefully acknowledge support from
the Simons Foundation and member institutions.
Full-text links:

Download:

Current browse context:

cs.HC

Change to browse by:

References & Citations

DBLP - CS Bibliography

Bookmark

(what is this?)
CiteULike logo BibSonomy logo Mendeley logo del.icio.us logo Digg logo Reddit logo

Computer Science > Human-Computer Interaction

Title: Debiased-CAM to mitigate systematic error with faithful visual explanations of machine learning

Abstract: Model explanations such as saliency maps can improve user trust in AI by highlighting important features for a prediction. However, these become distorted and misleading when explaining predictions of images that are subject to systematic error (bias). Furthermore, the distortions persist despite model fine-tuning on images biased by different factors (blur, color temperature, day/night). We present Debiased-CAM to recover explanation faithfulness across various bias types and levels by training a multi-input, multi-task model with auxiliary tasks for explanation and bias level predictions. In simulation studies, the approach not only enhanced prediction accuracy, but also generated highly faithful explanations about these predictions as if the images were unbiased. In user studies, debiased explanations improved user task performance, perceived truthfulness and perceived helpfulness. Debiased training can provide a versatile platform for robust performance and explanation faithfulness for a wide range of applications with data biases.
Comments: This work was intended as a replacement of arXiv:2012.05567 and any subsequent updates will appear there
Subjects: Human-Computer Interaction (cs.HC); Artificial Intelligence (cs.AI)
ACM classes: I.2.0
Cite as: arXiv:2201.12835 [cs.HC]
  (or arXiv:2201.12835v2 [cs.HC] for this version)

Submission history

From: Wencan Zhang [view email]
[v1] Sun, 30 Jan 2022 14:42:21 GMT (28020kb,D)
[v2] Tue, 1 Mar 2022 01:27:56 GMT (0kb,I)

Link back to: arXiv, form interface, contact.