Use-Case-Grounded Simulations for Explanation Evaluation

Chen, Valerie; Johnson, Nari; Topin, Nicholay; Plumb, Gregory; Talwalkar, Ameet

Full-text links:

Download:

Current browse context:

cs.HC

< prev | next >

new | recent | 2206

Computer Science > Human-Computer Interaction

Title: Use-Case-Grounded Simulations for Explanation Evaluation

Authors: Valerie Chen, Nari Johnson, Nicholay Topin, Gregory Plumb, Ameet Talwalkar

(Submitted on 5 Jun 2022 (v1), last revised 20 Aug 2022 (this version, v2))

Abstract: A growing body of research runs human subject evaluations to study whether providing users with explanations of machine learning models can help them with practical real-world use cases. However, running user studies is challenging and costly, and consequently each study typically only evaluates a limited number of different settings, e.g., studies often only evaluate a few arbitrarily selected explanation methods. To address these challenges and aid user study design, we introduce Use-Case-Grounded Simulated Evaluations (SimEvals). SimEvals involve training algorithmic agents that take as input the information content (such as model explanations) that would be presented to each participant in a human subject study, to predict answers to the use case of interest. The algorithmic agent's test set accuracy provides a measure of the predictiveness of the information content for the downstream use case. We run a comprehensive evaluation on three real-world use cases (forward simulation, model debugging, and counterfactual reasoning) to demonstrate that Simevals can effectively identify which explanation methods will help humans for each use case. These results provide evidence that SimEvals can be used to efficiently screen an important set of user study design decisions, e.g. selecting which explanations should be presented to the user, before running a potentially costly user study.

Subjects:	Human-Computer Interaction (cs.HC); Artificial Intelligence (cs.AI); Machine Learning (cs.LG)
Cite as:	arXiv:2206.02256 [cs.HC]
	(or arXiv:2206.02256v2 [cs.HC] for this version)

Submission history

From: Valerie Chen [view email]
[v1] Sun, 5 Jun 2022 20:12:19 GMT (2198kb,D)
[v2] Sat, 20 Aug 2022 15:42:29 GMT (3005kb,D)

Which authors of this paper are endorsers? | Disable MathJax (What is MathJax?)

Link back to: arXiv, form interface, contact.

> cs > arXiv:2206.02256

Download:

Current browse context:

Change to browse by:

References & Citations

DBLP - CS Bibliography

Bookmark

Computer Science > Human-Computer Interaction

Title: Use-Case-Grounded Simulations for Explanation Evaluation

Submission history