On the Utility of Prediction Sets in Human-AI Teams

Babbar, Varun; Bhatt, Umang; Weller, Adrian

Full-text links:

Download:

Current browse context:

cs.AI

< prev | next >

new | recent | 2205

Computer Science > Artificial Intelligence

Title: On the Utility of Prediction Sets in Human-AI Teams

Authors: Varun Babbar, Umang Bhatt, Adrian Weller

(Submitted on 3 May 2022 (v1), last revised 26 May 2022 (this version, v2))

Abstract: Research on human-AI teams usually provides experts with a single label, which ignores the uncertainty in a model's recommendation. Conformal prediction (CP) is a well established line of research that focuses on building a theoretically grounded, calibrated prediction set, which may contain multiple labels. We explore how such prediction sets impact expert decision-making in human-AI teams. Our evaluation on human subjects finds that set valued predictions positively impact experts. However, we notice that the predictive sets provided by CP can be very large, which leads to unhelpful AI assistants. To mitigate this, we introduce D-CP, a method to perform CP on some examples and defer to experts. We prove that D-CP can reduce the prediction set size of non-deferred examples. We show how D-CP performs in quantitative and in human subject experiments ($n=120$). Our results suggest that CP prediction sets improve human-AI team performance over showing the top-1 prediction alone, and that experts find D-CP prediction sets are more useful than CP prediction sets.

Comments:	Accepted at IJCAI 2022
Subjects:	Artificial Intelligence (cs.AI); Human-Computer Interaction (cs.HC)
Cite as:	arXiv:2205.01411 [cs.AI]
	(or arXiv:2205.01411v2 [cs.AI] for this version)

Submission history

From: Varun Babbar [view email]
[v1] Tue, 3 May 2022 10:53:40 GMT (6712kb,D)
[v2] Thu, 26 May 2022 12:43:37 GMT (6712kb,D)

Which authors of this paper are endorsers? | Disable MathJax (What is MathJax?)

Link back to: arXiv, form interface, contact.

> cs > arXiv:2205.01411

Download:

Current browse context:

Change to browse by:

References & Citations

DBLP - CS Bibliography

Bookmark

Computer Science > Artificial Intelligence

Title: On the Utility of Prediction Sets in Human-AI Teams

Submission history