We gratefully acknowledge support from
the Simons Foundation and member institutions.
Full-text links:

Download:

Current browse context:

cs.IR

Change to browse by:

References & Citations

DBLP - CS Bibliography

Bookmark

(what is this?)
CiteULike logo BibSonomy logo Mendeley logo del.icio.us logo Digg logo Reddit logo

Computer Science > Information Retrieval

Title: LIMEADE: From AI Explanations to Advice Taking

Abstract: Research in human-centered AI has shown the benefits of systems that can explain their predictions. Methods that allow an AI to take advice from humans in response to explanations are similarly useful. While both capabilities are well-developed for transparent learning models (e.g., linear models and GA$^2$Ms), and recent techniques (e.g., LIME and SHAP) can generate explanations for opaque models, little attention has been given to advice methods for opaque models. This paper introduces LIMEADE, the first general framework that translates both positive and negative advice (expressed using high-level vocabulary such as that employed by post-hoc explanations) into an update to an arbitrary, underlying opaque model. We demonstrate the generality of our approach with case studies on seventy real-world models across two broad domains: image classification and text recommendation. We show our method improves accuracy compared to a rigorous baseline on the image classification domains. For the text modality, we apply our framework to a neural recommender system for scientific papers on a public website; our user study shows that our framework leads to significantly higher perceived user control, trust, and satisfaction.
Comments: 18 pages, 7 figures
Subjects: Information Retrieval (cs.IR); Machine Learning (cs.LG); Machine Learning (stat.ML)
Cite as: arXiv:2003.04315 [cs.IR]
  (or arXiv:2003.04315v5 [cs.IR] for this version)

Submission history

From: Benjamin Lee [view email]
[v1] Mon, 9 Mar 2020 18:00:00 GMT (581kb)
[v2] Fri, 22 Oct 2021 03:13:39 GMT (2454kb,D)
[v3] Tue, 1 Mar 2022 23:42:10 GMT (1127kb)
[v4] Wed, 12 Oct 2022 22:45:19 GMT (1138kb)
[v5] Tue, 17 Jan 2023 23:29:15 GMT (1261kb)

Link back to: arXiv, form interface, contact.