References & Citations
Computer Science > Artificial Intelligence
Title: Logic-Based Explainability in Machine Learning
(Submitted on 24 Oct 2022 (v1), last revised 29 Jan 2023 (this version, v3))
Abstract: The last decade witnessed an ever-increasing stream of successes in Machine Learning (ML). These successes offer clear evidence that ML is bound to become pervasive in a wide range of practical uses, including many that directly affect humans. Unfortunately, the operation of the most successful ML models is incomprehensible for human decision makers. As a result, the use of ML models, especially in high-risk and safety-critical settings is not without concern. In recent years, there have been efforts on devising approaches for explaining ML models. Most of these efforts have focused on so-called model-agnostic approaches. However, all model-agnostic and related approaches offer no guarantees of rigor, hence being referred to as non-formal. For example, such non-formal explanations can be consistent with different predictions, which renders them useless in practice. This paper overviews the ongoing research efforts on computing rigorous model-based explanations of ML models; these being referred to as formal explanations. These efforts encompass a variety of topics, that include the actual definitions of explanations, the characterization of the complexity of computing explanations, the currently best logical encodings for reasoning about different ML models, and also how to make explanations interpretable for human decision makers, among others.
Submission history
From: Joao Marques-Silva [view email][v1] Mon, 24 Oct 2022 13:43:07 GMT (108kb)
[v2] Wed, 25 Jan 2023 08:47:49 GMT (124kb)
[v3] Sun, 29 Jan 2023 23:57:14 GMT (124kb)
Link back to: arXiv, form interface, contact.