Explaining Neural Matrix Factorization with Gradient Rollback

Lawrence, Carolin; Sztyler, Timo; Niepert, Mathias

Full-text links:

Download:

Current browse context:

cs.LG

< prev | next >

new | recent | 2010

Computer Science > Machine Learning

Title: Explaining Neural Matrix Factorization with Gradient Rollback

Authors: Carolin Lawrence, Timo Sztyler, Mathias Niepert

(Submitted on 12 Oct 2020 (v1), last revised 15 Dec 2020 (this version, v4))

Abstract: Explaining the predictions of neural black-box models is an important problem, especially when such models are used in applications where user trust is crucial. Estimating the influence of training examples on a learned neural model's behavior allows us to identify training examples most responsible for a given prediction and, therefore, to faithfully explain the output of a black-box model. The most generally applicable existing method is based on influence functions, which scale poorly for larger sample sizes and models.
We propose gradient rollback, a general approach for influence estimation, applicable to neural models where each parameter update step during gradient descent touches a smaller number of parameters, even if the overall number of parameters is large. Neural matrix factorization models trained with gradient descent are part of this model class. These models are popular and have found a wide range of applications in industry. Especially knowledge graph embedding methods, which belong to this class, are used extensively. We show that gradient rollback is highly efficient at both training and test time. Moreover, we show theoretically that the difference between gradient rollback's influence approximation and the true influence on a model's behavior is smaller than known bounds on the stability of stochastic gradient descent. This establishes that gradient rollback is robustly estimating example influence. We also conduct experiments which show that gradient rollback provides faithful explanations for knowledge base completion and recommender datasets.

Comments:	35th AAAI Conference on Artificial Intelligence, 2021. Includes Appendix
Subjects:	Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Machine Learning (stat.ML)
Cite as:	arXiv:2010.05516 [cs.LG]
	(or arXiv:2010.05516v4 [cs.LG] for this version)

Submission history

From: Carolin Lawrence [view email]
[v1] Mon, 12 Oct 2020 08:15:54 GMT (169kb,D)
[v2] Wed, 14 Oct 2020 07:27:32 GMT (169kb,D)
[v3] Mon, 9 Nov 2020 08:50:56 GMT (169kb,D)
[v4] Tue, 15 Dec 2020 07:01:30 GMT (164kb,D)

Which authors of this paper are endorsers? | Disable MathJax (What is MathJax?)

Link back to: arXiv, form interface, contact.

> cs > arXiv:2010.05516

Download:

Current browse context:

Change to browse by:

References & Citations

DBLP - CS Bibliography

Bookmark

Computer Science > Machine Learning

Title: Explaining Neural Matrix Factorization with Gradient Rollback

Submission history