RELAX: Representation Learning Explainability

Wickstrøm, Kristoffer K.; Trosten, Daniel J.; Løkse, Sigurd; Boubekki, Ahcène; Mikalsen, Karl Øyvind; Kampffmeyer, Michael C.; Jenssen, Robert

Full-text links:

Download:

Current browse context:

stat.ML

< prev | next >

new | recent | 2112

Statistics > Machine Learning

Title: RELAX: Representation Learning Explainability

Authors: Kristoffer K. Wickstrøm, Daniel J. Trosten, Sigurd Løkse, Ahcène Boubekki, Karl Øyvind Mikalsen, Michael C. Kampffmeyer, Robert Jenssen

(Submitted on 19 Dec 2021 (v1), last revised 21 Feb 2022 (this version, v2))

Abstract: Despite the significant improvements that representation learning via self-supervision has led to when learning from unlabeled data, no methods exist that explain what influences the learned representation. We address this need through our proposed approach, RELAX, which is the first approach for attribution-based explanations of representations. Our approach can also model the uncertainty in its explanations, which is essential to produce trustworthy explanations. RELAX explains representations by measuring similarities in the representation space between an input and masked out versions of itself, providing intuitive explanations and significantly outperforming the gradient-based baseline. We provide theoretical interpretations of RELAX and conduct a novel analysis of feature extractors trained using supervised and unsupervised learning, providing insights into different learning strategies. Finally, we illustrate the usability of RELAX in multi-view clustering and highlight that incorporating uncertainty can be essential for providing low-complexity explanations, taking a crucial step towards explaining representations.

Subjects:	Machine Learning (stat.ML); Machine Learning (cs.LG)
Cite as:	arXiv:2112.10161 [stat.ML]
	(or arXiv:2112.10161v2 [stat.ML] for this version)

Submission history

From: Kristoffer Wickstrøm [view email]
[v1] Sun, 19 Dec 2021 14:51:31 GMT (6504kb,D)
[v2] Mon, 21 Feb 2022 15:36:07 GMT (4813kb,D)

Which authors of this paper are endorsers? | Disable MathJax (What is MathJax?)

Link back to: arXiv, form interface, contact.

> stat > arXiv:2112.10161

Download:

Current browse context:

Change to browse by:

References & Citations

Bookmark

Statistics > Machine Learning

Title: RELAX: Representation Learning Explainability

Submission history