Not Just a Black Box: Learning Important Features Through Propagating Activation Differences

Shrikumar, Avanti; Greenside, Peyton; Shcherbina, Anna; Kundaje, Anshul

Full-text links:

Download:

Current browse context:

cs.LG

< prev | next >

new | recent | 1605

Computer Science > Machine Learning

Title: Not Just a Black Box: Learning Important Features Through Propagating Activation Differences

Authors: Avanti Shrikumar, Peyton Greenside, Anna Shcherbina, Anshul Kundaje

(Submitted on 5 May 2016 (v1), revised 8 May 2016 (this version, v2), latest version 11 Apr 2017 (v3))

Abstract: The purported "black box" nature of neural networks is a barrier to adoption in applications where interpretability is essential. Here we present DeepLIFT (Learning Important FeaTures), an efficient and effective method for computing importance scores in a neural network. DeepLIFT compares the activation of each neuron to its 'reference activation' and assigns contribution scores according to the difference. We apply DeepLIFT to models trained on natural images and genomic data, and show significant advantages over gradient-based methods.

Comments:	6 pages, 3 figures, a version of this is under review for the ICML Workshop on Human Interpretability in Machine Learning
Subjects:	Machine Learning (cs.LG); Computer Vision and Pattern Recognition (cs.CV); Neural and Evolutionary Computing (cs.NE)
Cite as:	arXiv:1605.01713 [cs.LG]
	(or arXiv:1605.01713v2 [cs.LG] for this version)

Submission history

From: Avanti Shrikumar [view email]
[v1] Thu, 5 May 2016 19:52:32 GMT (1441kb,D)
[v2] Sun, 8 May 2016 21:34:42 GMT (1441kb,D)
[v3] Tue, 11 Apr 2017 15:58:48 GMT (1624kb,D)

Which authors of this paper are endorsers? | Disable MathJax (What is MathJax?)

Link back to: arXiv, form interface, contact.

> cs > arXiv:1605.01713v2

Download:

Current browse context:

Change to browse by:

References & Citations

DBLP - CS Bibliography

Bookmark

Computer Science > Machine Learning

Title: Not Just a Black Box: Learning Important Features Through Propagating Activation Differences

Submission history