Visualizing and Understanding Neural Models in NLP

Li, Jiwei; Chen, Xinlei; Hovy, Eduard; Jurafsky, Dan

Full-text links:

Download:

Current browse context:

cs.CL

< prev | next >

new | recent | 1506

Change to browse by:

Computer Science > Computation and Language

Title: Visualizing and Understanding Neural Models in NLP

Authors: Jiwei Li, Xinlei Chen, Eduard Hovy, Dan Jurafsky

(Submitted on 2 Jun 2015 (v1), last revised 8 Jan 2016 (this version, v2))

Abstract: While neural networks have been successfully applied to many NLP tasks the resulting vector-based models are very difficult to interpret. For example it's not clear how they achieve {\em compositionality}, building sentence meaning from the meanings of words and phrases. In this paper we describe four strategies for visualizing compositionality in neural models for NLP, inspired by similar work in computer vision. We first plot unit values to visualize compositionality of negation, intensification, and concessive clauses, allow us to see well-known markedness asymmetries in negation. We then introduce three simple and straightforward methods for visualizing a unit's {\em salience}, the amount it contributes to the final composed meaning: (1) gradient back-propagation, (2) the variance of a token from the average word node, (3) LSTM-style gates that measure information flow. We test our methods on sentiment using simple recurrent nets and LSTMs. Our general-purpose methods may have wide applications for understanding compositionality and other semantic properties of deep networks , and also shed light on why LSTMs outperform simple recurrent nets,

Subjects:	Computation and Language (cs.CL)
Cite as:	arXiv:1506.01066 [cs.CL]
	(or arXiv:1506.01066v2 [cs.CL] for this version)

Submission history

From: Jiwei Li [view email]
[v1] Tue, 2 Jun 2015 21:17:31 GMT (1681kb,D)
[v2] Fri, 8 Jan 2016 18:10:22 GMT (2467kb,D)

Which authors of this paper are endorsers? | Disable MathJax (What is MathJax?)

Link back to: arXiv, form interface, contact.

> cs > arXiv:1506.01066

Download:

Current browse context:

Change to browse by:

References & Citations

DBLP - CS Bibliography

Bookmark

Computer Science > Computation and Language

Title: Visualizing and Understanding Neural Models in NLP

Submission history