TopicRNN: A Recurrent Neural Network with Long-Range Semantic Dependency

Dieng, Adji B.; Wang, Chong; Gao, Jianfeng; Paisley, John

Full-text links:

Download:

Current browse context:

cs.CL

< prev | next >

new | recent | 1611

Computer Science > Computation and Language

Title: TopicRNN: A Recurrent Neural Network with Long-Range Semantic Dependency

Authors: Adji B. Dieng, Chong Wang, Jianfeng Gao, John Paisley

(Submitted on 5 Nov 2016 (v1), last revised 27 Feb 2017 (this version, v2))

Abstract: In this paper, we propose TopicRNN, a recurrent neural network (RNN)-based language model designed to directly capture the global semantic meaning relating words in a document via latent topics. Because of their sequential nature, RNNs are good at capturing the local structure of a word sequence - both semantic and syntactic - but might face difficulty remembering long-range dependencies. Intuitively, these long-range dependencies are of semantic nature. In contrast, latent topic models are able to capture the global underlying semantic structure of a document but do not account for word ordering. The proposed TopicRNN model integrates the merits of RNNs and latent topic models: it captures local (syntactic) dependencies using an RNN and global (semantic) dependencies using latent topics. Unlike previous work on contextual RNN language modeling, our model is learned end-to-end. Empirical results on word prediction show that TopicRNN outperforms existing contextual RNN baselines. In addition, TopicRNN can be used as an unsupervised feature extractor for documents. We do this for sentiment analysis on the IMDB movie review dataset and report an error rate of $6.28\%$. This is comparable to the state-of-the-art $5.91\%$ resulting from a semi-supervised approach. Finally, TopicRNN also yields sensible topics, making it a useful alternative to document models such as latent Dirichlet allocation.

Comments:	International Conference on Learning Representations
Subjects:	Computation and Language (cs.CL); Artificial Intelligence (cs.AI); Machine Learning (cs.LG); Machine Learning (stat.ML)
Cite as:	arXiv:1611.01702 [cs.CL]
	(or arXiv:1611.01702v2 [cs.CL] for this version)

Submission history

From: Adji Bousso Dieng [view email]
[v1] Sat, 5 Nov 2016 21:25:07 GMT (4054kb,D)
[v2] Mon, 27 Feb 2017 03:03:38 GMT (4850kb,D)

Which authors of this paper are endorsers? | Disable MathJax (What is MathJax?)

Link back to: arXiv, form interface, contact.

> cs > arXiv:1611.01702

Download:

Current browse context:

Change to browse by:

References & Citations

DBLP - CS Bibliography

Bookmark

Computer Science > Computation and Language

Title: TopicRNN: A Recurrent Neural Network with Long-Range Semantic Dependency

Submission history