Nonparametric Topic Modeling with Neural Inference

Ning, Xuefei; Zheng, Yin; Jiang, Zhuxi; Wang, Yu; Yang, Huazhong; Huang, Junzhou

Full-text links:

Download:

Current browse context:

cs.CL

< prev | next >

new | recent | 1806

Computer Science > Computation and Language

Title: Nonparametric Topic Modeling with Neural Inference

Authors: Xuefei Ning, Yin Zheng, Zhuxi Jiang, Yu Wang, Huazhong Yang, Junzhou Huang

(Submitted on 18 Jun 2018)

Abstract: This work focuses on combining nonparametric topic models with Auto-Encoding Variational Bayes (AEVB). Specifically, we first propose iTM-VAE, where the topics are treated as trainable parameters and the document-specific topic proportions are obtained by a stick-breaking construction. The inference of iTM-VAE is modeled by neural networks such that it can be computed in a simple feed-forward manner. We also describe how to introduce a hyper-prior into iTM-VAE so as to model the uncertainty of the prior parameter. Actually, the hyper-prior technique is quite general and we show that it can be applied to other AEVB based models to alleviate the {\it collapse-to-prior} problem elegantly. Moreover, we also propose HiTM-VAE, where the document-specific topic distributions are generated in a hierarchical manner. HiTM-VAE is even more flexible and can generate topic distributions with better variability. Experimental results on 20News and Reuters RCV1-V2 datasets show that the proposed models outperform the state-of-the-art baselines significantly. The advantages of the hyper-prior technique and the hierarchical model construction are also confirmed by experiments.

Comments:	11 pages, 2 figures
Subjects:	Computation and Language (cs.CL); Information Retrieval (cs.IR); Machine Learning (cs.LG)
Cite as:	arXiv:1806.06583 [cs.CL]
	(or arXiv:1806.06583v1 [cs.CL] for this version)

Submission history

From: Yin Zheng [view email]
[v1] Mon, 18 Jun 2018 10:22:18 GMT (1154kb,D)

Which authors of this paper are endorsers? | Disable MathJax (What is MathJax?)

Link back to: arXiv, form interface, contact.

> cs > arXiv:1806.06583

Download:

Current browse context:

Change to browse by:

References & Citations

DBLP - CS Bibliography

Bookmark

Computer Science > Computation and Language

Title: Nonparametric Topic Modeling with Neural Inference

Submission history