Inductive Document Network Embedding with Topic-Word Attention

Brochier, Robin; Guille, Adrien; Velcin, Julien

Full-text links:

Download:

Current browse context:

cs.LG

< prev | next >

new | recent | 2001

Computer Science > Machine Learning

Title: Inductive Document Network Embedding with Topic-Word Attention

Authors: Robin Brochier, Adrien Guille, Julien Velcin

(Submitted on 10 Jan 2020)

Abstract: Document network embedding aims at learning representations for a structured text corpus i.e. when documents are linked to each other. Recent algorithms extend network embedding approaches by incorporating the text content associated with the nodes in their formulations. In most cases, it is hard to interpret the learned representations. Moreover, little importance is given to the generalization to new documents that are not observed within the network. In this paper, we propose an interpretable and inductive document network embedding method. We introduce a novel mechanism, the Topic-Word Attention (TWA), that generates document representations based on the interplay between word and topic representations. We train these word and topic vectors through our general model, Inductive Document Network Embedding (IDNE), by leveraging the connections in the document network. Quantitative evaluations show that our approach achieves state-of-the-art performance on various networks and we qualitatively show that our model produces meaningful and interpretable representations of the words, topics and documents.

Subjects:	Machine Learning (cs.LG); Computation and Language (cs.CL); Information Retrieval (cs.IR); Machine Learning (stat.ML)
Cite as:	arXiv:2001.03369 [cs.LG]
	(or arXiv:2001.03369v1 [cs.LG] for this version)

Submission history

From: Robin Brochier [view email]
[v1] Fri, 10 Jan 2020 10:14:07 GMT (161kb,D)

Which authors of this paper are endorsers? | Disable MathJax (What is MathJax?)

Link back to: arXiv, form interface, contact.

> cs > arXiv:2001.03369v1

Download:

Current browse context:

Change to browse by:

References & Citations

DBLP - CS Bibliography

Bookmark

Computer Science > Machine Learning

Title: Inductive Document Network Embedding with Topic-Word Attention

Submission history