Deconstructing and reconstructing word embedding algorithms

Newell, Edward; Kenyon-Dean, Kian; Cheung, Jackie Chi Kit

Full-text links:

Download:

Current browse context:

cs.CL

< prev | next >

new | recent | 1911

Computer Science > Computation and Language

Title: Deconstructing and reconstructing word embedding algorithms

Authors: Edward Newell, Kian Kenyon-Dean, Jackie Chi Kit Cheung

(Submitted on 29 Nov 2019)

Abstract: Uncontextualized word embeddings are reliable feature representations of words used to obtain high quality results for various NLP applications. Given the historical success of word embeddings in NLP, we propose a retrospective on some of the most well-known word embedding algorithms. In this work, we deconstruct Word2vec, GloVe, and others, into a common form, unveiling some of the necessary and sufficient conditions required for making performant word embeddings. We find that each algorithm: (1) fits vector-covector dot products to approximate pointwise mutual information (PMI); and, (2) modulates the loss gradient to balance weak and strong signals. We demonstrate that these two algorithmic features are sufficient conditions to construct a novel word embedding algorithm, Hilbert-MLE. We find that its embeddings obtain equivalent or better performance against other algorithms across 17 intrinsic and extrinsic datasets.

Comments:	15 pages
Subjects:	Computation and Language (cs.CL); Machine Learning (cs.LG)
Cite as:	arXiv:1911.13280 [cs.CL]
	(or arXiv:1911.13280v1 [cs.CL] for this version)

Submission history

From: Kian Kenyon-Dean [view email]
[v1] Fri, 29 Nov 2019 18:27:36 GMT (302kb,D)

Which authors of this paper are endorsers? | Disable MathJax (What is MathJax?)

Link back to: arXiv, form interface, contact.

> cs > arXiv:1911.13280

Download:

Current browse context:

Change to browse by:

References & Citations

DBLP - CS Bibliography

Bookmark

Computer Science > Computation and Language

Title: Deconstructing and reconstructing word embedding algorithms

Submission history