Sparse Overcomplete Word Vector Representations

Faruqui, Manaal; Tsvetkov, Yulia; Yogatama, Dani; Dyer, Chris; Smith, Noah

Full-text links:

Download:

Current browse context:

cs.CL

< prev | next >

new | recent | 1506

Change to browse by:

Computer Science > Computation and Language

Title: Sparse Overcomplete Word Vector Representations

Authors: Manaal Faruqui, Yulia Tsvetkov, Dani Yogatama, Chris Dyer, Noah Smith

(Submitted on 5 Jun 2015)

Abstract: Current distributed representations of words show little resemblance to theories of lexical semantics. The former are dense and uninterpretable, the latter largely based on familiar, discrete classes (e.g., supersenses) and relations (e.g., synonymy and hypernymy). We propose methods that transform word vectors into sparse (and optionally binary) vectors. The resulting representations are more similar to the interpretable features typically used in NLP, though they are discovered automatically from raw corpora. Because the vectors are highly sparse, they are computationally easy to work with. Most importantly, we find that they outperform the original vectors on benchmark tasks.

Comments:	Proceedings of ACL 2015
Subjects:	Computation and Language (cs.CL)
Cite as:	arXiv:1506.02004 [cs.CL]
	(or arXiv:1506.02004v1 [cs.CL] for this version)

Submission history

From: Manaal Faruqui [view email]
[v1] Fri, 5 Jun 2015 18:20:43 GMT (3064kb,D)

Which authors of this paper are endorsers? | Disable MathJax (What is MathJax?)

Link back to: arXiv, form interface, contact.

> cs > arXiv:1506.02004

Download:

Current browse context:

Change to browse by:

References & Citations

DBLP - CS Bibliography

Bookmark

Computer Science > Computation and Language

Title: Sparse Overcomplete Word Vector Representations

Submission history