We gratefully acknowledge support from
the Simons Foundation and member institutions.
Full-text links:

Download:

Current browse context:

cs.CL

Change to browse by:

cs

References & Citations

DBLP - CS Bibliography

Bookmark

(what is this?)
CiteULike logo BibSonomy logo Mendeley logo del.icio.us logo Digg logo Reddit logo

Computer Science > Computation and Language

Title: An Ensemble Method to Produce High-Quality Word Embeddings (2016)

Abstract: A currently successful approach to computational semantics is to represent words as embeddings in a machine-learned vector space. We present an ensemble method that combines embeddings produced by GloVe (Pennington et al., 2014) and word2vec (Mikolov et al., 2013) with structured knowledge from the semantic networks ConceptNet (Speer and Havasi, 2012) and PPDB (Ganitkevitch et al., 2013), merging their information into a common representation with a large, multilingual vocabulary. The embeddings it produces achieve state-of-the-art performance on many word-similarity evaluations. Its score of $\rho = .596$ on an evaluation of rare words (Luong et al., 2013) is 16% higher than the previous best known system.
Comments: Corrected author name, revised reproducibility instructions that didn't work anymore. 12 pages, 3 figures
Subjects: Computation and Language (cs.CL)
MSC classes: I.2.7
ACM classes: I.2.7
Cite as: arXiv:1604.01692 [cs.CL]
  (or arXiv:1604.01692v2 [cs.CL] for this version)

Submission history

From: Robyn Speer [view email]
[v1] Wed, 6 Apr 2016 16:58:35 GMT (96kb,D)
[v2] Thu, 19 Dec 2019 17:29:15 GMT (97kb,D)

Link back to: arXiv, form interface, contact.