References & Citations
Computer Science > Computation and Language
Title: An Ensemble Method to Produce High-Quality Word Embeddings (2016)
(Submitted on 6 Apr 2016 (v1), last revised 19 Dec 2019 (this version, v2))
Abstract: A currently successful approach to computational semantics is to represent words as embeddings in a machine-learned vector space. We present an ensemble method that combines embeddings produced by GloVe (Pennington et al., 2014) and word2vec (Mikolov et al., 2013) with structured knowledge from the semantic networks ConceptNet (Speer and Havasi, 2012) and PPDB (Ganitkevitch et al., 2013), merging their information into a common representation with a large, multilingual vocabulary. The embeddings it produces achieve state-of-the-art performance on many word-similarity evaluations. Its score of $\rho = .596$ on an evaluation of rare words (Luong et al., 2013) is 16% higher than the previous best known system.
Submission history
From: Robyn Speer [view email][v1] Wed, 6 Apr 2016 16:58:35 GMT (96kb,D)
[v2] Thu, 19 Dec 2019 17:29:15 GMT (97kb,D)
Link back to: arXiv, form interface, contact.