Current browse context:
cs.CL
Change to browse by:
References & Citations
Computer Science > Computation and Language
Title: Corrected CBOW Performs as well as Skip-gram
(Submitted on 30 Dec 2020 (v1), last revised 9 Nov 2021 (this version, v2))
Abstract: Mikolov et al. (2013a) observed that continuous bag-of-words (CBOW) word embeddings tend to underperform Skip-gram (SG) embeddings, and this finding has been reported in subsequent works. We find that these observations are driven not by fundamental differences in their training objectives, but more likely on faulty negative sampling CBOW implementations in popular libraries such as the official implementation, word2vec.c, and Gensim. We show that after correcting a bug in the CBOW gradient update, one can learn CBOW word embeddings that are fully competitive with SG on various intrinsic and extrinsic tasks, while being many times faster to train.
Submission history
From: Ozan İrsoy [view email][v1] Wed, 30 Dec 2020 21:37:28 GMT (140kb,D)
[v2] Tue, 9 Nov 2021 16:28:00 GMT (162kb,D)
Link back to: arXiv, form interface, contact.