We gratefully acknowledge support from
the Simons Foundation and member institutions.
Full-text links:

Download:

Current browse context:

cs.CL

Change to browse by:

References & Citations

DBLP - CS Bibliography

Bookmark

(what is this?)
CiteULike logo BibSonomy logo Mendeley logo del.icio.us logo Digg logo Reddit logo

Computer Science > Computation and Language

Title: A Comprehensive Empirical Evaluation of Existing Word Embedding Approaches

Abstract: Vector-based word representations help countless Natural Language Processing (NLP) tasks capture the language's semantic and syntactic regularities. In this paper, we present the characteristics of existing word embedding approaches and analyze them with regard to many classification tasks. We categorize the methods into two main groups - Traditional approaches mostly use matrix factorization to produce word representations, and they are not able to capture the semantic and syntactic regularities of the language very well. On the other hand, Neural-network-based approaches can capture sophisticated regularities of the language and preserve the word relationships in the generated word representations. We report experimental results on multiple classification tasks and highlight the scenarios where one approach performs better than the rest.
Comments: 28 pages, 3 figures and 10 tables
Subjects: Computation and Language (cs.CL); Neural and Evolutionary Computing (cs.NE)
Cite as: arXiv:2303.07196 [cs.CL]
  (or arXiv:2303.07196v2 [cs.CL] for this version)

Submission history

From: Obaidullah Zaland [view email]
[v1] Mon, 13 Mar 2023 15:34:19 GMT (151kb,D)
[v2] Sat, 2 Mar 2024 19:19:44 GMT (151kb,D)

Link back to: arXiv, form interface, contact.