We gratefully acknowledge support from
the Simons Foundation and member institutions.
Full-text links:

Download:

Current browse context:

cs.CL

Change to browse by:

cs

References & Citations

DBLP - CS Bibliography

Bookmark

(what is this?)
CiteULike logo BibSonomy logo Mendeley logo del.icio.us logo Digg logo Reddit logo ScienceWISE logo

Computer Science > Computation and Language

Title: A Comprehensive Survey on Word Representation Models: From Classical to State-Of-The-Art Word Representation Language Models

Abstract: Word representation has always been an important research area in the history of natural language processing (NLP). Understanding such complex text data is imperative, given that it is rich in information and can be used widely across various applications. In this survey, we explore different word representation models and its power of expression, from the classical to modern-day state-of-the-art word representation language models (LMS). We describe a variety of text representation methods, and model designs have blossomed in the context of NLP, including SOTA LMs. These models can transform large volumes of text into effective vector representations capturing the same semantic information. Further, such representations can be utilized by various machine learning (ML) algorithms for a variety of NLP related tasks. In the end, this survey briefly discusses the commonly used ML and DL based classifiers, evaluation metrics and the applications of these word embeddings in different NLP tasks.
Subjects: Computation and Language (cs.CL)
Journal reference: ACM Transactions on Asian and Low-Resource Language Information Processing (TALLIP) 2020
Cite as: arXiv:2010.15036 [cs.CL]
  (or arXiv:2010.15036v1 [cs.CL] for this version)

Submission history

From: Usman Naseem [view email]
[v1] Wed, 28 Oct 2020 15:15:13 GMT (2272kb,D)

Link back to: arXiv, form interface, contact.