PolyLM: Learning about Polysemy through Language Modeling

Ansell, Alan; Bravo-Marquez, Felipe; Pfahringer, Bernhard

Full-text links:

Download:

Current browse context:

cs.CL

< prev | next >

new | recent | 2101

Change to browse by:

Computer Science > Computation and Language

Title: PolyLM: Learning about Polysemy through Language Modeling

Authors: Alan Ansell, Felipe Bravo-Marquez, Bernhard Pfahringer

(Submitted on 25 Jan 2021)

Abstract: To avoid the "meaning conflation deficiency" of word embeddings, a number of models have aimed to embed individual word senses. These methods at one time performed well on tasks such as word sense induction (WSI), but they have since been overtaken by task-specific techniques which exploit contextualized embeddings. However, sense embeddings and contextualization need not be mutually exclusive. We introduce PolyLM, a method which formulates the task of learning sense embeddings as a language modeling problem, allowing contextualization techniques to be applied. PolyLM is based on two underlying assumptions about word senses: firstly, that the probability of a word occurring in a given context is equal to the sum of the probabilities of its individual senses occurring; and secondly, that for a given occurrence of a word, one of its senses tends to be much more plausible in the context than the others. We evaluate PolyLM on WSI, showing that it performs considerably better than previous sense embedding techniques, and matches the current state-of-the-art specialized WSI method despite having six times fewer parameters. Code and pre-trained models are available at this https URL

Comments:	EACL 2021
Subjects:	Computation and Language (cs.CL)
Cite as:	arXiv:2101.10448 [cs.CL]
	(or arXiv:2101.10448v1 [cs.CL] for this version)

Submission history

From: Alan Ansell [view email]
[v1] Mon, 25 Jan 2021 22:09:12 GMT (108kb,D)

Which authors of this paper are endorsers? | Disable MathJax (What is MathJax?)

Link back to: arXiv, form interface, contact.

> cs > arXiv:2101.10448

Download:

Current browse context:

Change to browse by:

References & Citations

DBLP - CS Bibliography

Bookmark

Computer Science > Computation and Language

Title: PolyLM: Learning about Polysemy through Language Modeling

Submission history