Interactive Re-Fitting as a Technique for Improving Word Embeddings

Powell, James; Sentz, Kari

Full-text links:

Download:

Current browse context:

cs.CL

< prev | next >

new | recent | 2010

Computer Science > Computation and Language

Title: Interactive Re-Fitting as a Technique for Improving Word Embeddings

Authors: James Powell, Kari Sentz

(Submitted on 30 Sep 2020)

Abstract: Word embeddings are a fixed, distributional representation of the context of words in a corpus learned from word co-occurrences. While word embeddings have proven to have many practical uses in natural language processing tasks, they reflect the attributes of the corpus upon which they are trained. Recent work has demonstrated that post-processing of word embeddings to apply information found in lexical dictionaries can improve their quality. We build on this post-processing technique by making it interactive. Our approach makes it possible for humans to adjust portions of a word embedding space by moving sets of words closer to one another. One motivating use case for this capability is to enable users to identify and reduce the presence of bias in word embeddings. Our approach allows users to trigger selective post-processing as they interact with and assess potential bias in word embeddings.

Subjects:	Computation and Language (cs.CL); Machine Learning (cs.LG)
Cite as:	arXiv:2010.00121 [cs.CL]
	(or arXiv:2010.00121v1 [cs.CL] for this version)

Submission history

From: James Powell [view email]
[v1] Wed, 30 Sep 2020 21:54:22 GMT (83kb,D)

Which authors of this paper are endorsers? | Disable MathJax (What is MathJax?)

Link back to: arXiv, form interface, contact.

> cs > arXiv:2010.00121

Download:

Current browse context:

Change to browse by:

References & Citations

DBLP - CS Bibliography

Bookmark

Computer Science > Computation and Language

Title: Interactive Re-Fitting as a Technique for Improving Word Embeddings

Submission history