We gratefully acknowledge support from
the Simons Foundation and member institutions.
Full-text links:

Download:

Current browse context:

cs.HC

Change to browse by:

References & Citations

DBLP - CS Bibliography

Bookmark

(what is this?)
CiteULike logo BibSonomy logo Mendeley logo del.icio.us logo Digg logo Reddit logo

Computer Science > Human-Computer Interaction

Title: Embedding Comparator: Visualizing Differences in Global Structure and Local Neighborhoods via Small Multiples

Abstract: Embeddings -- mappings from high-dimensional discrete input to lower-dimensional continuous vector spaces -- have been widely adopted in machine learning, linguistics, and computational biology as they often surface interesting and unexpected domain semantics. Through semi-structured interviews with embedding model researchers and practitioners, we find that current tools poorly support a central concern: comparing different embeddings when developing fairer, more robust models. In response, we present the Embedding Comparator, an interactive system that balances gaining an overview of the embedding spaces with making fine-grained comparisons of local neighborhoods. For a pair of models, we compute the similarity of the k-nearest neighbors of every embedded object, and visualize the results as Local Neighborhood Dominoes: small multiples that facilitate rapid comparisons. Using case studies, we illustrate the types of insights the Embedding Comparator reveals including how fine-tuning embeddings changes semantics, how language changes over time, and how training data differences affect two seemingly similar models.
Subjects: Human-Computer Interaction (cs.HC); Computation and Language (cs.CL); Machine Learning (cs.LG)
Cite as: arXiv:1912.04853 [cs.HC]
  (or arXiv:1912.04853v1 [cs.HC] for this version)

Submission history

From: Brandon Carter [view email]
[v1] Tue, 10 Dec 2019 17:46:43 GMT (2775kb,D)
[v2] Sat, 6 Mar 2021 21:28:57 GMT (4133kb,D)
[v3] Fri, 4 Mar 2022 14:51:40 GMT (9630kb,D)

Link back to: arXiv, form interface, contact.