Natural Language Descriptions of Deep Visual Features

Hernandez, Evan; Schwettmann, Sarah; Bau, David; Bagashvili, Teona; Torralba, Antonio; Andreas, Jacob

Full-text links:

Download:

Current browse context:

cs.CV

< prev | next >

new | recent | 2201

Computer Science > Computer Vision and Pattern Recognition

Title: Natural Language Descriptions of Deep Visual Features

Authors: Evan Hernandez, Sarah Schwettmann, David Bau, Teona Bagashvili, Antonio Torralba, Jacob Andreas

(Submitted on 26 Jan 2022 (v1), last revised 18 Apr 2022 (this version, v2))

Abstract: Some neurons in deep networks specialize in recognizing highly specific perceptual, structural, or semantic features of inputs. In computer vision, techniques exist for identifying neurons that respond to individual concept categories like colors, textures, and object classes. But these techniques are limited in scope, labeling only a small subset of neurons and behaviors in any network. Is a richer characterization of neuron-level computation possible? We introduce a procedure (called MILAN, for mutual-information-guided linguistic annotation of neurons) that automatically labels neurons with open-ended, compositional, natural language descriptions. Given a neuron, MILAN generates a description by searching for a natural language string that maximizes pointwise mutual information with the image regions in which the neuron is active. MILAN produces fine-grained descriptions that capture categorical, relational, and logical structure in learned features. These descriptions obtain high agreement with human-generated feature descriptions across a diverse set of model architectures and tasks, and can aid in understanding and controlling learned models. We highlight three applications of natural language neuron descriptions. First, we use MILAN for analysis, characterizing the distribution and importance of neurons selective for attribute, category, and relational information in vision models. Second, we use MILAN for auditing, surfacing neurons sensitive to human faces in datasets designed to obscure them. Finally, we use MILAN for editing, improving robustness in an image classifier by deleting neurons sensitive to text features spuriously correlated with class labels.

Comments:	To be published as a conference paper at ICLR 2022
Subjects:	Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Computation and Language (cs.CL); Machine Learning (cs.LG)
Cite as:	arXiv:2201.11114 [cs.CV]
	(or arXiv:2201.11114v2 [cs.CV] for this version)

Submission history

From: Evan Hernandez [view email]
[v1] Wed, 26 Jan 2022 18:48:02 GMT (32392kb,D)
[v2] Mon, 18 Apr 2022 17:31:20 GMT (32570kb,D)

Which authors of this paper are endorsers? | Disable MathJax (What is MathJax?)

Link back to: arXiv, form interface, contact.

> cs > arXiv:2201.11114v2

Download:

Current browse context:

Change to browse by:

References & Citations

DBLP - CS Bibliography

Bookmark

Computer Science > Computer Vision and Pattern Recognition

Title: Natural Language Descriptions of Deep Visual Features

Submission history