Can Neural Image Captioning be Controlled via Forced Attention?

Sadler, Philipp; Scheffler, Tatjana; Schlangen, David

Full-text links:

Download:

Current browse context:

cs.CL

< prev | next >

new | recent | 1911

Computer Science > Computation and Language

Title: Can Neural Image Captioning be Controlled via Forced Attention?

Authors: Philipp Sadler, Tatjana Scheffler, David Schlangen

(Submitted on 10 Nov 2019)

Abstract: Learned dynamic weighting of the conditioning signal (attention) has been shown to improve neural language generation in a variety of settings. The weights applied when generating a particular output sequence have also been viewed as providing a potentially explanatory insight into the internal workings of the generator. In this paper, we reverse the direction of this connection and ask whether through the control of the attention of the model we can control its output. Specifically, we take a standard neural image captioning model that uses attention, and fix the attention to pre-determined areas in the image. We evaluate whether the resulting output is more likely to mention the class of the object in that area than the normally generated caption. We introduce three effective methods to control the attention and find that these are producing expected results in up to 28.56% of the cases.

Comments:	Accepted shortpaper for the 12th International Conference on Natural Language Generation
Subjects:	Computation and Language (cs.CL); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
Cite as:	arXiv:1911.03936 [cs.CL]
	(or arXiv:1911.03936v1 [cs.CL] for this version)

Submission history

From: Philipp Sadler [view email]
[v1] Sun, 10 Nov 2019 14:00:27 GMT (4381kb,D)

Which authors of this paper are endorsers? | Disable MathJax (What is MathJax?)

Link back to: arXiv, form interface, contact.

> cs > arXiv:1911.03936

Download:

Current browse context:

Change to browse by:

References & Citations

DBLP - CS Bibliography

Bookmark

Computer Science > Computation and Language

Title: Can Neural Image Captioning be Controlled via Forced Attention?

Submission history