References & Citations
Computer Science > Computation and Language
Title: A Self-Explainable Stylish Image Captioning Framework via Multi-References
(Submitted on 20 Oct 2021 (v1), last revised 18 Nov 2021 (this version, v2))
Abstract: In this paper, we propose to build a stylish image captioning model through a Multi-style Multi modality mechanism (2M). We demonstrate that with 2M, we can build an effective stylish captioner and that multi-references produced by the model can also support explaining the model through identifying erroneous input features on faulty examples. We show how this 2M mechanism can be used to build stylish captioning models and show how these models can be utilized to provide explanations of likely errors in the models.
Submission history
From: Brent Harrison [view email][v1] Wed, 20 Oct 2021 18:00:40 GMT (10328kb,D)
[v2] Thu, 18 Nov 2021 18:39:15 GMT (10328kb,D)
Link back to: arXiv, form interface, contact.