Does Multimodality Help Human and Machine for Translation and Image Captioning?

Caglayan, Ozan; Aransa, Walid; Wang, Yaxing; Masana, Marc; García-Martínez, Mercedes; Bougares, Fethi; Barrault, Loïc; van de Weijer, Joost

doi:10.18653/v1/W16-2358

Full-text links:

Download:

Current browse context:

cs.CL

< prev | next >

new | recent | 1605

Computer Science > Computation and Language

Title: Does Multimodality Help Human and Machine for Translation and Image Captioning?

Authors: Ozan Caglayan, Walid Aransa, Yaxing Wang, Marc Masana, Mercedes García-Martínez, Fethi Bougares, Loïc Barrault, Joost van de Weijer

(Submitted on 30 May 2016 (v1), last revised 16 Aug 2016 (this version, v4))

Abstract: This paper presents the systems developed by LIUM and CVC for the WMT16 Multimodal Machine Translation challenge. We explored various comparative methods, namely phrase-based systems and attentional recurrent neural networks models trained using monomodal or multimodal data. We also performed a human evaluation in order to estimate the usefulness of multimodal data for human machine translation and image description generation. Our systems obtained the best results for both tasks according to the automatic evaluation metrics BLEU and METEOR.

Comments:	7 pages, 2 figures, v4: Small clarification in section 4 title and content
Subjects:	Computation and Language (cs.CL); Machine Learning (cs.LG); Neural and Evolutionary Computing (cs.NE)
DOI:	10.18653/v1/W16-2358
Cite as:	arXiv:1605.09186 [cs.CL]
	(or arXiv:1605.09186v4 [cs.CL] for this version)

Submission history

From: Ozan Çağlayan [view email]
[v1] Mon, 30 May 2016 11:47:00 GMT (218kb,D)
[v2] Thu, 2 Jun 2016 13:52:45 GMT (218kb,D)
[v3] Mon, 13 Jun 2016 15:33:11 GMT (218kb,D)
[v4] Tue, 16 Aug 2016 12:11:29 GMT (209kb,D)

Which authors of this paper are endorsers? | Disable MathJax (What is MathJax?)

Link back to: arXiv, form interface, contact.

> cs > arXiv:1605.09186

Download:

Current browse context:

Change to browse by:

References & Citations

DBLP - CS Bibliography

Bookmark

Computer Science > Computation and Language

Title: Does Multimodality Help Human and Machine for Translation and Image Captioning?

Submission history