Teaching Machines to Code: Neural Markup Generation with Visual Attention

Singh, Sumeet S.

Full-text links:

Download:

Current browse context:

cs.LG

< prev | next >

new | recent | 1802

Computer Science > Machine Learning

Title: Teaching Machines to Code: Neural Markup Generation with Visual Attention

Authors: Sumeet S. Singh

(Submitted on 15 Feb 2018 (v1), last revised 15 Jun 2018 (this version, v2))

Abstract: We present a neural transducer model with visual attention that learns to generate LaTeX markup of a real-world math formula given its image. Applying sequence modeling and transduction techniques that have been very successful across modalities such as natural language, image, handwriting, speech and audio; we construct an image-to-markup model that learns to produce syntactically and semantically correct LaTeX markup code over 150 words long and achieves a BLEU score of 89%; improving upon the previous state-of-art for the Im2Latex problem. We also demonstrate with heat-map visualization how attention helps in interpreting the model and can pinpoint (detect and localize) symbols on the image accurately despite having been trained without any bounding box data.

Comments:	For datasets, visualizations and ancillary material see: this https URL . For source code go to: this https URL
Subjects:	Machine Learning (cs.LG); Computation and Language (cs.CL); Computer Vision and Pattern Recognition (cs.CV); Neural and Evolutionary Computing (cs.NE)
Cite as:	arXiv:1802.05415 [cs.LG]
	(or arXiv:1802.05415v2 [cs.LG] for this version)

Submission history

From: Sumeet Sohan Singh [view email]
[v1] Thu, 15 Feb 2018 06:17:51 GMT (614kb,D)
[v2] Fri, 15 Jun 2018 21:36:10 GMT (1046kb,D)

Which authors of this paper are endorsers? | Disable MathJax (What is MathJax?)

Link back to: arXiv, form interface, contact.

> cs > arXiv:1802.05415

Download:

Current browse context:

Change to browse by:

References & Citations

DBLP - CS Bibliography

Bookmark

Computer Science > Machine Learning

Title: Teaching Machines to Code: Neural Markup Generation with Visual Attention

Submission history