We gratefully acknowledge support from
the Simons Foundation and member institutions.
Full-text links:

Download:

Current browse context:

cs.LG

Change to browse by:

References & Citations

DBLP - CS Bibliography

Bookmark

(what is this?)
CiteULike logo BibSonomy logo Mendeley logo del.icio.us logo Digg logo Reddit logo

Computer Science > Machine Learning

Title: Teaching Machines to Code: Neural Markup Generation with Visual Attention

Abstract: We present a neural transducer model with visual attention that learns to generate LaTeX markup of a real-world math formula given its image. Applying sequence modeling and transduction techniques that have been very successful across modalities such as natural language, image, handwriting, speech and audio; we construct an image-to-markup model that learns to produce syntactically and semantically correct LaTeX markup code over 150 words long and achieves a BLEU score of 89%; improving upon the previous state-of-art for the Im2Latex problem. We also demonstrate with heat-map visualization how attention helps in interpreting the model and can pinpoint (detect and localize) symbols on the image accurately despite having been trained without any bounding box data.
Comments: For datasets, visualizations and ancillary material see: this https URL . For source code go to: this https URL
Subjects: Machine Learning (cs.LG); Computation and Language (cs.CL); Computer Vision and Pattern Recognition (cs.CV); Neural and Evolutionary Computing (cs.NE)
Cite as: arXiv:1802.05415 [cs.LG]
  (or arXiv:1802.05415v2 [cs.LG] for this version)

Submission history

From: Sumeet Sohan Singh [view email]
[v1] Thu, 15 Feb 2018 06:17:51 GMT (614kb,D)
[v2] Fri, 15 Jun 2018 21:36:10 GMT (1046kb,D)

Link back to: arXiv, form interface, contact.