We gratefully acknowledge support from
the Simons Foundation and member institutions.
Full-text links:

Download:

Current browse context:

cs.DC

Change to browse by:

References & Citations

DBLP - CS Bibliography

Bookmark

(what is this?)
CiteULike logo BibSonomy logo Mendeley logo del.icio.us logo Digg logo Reddit logo

Computer Science > Distributed, Parallel, and Cluster Computing

Title: Tactics to Directly Map CNN graphs on Embedded FPGAs

Abstract: Deep Convolutional Neural Networks (CNNs) are the state-of-the-art in image classification. Since CNN feed forward propagation involves highly regular parallel computation, it benefits from a significant speed-up when running on fine grain parallel programmable logic devices. As a consequence, several studies have proposed FPGA-based accelerators for CNNs. However, because of the large computationalpower required by CNNs, none of the previous studies has proposed a direct mapping of the CNN onto the physical resources of an FPGA, allocating each processing actor to its own hardware instance.In this paper, we demonstrate the feasibility of the so called direct hardware mapping (DHM) and discuss several tactics we explore to make DHM usable in practice. As a proof of concept, we introduce the HADDOC2 open source tool, that automatically transforms a CNN description into a synthesizable hardware description with platform-independent direct hardware mapping.
Comments: IEEE Embedded Systems Letters, Institute of Electrical and Electronics Engineers, A Para\^itre, pp.1 - 1. arXiv admin note: text overlap with arXiv:1705.04543
Subjects: Distributed, Parallel, and Cluster Computing (cs.DC); Hardware Architecture (cs.AR); Computer Vision and Pattern Recognition (cs.CV)
DOI: 10.1109/LES.2017.2743247
Cite as: arXiv:1712.04322 [cs.DC]
  (or arXiv:1712.04322v1 [cs.DC] for this version)

Submission history

From: Francois Berry [view email]
[v1] Mon, 20 Nov 2017 08:13:39 GMT (207kb)

Link back to: arXiv, form interface, contact.