We gratefully acknowledge support from
the Simons Foundation and member institutions.
Full-text links:

Download:

Current browse context:

cs.ET

Change to browse by:

References & Citations

DBLP - CS Bibliography

Bookmark

(what is this?)
CiteULike logo BibSonomy logo Mendeley logo del.icio.us logo Digg logo Reddit logo

Computer Science > Emerging Technologies

Title: Linear Delay-cell Design for Low-energy Delay Multiplication and Accumulation

Authors: Aditya Shukla
Abstract: A practical deep neural network's (DNN) evaluation involves thousands of multiply-and-accumulate (MAC) operations. To extend DNN's superior inference capabilities to energy constrained devices, architectures and circuits that minimize energy-per-MAC must be developed. In this respect, analog delay-based MAC is advantageous due to reasons both extrinsic and intrinsic to the MAC implementation $-$ (1) lower fixed-point precision (1-8 bits) requirement in a DNN's evaluation, (2) better dynamic range than charge-based accumulation for smaller technology nodes and (3) simpler analog-digital interfaces. Implementing DNNs using delay-based MAC requires mixed-signal delay multipliers that accept digitally stored weights and analog voltages as arguments. To this end, a novel, linearly tune-able delay-cell is proposed, wherein, the delay is realized with an inverted MOS capacitor ($C^*$) steadily discharged from a linearly input-voltage dependent initial charge. The cell is analytically modeled, constraints for its functional validity are determined, and jitter-models are developed. Multiple cells with scaled delays, corresponding to each bit of the digital argument, must be cascaded to form the multiplier. To realize such bit-wise delay-scaling of the cells, a biasing circuit is proposed that generates sub-threshold gate-voltages to scale $C^*$'s discharging rate, and thus area-expensive transistor width-scaling is avoided. On applying the constraints and jitter models to 130nm technology, the minimum optimal $C^*$ was found to be 2 fF and maximum number of bits to be 5. Schematic-level simulations show a worst case energy-consumption close to the state-of-art, and thus, feasibility of the cell.
Comments: Keywords: Analog-computing, delay-cell, mixed-signal delay multiplier, multiply-and-accumulate
Subjects: Emerging Technologies (cs.ET); Signal Processing (eess.SP)
Cite as: arXiv:2007.13895 [cs.ET]
  (or arXiv:2007.13895v1 [cs.ET] for this version)

Submission history

From: Aditya Shukla [view email]
[v1] Mon, 27 Jul 2020 22:25:00 GMT (266kb)
[v2] Thu, 30 Jul 2020 00:23:21 GMT (266kb)
[v3] Mon, 3 Aug 2020 15:30:17 GMT (556kb)

Link back to: arXiv, form interface, contact.