We gratefully acknowledge support from
the Simons Foundation and member institutions.
Full-text links:

Download:

Current browse context:

cs.AR

Change to browse by:

References & Citations

DBLP - CS Bibliography

Bookmark

(what is this?)
CiteULike logo BibSonomy logo Mendeley logo del.icio.us logo Digg logo Reddit logo

Computer Science > Hardware Architecture

Title: A matrix math facility for Power ISA(TM) processors

Abstract: Power ISA(TM) Version 3.1 has introduced a new family of matrix math instructions, collectively known as the Matrix-Multiply Assist (MMA) facility. The instructions in this facility implement numerical linear algebra operations on small matrices and are meant to accelerate computation-intensive kernels, such as matrix multiplication, convolution and discrete Fourier transform. These instructions have led to a power- and area-efficient implementation of a high throughput math engine in the future POWER10 processor. Performance per core is 4 times better, at constant frequency, than the previous generation POWER9 processor. We also advocate the use of compiler built-ins as the preferred way of leveraging these instructions, which we illustrate through case studies covering matrix multiplication and convolution.
Subjects: Hardware Architecture (cs.AR); Machine Learning (cs.LG); Performance (cs.PF); Programming Languages (cs.PL)
Cite as: arXiv:2104.03142 [cs.AR]
  (or arXiv:2104.03142v1 [cs.AR] for this version)

Submission history

From: José Moreira [view email]
[v1] Wed, 7 Apr 2021 14:17:32 GMT (1862kb,D)

Link back to: arXiv, form interface, contact.