We gratefully acknowledge support from
the Simons Foundation and member institutions.
Full-text links:

Download:

Current browse context:

cs.PF

Change to browse by:

References & Citations

DBLP - CS Bibliography

Bookmark

(what is this?)
CiteULike logo BibSonomy logo Mendeley logo del.icio.us logo Digg logo Reddit logo

Mathematics > Numerical Analysis

Title: Acceleration of multiple precision matrix multiplication based on multi-component floating-point arithmetic using AVX2

Abstract: In this paper, we report the results obtained from the acceleration of multi-binary64-type multiple precision matrix multiplication with AVX2. We target double-double (DD), triple-double (TD), and quad-double (QD) precision arithmetic designed by certain types of error-free transformation (EFT) arithmetic. Furthermore, we implement SIMDized EFT functions, which simultaneously compute with four binary64 numbers on x86_64 computing environment, and by using help of them, we also develop SIMDized DD, TD, and QD additions and multiplications. In addition, AVX2 load/store functions were adopted to efficiently speed up reading and storing matrix elements from/to memory. Owing to these combined techniques, our implemented multiple precision matrix multiplications have been accelerated more than three times compared with non-accelerated ones. Our accelerated matrix multiplication modifies the performance of parallelization with OpenMP.
Subjects: Numerical Analysis (math.NA); Mathematical Software (cs.MS); Performance (cs.PF)
DOI: 10.1007/978-3-030-86976-2_14
Cite as: arXiv:2101.06584 [math.NA]
  (or arXiv:2101.06584v1 [math.NA] for this version)

Submission history

From: Tomonori Kouya [view email]
[v1] Sun, 17 Jan 2021 04:05:13 GMT (902kb,D)

Link back to: arXiv, form interface, contact.