Acceleration of multiple precision matrix multiplication based on multi-component floating-point arithmetic using AVX2

Kouya, Tomonori

doi:10.1007/978-3-030-86976-2_14

Full-text links:

Download:

Current browse context:

cs.PF

< prev | next >

new | recent | 2101

Mathematics > Numerical Analysis

Title: Acceleration of multiple precision matrix multiplication based on multi-component floating-point arithmetic using AVX2

Authors: Tomonori Kouya

(Submitted on 17 Jan 2021)

Abstract: In this paper, we report the results obtained from the acceleration of multi-binary64-type multiple precision matrix multiplication with AVX2. We target double-double (DD), triple-double (TD), and quad-double (QD) precision arithmetic designed by certain types of error-free transformation (EFT) arithmetic. Furthermore, we implement SIMDized EFT functions, which simultaneously compute with four binary64 numbers on x86_64 computing environment, and by using help of them, we also develop SIMDized DD, TD, and QD additions and multiplications. In addition, AVX2 load/store functions were adopted to efficiently speed up reading and storing matrix elements from/to memory. Owing to these combined techniques, our implemented multiple precision matrix multiplications have been accelerated more than three times compared with non-accelerated ones. Our accelerated matrix multiplication modifies the performance of parallelization with OpenMP.

Subjects:	Numerical Analysis (math.NA); Mathematical Software (cs.MS); Performance (cs.PF)
DOI:	10.1007/978-3-030-86976-2_14
Cite as:	arXiv:2101.06584 [math.NA]
	(or arXiv:2101.06584v1 [math.NA] for this version)

Submission history

From: Tomonori Kouya [view email]
[v1] Sun, 17 Jan 2021 04:05:13 GMT (902kb,D)

Which authors of this paper are endorsers? | Disable MathJax (What is MathJax?)

Link back to: arXiv, form interface, contact.

> math > arXiv:2101.06584

Download:

Current browse context:

Change to browse by:

References & Citations

DBLP - CS Bibliography

Bookmark

Mathematics > Numerical Analysis

Title: Acceleration of multiple precision matrix multiplication based on multi-component floating-point arithmetic using AVX2

Submission history