We gratefully acknowledge support from
the Simons Foundation and member institutions.
Full-text links:

Download:

Current browse context:

cs.MS

Change to browse by:

References & Citations

DBLP - CS Bibliography

Bookmark

(what is this?)
CiteULike logo BibSonomy logo Mendeley logo del.icio.us logo Digg logo Reddit logo

Computer Science > Mathematical Software

Title: High level implementation of geometric multigrid solvers for finite element problems: applications in atmospheric modelling

Abstract: The implementation of efficient multigrid preconditioners for elliptic partial differential equations (PDEs) is a challenge due to the complexity of the resulting algorithms and corresponding computer code. For sophisticated finite element discretisations on unstructured grids an efficient implementation can be very time consuming and requires the programmer to have in-depth knowledge of the mathematical theory, parallel computing and optimisation techniques on manycore CPUs. In this paper we show how the development of bespoke multigrid preconditioners can be simplified significantly by using a framework which allows the expression of the each component of the algorithm at the correct abstraction level. Our approach (1) allows the expression of the finite element problem in a language which is close to the mathematical formulation of the problem, (2) guarantees the automatic generation and efficient execution of parallel optimised low-level computer code and (3) is flexible enough to support different abstraction levels and give the programmer control over details of the preconditioner. We use the composable abstractions of the Firedrake/PyOP2 package to demonstrate the efficiency of this approach for the solution of strongly anisotropic PDEs in atmospheric modelling. The weak formulation of the PDE is expressed in Unified Form Language (UFL) and the lower PyOP2 abstraction layer allows the manual design of computational kernels for a bespoke geometric multigrid preconditioner. We compare the performance of this preconditioner to a single-level method and hypre's BoomerAMG algorithm. The Firedrake/PyOP2 code is inherently parallel and we present a detailed performance analysis for a single node (24 cores) on the ARCHER supercomputer. Our implementation utilises a significant fraction of the available memory bandwidth and shows very good weak scaling on up to 6,144 compute cores.
Comments: 22 pages, 5 figures, 9 tables. Submitted to JCP
Subjects: Mathematical Software (cs.MS); Numerical Analysis (math.NA); Fluid Dynamics (physics.flu-dyn)
MSC classes: 65F08, 65N55, 76M10, 86A10
ACM classes: D.2.2; G.1.3; G.1.8; G.4; J.2
Journal reference: Journal of Computational Physics 327:1-18 (2016)
DOI: 10.1016/j.jcp.2016.09.037
Cite as: arXiv:1605.00492 [cs.MS]
  (or arXiv:1605.00492v2 [cs.MS] for this version)

Submission history

From: Eike Hermann Müller [view email]
[v1] Mon, 2 May 2016 14:06:24 GMT (506kb,D)
[v2] Wed, 14 Sep 2016 09:10:47 GMT (127kb,D)

Link back to: arXiv, form interface, contact.