Enhancing data locality of the conjugate gradient method for high-order matrix-free finite-element implementations

Martin Kronbichler, Dmytro Sashko, Peter Munch

doi:10.1177/10943420221107880

User: Guest

If you experience problems opening the document, please try this link.

Title:: Enhancing data locality of the conjugate gradient method for high-order matrix-free finite-element implementations
Document type:: Zeitschriftenaufsatz
Author(s):: Martin Kronbichler, Dmytro Sashko, Peter Munch
Abstract:: This work presents a variant of the conjugate gradient (CG) method with minimal memory access for the vector operations, targeting high-order finite-element schemes with fast matrix-free operator evaluation and cheap preconditioners like the matrix diagonal. The algorithm relies on a data-dependency analysis and interleaves the vector updates and inner products in a CG iteration with the matrix-vector product. As a result, around 90% of the vector entries of the three active vectors of the CG method are transferred from slow RAM memory exactly once per iteration, with all additional access hitting fast cache memory. Node-level performance analyses and scaling studies on up to 147k cores show that the new method is around two times faster than a standard CG solver as well as optimized pipelined CG and s-step CG methods for large sizes that exceed processor caches, and provides similar performance near the strong scaling limit. «
This work presents a variant of the conjugate gradient (CG) method with minimal memory access for the vector operations, targeting high-order finite-element schemes with fast matrix-free operator evaluation and cheap preconditioners like the matrix diagonal. The algorithm relies on a data-dependency analysis and interleaves the vector updates and inner products in a CG iteration with the matrix-vector product. As a result, around 90% of the vector entries of the three active vectors of the C... »
Dewey Decimal Classification:: 620 Ingenieurwissenschaften
Journal title:: The International Journal of High Performance Computing Applications
Year:: 2022
Covered by:: Web of Science
Reviewed:: ja
Fulltext / DOI:: doi:10.1177/10943420221107880
Status:: Verlagsversion / published
Date of publication:: 14.07.2022
BibTeX

Occurrences:

mediaTUM Gesamtbestand Einrichtungen Schools TUM School of Engineering and Design Departments Engineering Physics and Computation Lehrstuhl für Numerische Mechanik (Prof. Wall)Peer-Reviewed Publications 2022

mediaTUM Gesamtbestand Hochschulbibliographie 2022 Schools und Fakultäten TUM School of Engineering and Design Lehrstuhl für Numerische Mechanik (Prof. Wall)