Accelerated Cyclic Reduction: A distributed-memory fast solver for structured linear systems

dc.contributor.authorChav́ez, Gustavo
dc.contributor.authorTurkiyyah, George M.
dc.contributor.authorZampini, Stefano
dc.contributor.authorLtaief, Hatem
dc.contributor.authorKeyes, David E.
dc.contributor.departmentDepartment of Computer Science
dc.contributor.facultyFaculty of Arts and Sciences (FAS)
dc.contributor.institutionAmerican University of Beirut
dc.date.accessioned2025-01-24T11:22:56Z
dc.date.available2025-01-24T11:22:56Z
dc.date.issued2018
dc.description.abstractWe present Accelerated Cyclic Reduction (ACR), a distributed-memory fast solver for rank-compressible block tridiagonal linear systems arising from the discretization of elliptic operators, developed here for three dimensions. Algorithmic synergies between Cyclic Reduction and hierarchical matrix arithmetic operations result in a solver that has O(kNlogN(logN+k2)) arithmetic complexity and O(k Nlog N) memory footprint, where N is the number of degrees of freedom and k is the rank of a block in the hierarchical approximation, and which exhibits substantial concurrency. We provide a baseline for performance and applicability by comparing with the multifrontal method with and without hierarchical semi-separable matrices, with algebraic multigrid and with the classic cyclic reduction method. Over a set of large-scale elliptic systems with features of nonsymmetry and indefiniteness, the robustness of the direct solvers extends beyond that of the multigrid solver, and relative to the multifrontal approach ACR has lower or comparable execution time and size of the factors, with substantially lower numerical ranks. ACR exhibits good strong and weak scaling in a distributed context and, as with any direct solver, is advantageous for problems that require the solution of multiple right-hand sides. Numerical experiments show that the rank k patterns are of O(1) for the Poisson equation and of O(n) for the indefinite Helmholtz equation. The solver is ideal in situations where low-accuracy solutions are sufficient, or otherwise as a preconditioner within an iterative method. © 2017
dc.identifier.doihttps://doi.org/10.1016/j.parco.2017.12.001
dc.identifier.eid2-s2.0-85042919840
dc.identifier.urihttp://hdl.handle.net/10938/25569
dc.language.isoen
dc.publisherElsevier B.V.
dc.relation.ispartofParallel Computing
dc.sourceScopus
dc.subjectCyclic reduction
dc.subjectElliptic equations
dc.subjectFast direct solvers
dc.subjectHierarchical matrices
dc.subjectAlgebra
dc.subjectDegrees of freedom (mechanics)
dc.subjectLinear systems
dc.subjectMatrix algebra
dc.subjectMemory architecture
dc.subjectParallel processing systems
dc.subjectPoisson equation
dc.subjectBlock-tridiagonal linear systems
dc.subjectDirect solvers
dc.subjectHierarchical approximations
dc.subjectMultiple right-hand sides
dc.subjectNumber of degrees of freedom
dc.subjectIterative methods
dc.titleAccelerated Cyclic Reduction: A distributed-memory fast solver for structured linear systems
dc.typeArticle

Files

Original bundle

Now showing 1 - 1 of 1
Loading...
Thumbnail Image
Name:
2018-4242.pdf
Size:
1.82 MB
Format:
Adobe Portable Document Format