4.3 Smoothers and coarsest-level solvers

The smoothers implemented in MLD2P4 include the Jacobi and block-Jacobi methods, a hybrid version of the forward and backward Gauss-Seidel methods, and the additive Schwarz (AS) ones (see, e.g., [23, 24]).

The hybrid Gauss-Seidel version is considered because the original Gauss-Seidel method is inherently sequential. At each iteration of the hybrid version, each parallel process uses the most recent values of its own local variables and the values of the non-local variables computed at the previous iteration, obtained by exchanging data with other processes before the beginning of the current iteration.

In the AS methods, the index space Ωk is divided into m k subsets Ωik of size nk,i, possibly overlapping. For each i we consider the restriction operator Rik nk,i×nk that maps a vector xk to the vector x ik made of the components of xk with indices in Ω ik, and the prolongation operator Pik = (R ik)T . These operators are then used to build A ik = R ikAkP ik, which is the restriction of Ak to the index space Ω ik. The classical AS preconditioner MASk is defined as

   k  -1   m∑k   k   k -1  k
(M  AS)   =     Pi (A i) R i,
           i=1

where Aik is supposed to be nonsingular. We observe that an approximate inverse of Aik is usually considered instead of (A ik)-1. The setup of M ASk during the multilevel build phase involves

The computation of zk = M ASkwk, with wk nk, during the multilevel application phase, requires

Variants of the classical AS method, which use modifications of the restriction and prolongation operators, are also implemented in MLD2P4. Among them, the Restricted AS (RAS) preconditioner usually outperforms the classical AS preconditioner in terms of convergence rate and of computation and communication time on parallel distributed-memory computers, and is therefore the most widely used among the AS preconditioners [6].

Direct solvers based on sparse LU factorizations, implemented in the third-party libraries reported in Section 3.2, can be applied as coarsest-level solvers by MLD2P4. Native inexact solvers based on incomplete LU factorizations, as well as Jacobi, hybrid (forward) Gauss-Seidel, and block Jacobi preconditioners are also available. Direct solvers usually lead to more effective preconditioners in terms of algorithmic scalability; however, this does not guarantee parallel efficiency.