Skip to content

sfilippone/amg4psblas

Repository files navigation

AMG4PSBLAS v1.2

Algebraic Multigrid Package based on PSBLAS (Parallel Sparse BLAS version 3.9)

AMG4PSBLAS is a package of parallel algebraic multilevel preconditioners included in the PSCToolkit (Parallel Sparse Computation Toolkit) software framework.

It is the prosecution of a software development project called MLD2P4 that started in 2007, which originally implemented a multilevel version of some domain decomposition preconditioners of additive-Schwarz type and was based on a parallel decoupled version of the well known smoothed aggregation method to generate the multilevel hierarchy of coarser matrices.

In the last few years the package was extended by including new algorithms and functionalities for the setup and application new AMG preconditioners with the final goal of improving efficiency and scalability when using tens of thousands cores and boosting reliability in dealing with general symmetric positive definite linear systems.

The original license is shown in the file LICENSE.MLD2P4; due to the significant number of changes and the vast enlargement in scope, we decided to rename the package as AMG4PSBLAS.

AMG4PSBLAS has been designed to provide scalable and easy-to-use preconditioners in the context of the PSBLAS (Parallel Sparse Basic Linear Algebra Subprograms) computational framework and can be used in conjuction with the Krylov solvers available in this framework. Our package is based on a completely algebraic approach; therefore users level interfaces assume that the system matrix and preconditioners are represented as PSBLAS distributed sparse matrices.

AMG4PSBLAS enables the user to easily specify different features of an algebraic multilevel preconditioner, thus allowing to experiment with different preconditioners for the problem and parallel computers at hand.

The package employs object-oriented design techniques in Fortran 2008, with interfaces to additional third party libraries such as MUMPS, UMFPACK, SuperLU, and SuperLU_Dist, which can be exploited in building multilevel preconditioners. The parallel implementation is based on a Single Program Multiple Data (SPMD) paradigm; the inter-process communication is based on MPI and is managed mainly through PSBLAS.

Main Refrerences:

The main reference for this project is

D'Ambra, P., Durastante, F., & Filippone, S. (2021). AMG preconditioners for linear solvers towards extreme scale. SIAM Journal on Scientific Computing, 43(5), S679-S703.

AMG4PSBLAS is the suite of preconditioners for the Parallel Sparse Computation Toolkit (PSCToolkit) suite of libraries. See the paper:

D’Ambra, P., Durastante, F., & Filippone, S. (2023). Parallel Sparse Computation Toolkit. Software Impacts, 15, 100463.

The main reference for features inherited from MLD2P4 is

P. D'Ambra, D. di Serafino, S. Filippone, MLD2P4: a Package of Parallel Algebraic Multilevel Domain Decomposition Preconditioners in Fortran 95, ACM Transactions on Mathematical Software, 37 (3), 2010, art. 30, doi: 10.1145/1824801.1824808.

Installing

Installation requires a working version of the PSBLAS library as a prerequisite. AMG4PSBLAS has several interfaces to third-party libraries that can be used in the construction and application phases of preconditioners;
in particular, it is possible to link AMG4PSBLAS with the libraries: MUMPS, SuperLU, SuperLU_Dist, UMFPACK. The usage of these third party libraries is not mandatory: the package can function
in isolation and without these features.

  1. Unpack the tar file in a directory of your choice (preferrably outside the main PSBLAS directory).
  2. run configure --with-psblas=<ABSOLUTE path of the PSBLAS install directory> --prefix=<install_path> adding the options for MUMPS, SuperLU, SuperLU_Dist, UMFPACK as desired. See AMG4PSBLAS User's and Reference Guide (Section 3) for details.
  3. Tweak Make.inc if you are not satisfied.
  4. run make;
  5. Go into the test subdirectory and build the examples of your choice.
  6. (if desired): make install or sudo make install if the install path requires privileged access.

Caution

The single precision version is supported only by MUMPS and SuperLU; thus, even if you specify at configure time to use UMFPACK or SuperLU_Dist, the corresponding preconditioner options will be available only from the double precision version.

CMAKE

AMG4PSBLAS supports building with CMake. To configure the project, you must explicitly provide the path where PSBLAS is installed. If this path is not specified, the configuration will fail with a fatal error.

From the root directory of the project, run:

1. Create and enter the build directory

mkdir build
cd build

2. Configure the project (MANDATORY: specify your PSBLAS path)

cmake -DPSBLAS_INSTALL_DIR=</path/to/psblas/installation> ..

During this step, CMake will:

  • Search for the PSBLAS package in the provided path.
  • Detect and configure MPI (required for C, C++, and Fortran).
  • Set up include and module directories based on the PSBLAS configuration.
  • Configure integer sizes (IPK and LPK) to match the PSBLAS installation.

2.1. Customizing the Installation Path

By default, the library will be installed in standard system locations. To install amg4psblas in a custom directory, use the CMAKE_INSTALL_PREFIX variable:

cmake -DPSBLAS_INSTALL_DIR=</path/to/psblas/installation> \
      -DCMAKE_INSTALL_PREFIX=</path/to/amg4psblas_install> ..

3. Compiling and Installing

Once configured, you can build the libraries (amg_prec and amg_cbind) and install them.

Build the library

make

Install the library, modules, and samples

make install

CUDA, OpeMP, OpenACC

CUDA, OpenMP and OpenACC features are transparently inherited by PSBLAS installation. If PSBLAS has been configured (and installed) with these supports then AMG4PSBLAS will transparently inherit them. It will then be possible to move the computation to GPU accelerator simply by selecting the appropriate variable types in the application. If the types have not been activated or installed for PSBLAS then they will not be available for AMG4PSBLAS either and the operation will be purely on CPU/MPI. See also the samples/cuda folder.

EoCoE - Software as service portal

In the European project “Energy oriented Center of Excellence: toward exascale for energy” we made available the software through a service portal: https://eocoe.psnc.pl/. This permits to test several cutting-edge computational methods for accelerating the transition to production, storage and management of clean, decarbonized energy. Among them you have the possibility of running PSBLAS+AMG4PSBLAS on some test problems to become familiar with using the software.

MPI and Compilers

The library has been successfully compiled and tested with the same compilers and MPI implementations as PSBLAS 3.9, which include:

  • MPICH 4.2.3, 4.3.0, 4.3.2
  • OpenMPI 4.1.8. 5.0.7, 5.0.8, 5.0.9

combined with

  • GNU compilers 10.5.0, 11.5.0, 12.5.0, 13.3.0, 14.2.0 14.3.0, 15.2.0
  • LLVM 20.1.0 and 21.1.0 (except OpenMPI 4.1.8 which does not build with LLVM)

Moreover, it has been tested with the Intel OneAPI toolchain versions 2025.2 and 2025.3

As of this release, the NVIDIA compiler 25.7 fails to handle our code. Cray, IBM and NAg compilers have been used for testing in the past, but not on this version.

TODO and bugs

  • Fix all reamining bugs. Bugs? We dont' have any ! 🤓

Note

To report bugs 🐛 or issues ❓ please use the GitHub issue system.

The AMG4PSBLAS team.

  • Pasqua D'Ambra (IAC-CNR, Naples, IT)
  • Fabio Durastante (University of Pisa and IAC-CNR, IT)
  • Salvatore Filippone (University of Rome Tor Vergata and IAC-CNR, IT)

Contributors (roughly reverse cronological order):

  • Luca Pepè Sciarria
  • Andrea Di Iorio
  • Ambra Abdullahi Hassan
  • Alfredo Buttari

About

A library of parallel sparse preconditionersfor PSBLAS

Topics

Resources

License

Unknown, Unknown licenses found

Licenses found

Unknown
LICENSE
Unknown
LICENSE.MLD2P4

Stars

Watchers

Forks

Packages

No packages published

Contributors 5