diff --git a/doc/source/performances.rst b/doc/source/performances.rst index 2222c2e1..84042685 100644 --- a/doc/source/performances.rst +++ b/doc/source/performances.rst @@ -4,9 +4,9 @@ Performances We report below the performances obtained on various architectures using Idefix. The reference test is the 3D MHD Orszag-Tang test problem with 2nd order reconstruction and uct_contact EMFS bundled in -Idefix test suite, computed with a 128\ :sup:`3` resolution per MPI sub-domain on GPUs or 32\ :sup:`3` -per MPI sub-domain on CPUs. All of the performances measures have been obtained enabling MPI on -*one full node*, but we report here the performance *per GPU* +Idefix test suite, disabling passive tracers. The test is computed with a 128\ :sup:`3` resolution per +MPI sub-domain on GPUs or 32\ :sup:`3` per MPI sub-domain on CPUs. All of the performances measures +have been obtained enabling MPI on *one full node*, but we report here the performance *per GPU* (i.e. with 2 GCDs on AMD Mi250) or *per core* (on CPU), i.e. dividing the node performance by the number of GPU/core to simplify the comparison with other clusters. diff --git a/doc/source/reference/makefile.rst b/doc/source/reference/makefile.rst index b015b682..bf8ecdad 100644 --- a/doc/source/reference/makefile.rst +++ b/doc/source/reference/makefile.rst @@ -125,7 +125,7 @@ Finally, *Idefix* can be configured to run on Mi250 by enabling HIP and the desi .. code-block:: bash - -DKokkos_ENABLE_HIP=ON -DKokkos_ENABLE_HIP_MULTIPLE_KERNEL_INSTANTIATION=ON -DKokkos_ARCH_VEGA90A=ON + -DKokkos_ENABLE_HIP=ON -DKokkos_ENABLE_HIP_MULTIPLE_KERNEL_INSTANTIATIONS=ON -DKokkos_ARCH_VEGA90A=ON MPI (multi-GPU) can be enabled by adding ``-DIdefix_MPI=ON`` as usual.