Skip to content

EESSI CUDA hook prevents loading even local (non-EESSI) CUDA module #523

@casparvl

Description

@casparvl
{EESSI 2023.06} (eessi_test_venv) [casparl@tcn1 PyTorch]$ module load torchvision/0.13.1-foss-2022a-CUDA-11.7.0
Lmod has detected the following error:
You requested to load CUDA  but while the module file exists, the actual software is not entirely shipped with EESSI due to licencing. You will need to install a full copy of the CUDA SDK where EESSI can find it.
For more information on how to do this, see https://www.eessi.io/docs/gpu/.

While processing the following module(s):
    Module fullname                            Module Filename
    ---------------                            ---------------
    CUDA/11.7.0                                /sw/arch/RHEL8/EB_production/2022/modulefiles/system/CUDA/11.7.0.lua
    torchvision/0.13.1-foss-2022a-CUDA-11.7.0  /sw/arch/RHEL8/EB_production/2022/modulefiles/vis/torchvision/0.13.1-foss-2022a-CUDA-11.7.0.lua

_mlstatus = False

There is no reason to prevent this load, since this is a local module. Especially if we want to support building on top of EESSI, this should just work.

It's (probably) quite easy to fix: we should make the if condition here more specific and make sure it also checks if it is an EESSI CUDA module. A check whether it is somewhere in the /cvmfs/$EESSI_CVMFS_REPO/ prefix is probably enough.

Metadata

Metadata

Assignees

No one assigned

    Type

    No type

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions