Skip to content

SASS/PTX emission behavior #585

@manopapad

Description

@manopapad

I just ran some builds passing different CUDA architectures to install.py, here's what comes out (tested on the generated .cu.o file with cuobjdump):

install.py --arch volta produces:

legion .cu files: sass70, ptx70
legate.core .cu files: sass70

install.py --arch "60;70" produces:

legion .cu files: sass60, sass70, ptx70
legate.core .cu files: sass60, ptx60, sass70, ptx70

install.py --arch 60 produces:

legion .cu files: sass60, ptx60
legate.core .cu files: sass60, ptx60

Can we replicate the Legion build behavior in legate.core/cunumeric as well? I.e. include all passed GPU architectures, emit SASS+PTX for latest architecture, just SASS for the rest, and handle named architectures the same way as numbered.

Metadata

Metadata

Labels

category:improvementPR introduces an improvement and will be classified as such in release notes

Type

No type

Projects

No projects

Milestone

No milestone

Relationships

None yet

Development

No branches or pull requests

Issue actions