Skip to content

DisableUnrollLoops should ignore DisableLLVMLoopOpt for PTX#5900

Closed
steven-johnson wants to merge 25 commits intomasterfrom
srj/ptx-opt
Closed

DisableUnrollLoops should ignore DisableLLVMLoopOpt for PTX#5900
steven-johnson wants to merge 25 commits intomasterfrom
srj/ptx-opt

Conversation

@steven-johnson
Copy link
Contributor

It's not entirely clear why, but setting the DisableUnrollLoops flag to true in the PTX backend can produce Cuda code that will fail at runtime, at least for certain drivers (e.g.: some of the tests in correctness_gpu_dynamic_shared when running under NVidia v460.x).

It's not entirely clear why, but setting the `DisableUnrollLoops` flag to `true` in the PTX backend can produce Cuda code that will fail at runtime, at least for certain drivers (e.g.: some of the tests in correctness_gpu_dynamic_shared when running under NVidia v460.x).
@abadams
Copy link
Member

abadams commented Apr 20, 2021

This doesn't seem like the right fix. The code with loop opts disabled is much simpler and clearer, yet it crashes.

You said it was driver-dependent, right? Maybe this is a bug inside the cuda driver?

@abadams
Copy link
Member

abadams commented Apr 20, 2021

To be clearer, I think the reason this isn't the right fix is that I would like disable_llvm_loop_opt to work for the bodies of cuda kernels, because I think a lot of the time that generates better kernels.

@steven-johnson
Copy link
Contributor Author

You said it was driver-dependent, right? Maybe this is a bug inside the cuda driver?

Yeah, probably correct -- but this is in the standard nvidia (closed-source) linux driver v460 (which Ubuntu 20 installs by default), and it's been out for a while with no updates. Maybe we should file a bug?

@steven-johnson steven-johnson added the skip_buildbots Do not run buildbots on this PR. Must add before opening PR as we scan labels immediately. label May 14, 2021
@steven-johnson steven-johnson removed the skip_buildbots Do not run buildbots on this PR. Must add before opening PR as we scan labels immediately. label May 14, 2021
@steven-johnson steven-johnson added the skip_buildbots Do not run buildbots on this PR. Must add before opening PR as we scan labels immediately. label May 14, 2021
@steven-johnson
Copy link
Contributor Author

Definitely not the right fix, closing

@steven-johnson steven-johnson deleted the srj/ptx-opt branch May 26, 2021 16:49
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

skip_buildbots Do not run buildbots on this PR. Must add before opening PR as we scan labels immediately.

Projects

None yet

Development

Successfully merging this pull request may close these issues.

2 participants