Skip to content

Improve upon error reporting in common#389

Closed
janekb04 wants to merge 2 commits intoNVIDIA:mainfrom
janekb04:error-reporting
Closed

Improve upon error reporting in common#389
janekb04 wants to merge 2 commits intoNVIDIA:mainfrom
janekb04:error-reporting

Conversation

@janekb04
Copy link
Collaborator

@janekb04 janekb04 commented Aug 18, 2023

Cherry picked from #377

  1. Makes error checking macros report the real source location of the error, rather than a rather unuseful logging.h.
  2. Fixes segfault when calling cuGetErrorString with wrong number of arguments.

Signed-off-by: Jan Bielak <jbielak@nvidia.com>
Signed-off-by: Jan Bielak <jbielak@nvidia.com>
@timmoon10
Copy link
Collaborator

These changes would fix #376. It's similar to #382, so I think we should incorporate these changes into that PR.

timmoon10 added a commit to timmoon10/TransformerEngine that referenced this pull request Aug 18, 2023
Incorporating changes from NVIDIA#389.

Co-authored-by: Tim Moon <tmoon@nvidia.com>
Co-authored-by: Jan Bielak <jbielak@nvidia.com>
Signed-off-by: Tim Moon <tmoon@nvidia.com>
@ksivaman
Copy link
Member

ksivaman commented Aug 19, 2023

Can we close this in favor of #382? @timmoon10 @janekb04

@janekb04
Copy link
Collaborator Author

I think so. It seems #382 was rebased on this.

@janekb04 janekb04 closed this Aug 21, 2023
timmoon10 added a commit that referenced this pull request Oct 24, 2023
* Do not include logging macros in installed C headers

Signed-off-by: Tim Moon <tmoon@nvidia.com>

* Debug logging macros

Signed-off-by: Tim Moon <tmoon@nvidia.com>

* Debug C++ tests

Use Google style for header includes.

Signed-off-by: Tim Moon <tmoon@nvidia.com>

* Update CUDA driver macros

Incorporating changes from #389.

Co-authored-by: Tim Moon <tmoon@nvidia.com>
Co-authored-by: Jan Bielak <jbielak@nvidia.com>
Signed-off-by: Tim Moon <tmoon@nvidia.com>

* Use core error checking macros in PyTorch extensions

Hack to get around macro redefinition warning.

Signed-off-by: Tim Moon <tmoon@nvidia.com>

* Fix missing arg when getting CUDA driver error string

Signed-off-by: Tim Moon <tmoon@nvidia.com>

* Reuse logging header in frameworks

Signed-off-by: Tim Moon <tmoon@nvidia.com>

---------

Signed-off-by: Tim Moon <tmoon@nvidia.com>
Co-authored-by: Jan Bielak <jbielak@nvidia.com>
@janekb04 janekb04 deleted the error-reporting branch August 4, 2025 16:35
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

3 participants

Comments