Skip to content
Merged
Show file tree
Hide file tree
Changes from all commits
Commits
Show all changes
39 commits
Select commit Hold shift + click to select a range
4871063
add
pggPL Mar 25, 2025
2040b35
weight workspace fix
pggPL Mar 25, 2025
6d60342
docs fix
pggPL Mar 25, 2025
0780661
file i forgot
pggPL Apr 1, 2025
f1a8e08
Merge remote-tracking branch 'upstream/main' into nvinspect_core
pggPL Apr 1, 2025
3db240f
[pre-commit.ci] auto fixes from pre-commit.com hooks
pre-commit-ci[bot] Apr 1, 2025
4ea29c8
fix
pggPL Apr 1, 2025
d39000c
lint fix
pggPL Apr 1, 2025
e727df1
Update transformer_engine/debug/pytorch/utils.py
pggPL Apr 2, 2025
9a8030e
setup fix
pggPL Apr 2, 2025
fb5b176
setup fix
pggPL Apr 2, 2025
b84f4e7
Update transformer_engine/pytorch/tensor/_internal/float8_tensor_base.py
pggPL Apr 2, 2025
c93afb7
all tensor types
pggPL Apr 2, 2025
c433066
fixes
pggPL Apr 2, 2025
85256b7
fixes
pggPL Apr 2, 2025
cc865b2
Merge remote-tracking branch 'upstream/main' into nvinspect_core
pggPL Apr 2, 2025
78db8c0
[pre-commit.ci] auto fixes from pre-commit.com hooks
pre-commit-ci[bot] Apr 2, 2025
348d4f4
fix
pggPL Apr 2, 2025
9446dff
Merge remote-tracking branch 'upstream/main' into nvinspect_core
pggPL Apr 7, 2025
2e1aa04
fixes
pggPL Apr 7, 2025
ef1ce89
fix
pggPL Apr 7, 2025
6205971
[pre-commit.ci] auto fixes from pre-commit.com hooks
pre-commit-ci[bot] Apr 7, 2025
207c6a9
Merge branch 'main' into nvinspect_core
pggPL Apr 8, 2025
96120b4
removed check
pggPL Apr 8, 2025
d73128a
move error
pggPL Apr 8, 2025
0e04417
Merge branch 'main' into nvinspect_core
pggPL Apr 8, 2025
3ce9ede
Merge branch 'main' into nvinspect_core
pggPL Apr 9, 2025
9fccb57
_reset
pggPL Apr 9, 2025
6957da5
Update transformer_engine/pytorch/module/linear.py
pggPL Apr 14, 2025
e8b61f4
Merge remote-tracking branch 'upstream/main' into nvinspect_core
pggPL Apr 15, 2025
64332c4
name documentation
pggPL Apr 15, 2025
17d93fa
added blockwise quantizer
pggPL Apr 15, 2025
876e6bf
[pre-commit.ci] auto fixes from pre-commit.com hooks
pre-commit-ci[bot] Apr 15, 2025
ff9d053
fix
pggPL Apr 15, 2025
9a2ffe2
[pre-commit.ci] auto fixes from pre-commit.com hooks
pre-commit-ci[bot] Apr 15, 2025
9eaf124
make debug option optional
pggPL Apr 15, 2025
650393e
Update transformer_engine/pytorch/tensor/quantized_tensor.py
pggPL Apr 15, 2025
b0d92c9
names fix
pggPL Apr 15, 2025
3aecae6
Merge branch 'main' into nvinspect_core
pggPL Apr 16, 2025
File filter

Filter by extension

Filter by extension


Conversations
Failed to load comments.
Loading
Jump to
Jump to file
Failed to load files.
Loading
Diff view
Diff view
4 changes: 4 additions & 0 deletions setup.py
Original file line number Diff line number Diff line change
Expand Up @@ -110,6 +110,10 @@ def setup_requirements() -> Tuple[List[str], List[str], List[str]]:
if not bool(int(os.getenv("NVTE_RELEASE_BUILD", "0"))):
if "pytorch" in frameworks:
install_reqs.extend(["torch>=2.1"])
install_reqs.append(
"nvdlfw-inspect @"
" git+https://github.com/NVIDIA/nvidia-dlfw-inspect.git@v0.1#egg=nvdlfw-inspect"
)
# Blackwell is not supported as of Triton 3.2.0, need custom internal build
# install_reqs.append("triton")
test_reqs.extend(["numpy", "torchvision", "prettytable", "PyYAML"])
Expand Down
11 changes: 11 additions & 0 deletions transformer_engine/debug/__init__.py
Original file line number Diff line number Diff line change
@@ -0,0 +1,11 @@
# Copyright (c) 2022-2025, NVIDIA CORPORATION & AFFILIATES. All rights reserved.
#
# See LICENSE for license information.

"""Top level package for numerical debugging."""

try:
from . import pytorch
from .pytorch.debug_state import set_weight_tensor_tp_group_reduce
except ImportError as e:
pass
3 changes: 3 additions & 0 deletions transformer_engine/debug/pytorch/__init__.py
Original file line number Diff line number Diff line change
@@ -0,0 +1,3 @@
# Copyright (c) 2022-2025, NVIDIA CORPORATION & AFFILIATES. All rights reserved.
#
# See LICENSE for license information.
Loading