-
Notifications
You must be signed in to change notification settings - Fork 79
Closed
Description
on A100 and H100
root@65627d6877e4:/opt/pytorch/nvfuser# bin/nvfuser_bench --benchmark_filter='NvFuserScheduler_Matmul_Manual/nvfuser_splitk_NT/M:1024/N:2048/K:4096/warps:8/stages:3/splitk_factor:5/smem_epilogue:1/manual_time'
......
terminate called after throwing an instance of 'nvfuser::nvfError'
what(): Fusion returns wrong results! The result tensor has shape [..., 1024,2048]. Mismatch happens at region result[...,526:783,0:328]
Exception raised from checkMatch at /opt/pytorch/nvfuser/benchmarks/cpp/matmul.cpp:104 (most recent call first):
frame #0: <unknown function> + 0x7e55f (0x5604dbe8b55f in bin/nvfuser_bench)
frame #1: <unknown function> + 0x6f653c (0x5604dc50353c in bin/nvfuser_bench)
frame #2: <unknown function> + 0x6fb670 (0x5604dc508670 in bin/nvfuser_bench)
frame #3: <unknown function> + 0x763ef8 (0x5604dc570ef8 in bin/nvfuser_bench)
frame #4: <unknown function> + 0x7434cb (0x5604dc5504cb in bin/nvfuser_bench)
frame #5: <unknown function> + 0x7450d5 (0x5604dc5520d5 in bin/nvfuser_bench)
frame #6: <unknown function> + 0x7456f9 (0x5604dc5526f9 in bin/nvfuser_bench)
frame #7: <unknown function> + 0x73482b (0x5604dc54182b in bin/nvfuser_bench)
frame #8: <unknown function> + 0x736c6e (0x5604dc543c6e in bin/nvfuser_bench)
frame #9: <unknown function> + 0x736f0e (0x5604dc543f0e in bin/nvfuser_bench)
frame #10: <unknown function> + 0x11acb4 (0x5604dbf27cb4 in bin/nvfuser_bench)
frame #11: <unknown function> + 0x29d90 (0x7f87a25d4d90 in /usr/lib/x86_64-linux-gnu/libc.so.6)
frame #12: __libc_start_main + 0x80 (0x7f87a25d4e40 in /usr/lib/x86_64-linux-gnu/libc.so.6)
frame #13: <unknown function> + 0x137355 (0x5604dbf44355 in bin/nvfuser_bench)
Aborted (core dumped)several other nvfuser_bench tests fail with similar error messages
Reactions are currently unavailable
Metadata
Metadata
Assignees
Labels
No labels