Skip to content
Merged
Show file tree
Hide file tree
Changes from all commits
Commits
Show all changes
107 commits
Select commit Hold shift + click to select a range
e605477
adding initial changes to master configs; adding initial updates to v…
cquil11 Nov 21, 2025
039cf15
adding new gb200 script
cquil11 Nov 21, 2025
f0d851d
adding integration to gb200 runner script and workflow files
cquil11 Nov 21, 2025
c967d83
revert and correct name of 1k1k scheduler workflow
cquil11 Nov 21, 2025
1b1b7a4
adding runners.yaml to workflow invocation
cquil11 Nov 21, 2025
1c03192
toJson on conc since it is now a list
cquil11 Nov 21, 2025
9fa8e92
correctly sending conc list to multnode
cquil11 Nov 21, 2025
fc77648
hotfix
cquil11 Nov 21, 2025
06ecd9c
correct env var to MAX batch size
cquil11 Nov 21, 2025
2401e65
set -x
cquil11 Nov 21, 2025
4104b15
debugging with dynmao fork
cquil11 Nov 21, 2025
d2a5c9f
debugging with dynmao fork pt 2
cquil11 Nov 21, 2025
9ca96b1
experiment
cquil11 Nov 21, 2025
379ccd4
adding separate script for launching
cquil11 Nov 24, 2025
cc24ba5
changing filenames
cquil11 Nov 24, 2025
cfd9890
ntasks per node
cquil11 Nov 24, 2025
6cf098c
making the spec-decoding output required
cquil11 Nov 24, 2025
34d8dc3
updating ntasks per node
cquil11 Nov 24, 2025
bccc2f1
test
cquil11 Nov 24, 2025
09b9ae5
test
cquil11 Nov 24, 2025
56ccdcd
conc list quoted
cquil11 Nov 24, 2025
a81e309
get rid of debug code
cquil11 Nov 24, 2025
6d818e5
testing support for dsr1
cquil11 Nov 24, 2025
9c8a245
testing support for dsr1 test
cquil11 Nov 24, 2025
5959a3c
testing support for dsr1 test
cquil11 Nov 24, 2025
a23315b
testing support for dsr1 test
cquil11 Nov 24, 2025
a809af2
testing support for dsr1 test
cquil11 Nov 24, 2025
dbc5a37
testing
cquil11 Nov 25, 2025
ef127b2
some changes to generate sweeps
cquil11 Nov 25, 2025
96171cf
testing and debugging
cquil11 Nov 25, 2025
faad1fc
adding new file code for sglang
cquil11 Nov 26, 2025
4b615da
adding new file code for sglang
cquil11 Nov 26, 2025
b25b730
changing file path
cquil11 Nov 26, 2025
4f50522
updating multinode fn hash
cquil11 Nov 26, 2025
3be1d1b
updating multinode fn hash
cquil11 Nov 26, 2025
a033f5d
dynamo trtllm to dynamo trt
cquil11 Nov 26, 2025
3f09a0e
changing process result
cquil11 Nov 26, 2025
a66597d
add is multinode
cquil11 Nov 26, 2025
0c22a96
bug fix
cquil11 Nov 27, 2025
6cec238
bug fix
cquil11 Nov 27, 2025
6ad9a8e
bug fix
cquil11 Nov 27, 2025
ea8ed1b
bug fix
cquil11 Nov 27, 2025
07f4af9
polishing
cquil11 Nov 27, 2025
01be2d5
polishing pt 2
cquil11 Nov 27, 2025
825712b
polishing pt 3
cquil11 Nov 27, 2025
05ddada
Merge branch 'main' into multinode-integration
cquil11 Nov 27, 2025
e9e7933
polishing pt 4
cquil11 Nov 27, 2025
8315ad6
Merge branch 'main' into multinode-integration
cquil11 Dec 1, 2025
95a05aa
fixing summarize.py
cquil11 Dec 1, 2025
da5e44c
polishing
cquil11 Dec 1, 2025
a865687
testing
cquil11 Dec 1, 2025
da73311
testing
cquil11 Dec 1, 2025
1227541
adding testing workflows
cquil11 Dec 1, 2025
0828c3e
adding testing workflows
cquil11 Dec 1, 2025
4c0dc18
adding testing workflows
cquil11 Dec 1, 2025
0a8a6c6
adding testing workflows
cquil11 Dec 1, 2025
4e98fe4
adding testing workflows
cquil11 Dec 1, 2025
7826b98
adding testing workflows
cquil11 Dec 1, 2025
3c1ce68
adding testing workflows
cquil11 Dec 1, 2025
15eda16
adding testing workflows
cquil11 Dec 1, 2025
ee065da
adding testing workflows
cquil11 Dec 1, 2025
cf583ae
adding testing workflows
cquil11 Dec 1, 2025
17c8fcb
adding testing workflows
cquil11 Dec 1, 2025
3994996
adding testing workflows
cquil11 Dec 1, 2025
b866178
adding testing workflows
cquil11 Dec 1, 2025
0e44f46
adding testing workflows
cquil11 Dec 1, 2025
8beb869
adding testing workflows
cquil11 Dec 1, 2025
2379449
adding tests
cquil11 Dec 1, 2025
9b3a680
adding tests
cquil11 Dec 1, 2025
64bcb03
Merge branch 'main' into multinode-integration
cquil11 Dec 1, 2025
8aab103
adding tests
cquil11 Dec 1, 2025
af3b8be
adding tests
cquil11 Dec 1, 2025
84feeda
adding tests
cquil11 Dec 1, 2025
e839277
adding tests
cquil11 Dec 1, 2025
37574dd
adding tests
cquil11 Dec 1, 2025
929ba0c
add updates for newest gb200 merge
cquil11 Dec 1, 2025
50f0af5
Merge branch 'main' into multinode-integration
cquil11 Dec 1, 2025
c95a3e4
add updates for newest gb200 merge pt 2
cquil11 Dec 2, 2025
5eb8ea1
move ntasks per node to framework level instead of runner level
cquil11 Dec 2, 2025
711783f
nexp hard coded to 1:
cquil11 Dec 2, 2025
0a19e43
add AMD configs to full sweep
cquil11 Dec 2, 2025
d041278
shut the line counter workflow up haha
cquil11 Dec 2, 2025
a604573
shut the line counter workflow up haha
cquil11 Dec 2, 2025
feae717
shut the line counter workflow up haha pt 2
cquil11 Dec 2, 2025
13a2761
updating testing logic
cquil11 Dec 2, 2025
8811b2d
add model prefix to label validator
cquil11 Dec 2, 2025
2a8626c
add more descriptive name to tests
cquil11 Dec 2, 2025
d126dca
update test for process results
cquil11 Dec 2, 2025
d82c9a6
Merge branch 'main' into multinode-integration
cquil11 Dec 2, 2025
7ee5bdb
add script mode
cquil11 Dec 3, 2025
f22cf47
fix bug
cquil11 Dec 3, 2025
efcb4e4
sglang: add fp8 8k1k and fp4 1k1k (#274)
ishandhanani Dec 4, 2025
21ec133
Revert "sglang: add fp8 8k1k and fp4 1k1k (#274)" (#283)
cquil11 Dec 4, 2025
228c8c9
Merge branch 'main' into multinode-integration
cquil11 Dec 4, 2025
0034119
Merge branch 'main' into multinode-integration
cquil11 Dec 4, 2025
1989874
Merge branch 'main' into multinode-integration
cquil11 Dec 5, 2025
ecc2025
get rid of ntasks per node required env var for sglang
cquil11 Dec 5, 2025
b8d6b23
bug fix
cquil11 Dec 5, 2025
caa7197
bug fix missing amd
cquil11 Dec 5, 2025
2d55f70
bug fix missing amd pt 2
cquil11 Dec 5, 2025
d7b36ea
Merge branch 'main' into multinode-integration
cquil11 Dec 5, 2025
3032a57
add served model name to summary
cquil11 Dec 5, 2025
34b257f
add served model name to summary pt 2
cquil11 Dec 5, 2025
38814e1
add served model name to summary pt 3
cquil11 Dec 5, 2025
54d0e42
fix max model len bug
cquil11 Dec 5, 2025
34870e3
add readme
cquil11 Dec 5, 2025
ca1c279
add image to json result
cquil11 Dec 5, 2025
File filter

Filter by extension

Filter by extension


Conversations
Failed to load comments.
Loading
Jump to
Jump to file
Failed to load files.
Loading
Diff view
Diff view
7 changes: 7 additions & 0 deletions .github/configs/amd-master.yaml
Original file line number Diff line number Diff line change
Expand Up @@ -5,6 +5,7 @@ dsr1-fp4-mi355x-sglang:
runner: mi355x
precision: fp4
framework: sglang
multinode: false
seq-len-configs:
- isl: 1024
osl: 1024
Expand All @@ -27,6 +28,7 @@ dsr1-fp8-mi300x-sglang:
runner: mi300x
precision: fp8
framework: sglang
multinode: false
seq-len-configs:
- isl: 1024
osl: 1024
Expand All @@ -48,6 +50,7 @@ dsr1-fp8-mi325x-sglang:
runner: mi325x
precision: fp8
framework: sglang
multinode: false
seq-len-configs:
- isl: 1024
osl: 1024
Expand All @@ -69,6 +72,7 @@ dsr1-fp8-mi355x-sglang:
runner: mi355x
precision: fp8
framework: sglang
multinode: false
seq-len-configs:
- isl: 1024
osl: 1024
Expand All @@ -90,6 +94,7 @@ gptoss-fp4-mi300x-vllm:
runner: mi300x
precision: fp4
framework: vllm
multinode: false
seq-len-configs:
- isl: 1024
osl: 1024
Expand Down Expand Up @@ -120,6 +125,7 @@ gptoss-fp4-mi325x-vllm:
runner: mi325x
precision: fp4
framework: vllm
multinode: false
seq-len-configs:
- isl: 1024
osl: 1024
Expand Down Expand Up @@ -150,6 +156,7 @@ gptoss-fp4-mi355x-vllm:
runner: mi355x
precision: fp4
framework: vllm
multinode: false
seq-len-configs:
- isl: 1024
osl: 1024
Expand Down
Loading
Loading