[NVIDIA] update vllm b200 image. TODO: add logic for docker runner. by kedarpotdar-nv · Pull Request #3 · SemiAnalysisAI/InferenceX

kedarpotdar-nv · 2025-09-02T01:54:39Z

Updated vllm B200 runner image with ToT . This image will work for Hopper as well, but want to try B200 updates first.

TODO: apply changes to Hopper and B200 docker config.

kimbochen · 2025-09-02T17:48:15Z

+
+FUSION_FLAG='{"pass_config":{"enable_fi_allreduce_fusion":true,"enable_attn_fusion":true,"enable_noop":true},"custom_ops":["+quant_fp8","+rms_norm"],"cudagraph_mode":"FULL_DECODE_ONLY","splitting_ops":[]}'
+
+NO_PREFIX_CACHING_FLAG="--no-enable-prefix-caching"


This seems redundant

kimbochen

Thank you for the PR. Everything looks good except the NO_PREFIX_CACHING_LFAG.

kedarpotdar-nv · 2025-09-02T17:54:00Z

Good catch, fixed!

update vllm b200 image. TODO: add logic for docker runner.

ec58c96

kedarpotdar-nv requested a review from kimbochen September 2, 2025 01:54

kedarpotdar-nv added the enhancement New feature or request label Sep 2, 2025

kimbochen reviewed Sep 2, 2025

View reviewed changes

remove redundant flag

9efe8ba

update dsr1 slurm

353351f

kimbochen merged commit f41800f into main Sep 3, 2025

kimbochen deleted the kepotdar/vllm-b200-update branch September 3, 2025 01:35

gemini-code-assist Bot mentioned this pull request Jan 13, 2026

Cursor bugbot instructions, resolves #397 #415

Closed

This was referenced Jan 17, 2026

[NVIDIA] fix: update ep metadata in gb200 dynamo sglang configs to match comments #486

Merged

[NV] Update DSR1 GB200 FP4 Disagg Submission #510

Merged

Klaud-Cold mentioned this pull request Feb 17, 2026

Add Qwen3.5-397B-A17B BF16 B200 SGLang benchmark (STP only) #704

Merged

claude Bot mentioned this pull request Mar 20, 2026

Separate eval-only workflow and change to 8k1k #911

Merged

Klaud-Cold mentioned this pull request Apr 5, 2026

collect more shapes #1004

Open

cquil11 added the NVIDIA label Apr 8, 2026

cquil11 changed the title ~~update vllm b200 image. TODO: add logic for docker runner.~~ [NVIDIA] update vllm b200 image. TODO: add logic for docker runner. Apr 8, 2026

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[NVIDIA] update vllm b200 image. TODO: add logic for docker runner.#3

[NVIDIA] update vllm b200 image. TODO: add logic for docker runner.#3
kimbochen merged 3 commits intomainfrom
kepotdar/vllm-b200-update

kedarpotdar-nv commented Sep 2, 2025

Uh oh!

kimbochen Sep 2, 2025

Uh oh!

kimbochen left a comment

Uh oh!

kedarpotdar-nv commented Sep 2, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants


		FUSION_FLAG='{"pass_config":{"enable_fi_allreduce_fusion":true,"enable_attn_fusion":true,"enable_noop":true},"custom_ops":["+quant_fp8","+rms_norm"],"cudagraph_mode":"FULL_DECODE_ONLY","splitting_ops":[]}'

		NO_PREFIX_CACHING_FLAG="--no-enable-prefix-caching"

Conversation

kedarpotdar-nv commented Sep 2, 2025

Uh oh!

kimbochen Sep 2, 2025

Choose a reason for hiding this comment

Uh oh!

kimbochen left a comment

Choose a reason for hiding this comment

Uh oh!

kedarpotdar-nv commented Sep 2, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants