[AMD] Use vLLM framework for DeepSeek R1 on MI325 and MI355 hardware by omirosh · Pull Request #104 · SemiAnalysisAI/InferenceX

omirosh · 2025-10-14T13:15:54Z

Please consider changing framework for DeepSeek R1 to vLLM, it shows better performance over SGLang.
Here is also documentation for running DeepSeek with vLLM.

qcolombet · 2025-10-17T15:25:06Z

@cquil11 , @functionstackx, I don't have the permission to assign a reviewer, so just tagging you both :).
I know you guys are figuring out the load on the CI before moving forward with the review.
Let me know what we can do to help.

functionstackx · 2025-10-17T17:05:02Z

@qcolombet yes, we are looking into it

@cquil11 is just trying to land an massive refactor PR first to reduce tech debt and then we can look into this one

functionstackx · 2025-10-22T19:12:03Z

@merrymercy

functionstackx · 2026-01-05T14:29:08Z

@omirosh closing as this is stale PR. happy to take a look at amd deepseek vllm in addition to amd's sglang deepseek configs if single node is an community vllm image that supports fp4 and fp8

…ression Upstream commit 52e697d (#108 "fix(nginx): raise file descriptor limit for nginx workers") prepends `ulimit -n 1048576 &&` to the nginx srun command. On clusters whose container inherits a sub-1M RLIMIT_NOFILE hard limit from slurmd/PAM, the bash builtin's setrlimit fails with EPERM (raising the hard rlimit needs CAP_SYS_RESOURCE in the init user namespace, which pyxis --container-remap-root does not grant). The `&&` short-circuits and nginx never starts — caught when re-running dsr1-fp4-gb200-dynamo-sglang. Pin back to 698590e ("feat(config): cluster-wide default_bash_preamble for ulimits and the like (#104)"), the immediately prior commit, where nginx runs without the chained ulimit. Bump forward once upstream softens the ulimit to `|| true` or makes it opt-in. Co-Authored-By: Claude Opus 4.7 (1M context) <noreply@anthropic.com>

omirosh added 3 commits October 14, 2025 15:33

use vLLM for DeepSeek R1

e498f96

changes to dsr1-tmpl.yml to run vllm

8a40c40

port var

5c8630f

omirosh requested a review from a team as a code owner October 14, 2025 13:15

omirosh added 3 commits October 28, 2025 08:09

change to vllm for mi325, mi300, mi355

3c8f3de

vllm latest image to run dsr1

bbd5b13

325x slurm fix

002bf86

functionstackx closed this Jan 5, 2026

cquil11 added the AMD label Apr 8, 2026

cquil11 changed the title ~~Use vLLM framework for DeepSeek R1 on MI325 and MI355 hardware~~ [AMD] Use vLLM framework for DeepSeek R1 on MI325 and MI355 hardware Apr 8, 2026

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[AMD] Use vLLM framework for DeepSeek R1 on MI325 and MI355 hardware#104

[AMD] Use vLLM framework for DeepSeek R1 on MI325 and MI355 hardware#104
omirosh wants to merge 6 commits intoSemiAnalysisAI:mainfrom
omirosh:amd_dsr1

omirosh commented Oct 14, 2025

Uh oh!

qcolombet commented Oct 17, 2025

Uh oh!

functionstackx commented Oct 17, 2025 •

edited

Loading

Uh oh!

functionstackx commented Oct 22, 2025

Uh oh!

functionstackx commented Jan 5, 2026 •

edited

Loading

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

Conversation

omirosh commented Oct 14, 2025

Uh oh!

qcolombet commented Oct 17, 2025

Uh oh!

functionstackx commented Oct 17, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

functionstackx commented Oct 22, 2025

Uh oh!

functionstackx commented Jan 5, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

functionstackx commented Oct 17, 2025 •

edited

Loading

functionstackx commented Jan 5, 2026 •

edited

Loading