Skip to content

[Fix] Override data parallel size for accelerator configs#611

Open
YouNeedCryDear wants to merge 1 commit into
mainfrom
fix/data-parallel-override
Open

[Fix] Override data parallel size for accelerator configs#611
YouNeedCryDear wants to merge 1 commit into
mainfrom
fix/data-parallel-override

Conversation

@YouNeedCryDear
Copy link
Copy Markdown
Collaborator

What this PR does

Adds accelerator-specific runtime argument overrides for data parallel size, covering --dp-size, --dp, --data-parallel-size, and -dp. It also extends tensor and pipeline parallel overrides to handle -tp and -pp aliases.

Why we need it

tensorParallelismOverride.dataParallelSize is part of the ServingRuntime accelerator config API, but the runtime argument override logic only applied tensor and pipeline parallel sizes. As a result, accelerator configs could set dataParallelSize without changing the generated runtime command. This makes DP override behavior match TP and PP override behavior.

Fixes # (not filed)

How to test

env GOCACHE=/private/tmp/ome-go-build-cache go test ./pkg/controller/v1beta1/inferenceservice/components -count=1

Checklist

  • Tests added/updated (if applicable)
  • Docs updated (if applicable) - N/A
  • make test passes locally

@github-actions github-actions Bot added inferenceservice InferenceService controller changes controller Controller changes tests Test changes labels May 13, 2026
@YouNeedCryDear YouNeedCryDear marked this pull request as ready for review May 13, 2026 21:51
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

controller Controller changes inferenceservice InferenceService controller changes tests Test changes

Projects

None yet

Development

Successfully merging this pull request may close these issues.

1 participant