Skip to content

(ci)(recipe): Add DeepSeek-R1 FP4 TP4 validation and DS recipe for SGLang-ATOM#614

Open
zhuyuhua-v wants to merge 5 commits intomainfrom
yuhua/sgl-dsrecipe-fp4ci
Open

(ci)(recipe): Add DeepSeek-R1 FP4 TP4 validation and DS recipe for SGLang-ATOM#614
zhuyuhua-v wants to merge 5 commits intomainfrom
yuhua/sgl-dsrecipe-fp4ci

Conversation

@zhuyuhua-v
Copy link
Copy Markdown
Collaborator

@zhuyuhua-v zhuyuhua-v commented Apr 20, 2026

Motivation

  • add DeepSeek-R1-FP4 TP4 coverage to SGLang-ATOM accuracy flows, including nightly/manual validation and dashboard metadata, with a 0.85 GSM8K threshold
  • align the DeepSeek-R1-FP8 TP4 GSM8K threshold to 0.91 across the ATOM SGLang PR and nightly accuracy workflows to avoid data floating issues.
  • add recipes/sglang_atom/DeepSeek-R1.md in the same style as the vLLM-ATOM recipe, covering server launch, benchmarking, accuracy validation, and profiling usage

Signed-off-by: zhuyuhua-v <yuhzhu@amd.com>
@ZLkanyo009 ZLkanyo009 marked this pull request as ready for review April 21, 2026 07:50
qichu-yun
qichu-yun previously approved these changes Apr 21, 2026
wuhuikx
wuhuikx previously approved these changes Apr 22, 2026
valarLip
valarLip previously approved these changes Apr 23, 2026
@valarLip
Copy link
Copy Markdown
Collaborator

image still wip?

Copilot AI review requested due to automatic review settings April 23, 2026 06:21
Copy link
Copy Markdown
Contributor

Copilot AI left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Pull request overview

Adds DeepSeek-R1 FP4 (MXFP4 weights) TP4 accuracy coverage to the ATOM SGLang CI/validation flows and documents how to run/benchmark/validate DeepSeek-R1 using the SGLang-ATOM backend.

Changes:

  • Add DeepSeek-R1 FP4 TP4 (MXFP4 checkpoint) to PR CI accuracy matrix and to nightly/manual accuracy validation matrix.
  • Align DeepSeek-R1 FP8 TP4 GSM8K accuracy threshold from 0.92 to 0.91 across workflows and dashboard model metadata.
  • Add an SGLang-ATOM DeepSeek-R1 recipe covering server launch, benchmarking, profiling, and GSM8K validation.

Reviewed changes

Copilot reviewed 4 out of 4 changed files in this pull request and generated 5 comments.

File Description
recipes/sglang_atom/DeepSeek-R1.md New SGLang-ATOM DeepSeek-R1 recipe (launch, benchmark, profiling, lm-eval).
.github/workflows/atom-sglang-test.yaml Updates PR CI accuracy threshold and adds DeepSeek-R1 FP4 TP4 to the matrix.
.github/workflows/atom-sglang-accuracy-validation.yaml Adds manual toggle + nightly coverage for DeepSeek-R1 FP4 TP4; aligns FP8 TP4 threshold.
.github/benchmark/sglang_models_accuracy.json Adds/updates dashboard metadata for the two DeepSeek-R1 TP4 accuracy entries (thresholds, baseline fields).

💡 Add Copilot custom instructions for smarter, more guided reviews. Learn how to get started.

Comment thread .github/benchmark/sglang_models_accuracy.json Outdated
Comment thread recipes/sglang_atom/DeepSeek-R1.md
Comment thread recipes/sglang_atom/DeepSeek-R1.md
Comment thread .github/workflows/atom-sglang-test.yaml Outdated
Comment thread .github/workflows/atom-sglang-accuracy-validation.yaml Outdated
Signed-off-by: zhuyuhua-v <yuhzhu@amd.com>
@zhuyuhua-v zhuyuhua-v dismissed stale reviews from wuhuikx, valarLip, and qichu-yun via 91f30ab April 23, 2026 09:18
@zhuyuhua-v zhuyuhua-v marked this pull request as draft April 24, 2026 05:24
@zhuyuhua-v zhuyuhua-v marked this pull request as ready for review April 24, 2026 05:26
Copilot AI review requested due to automatic review settings April 24, 2026 05:26
Copy link
Copy Markdown
Contributor

Copilot AI left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Pull request overview

Copilot reviewed 4 out of 4 changed files in this pull request and generated 4 comments.


💡 Add Copilot custom instructions for smarter, more guided reviews. Learn how to get started.

Comment thread .github/workflows/atom-sglang-test.yaml
Comment thread .github/workflows/atom-sglang-accuracy-validation.yaml
Comment thread .github/workflows/atom-sglang-accuracy-validation.yaml
Comment thread .github/benchmark/sglang_models_accuracy.json
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

5 participants