enable tp for benchmark by sywangyi · Pull Request #43750 · huggingface/transformers

sywangyi · 2026-02-05T01:53:45Z

enable tp in benchmark_v2, to ensure large model could run.

Signed-off-by: Wang, Yi <yi.a.wang@intel.com>

sywangyi · 2026-02-05T01:56:39Z

@remi-or , pls help review, thx very much.

sywangyi · 2026-02-24T02:45:05Z

hi, @remi-or any thought to enable tp, cp in the benchmark tool?

remi-or

Some nits, but otherwise lgtm. Have you tried out the benchmarking with TP? And if so, how were the results? I am curious what applications you are targeting this for 🙂 !

sywangyi · 2026-03-05T01:58:17Z

Some nits, but otherwise lgtm. Have you tried out the benchmarking with TP? And if so, how were the results? I am curious what applications you are targeting this for 🙂 !

actually I would like to leverage this benchmark tool in xpu to broader model, so I need to run bigger model(like moe serial). one card could not run such model for memory limitation. so I need to run with ep and tp with multiple-cards. also tp, ep support in kernels path is only in our radar.

Signed-off-by: Wang, Yi <yi.a.wang@intel.com>

remi-or · 2026-03-05T08:59:37Z

Ok, this looks good! I would like to test n my end before merging, will do so soon. My question was: did you manage to run the benchmarker in a distributed setting on your end? Or is this a small change needed for that but not enough to enable the feature? Thanks

sywangyi · 2026-03-05T10:01:17Z

yes, I test by my side using torchrun --nproc-per-node 2 run_benchmarks.py --enable-tp, enough to enable the feature.

remi-or

LGTM! Thanks for your patience.

HuggingFaceDocBuilderDev · 2026-03-19T14:05:18Z

The docs for this PR live here. All of your documentation changes will be reflected on that endpoint. The docs are available until 30 days after the last update.

enable tp for benchmark

9ab41e2

Signed-off-by: Wang, Yi <yi.a.wang@intel.com>

Merge branch 'main' into tp_benchmark

edfbb7a

remi-or reviewed Mar 2, 2026

View reviewed changes

refine code

2012079

Signed-off-by: Wang, Yi <yi.a.wang@intel.com>

Merge branch 'main' into tp_benchmark

bdaca0d

remi-or approved these changes Mar 19, 2026

View reviewed changes

remi-or added this pull request to the merge queue Mar 19, 2026

Merged via the queue into huggingface:main with commit 62c46ce Mar 19, 2026
17 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

enable tp for benchmark#43750

enable tp for benchmark#43750
remi-or merged 4 commits intohuggingface:mainfrom
sywangyi:tp_benchmark

sywangyi commented Feb 5, 2026

Uh oh!

sywangyi commented Feb 5, 2026

Uh oh!

sywangyi commented Feb 24, 2026

Uh oh!

remi-or left a comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

sywangyi commented Mar 5, 2026

Uh oh!

remi-or commented Mar 5, 2026

Uh oh!

sywangyi commented Mar 5, 2026 •

edited

Loading

Uh oh!

remi-or left a comment •

edited

Loading

Uh oh!

HuggingFaceDocBuilderDev commented Mar 19, 2026

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Conversation

sywangyi commented Feb 5, 2026

Uh oh!

sywangyi commented Feb 5, 2026

Uh oh!

sywangyi commented Feb 24, 2026

Uh oh!

remi-or left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

sywangyi commented Mar 5, 2026

Uh oh!

remi-or commented Mar 5, 2026

Uh oh!

sywangyi commented Mar 5, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

remi-or left a comment • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

HuggingFaceDocBuilderDev commented Mar 19, 2026

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

sywangyi commented Mar 5, 2026 •

edited

Loading

remi-or left a comment •

edited

Loading