Remove unused parameters and improve add_tensor_parallel_hooks_t… by michaelbenayoun · Pull Request #44768 · huggingface/transformers

michaelbenayoun · 2026-03-16T18:29:52Z

The function add_tensor_parallel_hooks_to_module has unused parameters, in this PR we:

Remove tp_plan, which is not used.
Remove parameter_name which is not used
Remove layer_name. This parameter is only used for logging purposes, and we can infer it when it does happen. It is a bit more costly, but since it is not supposed to happen, it is ok to proceed like that imo

…o_module

HuggingFaceDocBuilderDev · 2026-03-16T18:40:55Z

The docs for this PR live here. All of your documentation changes will be reflected on that endpoint. The docs are available until 30 days after the last update.

3outeille · 2026-03-24T14:58:12Z

        except NotImplementedError as e:
-            print(
-                f"Trying to prepare {layer_name}, but it's not supported. Corresponding module: {module} Fix it's TP plan: {e}"
+            modules2names = {v: k for k, v in dict(model.named_modules()).items()}


add_tensor_parallel_hooks_to_module is called for name, module in model.named_modules(): in def distribute_model() so that's a bit costly no to create a dict at every iteration?

It's ok imo, it is called only when we have a broken TP plan, not at every call.
We can also have a cache for this if you prefer.

Addressed it by caching

…ggingface#44768) * fix: remove unused parameters and improve add_tensor_parallel_hooks_to_module * feat: cache module -> name mappings in hooks * fix: do not use cache, use explicit parameter --------- Co-authored-by: Ferdinand Mom <47445085+3outeille@users.noreply.github.com>

fix: remove unused parameters and improve add_tensor_parallel_hooks_t…

064c3ec

…o_module

michaelbenayoun requested a review from 3outeille March 16, 2026 18:30

michaelbenayoun mentioned this pull request Mar 16, 2026

LoRA and Transformers TP huggingface/peft#3079

Merged

3outeille reviewed Mar 24, 2026

View reviewed changes

michaelbenayoun and others added 3 commits March 24, 2026 14:48

feat: cache module -> name mappings in hooks

219c3ce

fix: do not use cache, use explicit parameter

59709be

Merge branch 'main' into fix_add_tensor_parallel_hooks_to_module

853d7c6

3outeille approved these changes Apr 9, 2026

View reviewed changes

michaelbenayoun added this pull request to the merge queue Apr 9, 2026

github-merge-queue Bot removed this pull request from the merge queue due to failed status checks Apr 9, 2026

michaelbenayoun added this pull request to the merge queue Apr 9, 2026

Merged via the queue into huggingface:main with commit 9d840ea Apr 9, 2026
28 checks passed

michaelbenayoun deleted the fix_add_tensor_parallel_hooks_to_module branch April 9, 2026 17:11

BenjaminBossan mentioned this pull request Apr 10, 2026

Save checkpoint with TP huggingface/peft#3096

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Remove unused parameters and improve add_tensor_parallel_hooks_t…#44768

Remove unused parameters and improve add_tensor_parallel_hooks_t…#44768
michaelbenayoun merged 4 commits intohuggingface:mainfrom
michaelbenayoun:fix_add_tensor_parallel_hooks_to_module

michaelbenayoun commented Mar 16, 2026

Uh oh!

HuggingFaceDocBuilderDev commented Mar 16, 2026

Uh oh!

3outeille Mar 24, 2026 •

edited

Loading

Uh oh!

michaelbenayoun Mar 24, 2026

Uh oh!

michaelbenayoun Mar 24, 2026

Uh oh!

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Conversation

michaelbenayoun commented Mar 16, 2026

Uh oh!

HuggingFaceDocBuilderDev commented Mar 16, 2026

Uh oh!

3outeille Mar 24, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

michaelbenayoun Mar 24, 2026

Choose a reason for hiding this comment

Uh oh!

michaelbenayoun Mar 24, 2026

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

3outeille Mar 24, 2026 •

edited

Loading