Nit about `model_accepts_loss_kwargs` for loss by ArthurZucker · Pull Request #35113 · huggingface/transformers

ArthurZucker · 2024-12-06T07:42:49Z

What does this PR do?

There was a typo in #34915, as tests were passing, did not pay attention to it. Thanks to @techkang for reporting

HuggingFaceDocBuilderDev · 2024-12-06T08:09:01Z

The docs for this PR live here. All of your documentation changes will be reflected on that endpoint. The docs are available until 30 days after the last update.

ArthurZucker · 2024-12-06T08:11:36Z

from transformers import Wav2Vec2Model
import inspect

sif = inspect.signature(Wav2Vec2Model.forward)
any(k.kind == inspect.Parameter.VAR_KEYWORD for k in sif.parameters.values())

Returns false. But when in the example torch, you have unwrapped_model.forward seems to have loss_kwargs
cc @muellerzr

techkang · 2024-12-06T10:24:52Z

This PR still confuses me as I mentioned in previes PR.
There are two ways to use num_items_in_batch to fix GA loss issue.

Use num_items_in_batch in loss function defined by model. In this case, model_accepts_loss_kwargs is True.
The model doesn't have loss function or user has self-defined loss function, which is compute_loss_func.
However, this PR will make the second method invalid.

ArthurZucker · 2024-12-06T15:49:12Z

Hey!

Use num_items_in_batch in loss function defined by model. In this case, model_accepts_loss_kwargs is True.

No, model_accepts_loss_kwargs only depends on the forward_pass of the model, not the los function defined by the model. If the model accepts kwargs, we are gonna pass num_items_in_batch. Else we cannot.

The model doesn't have loss function or user has self-defined loss function, which is compute_loss_func.
However, this PR will make the second method invalid.

Even if user has a self defined loss function, if the forward pass does not support num_items_in_batch, we ought not to pass kwargs. I might be confusing with the trainer's compute_loss_func, and in that case will leave @muellerzr have a look

muellerzr · 2024-12-06T16:01:24Z

@ArthurZucker I disagree with 2, it was explicitly designed that way to catch this in case the model doesn't have this figured out yet and the user can/should manually do it themselves instead

nit

f6b4b57

This was referenced Dec 7, 2024

logic was inverted for passing loss_kwargs to forward pass #35128

Draft

Fix GA loss bugs and add unit test #35121

Merged

ArthurZucker marked this pull request as ready for review December 9, 2024 08:29

ArthurZucker closed this Dec 9, 2024

hiyouga mentioned this pull request Jan 13, 2025

About GA loss in the latest transformers version #35663

Closed

4 tasks

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Nit about `model_accepts_loss_kwargs` for loss#35113

Nit about `model_accepts_loss_kwargs` for loss#35113
ArthurZucker wants to merge 1 commit intomainfrom
nit-ga-condition

ArthurZucker commented Dec 6, 2024

Uh oh!

HuggingFaceDocBuilderDev commented Dec 6, 2024

Uh oh!

ArthurZucker commented Dec 6, 2024

Uh oh!

techkang commented Dec 6, 2024

Uh oh!

ArthurZucker commented Dec 6, 2024

Uh oh!

muellerzr commented Dec 6, 2024

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

Conversation

ArthurZucker commented Dec 6, 2024

What does this PR do?

Uh oh!

HuggingFaceDocBuilderDev commented Dec 6, 2024

Uh oh!

ArthurZucker commented Dec 6, 2024

Uh oh!

techkang commented Dec 6, 2024

Uh oh!

ArthurZucker commented Dec 6, 2024

Uh oh!

muellerzr commented Dec 6, 2024

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants