Fix AttributeError: add num_hidden_layers property to T5GemmaConfig by avchauzov · Pull Request #40920 · huggingface/transformers

avchauzov · 2025-09-16T21:14:28Z

What does this PR do?

This PR fixes an AttributeError that occurs when using T5Gemma models with Seq2SeqTrainer. The issue arises because DynamicCache in cache_utils.py expects config.num_hidden_layers to exist, but T5GemmaConfig doesn't have this attribute.

Changes:

Added num_hidden_layers property to T5GemmaConfig that returns encoder.num_hidden_layers
Added test to verify the property works correctly and is read-only

Problem: When training T5Gemma models with Seq2SeqTrainer, the code fails with:
AttributeError: 'T5GemmaConfig' object has no attribute 'num_hidden_layers'

Solution: Follow the pattern used by other encoder-decoder models (ProphetNet, Funnel) by adding a computed property that returns the encoder's layer count.

Fixes #40901

Before submitting

This PR fixes a typo or improves the docs (you can dismiss the other checks if that's the case).
Did you read the contributor guideline, Pull Request section?
Was this discussed/approved via a Github issue or the forum? Please add a link to it if that's the case.
Did you make sure to update the documentation with your changes? Here are the documentation guidelines, and here are tips on formatting docstrings.
Did you write any new necessary tests?

Who can review?

@ArthurZucker @SunMarc

(T5Gemma model configuration and Seq2SeqTrainer integration)

- Add num_hidden_layers property that returns encoder.num_hidden_layers - Fixes issue with Seq2SeqTrainer and DynamicCache expecting config.num_hidden_layers - Resolves #40901

github-actions · 2025-09-16T22:34:22Z

[For maintainers] Suggested jobs to run (before merge)

run-slow: t5gemma

SunMarc

Thanks ! Left a comment and pinging @gante for the generation part

SunMarc · 2025-09-17T16:28:22Z

+            encoder_num_hidden_layers=self.model_tester.encoder_num_hidden_layers,
+            decoder_num_hidden_layers=self.model_tester.decoder_num_hidden_layers,


not sure what you are trying to do here as it doesn't exist

SunMarc · 2025-09-17T16:28:39Z

+            encoder_num_hidden_layers=self.encoder_num_hidden_layers,
+            decoder_num_hidden_layers=self.decoder_num_hidden_layers,


SunMarc · 2025-09-17T16:29:31Z

+    @property
+    def num_hidden_layers(self) -> int:
+        """Number of hidden layers in the encoder."""
+        return self.encoder.num_hidden_layers
+
+    @num_hidden_layers.setter
+    def num_hidden_layers(self, value):
+        """Set number of hidden layers in the encoder."""
+        raise NotImplementedError(
+            "This model does not support setting `num_hidden_layers`. "
+            "Please set `encoder.num_hidden_layers` and `decoder.num_hidden_layers` separately."
+        )


not sure if we want this, maybe it will be better to just fix the generation to get the right attribute ?

gante

Thank you for opening the PR @avchauzov, but the fix is already addressed in #40939 🤗

The actual fix consists in removing a bad overwrite, and then fixing a few subtle bugs :)

SunMarc · 2025-09-18T16:32:16Z

Nice !

avchauzov · 2025-09-21T00:46:00Z

@gante Thank you! I'm closing this PR then!

avchauzov added 2 commits September 17, 2025 00:33

Fix AttributeError: add num_hidden_layers property to T5GemmaConfig

76e30df

- Add num_hidden_layers property that returns encoder.num_hidden_layers - Fixes issue with Seq2SeqTrainer and DynamicCache expecting config.num_hidden_layers - Resolves #40901

Merge branch 'main' into fix-t5gemma-num-hidden-layers

16a1bdc

SunMarc reviewed Sep 17, 2025

View reviewed changes

SunMarc requested a review from gante September 17, 2025 16:29

gante suggested changes Sep 18, 2025

View reviewed changes

avchauzov closed this Sep 21, 2025

avchauzov deleted the fix-t5gemma-num-hidden-layers branch September 21, 2025 00:46

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Fix AttributeError: add num_hidden_layers property to T5GemmaConfig#40920

Fix AttributeError: add num_hidden_layers property to T5GemmaConfig#40920
avchauzov wants to merge 2 commits intohuggingface:mainfrom
avchauzov:fix-t5gemma-num-hidden-layers

avchauzov commented Sep 16, 2025

Uh oh!

github-actions Bot commented Sep 16, 2025

Uh oh!

SunMarc left a comment

Uh oh!

SunMarc Sep 17, 2025

Uh oh!

SunMarc Sep 17, 2025

Uh oh!

SunMarc Sep 17, 2025

Uh oh!

gante left a comment

Uh oh!

SunMarc commented Sep 18, 2025

Uh oh!

avchauzov commented Sep 21, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

		encoder_num_hidden_layers=self.model_tester.encoder_num_hidden_layers,
		decoder_num_hidden_layers=self.model_tester.decoder_num_hidden_layers,

		encoder_num_hidden_layers=self.encoder_num_hidden_layers,
		decoder_num_hidden_layers=self.decoder_num_hidden_layers,

Conversation

avchauzov commented Sep 16, 2025

What does this PR do?

Before submitting

Who can review?

Uh oh!

github-actions Bot commented Sep 16, 2025

Uh oh!

SunMarc left a comment

Choose a reason for hiding this comment

Uh oh!

SunMarc Sep 17, 2025

Choose a reason for hiding this comment

Uh oh!

SunMarc Sep 17, 2025

Choose a reason for hiding this comment

Uh oh!

SunMarc Sep 17, 2025

Choose a reason for hiding this comment

Uh oh!

gante left a comment

Choose a reason for hiding this comment

Uh oh!

SunMarc commented Sep 18, 2025

Uh oh!

avchauzov commented Sep 21, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants