Fix: add num_hidden_layers property to T5GemmaConfig and add test for use_cache by priyankabolem · Pull Request #41077 · huggingface/transformers

priyankabolem · 2025-09-22T23:54:57Z

This PR fixes a bug in T5GemmaConfig where the configuration did not expose num_hidden_layers, which is required by generation/cache utilities (use_cache=True).
• Added num_hidden_layers property to T5GemmaConfig.
• Ensured fallback to decoder’s layer count if not set explicitly.
• Added a unit test (test_generation_t5gemma.py) to verify generation runs successfully with cache enabled.

Testing
• Added test_generate_use_cache_works_for_t5gemma.
• Verified test passes locally with pytest.

… use_cache

github-actions · 2025-09-22T23:56:09Z

[For maintainers] Suggested jobs to run (before merge)

run-slow: t5gemma

priyankabolem · 2025-09-23T00:32:23Z

Local pytest for t5gemma passes. The failing CI jobs seem unrelated. Please advise if changes are needed.

ydshieh · 2025-09-23T08:55:56Z

Hi @priyankabolem Could you share an example (or a test in the library) that is failing prior to this PR?

priyankabolem · 2025-09-23T19:33:59Z

Hi @ydshieh, thanks for the feedback!

I ran the test locally before and after the fix to confirm the issue:

On main (before fix): test_generate_use_cache_works_for_t5gemma fails with
AttributeError: 'T5GemmaConfig' object has no attribute 'num_hidden_layers'.

On my branch (after fix): the same test passes successfully.

This shows that the fix (adding num_hidden_layers to T5GemmaConfig) resolves the issue.

Fix: add num_hidden_layers property to T5GemmaConfig and add test for…

16e240a

… use_cache

This was referenced Apr 29, 2026

Cumulative feature and defect updates from recent Transformers PRs evalstate/transformers#42

Open

Cumulative defect fixes from recent Transformers PRs evalstate/transformers#43

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Fix: add num_hidden_layers property to T5GemmaConfig and add test for use_cache#41077

Fix: add num_hidden_layers property to T5GemmaConfig and add test for use_cache#41077
priyankabolem wants to merge 1 commit intohuggingface:mainfrom
priyankabolem:fix/t5gemma-use-cache

priyankabolem commented Sep 22, 2025

Uh oh!

github-actions Bot commented Sep 22, 2025

Uh oh!

priyankabolem commented Sep 23, 2025

Uh oh!

ydshieh commented Sep 23, 2025

Uh oh!

priyankabolem commented Sep 23, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Conversation

priyankabolem commented Sep 22, 2025

Uh oh!

github-actions Bot commented Sep 22, 2025

Uh oh!

priyankabolem commented Sep 23, 2025

Uh oh!

ydshieh commented Sep 23, 2025

Uh oh!

priyankabolem commented Sep 23, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants