Skip to content

Fix OOM for gemma4 31B on tot for gemma4_31b.yaml #1927

@athitten

Description

@athitten

Describe the bug

After #1904 got merged, main is running into OOM while running gemma4_31b.yaml which uses just fsdp + ac (does not use any parallelism). Creating this issue to investigate this further.

Steps/Code to reproduce bug

Please list minimal steps or code snippet for us to be able to reproduce the bug.

A helpful guide on on how to craft a minimal bug report http://matthewrocklin.com/blog/work/2018/02/28/minimal-bug-reports.

Expected behavior

A clear and concise description of what you expected to happen.

Additional context

Add any other context about the problem here.

Metadata

Metadata

Assignees

Labels

Type

No type

Projects

No projects

Milestone

No milestone

Relationships

None yet

Development

No branches or pull requests

Issue actions