Describe the bug
After #1904 got merged, main is running into OOM while running gemma4_31b.yaml which uses just fsdp + ac (does not use any parallelism). Creating this issue to investigate this further.
Steps/Code to reproduce bug
Please list minimal steps or code snippet for us to be able to reproduce the bug.
A helpful guide on on how to craft a minimal bug report http://matthewrocklin.com/blog/work/2018/02/28/minimal-bug-reports.
Expected behavior
A clear and concise description of what you expected to happen.
Additional context
Add any other context about the problem here.
Describe the bug
After #1904 got merged, main is running into OOM while running gemma4_31b.yaml which uses just fsdp + ac (does not use any parallelism). Creating this issue to investigate this further.
Steps/Code to reproduce bug
Please list minimal steps or code snippet for us to be able to reproduce the bug.
A helpful guide on on how to craft a minimal bug report http://matthewrocklin.com/blog/work/2018/02/28/minimal-bug-reports.
Expected behavior
A clear and concise description of what you expected to happen.
Additional context
Add any other context about the problem here.