Skip to content

fix: gradient should be averaged instead of summed across mbs#86

Merged
parthchadha merged 5 commits intomainfrom
pchadha/loss-scaling
Mar 27, 2025
Merged

fix: gradient should be averaged instead of summed across mbs#86
parthchadha merged 5 commits intomainfrom
pchadha/loss-scaling

Commits

Commits on Mar 27, 2025