Skip to content

Merge branch 'main' into pchadha/loss-scaling

ff02b5b
Select commit
Loading
Failed to load commit list.
Merged

fix: gradient should be averaged instead of summed across mbs #86

Merge branch 'main' into pchadha/loss-scaling
ff02b5b
Select commit
Loading
Failed to load commit list.