cp: fix: grad norm calculation for dtensor v2 (1693) into r0.5.0#1696
cp: fix: grad norm calculation for dtensor v2 (1693) into r0.5.0#1696
fix: grad norm calculation for dtensor v2 (1693) into r0.5.0#1696Conversation
Signed-off-by: Hemil Desai <hemild@nvidia.com> Signed-off-by: NeMo Bot <nemo-bot@nvidia.com>
|
📝 WalkthroughWalkthroughModified loss scaling in the policy worker to multiply loss by dp_size and cp_size before backpropagation to compensate for FSDP gradient reduction, ensuring correct gradient contributions. Added corresponding gradient norm validation checks in test metrics. Changes
Estimated code review effort🎯 3 (Moderate) | ⏱️ ~20 minutes Possibly related PRs
Suggested labels
Suggested reviewers
Pre-merge checks and finishing touches❌ Failed checks (1 warning)
✅ Passed checks (3 passed)
✨ Finishing touches
🧪 Generate unit tests (beta)
📜 Recent review detailsConfiguration used: Path: .coderabbit.yaml Review profile: CHILL Plan: Pro 📒 Files selected for processing (2)
🧰 Additional context used📓 Path-based instructions (6)**/*.sh📄 CodeRabbit inference engine (CODING_GUIDELINES.md)
Files:
tests/test_suites/**/*.sh📄 CodeRabbit inference engine (CODING_GUIDELINES.md)
Files:
!(**/tests/**|**/test_*.py|**/test_*.sh)📄 CodeRabbit inference engine (CODING_GUIDELINES.md)
Files:
**/*.{py,sh}📄 CodeRabbit inference engine (CODING_GUIDELINES.md)
Files:
**/*.py📄 CodeRabbit inference engine (CODING_GUIDELINES.md)
Files:
nemo_rl/**/*.py📄 CodeRabbit inference engine (CODING_GUIDELINES.md)
Files:
🧠 Learnings (2)📚 Learning: 2025-10-12T14:46:57.171ZApplied to files:
📚 Learning: 2025-11-28T19:05:27.876ZApplied to files:
🧬 Code graph analysis (1)nemo_rl/models/policy/workers/dtensor_policy_worker_v2.py (1)
⏰ Context from checks skipped due to timeout of 90000ms. You can increase the timeout in your CodeRabbit configuration to a maximum of 15 minutes (900000ms). (6)
🔇 Additional comments (2)
Thanks for using CodeRabbit! It's free for OSS, and your support helps us grow. If you like it, consider giving us a shout-out. Comment |
…VIDIA-NeMo#1696) Signed-off-by: Hemil Desai <hemild@nvidia.com> Signed-off-by: NeMo Bot <nemo-bot@nvidia.com> Co-authored-by: Hemil Desai <hemild@nvidia.com>
beep boop [🤖]: Hi @hemildesai 👋,
Summary by CodeRabbit
Bug Fixes
Tests
✏️ Tip: You can customize this high-level summary in your review settings.