cp: feat: Add deepseek flops tracker (1250) into r0.4.0#1309
cp: feat: Add deepseek flops tracker (1250) into r0.4.0#1309chtruong814 merged 1 commit intor0.4.0from
feat: Add deepseek flops tracker (1250) into r0.4.0#1309Conversation
📝 WalkthroughWalkthroughAdds DeepSeek-V3 support to FLOPs tracking by extending config-to-formula mapping and importing its formula. Updates unit tests to include a DeepSeek-V3 case with expected FLOPs. No public API changes. Changes
Sequence Diagram(s)sequenceDiagram
autonumber
participant U as Caller
participant FT as FLOPsTracker
participant CFG as convert_config_to_flops_config
participant F as flops_formulas
U->>FT: count_flops(model_type="deepseek_v3", config)
FT->>CFG: convert_config_to_flops_config(config)
CFG->>CFG: Build FLOPSConfig (MoE/LoRA/QK/V dims)
CFG-->>FT: FLOPSConfig + model_type="deepseek_v3"
FT->>F: deepseekv3(FLOPSConfig)
F-->>FT: computed_flops
FT-->>U: computed_flops
note over FT,F: New dispatch branch for deepseek_v3
Estimated code review effort🎯 2 (Simple) | ⏱️ ~10 minutes Possibly related PRs
Suggested labels
Suggested reviewers
Pre-merge checks and finishing touches❌ Failed checks (3 warnings)
✅ Passed checks (1 passed)
✨ Finishing touches
🧪 Generate unit tests (beta)
📜 Recent review detailsConfiguration used: Path: .coderabbit.yaml Review profile: CHILL Plan: Pro 📒 Files selected for processing (2)
🧰 Additional context used📓 Path-based instructions (2)**/*.py📄 CodeRabbit inference engine (CODING_GUIDELINES.md)
Files:
nemo_rl/**/*.py📄 CodeRabbit inference engine (CODING_GUIDELINES.md)
Files:
🧬 Code graph analysis (1)nemo_rl/utils/flops_tracker.py (1)
⏰ Context from checks skipped due to timeout of 90000ms. You can increase the timeout in your CodeRabbit configuration to a maximum of 15 minutes (900000ms). (5)
🔇 Additional comments (3)
Thanks for using CodeRabbit! It's free for OSS, and your support helps us grow. If you like it, consider giving us a shout-out. Comment |
beep boop [🤖]: Hi @guyueh1 👋,
Summary by CodeRabbit
New Features
Tests