Skip to content

Merge branch 'main' into aroshanghias/fix-vlm-grpo-flaky-metric-check

9d3e4bf
Select commit
Loading
Failed to load commit list.
Open

fix: use smoothed reward metric for VLM GRPO CLEVR convergence tests #2015

Merge branch 'main' into aroshanghias/fix-vlm-grpo-flaky-metric-check
9d3e4bf
Select commit
Loading
Failed to load commit list.