Skip to content

Add DPO to test

e1519d1
Select commit
Loading
Failed to load commit list.
Merged

Direct Preference Optimization (DPO) style rewards #99

Add DPO to test
e1519d1
Select commit
Loading
Failed to load commit list.

Workflow runs completed with no jobs