#
kto
Here are 4 public repositories matching this topic...
🚀 Optimize preferences effectively with ORPO, a framework for monolithic preference optimization without a reference model.
data reinforcement-learning medical human-pose-estimation gpt lora privacy-preserving ppo dpo huggingface kto low-resolution-images model-averaging llm generative-ai rlhf qwen medicalgpt
-
Updated
Feb 2, 2026 - Python
Improve this page
Add a description, image, and links to the kto topic page so that developers can more easily learn about it.
Add this topic to your repo
To associate your repository with the kto topic, visit your repo's landing page and select "manage topics."