R2Vul: Learning to Reason about Software Vulnerabilities with Reinforcement Learning and Structured Reasoning Distillation
vulnerability-detection knowledge-distillation reasoning large-language-models reinforcement-learning-from-ai-feedback
-
Updated
Aug 5, 2025 - Python