List view
First serious capability suite: small realistic maintenance tasks for evaluating coding-agent harness configurations and producing an AI-readable agent capability report.
No due date•9/9 issues closed
First serious capability suite: small realistic maintenance tasks for evaluating coding-agent harness configurations and producing an AI-readable agent capability report.