Skip to content

如何评估SFT训练性能 #122

@2402227288

Description

@2402227288

对于SFT训练,除了可以看他的mean_token_accuracy是否稳定0.9左右,怎么判断是否训练好了,能否进行强化学习了,是否需要通过pass@k这种形式进行进一步的测验呢,想请教下作者当时是如何评估的

Metadata

Metadata

Assignees

No one assigned

    Labels

    No labels
    No labels

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions