Skip to content

Add unit tests for ClippedPGLossFn (especially for upcoming vocab parallel path). #145

@SahilJain314

Description

@SahilJain314

Current unit tests only cover the NLL loss. Should also have tests for ClippedPGLossFn (superset of REINFORCE, PPO, GRPO, DAPO losses). Upcoming PR adds vocab parallel to the loss function as well

Metadata

Metadata

Assignees

Labels

testingRelated to testing

Type

No type

Projects

No projects

Milestone

No milestone

Relationships

None yet

Development

No branches or pull requests

Issue actions