feat: Integrate Penguin env by bxyu-nvidia · Pull Request #1336 · NVIDIA-NeMo/RL

bxyu-nvidia · 2025-10-10T18:58:51Z

What does this PR do ?

Add a one line overview of what this PR aims to accomplish.

Issues

List issues that this PR closes (syntax):

Usage

You can potentially add a usage example below

# Add a code snippet demonstrating how to use this

Before your PR is "Ready for review"

Pre checks:

Make sure you read and followed Contributor guidelines
Did you write any new necessary tests?
Did you run the unit tests and functional tests locally? Visit our Testing Guide for how to run tests
Did you add or update any necessary documentation? Visit our Document Development Guide for how to write, build and test the docs.

Additional Information

...

Signed-off-by: Brian Yu <bxyu@nvidia.com>

…n-env Signed-off-by: Brian Yu <bxyu@nvidia.com>

Signed-off-by: Brian Yu <bxyu@nvidia.com>

Signed-off-by: Terry Kong <terryk@nvidia.com>

Signed-off-by: Brian Yu <bxyu@nvidia.com>

… bxyu/integrate-penguin

Signed-off-by: Brian Yu <bxyu@nvidia.com>

terrykong · 2025-10-11T05:19:09Z

i think eventually these should make their way into examples/configs/recipes/llm which enforces there must be a nightly test for each yaml config

i think it would be good to have one base penguin config, like examples/configs/grpo_penguin.yaml and then all the other ones are based on that. since the environment is slightly different b/c it's penguin and not our openmathinstruct ones, you should probably update the precommit here:

https://github.com/NVIDIA-NeMo/RL/blob/main/.pre-commit-config.yaml#L71-L73

to also enforce minimization of these recipes

terrykong · 2025-10-11T05:20:29Z

wdyt about examples/penguin/run_grpo_penguin.py -> examples/run_grpo_penguin.py? i kind of think it should be higher and more easily discoverable.

i guess also related, wdyt about moving your configs like examples/penguin/grpo_workbench_qwen3_4binstruct.yaml to examples/configs/recipes/llm/grpo_penguin_workbench_qwen3_4binstruct.yaml?

terrykong · 2025-10-11T05:25:02Z

maybe this should go to tests/functional/run_penguin_single_node_sanity_tests.sh and then added to https://github.com/NVIDIA-NeMo/RL/blob/main/tests/functional/L1_Functional_Tests_GPU.sh to get auto picked up by our CI. You probably need to guard this test though so that it gets skipped if penguin doesn't exist

btw, does this use 8 gpus? we only have 2 A100 gpus in the CI

terrykong · 2025-10-11T05:41:50Z

+        # generation_config["max_new_tokens"]
+
+    penguin_environment = task_to_env["penguin"]
+    results = ray.get(penguin_environment.run_rollouts.remote(penguin_rows))


is it possible to statically type the return results: SomeStruct so the IDE can help us understand how to deal with this object and go-to-definition? It's hard from reading the code to understand what I should expect to be inside "results"

bxyu-nvidia · 2025-10-31T19:12:49Z

Closed in favor of #1450

bxyu-nvidia and others added 30 commits October 9, 2025 09:46

try add pengui nstub module

b121a9a

Signed-off-by: Brian Yu <bxyu@nvidia.com>

try add test

77484b2

Signed-off-by: Brian Yu <bxyu@nvidia.com>

move into test penguin

6846571

Signed-off-by: Brian Yu <bxyu@nvidia.com>

add to pyproject

f1794b5

Signed-off-by: Brian Yu <bxyu@nvidia.com>

add emails

36a8b36

Signed-off-by: Brian Yu <bxyu@nvidia.com>

no init

66d98d9

Signed-off-by: Brian Yu <bxyu@nvidia.com>

add penguin extra

b7f4124

Signed-off-by: Brian Yu <bxyu@nvidia.com>

find packages

fb84291

Signed-off-by: Brian Yu <bxyu@nvidia.com>

lint

78c3607

Signed-off-by: Brian Yu <bxyu@nvidia.com>

remove penguin submodule

a4249ee

Signed-off-by: Brian Yu <bxyu@nvidia.com>

copy in

30b92fe

Signed-off-by: Brian Yu <bxyu@nvidia.com>

bump lock

b99601f

Signed-off-by: Brian Yu <bxyu@nvidia.com>

dont skip fixture since its not a test

b093f92

Signed-off-by: Brian Yu <bxyu@nvidia.com>

fix test

3fff298

Signed-off-by: Brian Yu <bxyu@nvidia.com>

import config types

3591e1c

Signed-off-by: Brian Yu <bxyu@nvidia.com>

try search

fb98d02

Signed-off-by: Brian Yu <bxyu@nvidia.com>

fix test priknt

09b07d3

Signed-off-by: Brian Yu <bxyu@nvidia.com>

Merge branch 'bxyu/add-penguin-stub' into bxyu/add-penguin-env

5a21806

Signed-off-by: Brian Yu <bxyu@nvidia.com>

remove unused package name

318d62e

Signed-off-by: Brian Yu <bxyu@nvidia.com>

try add deps

f82c4bb

Signed-off-by: Brian Yu <bxyu@nvidia.com>

add test data

f100f21

Signed-off-by: Brian Yu <bxyu@nvidia.com>

try match mbridge cached deps flow

f03b948

Signed-off-by: Brian Yu <bxyu@nvidia.com>

use set

9af7e73

Signed-off-by: Brian Yu <bxyu@nvidia.com>

Merge branch 'main' of github.com:NVIDIA-NeMo/RL into bxyu/add-pengui…

20a67fb

…n-env Signed-off-by: Brian Yu <bxyu@nvidia.com>

update uv lock

941a75d

Signed-off-by: Brian Yu <bxyu@nvidia.com>

update uv lock

7a316ce

Signed-off-by: Terry Kong <terryk@nvidia.com>

copy over

956380d

Signed-off-by: Brian Yu <bxyu@nvidia.com>

copy over

6ed5b24

Signed-off-by: Brian Yu <bxyu@nvidia.com>

try fix deps

f6c53f5

Signed-off-by: Brian Yu <bxyu@nvidia.com>

add back uv lock update

8695546

Signed-off-by: Brian Yu <bxyu@nvidia.com>

bxyu-nvidia added 2 commits October 10, 2025 11:59

copy over

08370f7

Signed-off-by: Brian Yu <bxyu@nvidia.com>

Merge branch 'bxyu/add-penguin-env' of github.com:NVIDIA-NeMo/RL into…

39d006d

… bxyu/integrate-penguin

bxyu-nvidia changed the base branch from main to bxyu/add-penguin-env October 10, 2025 19:04

bxyu-nvidia marked this pull request as ready for review October 10, 2025 19:10

bxyu-nvidia requested review from a team as code owners October 10, 2025 19:10

add skipif

bedea32

Signed-off-by: Brian Yu <bxyu@nvidia.com>

bxyu-nvidia requested review from parthchadha and terrykong October 10, 2025 19:45

terrykong reviewed Oct 11, 2025

View reviewed changes

Base automatically changed from bxyu/add-penguin-env to main October 13, 2025 19:50

bxyu-nvidia requested review from a team as code owners October 13, 2025 19:50

bxyu-nvidia closed this Oct 31, 2025

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

feat: Integrate Penguin env#1336

feat: Integrate Penguin env#1336
bxyu-nvidia wants to merge 33 commits intomainfrom
bxyu/integrate-penguin

bxyu-nvidia commented Oct 10, 2025

Uh oh!

terrykong Oct 11, 2025

Uh oh!

terrykong Oct 11, 2025

Uh oh!

terrykong Oct 11, 2025

Uh oh!

terrykong Oct 11, 2025

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

terrykong Oct 11, 2025

Uh oh!

Uh oh!

Uh oh!

bxyu-nvidia commented Oct 31, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Conversation

bxyu-nvidia commented Oct 10, 2025

What does this PR do ?

Issues

Usage

Before your PR is "Ready for review"

Additional Information

Uh oh!

terrykong Oct 11, 2025

Choose a reason for hiding this comment

Uh oh!

terrykong Oct 11, 2025

Choose a reason for hiding this comment

Uh oh!

terrykong Oct 11, 2025

Choose a reason for hiding this comment

Uh oh!

terrykong Oct 11, 2025

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

terrykong Oct 11, 2025

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

bxyu-nvidia commented Oct 31, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants