Rendering correctness test determinism by huidongc · Pull Request #5353 · isaac-sim/IsaacLab

huidongc · 2026-04-22T06:14:04Z

Description

For the rendering correctness test, we don't really need the env.reset() call to fill the camera output buffers. Instead, if we remove env.reset(), the camera output buffers will be filled on the invocation of camera.data:

    @property
    def data(self) -> CameraData:
        # update sensors if needed
        self._update_outdated_buffers()
        # return the data
        return self._data

This means we can remove env.reset() calls in the rendering correctness test to avoid non-deterministic initial pose. Articulation bodies will be at their default pose for rendering for all combos. With this removal I can set the max pixel diff threshold to smaller (stricter) values:

    "cartpole": 1.0,      # decreased from 2.0
    "shadow_hand": 3.0,   # decreased from 8.0
    "dexsuite_kuka": 4.0, # decreased from 10.0

The test will become more sensitive to capture rendering changes but hopefully it can still tolerate minor pixel noise.

Fixes # (issue)

Type of change

Test-only change

Checklist

I have read and understood the contribution guidelines
I have run the pre-commit checks with ./isaaclab.sh --format
I have made corresponding changes to the documentation
My changes generate no new warnings
I have added tests that prove my fix is effective or that my feature works
I have updated the changelog and the corresponding version in the extension's config/extension.toml file
I have added my name to the CONTRIBUTORS.md or my name already exists there

…nistic initial poses

greptile-apps · 2026-04-22T06:16:22Z

Greptile Summary

This PR improves rendering correctness test determinism by removing the per-fixture env_cfg.seed and env.reset(seed=_ENV_SEED) calls (which introduced non-determinism by perturbing state before capture), centralizing per-environment pixel-difference thresholds in _MAX_DIFFERENT_PIXELS_PERCENTAGE_BY_ENV_NAME, and regenerating the golden images under the new deterministic baseline. The registered-task list is refactored from a flat list of strings to a list of (task_id, env_name) tuples so each task can pick the correct threshold.

Confidence Score: 5/5

Safe to merge — changes are test-only and tighten correctness thresholds under a more deterministic baseline.

All findings are P2 style suggestions. The core logic change (removing seed/reset calls and centralising thresholds) is sound and well-motivated. Golden images have been regenerated. No production code is affected.

No files require special attention; all changes are confined to the test suite and its golden images.

Important Files Changed

Filename	Overview
source/isaaclab_tasks/test/test_rendering_correctness.py	Removes env seed and reset calls from all fixtures; centralizes pixel-diff thresholds via `_MAX_DIFFERENT_PIXELS_PERCENTAGE_BY_ENV_NAME`; refactors `_RENDER_CORRECTNESS_TASK_IDS` to tuples; tightens thresholds. Minor KeyError risk if a new env_name tuple is added without updating the threshold dict.

Flowchart

%%{init: {'theme': 'neutral'}}%%
flowchart TD
    A[Test fixture created\ne.g. cartpole_env] --> B[env constructed\nno seed / no reset]
    B --> C[Camera.data.output accessed\nbuffers updated on access]
    C --> D[_validate_camera_outputs called\nwith env_name threshold lookup]
    D --> E{env_name in\n_MAX_DIFFERENT_PIXELS_PERCENTAGE_BY_ENV_NAME?}
    E -- Yes --> F[Compare against golden image\nusing per-env threshold]
    E -- No --> G[KeyError at runtime]
    F --> H{diff_pct <= threshold?}
    H -- Pass --> I[Test passes]
    H -- Fail --> J[pytest.fail with diff details]

_{Reviews (2): Last reviewed commit: "smaller (stricter) threshold" | Re-trigger Greptile}

greptile-apps · 2026-04-22T06:16:26Z

        camera_outputs_nested_dict = _collect_camera_outputs(env)
        num_camera_outputs = len(camera_outputs_nested_dict)


Camera data may be empty without a prior reset

gym.make() constructs the environment but, per the Gymnasium contract, sensor/observation data is not guaranteed to be populated until the first reset() call. If Isaac Sim defers filling camera buffers to reset(), then _collect_camera_outputs will return an empty dict and the assertion on line 794 (assert num_camera_outputs == 1) will fail with a confusing message rather than indicating the root cause.

The same pattern was removed from the other three env fixtures (shadow_hand_env, cartpole_env, dexsuite_kuka_allegro_lift_env). If those fixtures also rely on camera data being populated at construction time, any future environment that defers rendering until reset() will silently break here.

this is fine. we will check whether camera outputs exist and contain non-zero pixels inside _validate_camera_outputs.

Camera output buffers will be updated on invocation of Camera.data():

@property def data(self) -> CameraData: # update sensors if needed self._update_outdated_buffers() # return the data return self._data

Got it, thanks for clarifying! I see now that _validate_camera_outputs has the assertion assert len(camera_outputs) > 0 plus the per-tensor non-zero check, so empty or blank outputs will fail clearly rather than silently passing. That addresses my concern. 👍

Thank you for the clarification! You're right — the Camera.data property's lazy update via _update_outdated_buffers() ensures buffers are populated on access, making the explicit reset() unnecessary for camera data retrieval in this context.

This is a nice pattern that decouples buffer population from the reset lifecycle. My concern about the Gymnasium contract was misplaced here since Isaac Lab's camera sensor implementation handles this internally. 👍

huidongc · 2026-04-22T08:31:02Z

+# needs to be large enough to tolerate minor rendering noise while small enough to catch unexpected changes.
+_MAX_DIFFERENT_PIXELS_PERCENTAGE_BY_ENV_NAME = {
+    "cartpole": 1.0,
+    "shadow_hand": 3.5,


it was 8.0 before

huidongc · 2026-04-22T08:31:36Z

+# The value is set case by case based on the screen space taken up by the env in camera output images. It
+# needs to be large enough to tolerate minor rendering noise while small enough to catch unexpected changes.
+_MAX_DIFFERENT_PIXELS_PERCENTAGE_BY_ENV_NAME = {
+    "cartpole": 1.0,


it was 2.0 before

huidongc · 2026-04-22T08:32:03Z

+_MAX_DIFFERENT_PIXELS_PERCENTAGE_BY_ENV_NAME = {
+    "cartpole": 1.0,
+    "shadow_hand": 3.5,
+    "dexsuite_kuka": 4.5,


it was 10.0 before

isaaclab-review-bot

🤖 Isaac Lab Review Bot

Summary

This PR removes env.reset() calls from rendering correctness tests to eliminate non-deterministic initial articulation poses. Instead, camera buffers are filled via the lazy _update_outdated_buffers() mechanism when accessing camera.data. This allows tighter pixel difference thresholds (cartpole: 2.0→1.0, shadow_hand: 8.0→3.0, dexsuite_kuka: 10.0→4.0). The implementation is correct and well-reasoned.

Architecture Impact

Changes are self-contained to the test file. No cross-module impact — the _MAX_DIFFERENT_PIXELS_PERCENTAGE_BY_ENV_NAME dictionary centralizes thresholds and is only consumed within this test module.

Implementation Verdict

Ship it — Clean implementation with good code organization. The approach of relying on default articulation poses instead of randomized reset states is sound for rendering correctness tests.

Test Coverage

This PR is the test. The changes improve test reliability by removing a source of non-determinism. The lowered thresholds make the tests more sensitive to actual rendering regressions while the removal of env.reset() eliminates false positives from initial pose variance. The 71 updated golden images reflect the new deterministic default poses.

CI Status

CI shows 3 test failures, but none are caused by this PR:

Isaac-Cartpole-RGB-TheiaTiny-v0 failures in test_environments.py and test_environments_with_stage_in_memory.py — these are Theia vision model tests, unrelated to rendering correctness tests
Training benchmark failures (duration/reward thresholds) — flaky performance benchmarks, not related to this PR

The rendering correctness tests themselves (shard 3/3) passed ✅

Findings

🔵 Improvement: test_rendering_correctness.py:775-777 — Consider documenting the env_name mapping rationale
The _RENDER_CORRECTNESS_TASK_IDS list now includes env_name mappings (e.g., Isaac-Repose-Cube-Shadow-Vision-Direct-v0 maps to shadow_hand). A brief comment explaining why Shadow Hand is used for the Repose-Cube task would help future maintainers understand this is based on screen space coverage, not task similarity.

Overall this is a well-thought-out improvement that makes the tests more deterministic and the thresholds more meaningful. The code organization with the centralized threshold dictionary is cleaner than the previous inline values.

# Description For the rendering correctness test, we don't really need the `env.reset()` call to fill the camera output buffers. Instead, if we remove `env.reset()`, the camera output buffers will be filled on the invocation of camera.data: ```python @Property def data(self) -> CameraData: # update sensors if needed self._update_outdated_buffers() # return the data return self._data ``` This means we can remove `env.reset()` calls in the rendering correctness test to avoid non-deterministic initial pose. Articulation bodies will be at their default pose for rendering for all combos. With this removal I can set the max pixel diff threshold to smaller (stricter) values: ```python "cartpole": 1.0, # decreased from 2.0 "shadow_hand": 3.0, # decreased from 8.0 "dexsuite_kuka": 4.0, # decreased from 10.0 ``` The test will become more sensitive to capture rendering changes but hopefully it can still tolerate minor pixel noise. Fixes # (issue)  ## Type of change - Test-only change ## Checklist - [x] I have read and understood the [contribution guidelines](https://isaac-sim.github.io/IsaacLab/main/source/refs/contributing.html) - [x] I have run the [`pre-commit` checks](https://pre-commit.com/) with `./isaaclab.sh --format` - [ ] I have made corresponding changes to the documentation - [x] My changes generate no new warnings - [x] I have added tests that prove my fix is effective or that my feature works - [ ] I have updated the changelog and the corresponding version in the extension's `config/extension.toml` file - [ ] I have added my name to the `CONTRIBUTORS.md` or my name already exists there

remove env.reset() in the test because it would introduce non-determi…

f1457f8

…nistic initial poses

github-actions Bot added the isaac-lab Related to Isaac Lab team label Apr 22, 2026

greptile-apps Bot reviewed Apr 22, 2026

View reviewed changes

huidongc marked this pull request as draft April 22, 2026 06:17

central dict for pixel difference threshold

c308e1b

huidongc force-pushed the rendering-correctness-test-determinism branch from 55d512c to c308e1b Compare April 22, 2026 08:27

huidongc commented Apr 22, 2026

View reviewed changes

smaller (stricter) threshold

cbf9abb

huidongc marked this pull request as ready for review April 22, 2026 09:15

huidongc requested a review from pbarejko April 22, 2026 10:10

isaaclab-review-bot Bot reviewed Apr 22, 2026

View reviewed changes

pbarejko approved these changes Apr 22, 2026

View reviewed changes

huidongc merged commit 481d7ee into isaac-sim:develop Apr 22, 2026
28 of 31 checks passed

huidongc deleted the rendering-correctness-test-determinism branch April 22, 2026 22:27

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Rendering correctness test determinism#5353

Rendering correctness test determinism#5353
huidongc merged 3 commits into
isaac-sim:developfrom
huidongc:rendering-correctness-test-determinism

huidongc commented Apr 22, 2026 •

edited

Loading

Uh oh!

greptile-apps Bot commented Apr 22, 2026 •

edited

Loading

Uh oh!

greptile-apps Bot Apr 22, 2026

Uh oh!

huidongc Apr 22, 2026

Uh oh!

huidongc Apr 22, 2026

Uh oh!

isaaclab-review-bot Bot Apr 22, 2026

Uh oh!

isaaclab-review-bot Bot Apr 22, 2026

Uh oh!

Uh oh!

huidongc Apr 22, 2026

Uh oh!

huidongc Apr 22, 2026

Uh oh!

huidongc Apr 22, 2026

Uh oh!

isaaclab-review-bot Bot left a comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

		camera_outputs_nested_dict = _collect_camera_outputs(env)
		num_camera_outputs = len(camera_outputs_nested_dict)

Conversation

huidongc commented Apr 22, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Description

Type of change

Checklist

Uh oh!

greptile-apps Bot commented Apr 22, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Greptile Summary

Confidence Score: 5/5

Important Files Changed

Flowchart

Uh oh!

greptile-apps Bot Apr 22, 2026

Choose a reason for hiding this comment

Uh oh!

huidongc Apr 22, 2026

Choose a reason for hiding this comment

Uh oh!

huidongc Apr 22, 2026

Choose a reason for hiding this comment

Uh oh!

isaaclab-review-bot Bot Apr 22, 2026

Choose a reason for hiding this comment

Uh oh!

isaaclab-review-bot Bot Apr 22, 2026

Choose a reason for hiding this comment

Uh oh!

Uh oh!

huidongc Apr 22, 2026

Choose a reason for hiding this comment

Uh oh!

huidongc Apr 22, 2026

Choose a reason for hiding this comment

Uh oh!

huidongc Apr 22, 2026

Choose a reason for hiding this comment

Uh oh!

isaaclab-review-bot Bot left a comment

Choose a reason for hiding this comment

🤖 Isaac Lab Review Bot

Summary

Architecture Impact

Implementation Verdict

Test Coverage

CI Status

Findings

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

huidongc commented Apr 22, 2026 •

edited

Loading

greptile-apps Bot commented Apr 22, 2026 •

edited

Loading