Align JEPA supervision with leakage-free future targets by mehulsuresh · Pull Request #8 · ginwind/VLA-JEPA

mehulsuresh · 2026-04-23T20:14:15Z

Summary

This PR adds an opt-in JEPA supervision path for lerobot_datasets that constructs future-shifted target clips and encodes them separately from the context clip.

When datasets.vla_data.video_target_shift_steps is 0, behavior is unchanged.

When video_target_shift_steps is set to one encoder tubelet, the dataloader emits:

a shorter context clip for the predictor input
a future-shifted target clip with the same encoded temporal length for supervision

VLA_JEPA.forward() then encodes those clips separately instead of deriving both context and target states from a single encoded video window.

Paper Alignment

The paper describes VLA-JEPA as "leakage-free state prediction" and says future frames should be used only as supervision targets, not as inputs to the learner (paper, HTML).

This PR moves the implementation toward that setup:

the predictor input comes from a context clip
the supervision target comes from a separate future-shifted clip
the default single-pass path remains unchanged unless video_target_shift_steps is enabled

Add optional two-pass JEPA target clips

089ad66

mehulsuresh changed the title ~~Add optional two-pass JEPA target clips~~ Align JEPA supervision with leakage-free future targets Apr 23, 2026

mehulsuresh marked this pull request as ready for review April 23, 2026 21:15

ginwind force-pushed the main branch from 416a81f to a847cb0 Compare May 1, 2026 12:33

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Align JEPA supervision with leakage-free future targets#8

Align JEPA supervision with leakage-free future targets#8
mehulsuresh wants to merge 1 commit intoginwind:mainfrom
mehulsuresh:codex/upstream-jepa-two-pass

mehulsuresh commented Apr 23, 2026 •

edited

Loading

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

Conversation

mehulsuresh commented Apr 23, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Summary

Paper Alignment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

mehulsuresh commented Apr 23, 2026 •

edited

Loading