refactor: split train and val dataset in preference dataset by yuki-97 · Pull Request #1763 · NVIDIA-NeMo/RL

yuki-97 · 2026-01-13T04:00:22Z

Closes #1050.

Split train and val in build-in preference dataset, so that we could unblock multiple dataset support.
Unify the built-in datasets under nemo_rl/data/datasets/preference_datasets/ into a similar format.
Move setup_preference_data to nemo_rl/data/utils.py and reuse it.

Usage

data:
  # other data settings, see `examples/configs/sft.yaml` for more details
  ...
  # dataset settings
  train:
    # this dataset will override prompt_key and use the default values for other vars
    data_path: /path/to/local/train_dataset.jsonl  # local file or hf_org/hf_dataset_name (HuggingFace)
    prompt_key: context
    split: train  # used for HuggingFace datasets
  validation:
    # this dataset will use the default values for other vars except data_path
    data_path: /path/to/local/val_dataset.jsonl
  default:
    # will use below vars as default values if dataset doesn't specify it
    dataset_name: BinaryPreferenceDataset
    prompt_key: prompt
    chosen_key: chosen
    rejected_key: rejected
    prompt_file: null
    system_prompt_file: null

Migrate Guide

For dataset that loads from local JSONL file or HuggingFace (BinaryPreferenceDataset and PreferenceDataset)

# old
data:
  dataset_name: BinaryPreferenceDataset
  train_data_path: <PathToTrainingDataset>  # e.g., /path/to/local/dataset.jsonl or hf_org/hf_dataset_name (HuggingFace)
  val_data_path: <PathToValidationDataset>
  prompt_key: <PromptKey>, default is "prompt"
  chosen_key: <ChosenKey>, default is "chosen"
  rejected_key: <RejectedKey>, default is "rejected"
  train_split: <TrainSplit>, default is None  # used for HuggingFace datasets
  val_split: <ValSplit>, default is None  # used for HuggingFace datasets

# new
data:
  # other data settings, see `examples/configs/sft.yaml` for more details
  ...
  # dataset settings
  train:
    # this dataset will override prompt_key and use the default values for other vars
    data_path: /path/to/local/train_dataset.jsonl  # local file or hf_org/hf_dataset_name (HuggingFace)
    prompt_key: context
    split: train  # used for HuggingFace datasets
  validation:
    # this dataset will use the default values for other vars except data_path
    data_path: /path/to/local/val_dataset.jsonl
  default:
    # will use below vars as default values if dataset doesn't specify it
    dataset_name: BinaryPreferenceDataset
    prompt_key: prompt
    chosen_key: chosen
    rejected_key: rejected
    prompt_file: null
    system_prompt_file: null

For some built-in datasets that needs change

HelpSteer3

# old
data:
  dataset_name: HelpSteer3

# new
data:
  train:
    dataset_name: HelpSteer3
    split: train
  validation:
    dataset_name: HelpSteer3
    split: validation

Tulu3Preference

# old
data:
  dataset_name: Tulu3Preference

# new
data:
  train:
    dataset_name: Tulu3Preference
  validation: null

Test Result
Nightly tests are all good.

algo	result
rm
dpo

Signed-off-by: Yuki Huang <yukih@nvidia.com>

github-actions Bot added the Documentation Improvements or additions to documentation label Jan 13, 2026

yuki-97 linked an issue Jan 13, 2026 that may be closed by this pull request

Decouple train and eval dataset #1050

Closed

yuki-97 mentioned this pull request Jan 13, 2026

Refactor dataset module for train #909

Open

yuki-97 changed the base branch from yukih/split-train-val-dataset to main January 13, 2026 08:22

yuki-97 added the CI:L1 Run doctests, unit tests, and functional tests label Jan 13, 2026

yuki-97 temporarily deployed to nemo-ci January 13, 2026 08:22 — with GitHub Actions Inactive

yuki-97 had a problem deploying to nemo-ci January 13, 2026 08:26 — with GitHub Actions Failure

yuki-97 added CI:L1 Run doctests, unit tests, and functional tests and removed CI:L1 Run doctests, unit tests, and functional tests labels Jan 13, 2026

yuki-97 temporarily deployed to nemo-ci January 13, 2026 09:27 — with GitHub Actions Inactive

yuki-97 had a problem deploying to nemo-ci January 13, 2026 10:11 — with GitHub Actions Failure

yuki-97 force-pushed the yukih/split-train-val-dataset-preference branch from 0923975 to 2fb1777 Compare January 13, 2026 11:32

yuki-97 added CI:L1 Run doctests, unit tests, and functional tests and removed CI:L1 Run doctests, unit tests, and functional tests labels Jan 13, 2026

yuki-97 temporarily deployed to nemo-ci January 13, 2026 11:33 — with GitHub Actions Inactive

yuki-97 had a problem deploying to nemo-ci January 13, 2026 11:39 — with GitHub Actions Failure

yuki-97 force-pushed the yukih/split-train-val-dataset-preference branch from 2fb1777 to 6086b51 Compare January 13, 2026 14:38

yuki-97 added CI:L1 Run doctests, unit tests, and functional tests and removed CI:L1 Run doctests, unit tests, and functional tests labels Jan 13, 2026

yuki-97 temporarily deployed to nemo-ci January 13, 2026 14:39 — with GitHub Actions Inactive

yuki-97 had a problem deploying to nemo-ci January 13, 2026 14:49 — with GitHub Actions Failure

yuki-97 force-pushed the yukih/split-train-val-dataset-preference branch from 6086b51 to 994a15f Compare January 13, 2026 15:26

yuki-97 added CI:L1 Run doctests, unit tests, and functional tests and removed CI:L1 Run doctests, unit tests, and functional tests labels Jan 13, 2026

yuki-97 temporarily deployed to nemo-ci January 13, 2026 15:27 — with GitHub Actions Inactive

yuki-97 temporarily deployed to nemo-ci January 13, 2026 15:31 — with GitHub Actions Inactive

terrykong marked this pull request as ready for review January 21, 2026 23:16

terrykong requested review from a team as code owners January 21, 2026 23:16

yuki-97 mentioned this pull request Feb 3, 2026

Cleanup default_task_data_spec in AllTaskProcessedDataset #1867

Open

yuki-97 added CI:L1 Run doctests, unit tests, and functional tests and removed CI:L1 Run doctests, unit tests, and functional tests labels Feb 3, 2026

yuki-97 had a problem deploying to nemo-ci February 3, 2026 04:15 — with GitHub Actions Error

yuki-97 added CI:L1 Run doctests, unit tests, and functional tests and removed CI:L1 Run doctests, unit tests, and functional tests labels Feb 3, 2026

yuki-97 had a problem deploying to nemo-ci February 3, 2026 05:51 — with GitHub Actions Error

yuki-97 force-pushed the yukih/split-train-val-dataset-preference branch from eb00be0 to ba45d87 Compare February 3, 2026 05:51

yuki-97 added CI:L1 Run doctests, unit tests, and functional tests and removed CI:L1 Run doctests, unit tests, and functional tests labels Feb 3, 2026

yuki-97 temporarily deployed to nemo-ci February 3, 2026 05:53 — with GitHub Actions Inactive

terrykong enabled auto-merge (squash) February 3, 2026 06:11

terrykong approved these changes Feb 3, 2026

View reviewed changes

yuki-97 temporarily deployed to nemo-ci February 3, 2026 06:44 — with GitHub Actions Inactive

yuki-97 added 15 commits February 3, 2026 22:09

update BinaryPreferenceDataset and PreferenceDataset

d1161f7

Signed-off-by: Yuki Huang <yukih@nvidia.com>

update helpsteer3 and tulu3

76e3120

Signed-off-by: Yuki Huang <yukih@nvidia.com>

update run rm/dpo

f557b17

Signed-off-by: Yuki Huang <yukih@nvidia.com>

update doc

9c514ef

Signed-off-by: Yuki Huang <yukih@nvidia.com>

split PreferenceDatasetConfig

09cae25

Signed-off-by: Yuki Huang <yukih@nvidia.com>

update config

c768c1e

Signed-off-by: Yuki Huang <yukih@nvidia.com>

fix test

53c30e4

Signed-off-by: Yuki Huang <yukih@nvidia.com>

reuse setup_preference_data

c1d43ca

Signed-off-by: Yuki Huang <yukih@nvidia.com>

fix unit test

bf03742

Signed-off-by: Yuki Huang <yukih@nvidia.com>

add migrate guide message

77f6b6c

Signed-off-by: Yuki Huang <yukih@nvidia.com>

update doc

bb76878

Signed-off-by: Yuki Huang <yukih@nvidia.com>

fix val_data_paths

ab17575

Signed-off-by: Yuki Huang <yukih@nvidia.com>

change to use basename

8b99888

Signed-off-by: Yuki Huang <yukih@nvidia.com>

lint

28a45e6

Signed-off-by: Yuki Huang <yukih@nvidia.com>

update task_name

5a9ab6f

Signed-off-by: Yuki Huang <yukih@nvidia.com>

yuki-97 force-pushed the yukih/split-train-val-dataset-preference branch from ba45d87 to 5a9ab6f Compare February 3, 2026 14:09

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

refactor: split train and val dataset in preference dataset#1763

refactor: split train and val dataset in preference dataset#1763
terrykong merged 15 commits intomainfrom
yukih/split-train-val-dataset-preference

yuki-97 commented Jan 13, 2026 •

edited

Loading

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Conversation

yuki-97 commented Jan 13, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

yuki-97 commented Jan 13, 2026 •

edited

Loading