Skip to content

Questions Regarding Residual RL Fine-Tuning Process and Checkpoints #9

@winter12279

Description

@winter12279

Hi Compass Team,

Thank you for the great work and your previous support!

We are currently planning to fine-tune the model using residual RL with our environments and embodiments, as recommended in the GitHub repository:

The generalist policy uses one-hot embodiment encoding and may not generalize perfectly to unseen embodiment types. For best results with new embodiment types, we recommend fine-tuning with residual RL first.

To move forward, we’d like to clarify a few points:

  1. Could you guys provide detailed instructions or documentation on how to perform the residual RL fine-tuning process?
  2. What .ckpt files or related resources are required for fine-tuning? Would be great if you could point us to the right ones.

We’d really appreciate any guidance or materials you could share to help us get started.

Thanks again!

Metadata

Metadata

Assignees

No one assigned

    Labels

    No labels
    No labels

    Type

    No type

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions