Questions Regarding Residual RL Fine-Tuning Process and Checkpoints

Hi Compass Team,

Thank you for the great work and your previous support!

We are currently planning to fine-tune the model using residual RL with our environments and embodiments, as recommended in the GitHub repository:

> The generalist policy uses one-hot embodiment encoding and may not generalize perfectly to unseen embodiment types. For best results with new embodiment types, we recommend fine-tuning with residual RL first.


To move forward, we’d like to clarify a few points:
1. Could you guys provide detailed instructions or documentation on how to perform the residual RL fine-tuning process?
2. What .ckpt files or related resources are required for fine-tuning? Would be great if you could point us to the right ones.

We’d really appreciate any guidance or materials you could share to help us get started.

Thanks again!

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Questions Regarding Residual RL Fine-Tuning Process and Checkpoints #9

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Questions Regarding Residual RL Fine-Tuning Process and Checkpoints #9

Description

Metadata

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Issue actions