Hi Compass Team,
Thank you for the great work and your previous support!
We are currently planning to fine-tune the model using residual RL with our environments and embodiments, as recommended in the GitHub repository:
The generalist policy uses one-hot embodiment encoding and may not generalize perfectly to unseen embodiment types. For best results with new embodiment types, we recommend fine-tuning with residual RL first.
To move forward, we’d like to clarify a few points:
- Could you guys provide detailed instructions or documentation on how to perform the residual RL fine-tuning process?
- What .ckpt files or related resources are required for fine-tuning? Would be great if you could point us to the right ones.
We’d really appreciate any guidance or materials you could share to help us get started.
Thanks again!
Hi Compass Team,
Thank you for the great work and your previous support!
We are currently planning to fine-tune the model using residual RL with our environments and embodiments, as recommended in the GitHub repository:
To move forward, we’d like to clarify a few points:
We’d really appreciate any guidance or materials you could share to help us get started.
Thanks again!