Skip to content

This repo implements mutualistic reward shaping in multi-agent reinforcement learning (MARL) to enhance robot cooperation. Tested on CartPendulum, ShadowHand, and Mobile Manipulation, it improves stability, convergence, and coordination. Includes code, results, and documentation. Contributions welcome! πŸš€

License

Notifications You must be signed in to change notification settings

Cyber-physical-Systems-Lab/RewMARL

Folders and files

NameName
Last commit message
Last commit date

Latest commit

Β 

History

6 Commits
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 

Repository files navigation

RewMARL

This repo implements mutualistic reward shaping in multi-agent reinforcement learning (MARL) to enhance robot cooperation. Tested on CartPendulum, ShadowHand, and Mobile Manipulation, it improves stability, convergence, and coordination. Includes code, results, and documentation. Contributions welcome! πŸš€

This repository contains code for Investigating Symbiosis in Robotic Ecosystems: A Case Study for Multi-Robot Reinforcement Learning Reward Shaping.

Code is available now!

βš™οΈ Installation & Usage

This project has been tested on Ubuntu 22.04 using Isaac Sim 4.5.0 or 4.2.0.

βœ… Prerequisites

Install Isaac Sim and IsaacLab by following the IsaacLab pip installation guide.

Make sure the following files are placed correctly:

IsaacLab/
β”œβ”€β”€ source/
β”‚   β”œβ”€β”€ isaaclab_assets/
β”‚   β”‚   └── isaaclab_assets/
β”‚   β”‚       └── robots/
β”‚   β”‚           └── mobile_franka.py
β”‚   └── isaaclab_tasks/
β”‚       └── isaaclab_tasks/
β”‚           └── direct/
β”‚               β”œβ”€β”€ cart_pendulum/
β”‚               β”œβ”€β”€ shadow_hand/
β”‚               └── mobile_franka/

πŸš€ Example: Training MobileFranka

To train the MobileFranka multi-agent task using MAPPO:

./isaaclab.sh -p scripts/reinforcement_learning/skrl/train.py --algorithm MAPPO --task=MobileFrankaMARL #--headless

Example Environments

CartPendulum ShadowHand MobileFranka
CartPendulum ShadowHand MobileFranka
Cooperative Balancing: Multiple agents control different aspects of the double cart-pendulum system, requiring coordinated actions to maintain balance. Shadow Hand Object Passing: Multiple agents controlling different finger groups of the dexterous hand, collaborating through shared rewards to manipulate and pass objects with precision. Mobile Manipulation: Combining base movement and arm control agents that benefit from shared reward signals to perform coordinated navigation and manipulation tasks.

Key Features

  • Bio-Inspired Reward Shaping: Implements a formal symbiosis model to enhance cooperation in MARL.
  • Symbiotic Interaction Taxonomy: Categorizes agent interactions as mutualism, commensalism, and parasitism.
  • Improved Learning in Complex Tasks: Enhances stability, convergence, and variance reduction in high-dimensional environments.

🧠 Citation

If you use this code or find the idea useful, please consider citing our work:

@inproceedings{niu2025symbiosis,
  title={Investigating Symbiosis in Robotic Ecosystems: A Case Study for Multi-Robot Reinforcement Learning Reward Shaping},
  author    = {Xuezhi Niu and Didem GΓΌrdΓΌr Broo},
  booktitle = {the 2025 9th International Conference on Robotics and Automation Sciences (ICRAS)},
  year      = {2025},
  publisher = {IEEE}
}

HelloCat

About

This repo implements mutualistic reward shaping in multi-agent reinforcement learning (MARL) to enhance robot cooperation. Tested on CartPendulum, ShadowHand, and Mobile Manipulation, it improves stability, convergence, and coordination. Includes code, results, and documentation. Contributions welcome! πŸš€

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages