PyTorch implementation of some reinforcement learning algorithms: A2C, PPO, Behavioral Cloning from Observation (BCO), GAIL.
-
Updated
Nov 15, 2021 - Python
PyTorch implementation of some reinforcement learning algorithms: A2C, PPO, Behavioral Cloning from Observation (BCO), GAIL.
Generative Adversial Imitation Learning (GAIL) an inverse reinforcement learning algorithm
Minimal implementation of Proximal Policy Optimization (PPO) in PyTorch
Interactive imitation learning in Atari Gymnasium environments (e.g. Space Invaders) using human-in-the-loop demonstrations. The project explores different policy architectures (NN, CNN, ViT) and investigates GAIL for high-dimensional visual control. Developed as part of the Social Robotics (MU5EEH15 – 2025/2026) university course.
Add a description, image, and links to the gail-ppo topic page so that developers can more easily learn about it.
To associate your repository with the gail-ppo topic, visit your repo's landing page and select "manage topics."