Learn Reinforcement Learning

I will be developing this tutorial for everyone who is interested to tryout reinforcement learning through doing hands-on projects. We will start simple and eventually will build up the complexity of the project. I plan to cover some topics in OpenAI Gym and simulation in Gazebo and Isaac Sim using ROS2. Please forgive any mistakes that you find. The project structure is a bit messy since I'm also in the process of figuring it out. You're more than welcome to put issues and pull requests for improving this repository. Let's grow together!

Sections

Setup

Follow these commands for cloning and trying out this repository.

Let's first clone the repository using this command:

git clone git@github.com:ashvin-a/Learn-RL.git

Create a virtual environment for installing the dependencies.

python3 -m venv env

Utilise the environment. For a Windows machine, run:

.\env\Scripts\activate

And for Linux/Mac, run:

source env/bin/activate

Now, let's install the dependencies.

pip install -r requirements.txt

Yay! Now you've completed the setup! You can try running the trained Cartpole agent by running:

python src/simulation/simulation/cartpole/cartpole_test.py

Projects

1. Cartpole Agent

We will be using a simple agent, i.e, CartPole-V1, for this project. Here, the goal will be to balance the stick like an inverted pendulum. You could check out src/simulation/simulation/cartpole/cartpole.py for training the agent. You could try out the model by running cartpole_test.py.

2. Walker2D

This is a little more complicated. We now have multiple joints, and we have a lot more states and actions for this agent. Let's first train the agent to stay alive and keep hoping.

Here, for the simplicity of training, we will be using Proximal Policy Optimization(PPO) for training the agent since it's a "policy-based" method. (Btw, Q-learning is a value-based method.) You can check more about these in here.

Before trying it out, make sure to run this command since I've updated the dependencies:

pip install -r requirements.txt

In the src/simulation/simulation/walker2d directory, we have 4 scripts - two are for training the policy, and two for testing each of them out. For the demo, I trained the policy using train_walker_1.py, and the model is saved in walker2d_policy_1.zip. train_walker_2.py is an attempt to create a custom environment where we could fine-tune the reward function to include constraints to speed so that the walker will move slowly. It is completed, but the policy that was generated from that training has some issue that needs to be resolved. In the meantime, you could try out train_walker_1.py for training and test_walker_1.py for trying out the walk policies.

Name		Name	Last commit message	Last commit date
Latest commit History 31 Commits
src		src
.gitignore		.gitignore
README.md		README.md
requirements.txt		requirements.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Learn Reinforcement Learning

Sections

Setup

Projects

1. Cartpole Agent

2. Walker2D

About

Uh oh!

Releases

Packages

Languages

ashvin-a/Learn-RL

Folders and files

Latest commit

History

Repository files navigation

Learn Reinforcement Learning

Sections

Setup

Projects

1. Cartpole Agent

2. Walker2D

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages