Skip to content

TRPO example that reproduces the "Deep Reinforcement Learning that Matters" paper#449

Merged
muupan merged 24 commits intochainer:masterfrom
muupan:trpo-mujoco-matters
Sep 6, 2019
Merged

TRPO example that reproduces the "Deep Reinforcement Learning that Matters" paper#449
muupan merged 24 commits intochainer:masterfrom
muupan:trpo-mujoco-matters

Conversation

@muupan
Copy link
Copy Markdown
Member

@muupan muupan commented May 1, 2019

Merge #446 #448 first.

This PR adds a script to reproduce the TRPO results in "Deep Reinforcement Learning that Matters" (http://arxiv.org/abs/1709.06560).

As you can see from the tables in README, our results are competitive with theirs. https://github.com/muupan/chainerrl/tree/trpo-mujoco-matters/examples/mujoco/reproduction/trpo

@muupan
Copy link
Copy Markdown
Member Author

muupan commented Aug 27, 2019

/test

@pfn-ci-bot
Copy link
Copy Markdown
Collaborator

Successfully created a job for commit b0b017a:

@muupan muupan changed the title [WIP] TRPO example that reproduces the "Deep Reinforcement Learning that Matters" paper TRPO example that reproduces the "Deep Reinforcement Learning that Matters" paper Aug 27, 2019
@muupan muupan requested a review from toslunar August 27, 2019 06:31

# Normalize observations based on their empirical mean and variance
obs_normalizer = chainerrl.links.EmpiricalNormalization(
obs_space.low.size, clip_threshold=5)
Copy link
Copy Markdown
Member

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

I didn't find clip_threshold=5 in the paper, but 5 seems sufficiently large.

Copy link
Copy Markdown
Member Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

muupan and others added 3 commits September 6, 2019 16:08
Co-Authored-By: Toshiki Kataoka <tos.lunar@gmail.com>
It should not affect the behavior as the default value of the argument
is also 0.01.
@muupan
Copy link
Copy Markdown
Member Author

muupan commented Sep 6, 2019

/test

@pfn-ci-bot
Copy link
Copy Markdown
Collaborator

Successfully created a job for commit 8762890:

@muupan muupan requested a review from toslunar September 6, 2019 07:21
Copy link
Copy Markdown
Member

@toslunar toslunar left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

LGTM!

@muupan
Copy link
Copy Markdown
Member Author

muupan commented Sep 6, 2019

/test

@pfn-ci-bot
Copy link
Copy Markdown
Collaborator

Successfully created a job for commit dfdcf5d:

@muupan muupan merged commit 63dad78 into chainer:master Sep 6, 2019
@muupan muupan deleted the trpo-mujoco-matters branch September 6, 2019 11:54
@muupan muupan added the example label Feb 6, 2020
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

Projects

None yet

Development

Successfully merging this pull request may close these issues.

3 participants