Skip to content

Comments

batch normalization : moving average with different batch size#5975

Open
matthieu637 wants to merge 3 commits intoBVLC:masterfrom
matthieu637:upstream
Open

batch normalization : moving average with different batch size#5975
matthieu637 wants to merge 3 commits intoBVLC:masterfrom
matthieu637:upstream

Conversation

@matthieu637
Copy link

This formulation of decay moving average is more accurate if the batch size is changing over time.

@Noiredd
Copy link
Member

Noiredd commented Nov 2, 2017

I have restarted the tests that failed prior to #5973, it passes now. Anyone with better understanding of batch norm to state their opinion of this?

PS: Are you planning to change the GPU version too?

@matthieu637
Copy link
Author

More information :
If the batch size is fixed, as usually in supervised learning, it won't change the result.
In case of multiple batch size, for instance in reinforcement learning, it will be a little more accurate : small batches have less impact than bigger.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

2 participants