Put the acc_data in a new syncedmemory block by sonack · Pull Request #6141 · BVLC/caffe

sonack · 2017-12-30T08:33:17Z

When I was using Accuracy Layer in Training Phase, I found something weird, the loss was increasing and the gradient was much larger than normal.
A easy reproduction example is to uncomment the include_phase of accuracy layer in caffe/examples/cifar10/cifar10_quick_train_test.prototxt and use train_quick.sh to train it.
After strugglely reviewing the source code, I found this line in accuracy_layer.cu is the cause for it, it was saying Since this memory is not used for anything,we use it here to avoid having to allocate new GPU memory to accumulate intermediate results in the kernel.
But it is wrong, because when I use some blob in both loss layer and accuracy layer, caffe would insert split layer to duplicate this blob, and during the backward pass, the splited blob for accuracy layer 's diff would not be updated(covered) and should always be zero. However, here changed its value to the accuracy, which made the gradient after split layer is much larger.

I modified this line to allocate a new syncedmem object to store the acc_data, which made it independent with the bottom[0]'s diff field and solved above issue.

Noiredd · 2018-01-03T09:18:35Z

This problem has been first spotted in #5981 and has a temporary workaround in #5987. How is your PR different from it? Let's pick one to work on rather than waste energy simultaneously developing two solutions to one problem.

sonack added 2 commits December 30, 2017 16:18

Put the acc_data in a new syncedmemory block

fb0c599

Update accuracy_layer.cu

df308ed

shelhamer mentioned this pull request Jan 29, 2018

Clear Scratch Diffs to Prevent Contaminating Backward through Splits #6202

Merged

shelhamer closed this in #6202 Jan 29, 2018

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Comments

Put the acc_data in a new syncedmemory block#6141

Put the acc_data in a new syncedmemory block#6141
sonack wants to merge 2 commits intoBVLC:masterfrom
sonack:patch-1

sonack commented Dec 30, 2017

Uh oh!

Noiredd commented Jan 3, 2018

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Comments

Conversation

sonack commented Dec 30, 2017

Uh oh!

Noiredd commented Jan 3, 2018

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants