Accuracy without loss by sguada · Pull Request #522 · BVLC/caffe

sguada · 2014-06-20T00:00:33Z

This PR would separate the loss from the accuracy. This will allow using different losses while computing the accuracy.

Once the AccuracyLayer doesn't compute the loss then one need to add a LossLayer if want to compute it.

There are two options for AccuracyLayer, now it is doing a ArgMax on the data to compute the predicted label and then compared with expected label. That could be removed and let the ArgMaxLayer do that.

Another option will be to create the ArgMaxLayer, and maybe a LossLayer within the AccuracyLayer then connect them.

sguada · 2014-06-20T00:01:16Z

@Yangqing @shelhamer @jeffdonahue @longjon @sergeyk
What do you think about separating the Loss and the Accuracy? Or at least make it more general.

sergeyk · 2014-06-20T00:20:19Z

I'm for it

longjon · 2014-06-20T08:33:43Z

I agree that the accuracy should not be coupled to the multinomial logistic loss like it is now. The first step I see to fix this would be to update the AccuracyLayer to compute only the accuracy, to update the example prototxts to also compute the appropriate losses, and to make sure the resulting logs still contain the desired loss information (and if there are any tools that read that information, that they still work).

Factoring out the argmax might also be desirable, but I'd also be fine with leaving that out of this PR to keep things simple.

…o compute loss Changed all val.prototxt in examples to add a LossLayer to compute loss in Test

sguada · 2014-06-21T02:59:44Z

@sergeyk @longjon @shelhamer can each one of you test one of the examples to make sure they still work for you?

shelhamer · 2014-06-21T08:12:33Z

I'm for the separation. I agree with @longjon that splitting the accuracy and multinomial logistic loss is enough for now provided that we update our prototxt.

I'll test LeNet and the MNIST autoencoder and report back. The rest should be fine since the changes are the same, but someone else should check CaffeNet and AlexNet.

Thanks for clearing up the design Sergio!

shelhamer · 2014-06-21T08:20:24Z

A nice follow-up, not in this PR but in the future, would be to add a TYPE field to AccuracyLayer that could be argmax, top 5, absolute difference, squared error, and so on.

shelhamer · 2014-06-25T21:41:48Z

@sguada the SOFTMAX_LOSS layer needed its top blob(s) declaration updated. The net tests were failing because of the current exact number check and the lack of the top allocation. I pushed a fix to my fork, but can you add me as a repo collaborator so I can just push it to this PR?

I tested LeNet, the MNIST autoencoder, and that the CIFAR examples don't crash.

(@jeffdonahue I noticed the consolidated LeNet example crashes, seemingly because of LevelDB lock contention. This was on OS X. I have some recollection of a similar issue cropping up in a data layer in the past -- I don't think the train net and the test-on-train net can exist concurrently and read the same LevelDB.)

jeffdonahue · 2014-06-25T21:47:42Z

I think it actually should work, but maybe the OSX implementation of leveldb is buggy? leveldb is built to handle reads from multiple threads within the same process; just not from multiple processes. See http://stackoverflow.com/questions/9177598/multiple-instances-of-a-leveldb-database-at-the-same-time

shelhamer · 2014-06-25T21:52:16Z

It seems OS X leveldb is buggy then. I remember re-scoping tests to initialize / free the leveldb between tests because otherwise it crashed on OS X like it does now with this example. Perhaps another reason to adopt lmdb.

jeffdonahue · 2014-06-25T21:55:12Z

Oh, now that I actually look at the leveldb documentation [1] it seems that the threads need to be sharing the same leveldb::DB object, which makes sense as otherwise it's probably not any different from opening in two separate processes. So perhaps it's actually a bug that it doesn't crash on Linux?

[1] http://leveldb.googlecode.com/svn/trunk/doc/index.html

Concurrency

A database may only be opened by one process at a time. The leveldb implementation acquires a lock from the operating system to prevent misuse. Within a single process, the same leveldb::DB object may be safely shared by multiple concurrent threads. I.e., different threads may write into or fetch iterators or call Get on the same database without any external synchronization (the leveldb implementation will automatically do the required synchronization). However other objects (like Iterator and WriteBatch) may require external synchronization. If two threads share such an object, they must protect access to it using their own locking protocol. More details are available in the public header files.

sguada · 2014-06-26T00:00:06Z

@shelhamer thanks for doing some extra testing with this new PR.
I also got the same problem with the SOFTMAX_LOSS but didn't have the time to push it. Maybe now it would be a good idea to move it to the loss_layers.hpp.

shelhamer · 2014-06-26T00:04:20Z

@sguada Oh, I actually took a different approach and gave it two top blobs max so it can report the loss and the actual softmax output too: shelhamer@adab413

What do you think? I could also move it to loss_layers.hpp then push to your branch for merge if you like.

sguada · 2014-06-26T00:16:46Z

@shelhamer that's even better, maybe someone wants to get both things. I think it is a good idea to move it to loss_layers.hpp where it belongs, even if that means adding some includes.

I have added to my fork, so you should be able to push it now.

shelhamer · 2014-06-26T17:11:41Z

I have tested the examples and they seem fine. I will push my fix and the transplant of SoftmaxWithLoss to loss_layers.hpp soon and merge.

As a follow-up, how does everyone feel about introducing a common_layers.hpp for layers like Softmax, Flatten, Argmax etc. that aren't actually vision specific? Purely for the sake of organization of course.

Split accuracy and loss

sguada · 2014-06-26T17:48:34Z

Thanks @shelhamer for the merge and final retouches.

Split accuracy and loss

baimin1 · 2015-06-16T10:13:14Z

how to dowload these 20 files?

sguada added 2 commits June 19, 2014 16:40

Now Loss layers would return the loss in the top blob if requested

f5853bc

Modified test_net to check loss layer with top

03cd25c

sguada mentioned this pull request Jun 20, 2014

How to do regression? #512

Closed

Now AccuracyLayer only computes accuracy, one should use LossLayers t…

8d4dacf

…o compute loss Changed all val.prototxt in examples to add a LossLayer to compute loss in Test

shelhamer mentioned this pull request Jun 24, 2014

Top-k accuracy #531

Merged

fix SOFTMAX_LOSS to work with loss top blob interface

adab413

sguada changed the title ~~Accuracy without loss [Don't Merge yet]~~ Accuracy without loss Jun 25, 2014

shelhamer mentioned this pull request Jun 25, 2014

Multi label Data and MultiLabel Accuracy #523

Closed

file SoftmaxWithLoss in with loss layers

61e1ef0

shelhamer added a commit that referenced this pull request Jun 26, 2014

Merge pull request #522 from sguada/accuracy_without_loss

2cd155f

Split accuracy and loss

shelhamer merged commit 2cd155f into BVLC:dev Jun 26, 2014

kloudkl mentioned this pull request Jul 3, 2014

Layers in order #602

Merged

shelhamer mentioned this pull request Aug 13, 2014

Problem with the MNIST example script train_lenet_consolidated.sh #922

Closed

mohomran mentioned this pull request Aug 31, 2014

MNIST demo with lmdb (addresses issues 922 and 958) #1008

Merged

mitmul pushed a commit to mitmul/caffe that referenced this pull request Sep 30, 2014

Merge pull request BVLC#522 from sguada/accuracy_without_loss

bd16164

Split accuracy and loss

Conversation

sguada commented Jun 20, 2014

Uh oh!

sguada commented Jun 20, 2014

Uh oh!

sergeyk commented Jun 20, 2014

Uh oh!

longjon commented Jun 20, 2014

Uh oh!

sguada commented Jun 21, 2014

Uh oh!

shelhamer commented Jun 21, 2014

Uh oh!

shelhamer commented Jun 21, 2014

Uh oh!

shelhamer commented Jun 25, 2014

Uh oh!

jeffdonahue commented Jun 25, 2014

Uh oh!

shelhamer commented Jun 25, 2014

Uh oh!

jeffdonahue commented Jun 25, 2014

Uh oh!

sguada commented Jun 26, 2014

Uh oh!

shelhamer commented Jun 26, 2014

Uh oh!

sguada commented Jun 26, 2014

Uh oh!

shelhamer commented Jun 26, 2014

Uh oh!

sguada commented Jun 26, 2014

Uh oh!

baimin1 commented Jun 16, 2015

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

7 participants

Comments