Changes to layer exclusion prefixes and Softmax-related code [DON'T MERGE] by lukeyeager · Pull Request #623 · NVIDIA/DIGITS

lukeyeager · 2016-03-09T19:19:09Z

Close #605, close #335

NOTE: this will break networks which used to work in DIGITS

Changes:

Add layer exclusion to classification nets (originally in Layer exclusion naming convention applied to classification nets [DON'T MERGE YET] #605)
Stop automatically creating Softmax layers in the deploy prototxt when SoftmaxWithLoss layers were present in the original prototxt (since you can do it manually with prefixes now)
Only set inner_product_param.num_output when it was unset in the original prototxt
Update standard networks to use the new features
1. Prune useless GoogLeNet layers from deploy prototxt (Automatically prune GoogleNet useless branches in the deploy #335)

TODO:

Upgrade old internal network definitions
Documentation

lukeyeager · 2016-03-09T20:52:48Z

Fixed tests and fixed a bug

gheinrich · 2016-03-10T14:04:41Z

If the softmax layer is omitted from the deploy network, results look a bit funny:

Can we drop an error if a classification network is created without a softmax in (at least) the deploy network? Or if we want to allow users to not use them we could detect this and display inference results as scores, not probabilities. I'd vote for making them mandatory though.

gheinrich · 2016-03-10T16:17:48Z

Are those lines still needed:
https://github.com/NVIDIA/DIGITS/blob/master/digits/model/tasks/train.py#L98
https://github.com/NVIDIA/DIGITS/blob/master/digits/model/tasks/train.py#L104

In case someone hits a problem like that mentioned in NVIDIA#601 for a classification network. Once Caffe implements input layers and phase control from Python we should be able to remove those workarounds.

Important changes: * Only set inner_product_param.num_output when it was unset Also: * Stop trying to calculate network outputs (was only needed for setting inner_product_param.num_output) * Ensure that layer prefix exclusion works the same way for both generic and classification nets * A little refactoring to make generic and classification code paths more similar

Makes network specification more explicit

Explain how inner_product_param.num_output gets filled in automatically when missing.

lukeyeager · 2016-03-10T20:50:24Z

Can we drop an error if a classification network is created without a softmax in (at least) the deploy network?

Yes, that makes sense for classification networks. Whenever we get around to merging "classification" and "generic" we can do something more clever. But that's the right solution for now.

lukeyeager · 2016-03-10T21:02:29Z

As I'm thinking more about this, should we ask users to differentiate between Train/Val/Deploy by using layer.include.stage instead of by using layer.name prefixes? Since stages aren't fully supported yet, we'd still need to parse the network definition and convert it into train_val.prototxt and deploy.prototxt for now. But we could get users to start learning the stage syntax now, and just remove the internal hackery whenever Caffe fully supports stages.

Example of an all-in-one network specified using layer.include.stage:
https://github.com/lukeyeager/caffe/blob/ed2621d775/python/caffe/test/test_net.py#L155-L206

PR giving greater stage support to Caffe:
BVLC/caffe#3736

# Using layer name prefixes
layer {
  name: "train_loss"
  type: "SoftmaxWithLoss"
}
layer {
  name: "deploy_softmax"
  type: "Softmax"
}

# Using stages
layer {
  name: "loss"
  type: "SoftmaxWithLoss"
  include { stage: "train" }
  include { stage: "val" }
}
layer {
  name: "softmax"
  type: "Softmax"
  include { stage: "deploy" }
}

gheinrich · 2016-03-10T21:39:09Z

That sounds like a great idea. Is the stage syntax already known? I haven't found definitions of train, val and deploy phases in Caffe.

lukeyeager · 2016-03-10T21:44:18Z

Yeah, I think that's what we need to do. I'm working on it now.

The number of ways you can specify the NetState is somewhat staggering.

https://github.com/NVIDIA/caffe/blob/v0.14.2/src/caffe/proto/caffe.proto#L330-L337
https://github.com/NVIDIA/caffe/blob/v0.14.2/src/caffe/proto/caffe.proto#L258-L274

DIGITS definitely can't support all of those options. We'll have to keep it to a subset.

lukeyeager · 2016-03-11T01:44:29Z

Closing in favor of #628. I decided to make a new PR since it's such a different approach.

lukeyeager added enhancement caffe labels Mar 9, 2016

lukeyeager force-pushed the layer-exclusion-prefixes branch 2 times, most recently from a074edb to 5a635ca Compare March 9, 2016 20:52

gheinrich and others added 5 commits March 10, 2016 11:10

Layer exclusion naming convention applied to classification nets

5b9ec0b

In case someone hits a problem like that mentioned in NVIDIA#601 for a classification network. Once Caffe implements input layers and phase control from Python we should be able to remove those workarounds.

Update standard networks with layer name exclusion

c44b6b8

Makes network specification more explicit

Add comments to standard networks

f2e0781

Explain how inner_product_param.num_output gets filled in automatically when missing.

Throw error if Softmax missing from deploy net

f7a7b1e

Start updating docs

96fe5d5

lukeyeager force-pushed the layer-exclusion-prefixes branch from 5a635ca to 96fe5d5 Compare March 10, 2016 21:31

lukeyeager changed the title ~~Changes to layer exclusion prefixes and Softmax-related code~~ Changes to layer exclusion prefixes and Softmax-related code [DON'T MERGE] Mar 10, 2016

lukeyeager mentioned this pull request Mar 11, 2016

Use layer stages for all-in-one nets #628

Merged

1 task

lukeyeager closed this Mar 11, 2016

lukeyeager mentioned this pull request Mar 11, 2016

Layer exclusion naming convention applied to classification nets [DON'T MERGE YET] #605

Closed

lukeyeager deleted the layer-exclusion-prefixes branch June 8, 2016 18:04

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Comments

Changes to layer exclusion prefixes and Softmax-related code [DON'T MERGE]#623

Changes to layer exclusion prefixes and Softmax-related code [DON'T MERGE]#623
lukeyeager wants to merge 6 commits intoNVIDIA:masterfrom
lukeyeager:layer-exclusion-prefixes

lukeyeager commented Mar 9, 2016

Uh oh!

lukeyeager commented Mar 9, 2016

Uh oh!

gheinrich commented Mar 10, 2016

Uh oh!

gheinrich commented Mar 10, 2016

Uh oh!

lukeyeager commented Mar 10, 2016

Uh oh!

lukeyeager commented Mar 10, 2016

Uh oh!

gheinrich commented Mar 10, 2016

Uh oh!

lukeyeager commented Mar 10, 2016

Uh oh!

lukeyeager commented Mar 11, 2016

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Comments

Conversation

lukeyeager commented Mar 9, 2016

Uh oh!

lukeyeager commented Mar 9, 2016

Uh oh!

gheinrich commented Mar 10, 2016

Uh oh!

gheinrich commented Mar 10, 2016

Uh oh!

lukeyeager commented Mar 10, 2016

Uh oh!

lukeyeager commented Mar 10, 2016

Uh oh!

gheinrich commented Mar 10, 2016

Uh oh!

lukeyeager commented Mar 10, 2016

Uh oh!

lukeyeager commented Mar 11, 2016

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants