Enhance memory data layer by junshi15 · Pull Request #3589 · BVLC/caffe

junshi15 · 2016-01-22T19:41:29Z

This PR enhances memory data layer with:

forward_gpu() is added to avoid unneeded data copy
parameter "share_in_parallel" for controlling memory sharing or not (default: true)
parameter "source" for specifying the location of data source, e.g. hdfs URI

This PR introduce an optional parameter for LayerParameter:

"source_class" ... Name of custom class for parsing input data source

shelhamer · 2016-02-29T02:33:53Z

src/caffe/proto/caffe.proto

  optional LossParameter loss_param = 101;

+  // Specify name of source class
+  optional string source_class = 143;


This should be a field in MemoryDataParameter, not LayerParameter if I understand this patch.

Agree. Also, this parameter (along with source) needs some more explanation, as right now they seem unused in Caffe. Why are we adding these?

This pull request is motivated by making Caffe work on Spark (via CaffeOnSpark). source_class was used by CaffeOnSpark to decide which particular Scala class to use for handling a given data source. See CaffeOnSpark code.

Our original idea is to support custom sources for data layers beyond MemoryDataLayer. For example, to support NLP use cases, we have introduced a VWDataLayer. That's why we propose an optional field at LayerParameter. By having source_class at LayerParemeter, we will enable CaffeOnSpark users to introduce customized parser for different data layers.

FYI, VWDataParameter is defined as below:
message VWDataParameter {
optional string source = 1;
optional uint32 batch_size = 2;
optional bool shuffle = 3 [default = false];
message TopLayer{
optional string name = 1;
optional string type = 2;
optional uint32 channels = 3;
optional uint32 height = 4;
optional uint32 width = 5;
optional string vw_namespace = 6;
}
repeated TopLayer top = 4;
optional bool sparse = 5 [default = false];
optional bool memory_input = 6[default = false];
}

longjon · 2016-03-01T23:36:10Z

Comments as noted; also, the logic here does not seem quite right.

In this patch we step through device memory in GPU mode, and host memory in CPU mode. But what if we give this layer a host pointer in GPU mode? (Or (should this even be allowed?) a device pointer in CPU mode?) It looks like this patch will break things, even though (at least the former case) works fine as-is (and is a pretty common case!).

So it seem like the action of Forward should really depend on the pointer type, not the mode.

(And given that this is not totally straightforward, it would be nice to have a test to check these cases.)

junshi15 · 2016-03-04T19:23:11Z

Regarding the last comment, we assume memory layer gets a host pointer in the CPU mode (hence Forward_cpu) and a device pointer in the GPU mode (hence Forward_gpu). Is this a good assumption? Maybe we should add a few assertions to check that.

longjon · 2016-03-05T01:12:25Z

Regarding the last comment, we assume memory layer gets a host pointer in the CPU mode (hence Forward_cpu) and a device pointer in the GPU mode (hence Forward_gpu). Is this a good assumption?

No. As noted above, this assumption is violated in practice. The way I've most commonly used MemoryDataLayer is with a host pointer (the only supported option, right now) in GPU mode. That's desirable in the common case where some data already exists in host memory, and we want to run a GPU computation on it without bothering to copy all of it to device (perhaps because it won't fit, which is why we're batching through it with this layer in the first place). I don't think it's reasonable to break that use case, and I don't see any reason why the layer can't do the right thing for either mode and either pointer type.

junshi15 · 2016-03-07T00:13:10Z

Can we assume the following:

host memory is always allocated by CaffeMallocHost(), not by an arbitrary malloc().
in CPU mode, this layer does not take a device pointer, i.e. Forward_cpu() won't handle device pointer.
If those are o.k., then we can use cudaPointerGetAttributes() to detect the pointer type in the GPU mode.

shelhamer · 2017-04-14T23:05:23Z

Closing according to #5528. Thanks for the effort to improve this troubled layer all the same.

Borracho27

- Ojggrj

junshi15 force-pushed the MemoryDataLayer branch 2 times, most recently from 57da235 to 796d996 Compare January 27, 2016 13:34

junshi15 force-pushed the MemoryDataLayer branch from 796d996 to 285ae3d Compare February 25, 2016 18:55

enhance memory data layer

71cf0e5

junshi15 force-pushed the MemoryDataLayer branch from 285ae3d to 71cf0e5 Compare February 25, 2016 18:57

shelhamer added the enhancement label Feb 25, 2016

shelhamer reviewed Feb 29, 2016
View reviewed changes

shelhamer closed this Apr 14, 2017

Borracho27 reviewed Mar 24, 2023

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Comments

Enhance memory data layer#3589

Enhance memory data layer#3589
junshi15 wants to merge 1 commit intoBVLC:masterfrom
junshi15:MemoryDataLayer

junshi15 commented Jan 22, 2016

Uh oh!

shelhamer Feb 29, 2016

Uh oh!

longjon Mar 1, 2016

Uh oh!

anfeng Mar 4, 2016

Uh oh!

longjon commented Mar 1, 2016

Uh oh!

junshi15 commented Mar 4, 2016

Uh oh!

longjon commented Mar 5, 2016

Uh oh!

junshi15 commented Mar 7, 2016

Uh oh!

shelhamer commented Apr 14, 2017 •

edited

Loading

Uh oh!

Borracho27 left a comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

5 participants

Comments

Conversation

junshi15 commented Jan 22, 2016

Uh oh!

shelhamer Feb 29, 2016

Choose a reason for hiding this comment

Uh oh!

longjon Mar 1, 2016

Choose a reason for hiding this comment

Uh oh!

anfeng Mar 4, 2016

Choose a reason for hiding this comment

Uh oh!

longjon commented Mar 1, 2016

Uh oh!

junshi15 commented Mar 4, 2016

Uh oh!

longjon commented Mar 5, 2016

Uh oh!

junshi15 commented Mar 7, 2016

Uh oh!

shelhamer commented Apr 14, 2017 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Borracho27 left a comment

Choose a reason for hiding this comment

- Ojggrj

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

5 participants

shelhamer commented Apr 14, 2017 •

edited

Loading