Feed the ImageDataLayer with OpenCV images directly from memory by kloudkl · Pull Request #251 · BVLC/caffe

kloudkl · 2014-03-23T07:31:43Z

The discussion of #196 led to the conclusion that the ImagesLayer (#120) by @sguada is more suitable for ingesting raw images. Following @Yangqing's suggestion, this PR extends ImagesLayer to accept OpenCV format images which are used by computer vision researchers and engineers very commonly.

kloudkl · 2014-03-25T00:03:17Z

Rebased and ready for being merged.

kloudkl · 2014-04-03T06:07:05Z

@shelhamer, this has been rebased and tested again. Please invite a reviewer. Thanks!

shelhamer · 2014-04-04T04:27:20Z

@sguada could you review this since you reviewed #196 and have been working on the data layers?

@kloudkl could you squash 713bb18, 148dbdb, and 6354a65 to clean up the history? Just a minor style point.

kloudkl · 2014-04-04T10:02:20Z

Squashed and tested.

shelhamer · 2014-05-02T22:25:42Z

@longjon is this superseded by #294? Thanks for the work even if so @kloudkl.

longjon · 2014-05-02T22:42:34Z

@shelhamer, this PR understands two things which #294 does not:

OpenCV datatypes
std::vectors and discontinuous data

If this functionality is desired in caffe, it probably makes the most sense to generalize #294 in some way.

onauparc · 2014-06-23T16:13:10Z

@kloudkl could you resolve conflicts? I'd like to give it a try.

kloudkl · 2014-06-24T04:53:02Z

@onauparc, you can try it now.

shelhamer · 2014-06-24T05:28:30Z

@longjon I've heard a desire for OpenCV input come up a few times now. Could you review this and decide whether to merge or guide adapting this into a generalization of #294 ? Thanks!

kloudkl · 2014-06-24T05:53:25Z

The ImageDataLayer does a series of necessary pre-processing steps. The more general solution would probably have to wait until @Yangqing's new design is finalized.

longjon · 2014-06-25T01:54:19Z

@shelhamer (and others): If I read correctly, this patch modifies ImageDataLayer by implementing OpenCVImageToDatum, which duplicates ReadImageToDatum, omitting the file reading step.

This all seems a bit convoluted; we start with a cv::Mat, which is row-major storage, then convert to a protobuf format, then back to a row-major blob! This is true both for this patch and for the existing ImageDataLayer. (I guess there is an implicit transposition, but that should be abstracted anyway).

I think this should be folded into @Yangqing's plan at #407 (comment). Of course, neither this (nor MemoryDataLayer) probably need prefetching, but it doesn't hurt. This layer would be implemented by simply stepping over a vector<cv::Mat>, similar to the way MemoryDataLayer is implemented. ImageDataLayer would be implemented similarly, except the cv::Mats would be read from disk. ReadImageToDatum would go away.

One can read from cv::Mats today, using MemoryDataLayer, either updating the data blob-at-a-time or copying all data into contiguous memory. Yes, one has to do preprocessing that DataLayer currently does oneself, but since we seem to moving in the direction of preprocessing outside the data layer, (which I totally support), it might be worthwhile to just implement desired in-network preprocessing using existing (or new) layers.

shelhamer · 2014-06-27T21:18:37Z

@longjon totally agree. @Yangqing's data layer refactoring will be the foundation to bring this all together, separate pre-processing, and make our lives easier.

bhack · 2014-07-23T10:53:09Z

@longjon I agree that is not efficient. So.. do we need to wait for #407?

kloudkl · 2014-08-09T11:15:43Z

In the latest ImageDataLayer, the OpenCV images in the Mat format are transformed into Blobs directly and the source parameter doesn't have to be set anymore (#499 (comment)).

Until now, the data layer refactoring project has made no observable progress. This PR is still the only available implementation to automatically transform the OpenCV images and pass them into the network. Would you please review this again and decide whether to merge or not?

shelhamer · 2014-08-09T17:24:04Z

@longjon please review and decide on merge. Thanks for the rebase and alternative data layer work @kloudkl.

bhack · 2014-08-22T09:16:00Z

@kloudkl I think that #954 merged in dev impact this PR.

bhack · 2014-08-22T12:35:25Z

I think that this is one of the cases when a fresher PR charges some re-basing work to an older PR only because it was reviewed faster. I know that like other open source projects review are done best effort but I think that we could implement some little policy to incentive PR maintaining especially when we could have PR opened for months with cross impact.

kloudkl · 2014-08-27T11:23:07Z

@bhack, now that #954 extracted the duplicated data transformations out of the old data layers, the purpose of this PR can be better achieved with the new DataTransformer and the MemoryDataLayer as @shelhamer suggested in #941. So this should be closed.

bhack · 2014-08-27T12:11:57Z

@kloudkl If you want to remove this can you add a little doc on how to interface Opencv mat with MemoryDataLayer?

kloudkl · 2014-08-29T03:52:40Z

Development continues in #995.

This was referenced Mar 23, 2014

Add memory data layer to pass data directly into the network #196

Closed

Add WARPLossLayer and gradient check test cases #126

Closed

shelhamer assigned sguada Apr 4, 2014

bhack mentioned this pull request Jun 17, 2014

how to make a prediction in C++ #499

Closed

shelhamer assigned longjon and unassigned sguada Jun 24, 2014

kloudkl changed the title ~~Feed ImagesLayer with OpenCV images directly from memory~~ Feed the ImageDataLayer with OpenCV images directly from memory Jun 24, 2014

shuokay mentioned this pull request Jun 26, 2014

ImageDataParameter has no field specify whether input images are color or grayscale #538

Closed

kloudkl added 9 commits August 9, 2014 18:59

Adapt ReadImageToDatum into OpenCVImageToDatum, fix Size bug and test it

cf99705

Make it possible to directly pass images from memory into ImagesLayer

e95e71b

Rebase and change the ImagesLayer::Forward/Backward signatures

7d4bc98

Check image data param has source before joining or creating pthread

f2318d2

Resolve the rebase merge conflicts to catch up with the bleeding edge

910ada8

Fix the compilation errors & warnings due to rebasing

f7e4987

Add utility function to directly convert OpenCV image into Blob

1f06343

Add io function to directly read OpenCV image to Blob

fab0725

Directly convert OpenCV image to Blob in the ImageDataLayer

fd0ffe3

Explicitly config if the ImageDataLayer gets data from disk or memory

28fe152

The ImageDataLayer can also be set up with data blob from memory

db7e720

kloudkl mentioned this pull request Aug 17, 2014

boost::python vs. cython and Python interface preprocessing profiling and improvement #941

Closed

sh1r0 added a commit to sh1r0/caffe that referenced this pull request Aug 24, 2014

merge caffe PR BVLC#251 to feed ImageDataLayer from memory

b5e8e17

kloudkl closed this Aug 27, 2014

Comments

Conversation

kloudkl commented Mar 23, 2014

Uh oh!

kloudkl commented Mar 25, 2014

Uh oh!

kloudkl commented Apr 3, 2014

Uh oh!

shelhamer commented Apr 4, 2014

Uh oh!

kloudkl commented Apr 4, 2014

Uh oh!

shelhamer commented May 2, 2014

Uh oh!

longjon commented May 2, 2014

Uh oh!

onauparc commented Jun 23, 2014

Uh oh!

kloudkl commented Jun 24, 2014

Uh oh!

shelhamer commented Jun 24, 2014

Uh oh!

kloudkl commented Jun 24, 2014

Uh oh!

longjon commented Jun 25, 2014

Uh oh!

shelhamer commented Jun 27, 2014

Uh oh!

bhack commented Jul 23, 2014

Uh oh!

kloudkl commented Aug 9, 2014

Uh oh!

shelhamer commented Aug 9, 2014

Uh oh!

bhack commented Aug 22, 2014

Uh oh!

bhack commented Aug 22, 2014

Uh oh!

kloudkl commented Aug 27, 2014

Uh oh!

bhack commented Aug 27, 2014

Uh oh!

kloudkl commented Aug 29, 2014

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

6 participants