Rewrite crop layer GPU implementation by erictzeng · Pull Request #5548 · BVLC/caffe

erictzeng · 2017-04-19T01:36:05Z

The crop layer currently in Caffe is really slow on GPU. For example, in fcn8s on a 500x375 image, the final crop layer alone takes 8.3ms out of a 65.1ms forward pass (12.7%)!

This seems to be a result of the fact that the original GPU implementation is a fairly faithful reproduction of the CPU version. The CPU version is a series of recursive calls that eventually delegates to caffe_copy to copy a contiguous portion of the crop. The original GPU version is thus a similar series of recursive calls that eventually delegates to a CUDA kernel. This ends up being horribly inefficient in practice, since we are forced to sync after each copy, and we do a large number of copies (one for each leaf of the recursion tree).

This PR rewrites the GPU implementation to do the entire operation in a single kernel call. Under the same conditions as before, the new implementation takes 0.3ms for a forward pass, which is roughly a 28x speedup. In practice, the speedup depends on the size of the input, with the largest gains on the largest input blobs.

I think this should be good to go. Let me know if anything seems off.

shelhamer · 2017-05-04T23:55:48Z

Thanks for the speed-up Eric!

erictzeng added 2 commits April 18, 2017 18:24

Rewrite crop cuda kernel

33f8612

Fix crop layer lint errors

cd1696d

shelhamer merged commit 7d3f8a7 into BVLC:master May 4, 2017

nagadomi mentioned this pull request Nov 22, 2018

本家の凄い新モデル「CUNET」がリリースされました lltcggie/waifu2x-caffe#132

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Rewrite crop layer GPU implementation#5548

Rewrite crop layer GPU implementation#5548
shelhamer merged 2 commits intoBVLC:masterfrom
erictzeng:crop

erictzeng commented Apr 19, 2017

Uh oh!

shelhamer commented May 4, 2017

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Conversation

erictzeng commented Apr 19, 2017

Uh oh!

shelhamer commented May 4, 2017

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants