Fallback to different cuDNN algorithm when under memory pressure by nsubtil · Pull Request #2211 · BVLC/caffe

nsubtil · 2015-03-26T20:47:34Z

CUDNN_CONVOLUTION_FWD_PREFER_FASTEST requires a lot of GPU memory, which may
not always be available. Add a fallback path that uses
CUDNN_CONVOLUTION_FWD_ALGO_IMPLICIT_GEMM when the allocation fails.

CUDNN_CONVOLUTION_FWD_PREFER_FASTEST requires a lot of GPU memory, which may not always be available. Add a fallback path that uses CUDNN_CONVOLUTION_FWD_ALGO_IMPLICIT_GEMM when the allocation fails.

shelhamer · 2015-03-26T21:43:21Z

I can confirm this works for models like CaffeNet where #2038 runs out of memory.

shelhamer · 2015-03-26T21:55:35Z

src/caffe/layers/cudnn_conv_layer.cu

Will this always rule out the cuDNN GEMM convolution? At least in the Caffe GEMM convolution the workspace is the kernel dimensions (kernel_h * kernel_w * channels) * output dimensions (height_out_* width_out_) as in https://github.com/BVLC/caffe/blob/master/src/caffe/layers/base_conv_layer.cpp#L147, although in the cuDNN implementation I suppose the workspace could be just the input data so the +1 allows it here.

Yes, my understanding is that the workspace size depends only on the input data size.

The intent is that GEMM convolution will still be chosen if possible. However, in practice, we expect that there won't be enough memory available in many use cases.

shelhamer · 2015-03-26T21:56:59Z

Thanks for the fix @nsubtil! Please address my inline comments for merge.

nsubtil · 2015-03-26T22:55:47Z

It sounds like no further code changes are actually required here. If you're satisfied with my replies, I think this should be ready to merge.

Thanks!

Fallback to different cuDNN algorithm when under memory pressure; fix #2197

Fallback to different cuDNN algorithm when under memory pressure

add73fb

CUDNN_CONVOLUTION_FWD_PREFER_FASTEST requires a lot of GPU memory, which may not always be available. Add a fallback path that uses CUDNN_CONVOLUTION_FWD_ALGO_IMPLICIT_GEMM when the allocation fails.

shelhamer reviewed Mar 26, 2015
View reviewed changes

shelhamer added a commit that referenced this pull request Mar 26, 2015

Merge pull request #2211 from nsubtil/fix-cudnn-algo

5c009d8

Fallback to different cuDNN algorithm when under memory pressure; fix #2197

shelhamer merged commit 5c009d8 into BVLC:master Mar 26, 2015

nsubtil deleted the fix-cudnn-algo branch March 26, 2015 23:54

shelhamer mentioned this pull request Apr 6, 2015

Cudnn v2 with caffe on Jetson TK1 #2259

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Comments

Fallback to different cuDNN algorithm when under memory pressure#2211

Fallback to different cuDNN algorithm when under memory pressure#2211
shelhamer merged 1 commit intoBVLC:masterfrom
nsubtil:fix-cudnn-algo

nsubtil commented Mar 26, 2015

Uh oh!

shelhamer commented Mar 26, 2015

Uh oh!

shelhamer Mar 26, 2015

Uh oh!

nsubtil Mar 26, 2015

Uh oh!

shelhamer commented Mar 26, 2015

Uh oh!

nsubtil commented Mar 26, 2015

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Comments

Conversation

nsubtil commented Mar 26, 2015

Uh oh!

shelhamer commented Mar 26, 2015

Uh oh!

shelhamer Mar 26, 2015

Choose a reason for hiding this comment

Uh oh!

nsubtil Mar 26, 2015

Choose a reason for hiding this comment

Uh oh!

shelhamer commented Mar 26, 2015

Uh oh!

nsubtil commented Mar 26, 2015

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants