Adapts memory-efficient attention to large unet_bs #122

ryudrigo · 2022-09-05T18:02:08Z

And polishes memory-efficient attention in general.

Enables e.g. 1024px generation on 8 GB.

Inspired by comments by @Doggettx on #117

Currently, attn_step is set to 1. If you want more speed and less memory efficiency, you'd have to change that on ldm/modules/attention.py line 153.

@basujindal I didn't want to change the CLI or gradio commands, if you want that to be a parameter, I can modify the PR if you'd like.

TheEnhas · 2022-09-06T02:05:09Z

Probably a good idea to leave it at 1 IMO since this fork seems to be about having the best memory efficiency at the cost of speed, which is great for low VRAM GPUs, big generations or for just general use without worrying about OOM errors (I have a 8 GB 3060 Ti and use mostly this for the latter reason). And there's still turbo mode on top of that for only 1GB VRAM more, I almost always use it and it's much faster.

basujindal · 2022-09-06T07:51:18Z

Hi, thanks a lot for adding creating a pull request for these changes. Before I merge, can you please remove the changes for inpaint_gradio.py? If I am not wrong its the old inpaint file before the changes in the last commit. Thanks!

ryudrigo · 2022-09-06T11:11:05Z

I didn't notice I was leaving the new inpainting out. Thanks! Just corrected it.

ryudrigo · 2022-09-06T18:00:16Z

Please don't merge just yet -- I need to uncomment the mask code and test it

rockerBOO · 2022-09-07T14:09:00Z

I have tested this with txt2img 1024x1024 on a 1080 8GB and works great.

ryudrigo · 2022-09-07T18:06:35Z

Still working on that masking code as part of a larger inpainting PR.

So far, the code I commented out is not used by the inpainting script, so it won't make a difference. But if anyone uses this in another repo, please look at the linked issue (#129) and at the code

TingTingin · 2022-09-07T18:42:01Z

do you adjust it up or down the attention steps and does it have to be and int or can it be a float

ryudrigo · 2022-09-07T19:44:46Z

do you adjust it up or down the attention steps and does it have to be and int or can it be a float

Not sure if I understood the question, I'll try to answer as best as I can. I introduced the parameter att_steps, which has to be an int. You can test it if you want to check, but, from what I've seen, there is not much reason to use it greater than 1. The delay is very small.

TingTingin · 2022-09-07T20:02:27Z

Sorry for not being clear was referring to this if it has to be an it i guess it can only go up so you all read answered

Currently, attn_step is set to 1. If you want more speed and less memory efficiency, you'd have to change that on ldm/modules/attention.py line 153.`

ryudrigo · 2022-09-07T20:38:17Z

Oh all right! I rested it more and found out the speed improvement is really small (less than 10%) so I'd just leave it at 1

TingTingin · 2022-09-07T20:39:41Z

Yeah on my system it didn't seem to show any significant change either

remybonnav · 2022-09-20T06:37:44Z

I cannot find your modified attention.py
It seems that your stable-diffusion repo is offline

ryudrigo added 3 commits September 5, 2022 14:53

adapts memory-efficient attention to unet_bs larger than 1

c70ba37

merge

a5a7c5d

further improvements on memory-efficient attention

53ae6d7

ryudrigo mentioned this pull request Sep 5, 2022

Memory-efficient attention and gradio mask fixed #117

Merged

puts back changes to inpaint_gradio.py

ded84de

ryudrigo mentioned this pull request Sep 6, 2022

[Bug]: CUDA out of memory Sygil-Dev/sygil-webui#673

Closed

1 task

basujindal merged commit d154155 into basujindal:main Sep 7, 2022

ryudrigo mentioned this pull request Sep 7, 2022

PR 122 comments out masking code #129

Closed

ryudrigo mentioned this pull request Sep 7, 2022

Stable Diffusion PR optimizes VRAM, generate 576x1280 images with 6 GB VRAM invoke-ai/InvokeAI#364

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Adapts memory-efficient attention to large unet_bs #122

Adapts memory-efficient attention to large unet_bs #122

Uh oh!

ryudrigo commented Sep 5, 2022 •

edited

Loading

Uh oh!

TheEnhas commented Sep 6, 2022

Uh oh!

basujindal commented Sep 6, 2022

Uh oh!

ryudrigo commented Sep 6, 2022

Uh oh!

ryudrigo commented Sep 6, 2022

Uh oh!

rockerBOO commented Sep 7, 2022

Uh oh!

ryudrigo commented Sep 7, 2022

Uh oh!

TingTingin commented Sep 7, 2022

Uh oh!

ryudrigo commented Sep 7, 2022

Uh oh!

TingTingin commented Sep 7, 2022 •

edited

Loading

Uh oh!

ryudrigo commented Sep 7, 2022

Uh oh!

TingTingin commented Sep 7, 2022

Uh oh!

remybonnav commented Sep 20, 2022

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

6 participants

Adapts memory-efficient attention to large unet_bs #122

Adapts memory-efficient attention to large unet_bs #122

Uh oh!

Conversation

ryudrigo commented Sep 5, 2022 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

TheEnhas commented Sep 6, 2022

Uh oh!

basujindal commented Sep 6, 2022

Uh oh!

ryudrigo commented Sep 6, 2022

Uh oh!

ryudrigo commented Sep 6, 2022

Uh oh!

rockerBOO commented Sep 7, 2022

Uh oh!

ryudrigo commented Sep 7, 2022

Uh oh!

TingTingin commented Sep 7, 2022

Uh oh!

ryudrigo commented Sep 7, 2022

Uh oh!

TingTingin commented Sep 7, 2022 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

ryudrigo commented Sep 7, 2022

Uh oh!

TingTingin commented Sep 7, 2022

Uh oh!

remybonnav commented Sep 20, 2022

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

6 participants

ryudrigo commented Sep 5, 2022 •

edited

Loading

TingTingin commented Sep 7, 2022 •

edited

Loading