Martinobettucci memory efficient patch for xformers on ldm #431

martinobettucci · 2022-10-26T17:14:59Z

No description provided.

Run attention in a loop to allow for much higher resolutions (over 1920x1920 on a 3090)

Correction to comment

Better memory handling for model.decode_first_stage so it doesn't crash anymore after 100% rendering

Old version gave incorrect free memory results causing in crashes on edge cases.

Set model to half in txt2img and img2img for less memory usage.

Forgot to comment out debug info

Technically you could run at higher steps as long as the resolution is dividable by the steps but you're going to run into memory issues later on anyhow.

Improves performance and is no longer needed.

Significant performance boost at higher resolutions when running in auto_cast or half mode on 3090 went from 1.13it/s to 1.63it/s at 1024x1024 Will also allow for higher resolutions due to sigmoid fix and using half memory

Performance boost and fix sigmoid for higher resolutions

Only need to wrap the model now with PromptGuidanceModelWrapper, and call prepare_prompts. No need to change samplers anymore, for example see changes in txt2img.py special format inside prompts: [sentence1:sentence2:step] will swap sentence1 (or sentence) for sentence2 at step [sentence:step] will add sentence at step [:sentence:step] will remove sentence at step [sentence] will add sentence at step 0 (only useful for negative prompts) when a sentence starts with - it will be seen as a negative prompt {scale:step} will switch to defined guidance scale at step, does not work if initial guidance scale was 1.0

Changed attention to code like used in diffusers

* Update attention.py Run attention in a loop to allow for much higher resolutions (over 1920x1920 on a 3090) * Update attention.py Correction to comment * Update attention.py * Fixed memory handling for model.decode_first_stage Better memory handling for model.decode_first_stage so it doesn't crash anymore after 100% rendering * Fixed free memory calculation Old version gave incorrect free memory results causing in crashes on edge cases. * Set model to half Set model to half in txt2img and img2img for less memory usage. * Commented out debug info Forgot to comment out debug info * Raise error when steps too high Technically you could run at higher steps as long as the resolution is dividable by the steps but you're going to run into memory issues later on anyhow. * Added max. res info to memory exception * Reverted in place tensor functions back to CompVis version Improves performance and is no longer needed. * Missed one function to revert * Update README.md * Performance boost and fix sigmoid for higher resolutions Significant performance boost at higher resolutions when running in auto_cast or half mode on 3090 went from 1.13it/s to 1.63it/s at 1024x1024 Will also allow for higher resolutions due to sigmoid fix and using half memory Co-authored-by: Doggettx <Doggettpm@protonmail.com> Co-authored-by: Doggettx <110817577+Doggettx@users.noreply.github.com>

Doggettx fork merge

prompt2prompt easier to implement, more options

Update attention.py

Signed-off-by: Martino Bettucci <martinobettucci@users.noreply.github.com>

Doggettx and others added 22 commits September 5, 2022 09:26

Update attention.py

afaefd2

Run attention in a loop to allow for much higher resolutions (over 1920x1920 on a 3090)

Update attention.py

5065b41

Correction to comment

Update attention.py

8283bb5

Fixed memory handling for model.decode_first_stage

d3c91ec

Better memory handling for model.decode_first_stage so it doesn't crash anymore after 100% rendering

Fixed free memory calculation

507ddec

Old version gave incorrect free memory results causing in crashes on edge cases.

Set model to half

a1fbe55

Set model to half in txt2img and img2img for less memory usage.

Commented out debug info

7a32fd6

Forgot to comment out debug info

Raise error when steps too high

755bec8

Technically you could run at higher steps as long as the resolution is dividable by the steps but you're going to run into memory issues later on anyhow.

Added max. res info to memory exception

f134e24

Reverted in place tensor functions back to CompVis version

830f694

Improves performance and is no longer needed.

Missed one function to revert

c2d72c5

Update README.md

cd3d653

Performance boost and fix sigmoid for higher resolutions

5fe97c6

Significant performance boost at higher resolutions when running in auto_cast or half mode on 3090 went from 1.13it/s to 1.63it/s at 1024x1024 Will also allow for higher resolutions due to sigmoid fix and using half memory

Merge pull request CompVis#12 from Doggettx/autocast-improvements

ab0bff6

Performance boost and fix sigmoid for higher resolutions

Update attention.py

223736a

Changed attention to code like used in diffusers

Merge pull request #2 from Doggettx/main

755379b

Doggettx fork merge

Merge pull request #3 from Doggettx/prompt2prompt-v2

eaf7890

prompt2prompt easier to implement, more options

Merge pull request #4 from Doggettx/diffusers-attention

5a3335b

Update attention.py

Update attention.py

ac6b75d

Signed-off-by: Martino Bettucci <martinobettucci@users.noreply.github.com>

Update attention.py

93b491f

Signed-off-by: Martino Bettucci <martinobettucci@users.noreply.github.com>

martinobettucci closed this Oct 26, 2022

martinobettucci deleted the martinobettucci-patch-1 branch October 26, 2022 17:16

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Martinobettucci memory efficient patch for xformers on ldm #431

Martinobettucci memory efficient patch for xformers on ldm #431

Uh oh!

martinobettucci commented Oct 26, 2022

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Martinobettucci memory efficient patch for xformers on ldm #431

Martinobettucci memory efficient patch for xformers on ldm #431

Uh oh!

Conversation

martinobettucci commented Oct 26, 2022

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants