Enabling gradient checkpointing for VAE #2536

aandyw · 2023-03-02T21:17:32Z

PR for issue #1883

HuggingFaceDocBuilderDev · 2023-03-02T21:22:17Z

The documentation is not available anymore as the PR was closed or merged.

src/diffusers/models/unet_2d_condition.py

src/diffusers/models/vae.py

patrickvonplaten

Let's try to only add gradient checkpointing for the blocks - only there we can really save memory. It doesn't help much to add it to individual layers, such as self.conv_in

@williamberman could you also have a look?

aandyw · 2023-03-08T18:37:33Z

Let's try to only add gradient checkpointing for the blocks - only there we can really save memory. It doesn't help much to add it to individual layers, such as self.conv_in

@williamberman could you also have a look?

Got it. Thanks for the feedback.

examples/research_projects/onnxruntime/text_to_image/train_text_to_image.py

patrickvonplaten · 2023-03-13T10:11:26Z

examples/research_projects/onnxruntime/text_to_image/train_text_to_image.py

    if args.gradient_checkpointing:
        unet.enable_gradient_checkpointing()
+        vae.enable_gradient_checkpointing()
+    else:


Note that gradient checkpointing doesn't necessarily mean that the model is training

How exactly is fine-tuning the VAE done in train_text_to_image?

It seems like the issue #1883 wants to enable gradient checkpointing specifically for training VAE independently so should we add in some sort of argument?

src/diffusers/models/modeling_utils.py

src/diffusers/models/vae.py

src/diffusers/models/autoencoder_kl.py

patrickvonplaten

The PR looks in a much better shape now, thanks!

Could you try to revert the reformatting of unrelated functions and then let's make sure to always freeze the unet and vae of train_text_to_image just like before. Note that gradient checkpointing still helps even if the weights are frozen :-)

Good job! Think we're on the final stretch now

aandyw · 2023-03-13T17:28:52Z

The PR looks in a much better shape now, thanks!

Could you try to revert the reformatting of unrelated functions and then let's make sure to always freeze the unet and vae of train_text_to_image just like before. Note that gradient checkpointing still helps even if the weights are frozen :-)

Good job! Think we're on the final stretch now

Thanks for the feedback! I really appreciate it.

Apologies on the formatting. I'm using black and it seems to really want these functions formatted this way. I might have missed this but is there a suggested linter/formatter to use?

patrickvonplaten · 2023-03-13T19:40:31Z

@pie31415 could you try to use the newest black version, i.e.:

diffusers/setup.py

Line 84 in 4ae54b3

"black~=23.1",

- does this correspond to the version you have? :-)

aandyw · 2023-03-13T23:04:18Z

Fixed the issue with formatting. It was because of issues with line endings being CRLF.

However, some of the code formatting is still because of black (formatting on save).
e.g.

tile = z[:, :, i : i + self.tile_latent_min_size, j : j + self.tile_latent_min_size]

becomes

tile = z[
  :,
  :,
  i : i + self.tile_latent_min_size,
  j : j + self.tile_latent_min_size,
 ]

patrickvonplaten · 2023-03-14T15:59:40Z

examples/research_projects/onnxruntime/text_to_image/train_text_to_image.py

    if args.gradient_checkpointing:
        unet.enable_gradient_checkpointing()
+        vae.enable_gradient_checkpointing()
+        vae.requires_grad_(True)


Suggested change

vae.requires_grad_(True)

Think even without requires_grad_ this should help as gradients are passed down

src/diffusers/models/vae.py

patrickvonplaten

Cool this now looks good to me!

Last thing we should do is to add a simple test that makes sure that gradient checkpointing works. Maybe you can get some inspiration from:

diffusers/tests/models/test_models_unet_2d_condition.py

Line 125 in 07a0c1c

def test_gradient_checkpointing(self):

and add a similar test here:
https://github.com/huggingface/diffusers/blob/main/tests/models/test_models_vae.py

After that we can merge I think :-)

…t_to_image.py Co-authored-by: Patrick von Platen <patrick.v.platen@gmail.com>

Co-authored-by: Patrick von Platen <patrick.v.platen@gmail.com>

examples/research_projects/onnxruntime/text_to_image/train_text_to_image.py

williamberman · 2023-03-16T08:54:15Z

wow super cool :) I learned how our gradient checkpointing works from this PR 😁 will approve once test is written

…415/diffusers into gradient-checkpointing-vae

aandyw · 2023-03-16T19:33:15Z

Seem to be failing CI/CD tests for MacOS. Anyway to fix this atm?

williamberman · 2023-03-17T20:59:24Z

macos tests are unrelated, looks great

patrickvonplaten · 2023-03-21T13:07:09Z

Good job @pie31415

* updated black format * update black format * make style format * updated line endings * update code formatting * Update examples/research_projects/onnxruntime/text_to_image/train_text_to_image.py Co-authored-by: Patrick von Platen <patrick.v.platen@gmail.com> * Update src/diffusers/models/vae.py Co-authored-by: Patrick von Platen <patrick.v.platen@gmail.com> * Update src/diffusers/models/vae.py Co-authored-by: Patrick von Platen <patrick.v.platen@gmail.com> * added vae gradient checkpointing test * make style --------- Co-authored-by: Patrick von Platen <patrick.v.platen@gmail.com> Co-authored-by: Will Berman <wlbberman@gmail.com>

aandyw changed the title ~~Enabling gradient checkpointing for VAE~~ [WIP] Enabling gradient checkpointing for VAE Mar 2, 2023

aandyw changed the title ~~[WIP] Enabling gradient checkpointing for VAE~~ Enabling gradient checkpointing for VAE Mar 2, 2023

aandyw marked this pull request as ready for review March 2, 2023 23:20

aandyw changed the title ~~Enabling gradient checkpointing for VAE~~ [WIP] Enabling gradient checkpointing for VAE Mar 3, 2023

aandyw changed the title ~~[WIP] Enabling gradient checkpointing for VAE~~ Enabling gradient checkpointing for VAE Mar 3, 2023

aandyw changed the title ~~Enabling gradient checkpointing for VAE~~ [WIP] Enabling gradient checkpointing for VAE Mar 3, 2023

patrickvonplaten reviewed Mar 7, 2023

View reviewed changes

src/diffusers/models/unet_2d_condition.py Outdated Show resolved Hide resolved

patrickvonplaten reviewed Mar 7, 2023

View reviewed changes

src/diffusers/models/vae.py Outdated Show resolved Hide resolved

patrickvonplaten reviewed Mar 7, 2023

View reviewed changes

patrickvonplaten requested a review from williamberman March 7, 2023 10:28

aandyw commented Mar 8, 2023

View reviewed changes

examples/research_projects/onnxruntime/text_to_image/train_text_to_image.py Outdated Show resolved Hide resolved

aandyw requested review from patrickvonplaten and removed request for williamberman March 10, 2023 16:04