Skip to content

Conversation

@mh-dm
Copy link
Contributor

@mh-dm mh-dm commented Sep 10, 2022

Tested on nvidia eGPU setup so YMMV with the default half precision math. Speed from 1.69it/s to 1.89it/s and max VRAM from 4.44G to 3.37G for generating 512x512 images. Measured after applying a separate PR #484 Move model.half() before model.to(device)

…ax VRAM

Tested on nvidia eGPU setup so YMMV with the default half precision math.
Speed from 1.69it/s to 1.89it/s and max VRAM from 4.44G to 3.37G for generating 512x512 images. Measured after applying a separate PR #484 Move model.half() before model.to(device)
@tildebyte
Copy link
Contributor

If this is identical to #495, please close this one. Only bugfixes should target 'main'.

@lstein lstein closed this Sep 11, 2022
@mh-dm mh-dm deleted the float16 branch September 12, 2022 10:47
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

3 participants