-
Notifications
You must be signed in to change notification settings - Fork 6.7k
Description
Currently, using Stable Diffusion with huggingface/diffusers often fails due to VRAM shortage, even when using a 16GB GPU, and even when trying to make fairly humble generations, like an image with a 16:9 aspect ratio (which is important for making videos).
Basujinal's fork uses drastically less VRAM, allowing larger images to be generated.
However, his code is nigh impenetrable.
Thus, a developer either has to choose between two flawed options:
- Opting for easy development using huggingface's philosophy of clean and simple code, but being plagued by CUDA out-of-memory failures.
- Opting for being free of VRAM worries, but be reduced to glacially slow development, due to having to rely on buggy and esoteric code.
This sucks! If someone were to replicate basujinal's "optimized" SD modifications, using Hugging Face's Contribution philosophy, we could have the best of both worlds: clean and easy development and no VRAM worries.
I'd put my money on VRAM shortages being the primary reason why devs choose not to use huggingface/diffusers. This is tragic, since I think clean code led by Hugging Face is the future. We need a Hugging Face "optimized" pipeline.