You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
{{ message }}
This repository was archived by the owner on Feb 7, 2025. It is now read-only.
To maximise the shape that can go through the VAE or VQ-VAE, sometimes we must pick size shapes that result in latent space shapes that cannot go through the LDM for not being multiple of 2**(num levels on the unet).
To overcome this, a solution is to pad the VAE latent space and crop it back after sampling before passing it to the VAE - VQVAE. Having a simple MONAI transform on the inferer would be enough.