RepLDM: Reprogramming Pretrained Latent Diffusion Models for High-Quality, High-Efficiency, High-Resolution Image Generation
🔥🔥🔥 RepLDM is a training-free method for higher-resolution image generation, enabling the 8k image generation! You can freely adjust the richness of colors and details in the generated image through attention guidance.
- SDXL Based
- Text to Image
- [ √ ] RepLDM
- [ √ ] FreeScale + AttentionGuidance
- +ControlNet
- [ √ ] RepLDM
- [___] FreeScale + AttentionGuidance
- Text to Image
- FLUX Based
- Text to Image
- [___] RepLDM
- Text to Image
- SD3 Based
- Text to Image
- [___] RepLDM
- Text to Image
- [___] Web UI
conda create -n repldm python=3.9
conda activate repldm
pip install -e .
TODO
TODO
RepLDM enables the rapid synthesis of high-quality, high-resolution images without the need for further training.
It consists of two stages:
- Synthesizing high-quality images at the training resolution using Attention Guidance.
- Generating finer high-resolution images through pixel upsampling and "diffusion-denoising" loop.
Attention Guidance enables the generation of images with more vivid colors and richer details, as shown in the figure below.
Attention Guidance can be used in conjunction with plugins such as ControlNet to achieve an enhanced visual experience, as illustrated in the figure below.
Attention Guidance allows users to freely adjust the level of detail and color richness in an image according to their preferences, simply by modifying the attention guidance scale, as shown in the figure below.
Attention Guidance computes layout-enhanced representations using a training-free self-attention (TFSA) mechanism and leverages them to strengthen layout consistency:
where
The implementation in the main branch includes some modifications based on the original version. If you want to compare with the original method reported in the paper, please refer to the code in the base branch.
@inproceedings{caorepldm,
title={RepLDM: Reprogramming Pretrained Latent Diffusion Models for High-Quality, High-Efficiency, High-Resolution Image Generation},
author={Cao, Boyuan and Ye, Jiaxin and Wei, Yujie and Shan, Hongming},
booktitle={The Thirty-ninth Annual Conference on Neural Information Processing Systems}
}





