Possible Improvements for Current Repository

Upon thorough examination of the repository, we believe it could be enriched by the introduction of additional features. These enhancements aim to augment the repository's functionality and extend the available modules for the MONAI user community. The suggested enhancements are detailed below.

- We propose the development of varied conditional encoder modules, as depicted in the original latent-diffusion [repository](https://github.com/CompVis/latent-diffusion/blob/main/ldm/modules/encoders/modules.py), for the generation of N-Dimensional medical images. The prospective supplementary modules are outlined as follows:
  - ClassEmbedder
  - TransformerEmbedder
  - BERTTokenizer
  - BERTEmbedder
  - SpatialRescaler
  - FrozenCLIPTextEmbedder
  - FrozenClipImageEmbedder, etc.
  - Furthermore, it is crucial to incorporate comprehensive tutorials for each newly implemented encoder.
- Consider refining the implementation of latent-diffusion to accommodate various condition types. Currently, it exclusively supports ["cross-attention"](https://github.com/Project-MONAI/GenerativeModels/blob/main/generative/networks/nets/diffusion_model_unet.py#L1000). We propose the inclusion of two or more additional options to enhance the system's capabilities.
  1. [concat](https://github.com/CompVis/latent-diffusion/blob/main/ldm/models/diffusion/ddpm.py#L1405-L1407)
  2. [hybrid](https://github.com/CompVis/latent-diffusion/blob/main/ldm/models/diffusion/ddpm.py#L1411-L1414), etc.
- Suggested improvements relating to GPU.
  - Inclusion of activation checkpointing for memory optimization along with associated tutorials.
  - Integration of distributed model training, accompanied by relevant tutorials.
- Introduction of PyTorch `ConvTranspose` support in the decoder to prevent `int32` limitation on `torch.nn.functional.interpolate` for large tensors.
- We propose offering pre-trained diffusion model weights, accessible via the Cloud, for user integration within their specific applications, accompanied by a comprehensive demo or tutorial for ease of use.

We express keen interest in proceeding with comprehensive discussions concerning any of the items outlined above.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Possible Improvements for Current Repository #410

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Possible Improvements for Current Repository #410

Description

Metadata

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Issue actions