3D medical image classification repository

_{Copyright German Cancer Research Center (DKFZ) and contributors. Please make sure that your usage of this code is in compliance with its license.}

Welcome to this 3D medical image classification repository. The repository builds up on the IMAGE CLASSIFICATION FRAMEWORK BY HELMHOLTZ IMAGING. This repository was extended to allow fine-tuning checkpoints from this repository: nnssl.

Installation

Requirements

Install the requirements in a virtual environment by:

pip install -r requirements.txt

You might need to adapt the cuda versions for torch and torchvision. Find a torch installation guide for your system here.

Dataset preprocessing

Currently, preprocessing is highly dataset- and user-dependent. However in this file you can find examples of how a dataset can be preprocessed.

For the SSL3D challenge we will resample all images towards a 1mm target spacing and then crop the center of the image with an 160 cubic block.

Including other datasets

For including your own dataset follow these steps:

In the dataset directory create a new file that implements the torch dataset class for your data. See example.
Additionally, create the DataModule for your dataset by writing a class that inherits from BaseDataModule. Write the init and setup functions for your dataset. The dataloaders are already defined by the BaseDataModule. An example could look like this:
```
from .base_datamodule import BaseDataModule

class CustomDataModule(BaseDataModule):
  def __init__(self, **params):
      super(CustomDataModule, self).__init__(**params)

  def setup(self, stage: str):
      self.train_dataset = YourCustomPytorchDataset(
          data_path=self.data_path,
          split="train",
          transform=self.train_transforms,
      )
      self.val_dataset = YourCustomPytorchDataset(
          data_path=self.data_path,
          split="val",
          transform=self.test_transforms,
      )
```
Note that the __init__ function takes **params and passes them to the super init. By doing so the attributes self.data_path, self.train_transforms and self.test_transforms are already set automatically and can be used in the setup function. The self.data_path is a joined path consisting of the configs data.module.data_root_dir and data.module.name. Custom transforms can be added in ./augmentation/policies/<your-data>.py. They need to inherit from the BaseTransform class. See the existing transforms for examples!

Add a <your-data>.yaml file to the data config group, defining some data-specific variables.

# @package _global_
data:
  module:
    _target_: datasets.RECvsT_1mm_cropped_160.RECvsT_1mm_cropped_160_DataModule
    name: RECvsT_1mm_cropped_160
    data_root_dir: ${data_dir}
    batch_size: 1
    train_transforms:
    _target_: augmentation.policies.batchgenerators.get_training_transforms
    patch_size: ${data.patch_size}
    rotation_for_DA: 0.523599
    mirror_axes: [0,1,2]
    do_dummy_2d_data_aug: False
    test_transforms: null
  cv:
    k:3

  num_classes: 2
  patch_size: [160, 160, 160]

model:
  task: 'Classification'
  cifar_size: False
  input_channels: 2
  input_dim: 3
  input_shape: ${data.patch_size}
  optimizer: AdamW
  lr: 0.0001
  warmstart: 20
  weight_decay: 1e-2
  label_smoothing: 0.2

trainer:
logger:
  project: RECvsT_1mm_cropped_160
accumulate_grad_batches: 48
max_epochs: 400
sync_batchnorm: True

metrics:
- 'f1'
- 'balanced_acc'
- 'ap'
- 'auroc'

The data.module._target_ defines the path to your DataModule. Note that the first line of the file needs to be # @package _global_ in order for Hydra to read the config properly.

Training

Primus-M

Fine-tuning:

python main.py env=cluster model=primus data=Datasetname trainer.devices=1 model.pretrained=True model.chpt_path=<path/to/checkpoint>

Training from scratch:

python main.py env=cluster model=primus data=Datasetname trainer.devices=1 model.pretrained=False

ResEnc-L

Fine-tuning:

python main.py env=cluster model=resenc data=Datasetname trainer.devices=1 model.pretrained=True model.chpt_path=<path/to/checkpoint>

Training from scratch:

python main.py env=cluster model=resenc data=Datasetname trainer.devices=1 model.pretrained=False

If you use this codebase, please cite:

   @misc{Openmind,
   title={An OpenMind for 3D medical vision self-supervised learning},
   author={Tassilo Wald and Constantin Ulrich and Jonathan Suprijadi and Sebastian Ziegler and Michal Nohel and Robin Peretzke and Gregor Köhler and Klaus H. Maier-Hein},
   year={2025},
   eprint={2412.17041},
   archivePrefix={arXiv},
   primaryClass={cs.CV},
   url={https://arxiv.org/abs/2412.17041},
   }

Name		Name	Last commit message	Last commit date
Latest commit History 98 Commits
augmentation		augmentation
cli_configs		cli_configs
datasets		datasets
metrics		metrics
models		models
regularization		regularization
.gitignore		.gitignore
LICENCE		LICENCE
README.md		README.md
base_model.py		base_model.py
inference.py		inference.py
main.py		main.py
parsing_utils.py		parsing_utils.py
requirements.txt		requirements.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

3D medical image classification repository

Installation

Requirements

Dataset preprocessing

Including other datasets

Training

Primus-M

ResEnc-L

About

Uh oh!

Releases

Packages

Uh oh!

Contributors 3

Uh oh!

Languages

License

constantinulrich/SSL3D_classification

Folders and files

Latest commit

History

Repository files navigation

3D medical image classification repository

Installation

Requirements

Dataset preprocessing

Including other datasets

Training

Primus-M

ResEnc-L

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors 3

Uh oh!

Languages

Packages