-
Notifications
You must be signed in to change notification settings - Fork 1.4k
Description
Describe the bug
When using a CheckpointSaver among the validation handlers, the argument save_final=True results in every epoch being stored under net_final_iteration=1.pt when final_filename=None. I interpret the docs (default to checkpoint_final_iteration=N.pt) that the file will be renamed after each completed epoch.
To Reproduce
add save_final=True to the CheckpointHandler in the Lung Segmentation Tutorial
Expected behavior
net_final_iteration=1.pt -> net_final_iteration=2.pt -> ... -> net_final_iteration=MAX_EPOCHS.pt
Screenshots
The logging function also prints incorrect iteration numbers:
INFO:ignite.engine.engine.SupervisedEvaluator:Train completed, saved final checkpoint: net_final_iteration=1.pt
...
INFO:ignite.engine.engine.SupervisedEvaluator:Train completed, saved final checkpoint: net_final_iteration=1.pt
...
INFO:ignite.engine.engine.SupervisedEvaluator:Train completed, saved final checkpoint: net_final_iteration=1.pt
Environment
Ensuring you use the relevant python executable, please paste the output of:
MONAI version: 0.3.0+95.g535561e
Python version: 3.8.6 | packaged by conda-forge | (default, Oct 7 2020, 18:42:56) [Clang 10.0.1 ]
OS version: Darwin (19.6.0)
Numpy version: 1.19.2
Pytorch version: 1.7.0
MONAI flags: HAS_EXT = False, USE_COMPILED = False
Optional dependencies:
Pytorch Ignite version: 0.4.2
Nibabel version: 3.2.0
scikit-image version: NOT INSTALLED or UNKNOWN VERSION.
Pillow version: 8.0.1
Tensorboard version: 2.4.0
gdown version: NOT INSTALLED or UNKNOWN VERSION.
TorchVision version: 0.8.1
ITK version: NOT INSTALLED or UNKNOWN VERSION.
tqdm version: 4.53.0
lmdb version: NOT INSTALLED or UNKNOWN VERSION.
For details about installing the optional dependencies, please visit:
https://docs.monai.io/en/latest/installation.html#installing-the-recommended-dependencies
Additional context
Using SupervisedEvaluator and SupervisedTrainer.