Great work, but when I trained using the MSD task007 dataset, I noticed that the validation mean_dice reached 0.6 during training. However, after training, the test mean_dice dropped to only 0.49. Additionally, the Dice scores for the pancreas and tumor regions didn't match the values reported in the paper. I set the training iteration steps to 15,000. What would be the optimal number of training iteration steps?
Great work, but when I trained using the MSD task007 dataset, I noticed that the validation mean_dice reached 0.6 during training. However, after training, the test mean_dice dropped to only 0.49. Additionally, the Dice scores for the pancreas and tumor regions didn't match the values reported in the paper. I set the training iteration steps to 15,000. What would be the optimal number of training iteration steps?