The loss often reaches "Nan" when I run the run_model_train, but I haven't found a solution yet. Have you ever run into this situation?