I think it's better to show epoch loss, otherwise hard to know if it's converged or not.
I think it's better to show epoch loss, otherwise hard to know if it's converged or not.