Tensor parallelism is not suitable for cases, e.g. small model training. We can set it to be optional.
Tensor parallelism is not suitable for cases, e.g. small model training. We can set it to be optional.