We should provide support to use fused normalization kernels in shardformer.
We should provide support to use fused normalization kernels in shardformer.