### Describe the feature In `Shardformer`, all layers accept process group as the arguments for distributed operations. We should remove the dependency on the gpc.
Describe the feature
In
Shardformer, all layers accept process group as the arguments for distributed operations. We should remove the dependency on the gpc.