Replace the dtype to fp8 in communication is useful For bandwidth restricted scenario. Please support the fp8 communication in Shardformer(SP and TP)
Replace the dtype to fp8 in communication is useful For bandwidth restricted scenario. Please support the fp8 communication in Shardformer(SP and TP)