🐛 Describe the bug
Describe the bug
I'm working on the dreambooth example.
gpc.set_seed(args.seed) will cause following error.
AssertionError: The seed for ParallelMode.DATA has been added
if I directly comment out it, everything works fine.
To Reproduce
I'm using below command to support further multi-machine training. But while I'm testing, I only run on a single 8*V100 machine.
python -m torch.distributed.run --nproc_per_node=$GPU_NUM --nnodes=$WORLD_SIZE \
--node_rank=$RANK --master_addr=$MASTER_ADDR --master_port=$MASTER_PORT ./train_dreambooth_colossalai.py
Expected behavior
The seed should be rightly set.
Environment
No response
🐛 Describe the bug
Describe the bug
I'm working on the dreambooth example.
gpc.set_seed(args.seed) will cause following error.
AssertionError: The seed for ParallelMode.DATA has been addedif I directly comment out it, everything works fine.
To Reproduce
I'm using below command to support further multi-machine training. But while I'm testing, I only run on a single 8*V100 machine.
Expected behavior
The seed should be rightly set.
Environment
No response