You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
------------ Environment ------------
Colossal-AI version: 0.3.4
PyTorch version: 2.0.1
System CUDA version: 11.7
CUDA version required by PyTorch: 11.7
🐛 Describe the bug
环境
BUG 细节
微调代码修改自:https://github.com/hpcaitech/ColossalAI/blob/main/examples/language/llama2/finetune.py
加载模型:https://github.com/hpcaitech/ColossalAI/blob/main/examples/language/llama2/finetune.py#L237
除了plugin的类型其余变量都保持一致,发现zero2时,loss的表现正常,而使用gemini时,更像是从一个随机初始化的weight进行优化
zero2,loss 正常从比较低的水平开始下降:

gemini,loss 从特别高的水平下降:

Environment
No response