You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
This work should be started after #3148 . And then we have ability create a model with lazy initialiazation and sharding. We have to verify the correctness for distributed training (tensor parallel or zero-3) cases.
Wanna track the development progress? Take a look at
Overview
This work should be started after #3148 . And then we have ability create a model with lazy initialiazation and sharding. We have to verify the correctness for distributed training (tensor parallel or zero-3) cases.
Wanna track the development progress? Take a look at
proposal: #3124
kanban: Lazy initialization
Goal
Verify the correctness of lazy init for distributed cases.