[zero] improving compatibility of low level zero

# Overview
For now, the way of low level zero in ColossalAI sharding the OS when init may lead to an unbalanced load on each rank. For instance, if a model had 5 params(each contains some parameters) and we got 8 GPUs with Zero-DP, with the current implementation, 3 GPUs would have no OS on them. Thus, a sharding method splitting all parameters evenly is needed.

# Goal
With a new sharding method, each rank would get a similar number of parameters.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[zero] improving compatibility of low level zero #3954

Overview

Goal

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

[zero] improving compatibility of low level zero #3954

Description

Overview

Goal

Metadata

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Issue actions