Skip to content

[shardformer] support layer state_dict and load_state_dict #4061

@FrankLeeeee

Description

@FrankLeeeee

We need to ensure that the weights of a distributed layer and that of a normal pytorch layer can be interchangeably loaded and saved in shardformer.

Metadata

Metadata

Assignees

Type

No type

Projects

Status

✅ Done

Milestone

No milestone

Relationships

None yet

Development

No branches or pull requests

Issue actions