Skip to content

[zero] support no_sync #4128

@Gy-Lu

Description

@Gy-Lu

For now, ZeRO1 with gradient accumulation is not used in the plugin, which means users can not use booster and zero1+ga.

We can use just self.require_grad_sync = True instead of interval. Moreover, supporting plugin.no_sync() is need as well.

Metadata

Metadata

Assignees

Labels

No labels
No labels

Type

No type

Projects

Status

✅ Done

Milestone

No milestone

Relationships

None yet

Development

No branches or pull requests

Issue actions