Skip to content

[FEATURE]: gemini能否支持梯度累计 #4590

@zryowen123

Description

@zryowen123

Describe the feature

训练大模型,增大batchsize有助于训练的稳定性,对于特别大的模型batchsize大小受限,只能通过梯度累积的方式实现增大batchsize的目的,gemini能否实现支持梯度累积呢?

Metadata

Metadata

Assignees

No one assigned

    Labels

    enhancementNew feature or request

    Type

    No type

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions