Add bf16 training support #479

jonahsamost · 2026-01-31T00:56:39Z

Add bf16 mixed precision training. You can pass in a config to train with/without.

How it works is theres fp32 master weights for the optimizer and bf16 for forwards/backwards. The bf16 weights will get synced from the fp32 after each optimizer step.

Some tensors are kept in 32 bit for training stability (advantages and advantages statistics)

jonahsamost added 3 commits January 30, 2026 08:02

trains but not well

142cc40

working

f68f7f7

working both

95c05d2

jsuarez5341 merged commit 9e1f1b4 into PufferAI:4.0 Jan 31, 2026
0 of 12 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add bf16 training support #479

Add bf16 training support #479

jonahsamost commented Jan 31, 2026

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Add bf16 training support #479

Add bf16 training support #479

Conversation

jonahsamost commented Jan 31, 2026

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants