Add ReadMe about MFU by gobbleturk · Pull Request #2031 · AI-Hypercomputer/maxtext

gobbleturk · 2025-07-27T23:34:44Z

Description

Add a ReadMe section discussing Model flops utilization MFU (definition and how we report it).

We may want to add sections to this in the future (e.g. hardware utilizations or memory usage)

This is meant to help clarify the recent change about our attention flop calculation change (accounting for causality) in #1988

Note for reviewers: Click on display rich diff to see resultant markdown: https://screenshot.googleplex.com/9WxhjW8EV6PWJ9B

Tests

N/A readme

Checklist

Before submitting this PR, please make sure (put X in square brackets):

I have performed a self-review of my code.
I have necessary comments in my code, particularly in hard-to-understand areas.
I have run end-to-end tests tests and provided workload links above if applicable.
I have made or will make corresponding changes to the doc if needed.

shralex · 2025-07-28T01:53:59Z

README.md

Not only causal, also chunked and local

shralex · 2025-07-28T01:57:33Z

getting_started/Performance_Metrics.md

Its an upper bound - doesn't mean its actually achievable, right ?

Yes, should we add something like this: "While achieving 100% is not practical due to many factors, the MFU score effectively shows how much room is left for optimization."

Done!

Note we've gotten 70% MFU before on v5p, I've heard 80%+ MFU (even bf16, probably also v5p), its theoretically possible to get pretty close

shralex

Thanks Matt!

README.md

getting_started/Performance_Metrics.md

gagika · 2025-07-28T05:52:04Z

getting_started/Performance_Metrics.md

Yes, should we add something like this: "While achieving 100% is not practical due to many factors, the MFU score effectively shows how much room is left for optimization."

getting_started/Performance_Metrics.md

README.md

getting_started/Performance_Metrics.md

README.md

shralex · 2025-07-28T23:48:17Z

getting_started/Performance_Metrics.md

Do we want to say anything more about local or chunked attention ?

gagika · 2025-07-30T21:49:28Z

README.md

dividing the attention the flops -> dividing the attention flops (no the)

gobbleturk requested review from A9isha, RissyRan, SurbhiJainUSC, aireenmei, bvandermoon, gagika, hengtaoguo, khatwanimohit, richjames0, shralex, vipannalla and yangyuwei as code owners July 27, 2025 23:34

shralex reviewed Jul 28, 2025

View reviewed changes

gagika reviewed Jul 28, 2025

View reviewed changes

RissyRan reviewed Jul 28, 2025

View reviewed changes

README.md Outdated Show resolved Hide resolved

getting_started/Performance_Metrics.md Outdated Show resolved Hide resolved

getting_started/Performance_Metrics.md Outdated Show resolved Hide resolved

getting_started/Performance_Metrics.md Outdated Show resolved Hide resolved

gobbleturk force-pushed the mattdavidow-mfu-readme branch from 000af45 to 853d8b0 Compare July 28, 2025 18:32

shralex reviewed Jul 28, 2025

View reviewed changes

README.md Outdated Show resolved Hide resolved

shralex reviewed Jul 28, 2025

View reviewed changes

gobbleturk force-pushed the mattdavidow-mfu-readme branch from 853d8b0 to 51317f8 Compare July 29, 2025 17:30

shralex approved these changes Jul 30, 2025

View reviewed changes

gagika approved these changes Jul 30, 2025

View reviewed changes

github-actions bot added the pull ready label Jul 30, 2025

gobbleturk force-pushed the mattdavidow-mfu-readme branch from 51317f8 to 12141a3 Compare July 31, 2025 16:36

gobbleturk requested a review from NuojCheng as a code owner July 31, 2025 16:36

gobbleturk force-pushed the mattdavidow-mfu-readme branch 2 times, most recently from 4e2d5c0 to 924f4e0 Compare July 31, 2025 16:42

Add ReadMe about MFU

df562c9

gobbleturk force-pushed the mattdavidow-mfu-readme branch from 924f4e0 to df562c9 Compare July 31, 2025 16:43

copybara-service bot merged commit 816b876 into main Jul 31, 2025
20 checks passed

copybara-service bot deleted the mattdavidow-mfu-readme branch July 31, 2025 18:18

Conversation

gobbleturk commented Jul 27, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Description

Tests

Checklist

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

shralex left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

Comments

gobbleturk commented Jul 27, 2025 •

edited

Loading