Skip to content

fix TFLOPs calculation#350

Closed
polisettyvarma wants to merge 1 commit intodeepspeedai:mainfrom
polisettyvarma:main
Closed

fix TFLOPs calculation#350
polisettyvarma wants to merge 1 commit intodeepspeedai:mainfrom
polisettyvarma:main

Conversation

@polisettyvarma
Copy link
Copy Markdown

when GQA used, we observe right TFLOPs after this fix.
huge difference in TFLOPs is solved for selective recompute when GQA is not used.
some other minor difference will also be observed as logits macs also added.

when GQA used, we observe right TFLOPs after this fix.
huge difference in TFLOPs is solved for selective recompute when GQA is not used.
some other minor difference will also be observed as logits macs also added.
@polisettyvarma
Copy link
Copy Markdown
Author

same PR in another PR - #371

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

1 participant