[feature]: support FP8 communication in pipeline parallelism by BurkeHulk · Pull Request #5885 · hpcaitech/ColossalAI

BurkeHulk · 2024-07-04T12:46:23Z

📌 Checklist before creating the PR

I have created an issue for this PR for traceability
The title follows the standard format: [doc/gemini/tensor/...]: A concise description
I have added relevant tags if possible for us to better distinguish different PRs
I have installed pre-commit: pip install pre-commit && pre-commit install

🚨 Issue number

📝 What does this PR do?

Implement per-channel scaling (in PyTorch) for FP8 quantization.
Support PyTorch native FP8 formats.
Refer to:
https://pytorch.org/docs/stable/tensors.html#id7
https://arxiv.org/pdf/2209.05433

Finetuning task accuracy:

Pipeline P.	BERT - 4 GPUs		GPT2 - 4 GPUs
FP16	0.8418		0.7039
FP8-auto	0.8435		0.7016

cast_to_fp8, cast_from_fp8, all_reduce_fp8

for more information, see https://pre-commit.ci

…p8_comm # Conflicts: # colossalai/quantization/fp8.py

for more information, see https://pre-commit.ci

BurkeHulk added 2 commits July 1, 2024 13:44

fp8 operators for compressed communication

f5a52e1

cast_to_fp8, cast_from_fp8, all_reduce_fp8

Merge branch 'hpcaitech:main' into feature/fp8_comm

6991819

BurkeHulk requested a review from a team as a code owner July 4, 2024 12:46

[pre-commit.ci] auto fixes from pre-commit.com hooks

e17f835

for more information, see https://pre-commit.ci

GuangyaoZhang reviewed Jul 8, 2024

View reviewed changes

Comment thread colossalai/quantization/fp8.py Outdated

fix typo

dbfa7d3

ver217 reviewed Jul 10, 2024

View reviewed changes

Comment thread colossalai/quantization/fp8.py Outdated

Comment thread colossalai/quantization/fp8.py Outdated

Comment thread colossalai/quantization/fp8.py Outdated

BurkeHulk and others added 5 commits July 12, 2024 15:23

fix scaling algorithm in FP8 casting

1e19594

support fp8 communication in pipeline parallelism

e881901

add fp8_communication flag in the script

6601874

Merge remote-tracking branch 'origin/feature/fp8_comm' into feature/f…

1f1b856

…p8_comm # Conflicts: # colossalai/quantization/fp8.py

[pre-commit.ci] auto fixes from pre-commit.com hooks

51f916b

for more information, see https://pre-commit.ci

BurkeHulk enabled auto-merge July 16, 2024 03:21

BurkeHulk changed the title ~~Feature/fp8 comm~~ [feature]: support FP8 communication in pipeline parallelism Jul 16, 2024

This was linked to issues Jul 16, 2024

[FEATURE]: [PyTorch] per-channel FP8 quantization #5873

Closed

[Feature]: [PyTorch] FP8 all-reduce using all-to-all and all-gather #5886

Closed

ver217 approved these changes Jul 16, 2024

View reviewed changes

BurkeHulk merged commit 9470701 into hpcaitech:feature/fp8_comm Jul 16, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[feature]: support FP8 communication in pipeline parallelism#5885

[feature]: support FP8 communication in pipeline parallelism#5885
BurkeHulk merged 9 commits intohpcaitech:feature/fp8_commfrom
BurkeHulk:feature/fp8_comm

BurkeHulk commented Jul 4, 2024 •

edited

Loading

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Conversation

BurkeHulk commented Jul 4, 2024 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

📌 Checklist before creating the PR

🚨 Issue number

📝 What does this PR do?

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

BurkeHulk commented Jul 4, 2024 •

edited

Loading