🥓 [docs] add CP docs by kashif · Pull Request #3994 · huggingface/trl

kashif · 2025-09-02T10:48:21Z

What does this PR do?

Add example and docs for CP SFT training.

HuggingFaceDocBuilderDev · 2025-09-02T10:52:23Z

The docs for this PR live here. All of your documentation changes will be reflected on that endpoint. The docs are available until 30 days after the last update.

qgallouedec · 2025-09-02T21:01:18Z

+
+**Option 1: Using SFTConfig**
+
+**With Wrapped Strategy:**


I recommend not mentionning "wrapped" in the documentation. I feel like this is a very advanced use, and it somewhat distracts from the message of this section.

Co-authored-by: Quentin Gallouédec <45557362+qgallouedec@users.noreply.github.com>

kashif · 2025-09-03T07:46:01Z

i'll undo my change

lewtun

Nice doc! Overall LGTM with @qgallouedec comments about avoiding the wrapped example

lewtun · 2025-09-03T08:37:03Z

+from trl import SFTConfig
+
+training_args = SFTConfig(
+    max_seq_length=2048,              # Long sequence length


Maybe we can make this truly long like 16384 tokens?

lewtun · 2025-09-03T08:38:20Z

+
+#### Accelerate Configuration
+
+Create an accelerate config file (e.g. `context_parallel_config.yaml` for 2 GPUs):


I wonder if it makes sense to have a copy of this in accelerate_configs/fsdp2_cp.yaml so there's a standard reference people can work from? If you agree, I'd make it to run on 8 GPUs which is the default for our other configs

ok yes good idea with 8 gpus

Co-authored-by: lewtun <lewis.c.tunstall@gmail.com>

kashif · 2025-09-03T10:16:26Z

still needs huggingface/transformers#40619

Co-authored-by: Sergio Paniego Blanco <sergiopaniegoblanco@gmail.com>

kashif · 2025-09-04T11:00:03Z

@qgallouedec forgot to checkin the yaml... try now

Co-authored-by: Sergio Paniego Blanco <sergiopaniegoblanco@gmail.com>

qgallouedec

lgtm!!

qgallouedec · 2025-09-09T02:30:26Z

+
+#### Training Configuration
+
+You can configure context parallelism training either programmatically or via command line:


just realized that this is not accurate, as you always have to pass the accelerate config. I'll refactor a bit

add CP docs

3eb13eb

kashif requested a review from lewtun September 2, 2025 10:48

Apply padding to multiple if needed

f94a31a

qgallouedec reviewed Sep 2, 2025

View reviewed changes

Comment thread docs/source/reducing_memory_usage.md Outdated

qgallouedec reviewed Sep 2, 2025

View reviewed changes

Comment thread docs/source/reducing_memory_usage.md Outdated

qgallouedec reviewed Sep 2, 2025

View reviewed changes

Comment thread docs/source/reducing_memory_usage.md Outdated

qgallouedec reviewed Sep 2, 2025

View reviewed changes

Comment thread docs/source/reducing_memory_usage.md Outdated

qgallouedec reviewed Sep 2, 2025

View reviewed changes

Comment thread docs/source/reducing_memory_usage.md Outdated

qgallouedec reviewed Sep 2, 2025

View reviewed changes

Comment thread docs/source/reducing_memory_usage.md Outdated

qgallouedec reviewed Sep 2, 2025

View reviewed changes

Comment thread docs/source/reducing_memory_usage.md Outdated

qgallouedec mentioned this pull request Sep 2, 2025

✖️ Support pad-to-multiple-of and padding-free #3996

Merged

kashif and others added 7 commits September 3, 2025 09:44

Update docs/source/reducing_memory_usage.md

38034ac

Co-authored-by: Quentin Gallouédec <45557362+qgallouedec@users.noreply.github.com>

Update docs/source/reducing_memory_usage.md

63d3b56

Co-authored-by: Quentin Gallouédec <45557362+qgallouedec@users.noreply.github.com>

Update docs/source/reducing_memory_usage.md

dd01b16

Co-authored-by: Quentin Gallouédec <45557362+qgallouedec@users.noreply.github.com>

Update docs/source/reducing_memory_usage.md

c447e67

Co-authored-by: Quentin Gallouédec <45557362+qgallouedec@users.noreply.github.com>

Update docs/source/reducing_memory_usage.md

d016df9

Co-authored-by: Quentin Gallouédec <45557362+qgallouedec@users.noreply.github.com>

Update docs/source/reducing_memory_usage.md

5fcd83f

Co-authored-by: Quentin Gallouédec <45557362+qgallouedec@users.noreply.github.com>

Update docs/source/reducing_memory_usage.md

169fa5d

Co-authored-by: Quentin Gallouédec <45557362+qgallouedec@users.noreply.github.com>

lewtun reviewed Sep 3, 2025

View reviewed changes

Update docs/source/reducing_memory_usage.md

18c449f

Co-authored-by: lewtun <lewis.c.tunstall@gmail.com>

kashif and others added 6 commits September 3, 2025 10:26

removed wrapped section

daa4bc4

undo apply padding

eb4f6af

increase seq length

59bd347

Merge branch 'main' into cp-docs

2b6fb8d

larger max_seq_length

ac31981

Merge branch 'main' into cp-docs

15fba4a

kashif and others added 3 commits September 4, 2025 12:56

Update docs/source/reducing_memory_usage.md

a572705

Co-authored-by: Sergio Paniego Blanco <sergiopaniegoblanco@gmail.com>

Update docs/source/reducing_memory_usage.md

6c02149

Co-authored-by: Sergio Paniego Blanco <sergiopaniegoblanco@gmail.com>

Update context_parallel_2gpu.yaml for FSDP settings

39662df

sergiopaniego reviewed Sep 4, 2025

View reviewed changes

Comment thread docs/source/reducing_memory_usage.md Outdated

sergiopaniego reviewed Sep 4, 2025

View reviewed changes

Comment thread docs/source/reducing_memory_usage.md Outdated

sergiopaniego reviewed Sep 4, 2025

View reviewed changes

Comment thread docs/source/reducing_memory_usage.md Outdated

sergiopaniego reviewed Sep 4, 2025

View reviewed changes

Comment thread docs/source/reducing_memory_usage.md Outdated

kashif and others added 3 commits September 4, 2025 17:31

Update docs/source/reducing_memory_usage.md

0582f2c

Co-authored-by: Sergio Paniego Blanco <sergiopaniegoblanco@gmail.com>

Update docs/source/reducing_memory_usage.md

df65190

Co-authored-by: Sergio Paniego Blanco <sergiopaniegoblanco@gmail.com>

Update docs/source/reducing_memory_usage.md

45fd55a

Co-authored-by: Sergio Paniego Blanco <sergiopaniegoblanco@gmail.com>

kashif commented Sep 4, 2025

View reviewed changes

Comment thread docs/source/reducing_memory_usage.md Outdated

kashif and others added 4 commits September 4, 2025 17:32

Update docs/source/reducing_memory_usage.md

9aafb72

Merge branch 'main' into cp-docs

7af1d75

Merge branch 'main' into cp-docs

1920265

Merge branch 'main' into cp-docs

766c842

qgallouedec changed the title ~~[docs] add CP docs~~ 🥓 [docs] add CP docs Sep 9, 2025

Merge branch 'main' into cp-docs

7ce90bb

qgallouedec reviewed Sep 9, 2025

View reviewed changes

Comment thread docs/source/reducing_memory_usage.md Outdated

Update docs/source/reducing_memory_usage.md

22ba2e3

qgallouedec reviewed Sep 9, 2025

View reviewed changes

Comment thread docs/source/reducing_memory_usage.md Outdated

qgallouedec reviewed Sep 9, 2025

View reviewed changes

Comment thread docs/source/reducing_memory_usage.md Outdated

qgallouedec reviewed Sep 9, 2025

View reviewed changes

Comment thread docs/source/reducing_memory_usage.md Outdated

qgallouedec reviewed Sep 9, 2025

View reviewed changes

Comment thread docs/source/reducing_memory_usage.md Outdated

Apply suggestions from code review

f113df0

qgallouedec approved these changes Sep 9, 2025

View reviewed changes

qgallouedec reviewed Sep 9, 2025

View reviewed changes

let's go

f502b2a

qgallouedec merged commit 1b799a2 into main Sep 9, 2025
2 checks passed

qgallouedec deleted the cp-docs branch September 9, 2025 03:46


		#### Accelerate Configuration

		Create an accelerate config file (e.g. `context_parallel_config.yaml` for 2 GPUs):


		#### Training Configuration

		You can configure context parallelism training either programmatically or via command line:

Conversation

kashif commented Sep 2, 2025

What does this PR do?

Uh oh!

HuggingFaceDocBuilderDev commented Sep 2, 2025

Uh oh!

qgallouedec Sep 2, 2025

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

kashif commented Sep 3, 2025

Uh oh!

lewtun left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

lewtun Sep 3, 2025

Choose a reason for hiding this comment

Uh oh!

kashif Sep 3, 2025

Choose a reason for hiding this comment

Uh oh!

lewtun Sep 3, 2025

Choose a reason for hiding this comment

Uh oh!

kashif Sep 3, 2025

Choose a reason for hiding this comment

Uh oh!

kashif commented Sep 3, 2025

Uh oh!

kashif commented Sep 4, 2025

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

qgallouedec left a comment

Choose a reason for hiding this comment

Uh oh!

qgallouedec Sep 9, 2025

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

5 participants