Skip to content

[docs] optimizers, hyperparam search, training features#44290

Merged
stevhliu merged 7 commits intohuggingface:mainfrom
stevhliu:training-part-3
Apr 9, 2026
Merged

[docs] optimizers, hyperparam search, training features#44290
stevhliu merged 7 commits intohuggingface:mainfrom
stevhliu:training-part-3

Conversation

@stevhliu
Copy link
Copy Markdown
Member

  • adds trainer_recipes.md to show how to use other practical Trainer features outside of the basic training loop
  • updates hyperparam search docs
  • updates optimizers.md with how to customize it (prebuilt instances, passing a class + kwargs, subclassing)

@HuggingFaceDocBuilderDev
Copy link
Copy Markdown

The docs for this PR live here. All of your documentation changes will be reflected on that endpoint. The docs are available until 30 days after the last update.

@stevhliu stevhliu requested a review from SunMarc February 26, 2026 01:40
Copy link
Copy Markdown
Member

@SunMarc SunMarc left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Thanks for these, this is already much better, left a couple of comments

Comment thread docs/source/en/optimizers.md Outdated
Comment thread docs/source/en/optimizers.md Outdated
Comment thread docs/source/en/optimizers.md Outdated
Comment thread docs/source/en/optimizers.md Outdated
Comment thread docs/source/en/optimizers.md
Comment thread docs/source/en/optimizers.md Outdated
Comment thread docs/source/en/optimizers.md Outdated
Comment thread docs/source/en/trainer_recipes.md Outdated
Comment thread docs/source/en/trainer_recipes.md Outdated
@@ -0,0 +1,255 @@
<!--Copyright 2026 The HuggingFace Team. All rights reserved.
Copy link
Copy Markdown
Member

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

we can keep these here for now, but i feel like they can be moved in other sections. Trainer receipe name don't fully represent what we have here. Maybe we can think of another name ? I feel like we could create another kind of recipes here.

Copy link
Copy Markdown
Member Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

hmm yeah a recipe is more of a contained end-to-end pattern. i'll rename this to "Trainer features" which i think is more accurate of what is described here. this would free up Trainer recipes for actual recipes later

@stevhliu stevhliu requested a review from SunMarc March 16, 2026 20:24
@stevhliu
Copy link
Copy Markdown
Member Author

i updated the scheduler docs to include GreedyLR from #44271 (sorry i missed your comment!)

@stevhliu
Copy link
Copy Markdown
Member Author

stevhliu commented Apr 2, 2026

updated with flashoptim from #44438!

Copy link
Copy Markdown
Member

@SunMarc SunMarc left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Thanks a lot !

@stevhliu stevhliu added this pull request to the merge queue Apr 9, 2026
Merged via the queue into huggingface:main with commit f965b10 Apr 9, 2026
16 checks passed
@stevhliu stevhliu deleted the training-part-3 branch April 9, 2026 20:00
sirzechs66 pushed a commit to sirzechs66/transformers that referenced this pull request Apr 18, 2026
…44290)

* optimizer + scheduler

* hyperparam

* training features

* feedback

* fix

* greedylr

* flashoptim
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

3 participants