Drop noisy generate warnings when do_sample=False (or num_beams=1) by ArthurZucker · Pull Request #45559 · huggingface/transformers

ArthurZucker · 2026-04-22T03:03:26Z

Summary

GenerationConfig.validate() was warning about sampling-only flags (temperature, top_p, top_k, min_p, top_h, typical_p, epsilon_cutoff, eta_cutoff) whenever do_sample was not True, and about beam-only flags (early_stopping, length_penalty) whenever num_beams == 1 -- even when those values were inherited from the model's generation_config.json. In practice, nearly every popular Hub model ships with a non-default temperature/top_p, so users got a warning for every generate(do_sample=False) call.

This PR threads a user_set_attributes set through GenerationConfig.__init__ and update(), so validate() can distinguish attributes the caller explicitly provided from values inherited from the model's default config. The sampling-only and beam-only warnings now only fire for user-set attributes.

Behavior

generate(do_sample=False) on a model whose hub config has temperature=0.6, top_p=0.9: silent (values inherited, not user intent).
generate(do_sample=False, top_p=0.8): warns about top_p (user explicitly set both -- almost certainly a mistake).
GenerationConfig(do_sample=False, temperature=0.5): warns (both explicit).
generation_config.validate(strict=True) (e.g. from save_pretrained) with no user_set_attributes preserves the original "refuse to save bad configs" behavior.

cc @Cyrilvallez

HuggingFaceDocBuilderDev · 2026-04-22T03:32:16Z

The docs for this PR live here. All of your documentation changes will be reflected on that endpoint. The docs are available until 30 days after the last update.

zucchini-nlp

in general i agree, users don't want to see warnings if they just turn off sampling iwth one flag. Though I am not sure about the reverse case, when they pass sampling params and the default config has do_sample=False. Kinda misleading if we silently fallback to greedy generation, no?

zucchini-nlp · 2026-04-22T09:36:10Z

-        if self.do_sample is not True:
+        #
+        # The warning is suppressed for flags that weren't explicitly set by the caller (see `_is_user_set`): values
+        # inherited from a model's `generation_config.json` are harmless when the user opts into greedy decoding.
+        # We also require `do_sample` itself to be user-set -- otherwise the non-sampling mode was inherited and the
+        # user never expressed intent to skip sampling, so flagging their sampling kwargs would be misleading.
+        if self.do_sample is not True and _is_user_set("do_sample"):
            greedy_wrong_parameter_msg = (
-                "`do_sample` is set not to set `True`. However, `{flag_name}` is set to `{flag_value}` -- this flag is only "
-                "used in sample-based generation modes. You should set `do_sample=True` or unset `{flag_name}`."
+                "`do_sample` is set to `{do_sample}`. However, `{flag_name}` is set to `{flag_value}` -- this flag is "
+                "only used in sample-based generation modes. You should set `do_sample=True` or unset `{flag_name}`."


great opportunity to move to hub-dataclass validation 😄

zucchini-nlp · 2026-04-22T09:39:07Z

+        # Inverse provenance case: `do_sample` inherited from a model's config (so not user-set this call), user only
+        # sets a sampling flag. The conflict shouldn't produce noise because the user never asked for greedy.
+        logger.warning_once.cache_clear()
+        greedy_hub_config = GenerationConfig(do_sample=False)  # mimics a model's default config forcing greedy
+        with CaptureLogger(logger) as captured_logs:
+            greedy_hub_config.update(top_p=0.8)
+        self.assertEqual(len(captured_logs.out), 0)


i am not sure about this one. Beginner users might expect this to just work and sample with top-p, while we silently fallback to greedy

zucchini-nlp · 2026-04-22T09:41:01Z

        # 2.4. check `num_return_sequences`
        if self.num_return_sequences is not None and self.num_return_sequences > 1:
            if self.num_beams is None or self.num_beams == 1:
                if not self.do_sample:
                    raise ValueError(
                        "Greedy methods (do_sample != True) without beam search do not support "
                        f"`num_return_sequences` different than 1 (got {self.num_return_sequences})."
                    )
            elif (
                self.num_beams is not None
                and self.num_return_sequences is not None
                and self.num_return_sequences > self.num_beams
            ):


so the is-user check is only when greedy decoding, no changes on other params?

Cyrilvallez · 2026-04-24T06:49:40Z

Completely messed up this branch by mistake 😅 Opened #45619 with the correct diffs

ArthurZucker requested a review from Cyrilvallez April 22, 2026 06:21

zucchini-nlp reviewed Apr 22, 2026

View reviewed changes

squash

a691ca5

Cyrilvallez force-pushed the drop-sampling-flag-warnings branch from 0579f79 to a691ca5 Compare April 24, 2026 06:31

Cyrilvallez mentioned this pull request Apr 24, 2026

Remove unnecessary generate warnings #45619

Merged

Cyrilvallez closed this Apr 24, 2026

evalstate mentioned this pull request Apr 28, 2026

Cumulative defect fixes from recent Transformers PRs evalstate/transformers#41

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Drop noisy generate warnings when do_sample=False (or num_beams=1)#45559

Drop noisy generate warnings when do_sample=False (or num_beams=1)#45559
ArthurZucker wants to merge 1 commit intohuggingface:mainfrom
ArthurZucker:drop-sampling-flag-warnings

ArthurZucker commented Apr 22, 2026 •

edited

Loading

Uh oh!

HuggingFaceDocBuilderDev commented Apr 22, 2026

Uh oh!

zucchini-nlp left a comment

Uh oh!

zucchini-nlp Apr 22, 2026

Uh oh!

zucchini-nlp Apr 22, 2026

Uh oh!

zucchini-nlp Apr 22, 2026

Uh oh!

Cyrilvallez commented Apr 24, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

Conversation

ArthurZucker commented Apr 22, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Summary

Behavior

Uh oh!

HuggingFaceDocBuilderDev commented Apr 22, 2026

Uh oh!

zucchini-nlp left a comment

Choose a reason for hiding this comment

Uh oh!

zucchini-nlp Apr 22, 2026

Choose a reason for hiding this comment

Uh oh!

zucchini-nlp Apr 22, 2026

Choose a reason for hiding this comment

Uh oh!

zucchini-nlp Apr 22, 2026

Choose a reason for hiding this comment

Uh oh!

Cyrilvallez commented Apr 24, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

ArthurZucker commented Apr 22, 2026 •

edited

Loading