Generate: TF uses `GenerationConfig` as the basis for `.generate()` parametrization by gante · Pull Request #20994 · huggingface/transformers

gante · 2023-01-04T11:49:01Z

What does this PR do?

Changes the TF side of .generate() such that it relies on the GenerationConfig. This is the TF equivalent of #20388

HuggingFaceDocBuilderDev · 2023-01-04T12:02:38Z

The documentation is not available anymore as the PR was closed or merged.

gante · 2023-01-04T17:52:39Z

+            if return_dict_in_generate is not None
+            else self.generation_config.return_dict_in_generate
        )
+        use_cache = model_kwargs.pop("use_cache", self.generation_config.use_cache)


This pattern, pulling use_cache out of the model_kwargs, is the only “new” modification when compared to the PR made on the PT side.

It was also an existing bug that went undetected: when use_cache is set in model_kwargs, the XLA-compiled loop converts it to a tf.bool variable (because it is part of an input/output variable in the tf.while_loop, and therefore assumed as dynamic at runtime). Because it is converted to a tf.bool, XLA model compilation fails due to lines like present_key_values = () if use_cache else None, which require use_cache to be a static variable.

Workaround: pull use_cache out of the model_kwargs and use the corresponding static variable in the tf.while_loop

gante · 2023-01-04T17:53:42Z

            else:
                input_ids = tf.expand_dims(generated[:, cur_len - 1], -1)
-            model_inputs = self.prepare_inputs_for_generation(input_ids, **model_kwargs)
+            model_inputs = self.prepare_inputs_for_generation(input_ids, use_cache=use_cache, **model_kwargs)


(see comment above)

sgugger

Thanks a lot for expanding generation configs to TF!

…arametrization (huggingface#20994)

gante added 2 commits January 4, 2023 11:38

tmp commit

ad92874

add gen config in a few more places

fa2765d

gante changed the title ~~Generate: TF uses generation config~~ Generate: TF uses GenerationConfig as the basis for .generate() parametrization Jan 4, 2023

gante added 5 commits January 4, 2023 12:07

load gen config in from_pretrained

a28ae6f

tmp commit

c59ac63

use_cache outside model_kwargs

bf2db7d

modify rag too

7fd57bf

rectify examples

225cf23

gante marked this pull request as ready for review January 4, 2023 17:31

gante requested a review from sgugger January 4, 2023 17:31

pull use_cache

9ec758f

gante commented Jan 4, 2023

View reviewed changes

sgugger approved these changes Jan 4, 2023

View reviewed changes

gante merged commit a6c850e into huggingface:main Jan 4, 2023

gante deleted the generation_config_tf branch January 4, 2023 18:23

silverriver pushed a commit to silverriver/transformers that referenced this pull request Jan 6, 2023

Generate: TF uses GenerationConfig as the basis for .generate() p…

27137f4

…arametrization (huggingface#20994)

gante mentioned this pull request Jan 17, 2023

Generate: TF contrastive search must pop use_cache from model_kwargs #21149

Merged

ydshieh mentioned this pull request Aug 17, 2023

Skip test_contrastive_generate for TFXLNet #25574

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Generate: TF uses `GenerationConfig` as the basis for `.generate()` parametrization#20994

Generate: TF uses `GenerationConfig` as the basis for `.generate()` parametrization#20994
gante merged 8 commits intohuggingface:mainfrom
gante:generation_config_tf

gante commented Jan 4, 2023 •

edited

Loading

Uh oh!

HuggingFaceDocBuilderDev commented Jan 4, 2023 •

edited

Loading

Uh oh!

gante Jan 4, 2023

Uh oh!

gante Jan 4, 2023

Uh oh!

sgugger left a comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Conversation

gante commented Jan 4, 2023 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

What does this PR do?

Uh oh!

HuggingFaceDocBuilderDev commented Jan 4, 2023 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

gante Jan 4, 2023

Choose a reason for hiding this comment

Uh oh!

gante Jan 4, 2023

Choose a reason for hiding this comment

Uh oh!

sgugger left a comment

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

gante commented Jan 4, 2023 •

edited

Loading

HuggingFaceDocBuilderDev commented Jan 4, 2023 •

edited

Loading