Fix `attn_implementation` documentation by fxmarty · Pull Request #29295 · huggingface/transformers

fxmarty · 2024-02-26T11:37:06Z

As reported in #26572 (comment), attn_implementation is wrongfully documented under PretrainedConfig, and is not under AutoModel.from_config & PreTrainedModel.from_pretrained as it should be.

HuggingFaceDocBuilderDev · 2024-02-26T11:59:49Z

The docs for this PR live here. All of your documentation changes will be reflected on that endpoint. The docs are available until 30 days after the last update.

amyeroberts · 2024-02-26T12:25:02Z

@fxmarty I don't think this is quite right - if someone can pass in

model = AutoModel.from_config(config, attn_implementation="foo")

they should also be able to pass it in for the config constructor and then pass that to the model

config = ModelConfig(attn_implementation="foo")
model = AutoModel.from_config(config)

fxmarty · 2024-02-26T12:41:30Z

@amyeroberts This is not what was suggested in #26572 (comment) by @patrickvonplaten.

Happy to change that, but I think it should be in an other PR. This PR simply reflects in the documentation the current expected usage by users:

model = AutoModel.from_config(cfg, attn_implementation="eager")
model = LlamaModel.from_pretrained("xxx", attn_implementation="eager")

while the following is illegal/not supported:

from transformers import AutoModelForCausalLM, AutoConfig, LlamaForCausalLM

cfg = AutoConfig.from_pretrained("fxmarty/tiny-llama-fast-tokenizer")
cfg.attn_implementation = "eager"

model = LlamaForCausalLM(cfg)

they should also be able to pass it in for the config constructor

Ultimately I agree with you, PreTrainedModel.__init__ should in the future have a way to obey to a specified attn_implementation (currently not exposed to users, only config._attn_implementation = "eager" works).

amyeroberts · 2024-02-26T18:44:52Z

@fxmarty OK, thanks for explaining and linking to the relevant comment! If it's something that causes a lot of confusion we can circle back on enabling it through the config creation

amyeroberts

Thanks for following up on this and making the docstrings consistent!

fix

1744742

fxmarty requested review from ArthurZucker and amyeroberts February 26, 2024 11:37

amyeroberts approved these changes Feb 26, 2024

View reviewed changes

ArthurZucker approved these changes Feb 27, 2024

View reviewed changes

fxmarty merged commit 6d3b643 into huggingface:main Feb 27, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Fix `attn_implementation` documentation#29295

Fix `attn_implementation` documentation#29295
fxmarty merged 1 commit intohuggingface:mainfrom
fxmarty:fix-sdpa-doc

fxmarty commented Feb 26, 2024

Uh oh!

HuggingFaceDocBuilderDev commented Feb 26, 2024

Uh oh!

amyeroberts commented Feb 26, 2024 •

edited

Loading

Uh oh!

fxmarty commented Feb 26, 2024 •

edited

Loading

Uh oh!

amyeroberts commented Feb 26, 2024

Uh oh!

amyeroberts left a comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

Conversation

fxmarty commented Feb 26, 2024

Uh oh!

HuggingFaceDocBuilderDev commented Feb 26, 2024

Uh oh!

amyeroberts commented Feb 26, 2024 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

fxmarty commented Feb 26, 2024 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

amyeroberts commented Feb 26, 2024

Uh oh!

amyeroberts left a comment

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

amyeroberts commented Feb 26, 2024 •

edited

Loading

fxmarty commented Feb 26, 2024 •

edited

Loading