Fix GPT2 with cross attention#39754
Conversation
|
run-slow: gpt2, vision_encoder_decoder |
|
This comment contains run-slow, running the specified jobs: models: ['models/gpt2', 'models/vision_encoder_decoder'] |
|
The docs for this PR live here. All of your documentation changes will be reflected on that endpoint. The docs are available until 30 days after the last update. |
|
|
||
| inputs_dict["output_attentions"] = True | ||
| config.output_hidden_states = False | ||
| config._attn_implementation = "eager" |
There was a problem hiding this comment.
we should use config.set_....
|
run-slow: decision_transformer, gpt2, vision_encoder_decoder |
|
This comment contains run-slow, running the specified jobs: models: ['models/decision_transformer', 'models/gpt2', 'models/vision_encoder_decoder'] |
|
[For maintainers] Suggested jobs to run (before merge) run-slow: decision_transformer, gpt2, vision_encoder_decoder |
|
run-slow: decision_transformer, gpt2, vision_encoder_decoder |
|
This comment contains run-slow, running the specified jobs: models: ['models/decision_transformer', 'models/gpt2', 'models/vision_encoder_decoder'] |
* fix * use new mask API * style * fix copies and attention tests * fix head pruning tests
* fix * use new mask API * style * fix copies and attention tests * fix head pruning tests
* fix * use new mask API * style * fix copies and attention tests * fix head pruning tests
* fix * use new mask API * style * fix copies and attention tests * fix head pruning tests
* fix * use new mask API * style * fix copies and attention tests * fix head pruning tests
* fix * use new mask API * style * fix copies and attention tests * fix head pruning tests
* fix * use new mask API * style * fix copies and attention tests * fix head pruning tests
* fix * use new mask API * style * fix copies and attention tests * fix head pruning tests
Original PR #39754 by zucchini-nlp Original: huggingface/transformers#39754
Merged from original PR #39754 Original: huggingface/transformers#39754
Original PR #39754 by zucchini-nlp Original: huggingface/transformers#39754
Merged from original PR #39754 Original: huggingface/transformers#39754
What does this PR do?
Fixes #39746