Add head_mask and decoder_head_mask to PyTorch LED by stancld · Pull Request #9856 · huggingface/transformers

stancld · 2021-01-27T17:47:54Z

This PR implements head_mask and decoder_head_mask for PyTorch LED (and Longformer as there's a copy dependency) and it is the follow-up to the open issue #9814.

Motivation: This PR is a part of an endeavour to enable the usage of head_mask and decoder_head_mask for all encoder-decoder transformers following the recent work on BART-like models (#9569).

Fixes: #9814

Reviewers: @patrickvonplaten @LysandreJik @stas00

* Add head_mask to longformer to fix dependencies of LED on Longformer. * Not working yet

patrickvonplaten

Great! Very clean implementation, thanks for taking care of this :-)

LysandreJik

Fantastic! Thanks for working on that @stancld!

sgugger

Very clean, thanks for your PR! Just one styling nit, but feel free to ignore.

sgugger · 2021-02-01T13:11:32Z

+            assert layer_head_mask.size() == (
+                self.num_heads,
+            ), f"Head mask for a single layer should be of size {(self.num_heads,)}, but is {layer_head_mask.size()}"


Complete nit, but those asserts are not super well formatted. Can we replace them by if and raise a proper error?

Suggested change

assert layer_head_mask.size() == (

self.num_heads,

), f"Head mask for a single layer should be of size {(self.num_heads,)}, but is {layer_head_mask.size()}"

if layer_head_mask.size() == self.num_heads:

raise ValueError(f"Head mask for a single layer should be of size {(self.num_heads,)}, but is {layer_head_mask.size()}")

(If you do one do all of them.)

much better - I tried to find a way to make the autoformatter be nice, but couldn't figure out. Yours is great and it reads even better.

@sgugger Thank you very much for your suggestion, I definitely agree with this. Do you think I should create a new PR to replace this part in all the models where desired?

Since it was a nit and not a requirement, I'd say whatever works the best for you, @stancld - thank you!

@stas00 Thanks for a quick reply. As such, I would leave it now as it is and I will replace these assertions in a new PR once there will be head_mask and decoder_head_mask implemented for all encoder-decoder models. Just in order not to mix this change in this PR, even though it's only a minor one; if it's okay :)

That works well, @stancld!

Thank you for your great contribution!

stancld added 6 commits January 27, 2021 00:04

Add {decoder_,}head_mask to LED

1f4a4a1

Fix create_custom_forward signatue in encoder

4156d43

Add head_mask to longformer

6d80591

* Add head_mask to longformer to fix dependencies of LED on Longformer. * Not working yet

Add mising one input in longofrmer_modeling.py

6e61b05

make fix-copies

0e3a696

Merge remote-tracking branch 'upstream/master'

fbd2d3c

stancld mentioned this pull request Jan 27, 2021

Missing head_mask and decoder_head_mask arguments in encoder-decoder models #9814

Closed

stancld changed the title ~~Add head_mask and decoder_head_mask to LED (+ head_mask to Longformer due to copy dependency)~~ Add head_mask and decoder_head_mask to PyTorch LED Jan 27, 2021

patrickvonplaten approved these changes Feb 1, 2021

View reviewed changes

patrickvonplaten requested review from LysandreJik and sgugger February 1, 2021 06:31

LysandreJik approved these changes Feb 1, 2021

View reviewed changes

sgugger approved these changes Feb 1, 2021

View reviewed changes

stas00 merged commit 71bdc07 into huggingface:master Feb 2, 2021

stancld deleted the LED_encoder_decoder_head_masks branch February 2, 2021 20:38

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add head_mask and decoder_head_mask to PyTorch LED#9856

Add head_mask and decoder_head_mask to PyTorch LED#9856
stas00 merged 6 commits intohuggingface:masterfrom
stancld:LED_encoder_decoder_head_masks

stancld commented Jan 27, 2021 •

edited

Loading

Uh oh!

patrickvonplaten left a comment

Uh oh!

LysandreJik left a comment

Uh oh!

sgugger left a comment

Uh oh!

sgugger Feb 1, 2021

Uh oh!

stas00 Feb 1, 2021

Uh oh!

stancld Feb 2, 2021

Uh oh!

stas00 Feb 2, 2021

Uh oh!

stancld Feb 2, 2021

Uh oh!

stas00 Feb 2, 2021

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

5 participants

Conversation

stancld commented Jan 27, 2021 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

patrickvonplaten left a comment

Choose a reason for hiding this comment

Uh oh!

LysandreJik left a comment

Choose a reason for hiding this comment

Uh oh!

sgugger left a comment

Choose a reason for hiding this comment

Uh oh!

sgugger Feb 1, 2021

Choose a reason for hiding this comment

Uh oh!

stas00 Feb 1, 2021

Choose a reason for hiding this comment

Uh oh!

stancld Feb 2, 2021

Choose a reason for hiding this comment

Uh oh!

stas00 Feb 2, 2021

Choose a reason for hiding this comment

Uh oh!

stancld Feb 2, 2021

Choose a reason for hiding this comment

Uh oh!

stas00 Feb 2, 2021

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

5 participants

stancld commented Jan 27, 2021 •

edited

Loading