Add wmt translation example by patrickvonplaten · Pull Request #3428 · huggingface/transformers

patrickvonplaten · 2020-03-25T10:47:35Z

PR adds translation example for T5.
It uses the sacrebleu BLEU scorer.

I adapted the README.md a bit so that users are aware that models in official paper were attained with finetuned T5 @craffel

LysandreJik

Cool, that looks clean. I guess we don't need all the usual arguments (data parallel, fp16 and others) since this is an evaluation script and not a training script?

It doesn't seem to run on GPU, does it take a long time to compute? Shouldn't we try to cast to a GPU if there's one available?

patrickvonplaten · 2020-03-26T13:41:36Z

long

Yeah we should definitely try to run it on a GPU - will take a look at that :-)

patrickvonplaten · 2020-03-26T13:56:30Z

Not sure whether we need fp16 and multi-gpu training. I think single GPU training is enough and t5 + wmt does not take much memory. But happy to take a look into it if you guys think it's worth it :-) @thomwolf @LysandreJik @julien-c

patrickvonplaten · 2020-03-26T14:44:33Z

Code quality test fails because of unpinned isort library (see #3449)

thomwolf

Looks good to me

craffel · 2020-03-26T16:53:57Z

@@ -0,0 +1,51 @@
+***This script evaluates the [T5 Model](https://arxiv.org/pdf/1910.10683.pdf) ``t5-base`` on the English to German WMT dataset. Please note that the results in the paper were attained using a ``t5-base`` model fine-tuned on translation, so that results will be slightly worse here***


Suggest changing "the T5 Model t5-base" to "the multitask pre-trained checkpoint for t5-base

If you want to be more specific than "slightly", it looks like they are about 1.5 BLEU points lower than they would be with fine-tuning.

codecov-io · 2020-03-26T17:24:27Z

Codecov Report

Merging #3428 into master will not change coverage.
The diff coverage is n/a.

@@           Coverage Diff           @@
##           master    #3428   +/-   ##
=======================================
  Coverage   52.51%   52.51%           
=======================================
  Files         100      100           
  Lines       17051    17051           
=======================================
  Hits         8954     8954           
  Misses       8097     8097

Continue to review full report at Codecov.

Legend - Click here to learn more
Δ = absolute <relative> (impact), ø = not affected, ? = missing data
Powered by Codecov. Last update b4fb94f...713524e. Read the comment docs.

LysandreJik

LGTM!

patrickvonplaten force-pushed the add_t5_ranslation_example branch from 03a688d to 66dc34e Compare March 25, 2020 12:56

patrickvonplaten changed the title ~~Add Translation for WMT example~~ Add wmt translation example Mar 25, 2020

patrickvonplaten mentioned this pull request Mar 25, 2020

Extend config with task specific configs. #3433

Merged

3 tasks

patrickvonplaten force-pushed the add_t5_ranslation_example branch 5 times, most recently from 3bd6d51 to c8e515a Compare March 26, 2020 12:09

patrickvonplaten requested review from LysandreJik and thomwolf March 26, 2020 12:51

LysandreJik reviewed Mar 26, 2020

View reviewed changes

Comment thread examples/translation/t5/evaluate_wmt.py Outdated

Comment thread examples/translation/t5/evaluate_wmt.py Outdated

patrickvonplaten added 4 commits March 26, 2020 15:09

add translation example

bcbbfb5

make style

486cacb

adapt docstring

58e5ce6

add gpu device as input for example

abdc617

patrickvonplaten force-pushed the add_t5_ranslation_example branch from a5b69f7 to abdc617 Compare March 26, 2020 14:09

small renaming

274c308

thomwolf approved these changes Mar 26, 2020

View reviewed changes

craffel reviewed Mar 26, 2020

View reviewed changes

better README

c3bce97

patrickvonplaten force-pushed the add_t5_ranslation_example branch 2 times, most recently from 713524e to c3bce97 Compare March 26, 2020 17:29

Merge branch 'master' into add_t5_ranslation_example

054f889

LysandreJik approved these changes Mar 26, 2020

View reviewed changes

patrickvonplaten merged commit 5ad2ea0 into huggingface:master Mar 26, 2020

sshleifer mentioned this pull request Mar 27, 2020

CircleCI ExamplesTests::test_run_squad failing #3469

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add wmt translation example#3428

Add wmt translation example#3428
patrickvonplaten merged 7 commits intohuggingface:masterfrom
patrickvonplaten:add_t5_ranslation_example

patrickvonplaten commented Mar 25, 2020 •

edited

Loading

Uh oh!

LysandreJik left a comment

Uh oh!

Uh oh!

Uh oh!

patrickvonplaten commented Mar 26, 2020

Uh oh!

patrickvonplaten commented Mar 26, 2020

Uh oh!

patrickvonplaten commented Mar 26, 2020

Uh oh!

thomwolf left a comment

Uh oh!

craffel Mar 26, 2020

Uh oh!

codecov-io commented Mar 26, 2020 •

edited

Loading

Uh oh!

LysandreJik left a comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

5 participants

		@@ -0,0 +1,51 @@
		*This script evaluates the [T5 Model](https://arxiv.org/pdf/1910.10683.pdf) ``t5-base`` on the English to German WMT dataset. Please note that the results in the paper were attained using a ``t5-base`` model fine-tuned on translation, so that results will be slightly worse here*

Conversation

patrickvonplaten commented Mar 25, 2020 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

LysandreJik left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

patrickvonplaten commented Mar 26, 2020

Uh oh!

patrickvonplaten commented Mar 26, 2020

Uh oh!

patrickvonplaten commented Mar 26, 2020

Uh oh!

thomwolf left a comment

Choose a reason for hiding this comment

Uh oh!

craffel Mar 26, 2020

Choose a reason for hiding this comment

Uh oh!

codecov-io commented Mar 26, 2020 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Codecov Report

Uh oh!

LysandreJik left a comment

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

5 participants

patrickvonplaten commented Mar 25, 2020 •

edited

Loading

codecov-io commented Mar 26, 2020 •

edited

Loading