Added compatibility for XNLI #34

lintangsutawika · 2022-07-13T17:01:39Z

added template_config_name arg so that the dataset and prompt template source can be different, example: prompt from XNLI En but data to eval from XNLI Fr

…e source can be different, example: prompt from XNLI En but data to eval from XNLI Fr

lintangsutawika · 2022-07-13T17:02:37Z

Example to run eval on XNLI

CHECKPOINT_PATH="bigscience/bloom-350m"
OUTPUT_DIR="bloom-xnli"
dataset_name="xnli"
template_config_name="en"
dataset_config_name="fr"

python t-zero/evaluation/run_eval.py \
        --dataset_name $dataset_name \
        --dataset_config_name $dataset_config_name \
        --template_config_name $template_config_name \
        --model_name_or_path $CHECKPOINT_PATH \
        --output_dir $OUTPUT_DIR \
        --template_name 'GPT-3 style'

@thomasw21

VictorSanh

thank you @lintangsutawika! lgtm! feel free to merge!

thomasw21 · 2022-07-14T09:12:11Z

evaluation/run_eval.py

+        "--template_config_name",
+        type=str,
+        default=None,
+        help="The name of the dataset_config_name of the template we want to use, example: use XNLI En prompts for XNLI Fr",


Is using english prompts on the french XNLI something good? Like I understand it's some sort of measure multilinguality, but I would have expected to have a bunch of frnech prompts for the XNLI fr ...

I thought using english prompts for XNLI is indeed what we wanted to accomplish?

Anyway, evaluating XNLI with prompts and dataset in the same language only requires the Promptsource part be updated. So --template_config_name adds more flexibility while not needing to change too much of the code to be able to eval on multiple multilingual settings.

thomasw21 · 2022-07-14T09:30:32Z

Looks good to me besides the fact that prompts are english prompts (I expected to have them in their languages).

VictorSanh · 2022-07-14T19:47:03Z

Looks good to me besides the fact that prompts are english prompts (I expected to have them in their languages).

i think this is fine. A bunch of the prompts coming from the eval hackathon are code switching

thomasw21 · 2022-07-15T07:40:30Z

Thanks @lintangsutawika

added template_config_name arg so that the dataset and prompt templat…

08b7221

…e source can be different, example: prompt from XNLI En but data to eval from XNLI Fr

VictorSanh approved these changes Jul 13, 2022

View reviewed changes

thomasw21 reviewed Jul 14, 2022

View reviewed changes

thomasw21 merged commit 50c27b5 into bigscience-workshop:thomas/support_new_accelerate_api Jul 15, 2022

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Added compatibility for XNLI #34

Added compatibility for XNLI #34

Uh oh!

lintangsutawika commented Jul 13, 2022

Uh oh!

lintangsutawika commented Jul 13, 2022

Uh oh!

VictorSanh left a comment

Uh oh!

thomasw21 Jul 14, 2022

Uh oh!

lintangsutawika Jul 15, 2022

Uh oh!

thomasw21 commented Jul 14, 2022

Uh oh!

VictorSanh commented Jul 14, 2022

Uh oh!

thomasw21 commented Jul 15, 2022

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Added compatibility for XNLI #34

Added compatibility for XNLI #34

Uh oh!

Conversation

lintangsutawika commented Jul 13, 2022

Uh oh!

lintangsutawika commented Jul 13, 2022

Uh oh!

VictorSanh left a comment

Choose a reason for hiding this comment

Uh oh!

thomasw21 Jul 14, 2022

Choose a reason for hiding this comment

Uh oh!

lintangsutawika Jul 15, 2022

Choose a reason for hiding this comment

Uh oh!

thomasw21 commented Jul 14, 2022

Uh oh!

VictorSanh commented Jul 14, 2022

Uh oh!

thomasw21 commented Jul 15, 2022

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants