Short Answer Feedback Generation Baseline

!!! Version 3.0 of this dataset can now be found Huggingface!!!

This repository contains the dataset and code corresponding to the paper "Your Answer is Incorrect… Would you like to know why? Introducing a Bilingual Short Answer Feedback Dataset" (Filighera et al., ACL 2022).

The experiments were run with a conda python environment using python version 3.7.10. The environment to reproduce our results can be installed in the following way (for windows, uncomment pywin32=300 in requirements.txt):

conda create --name baseline python=3.7
conda activate baseline
conda install pytorch==1.7.1 torchvision==0.8.2 torchaudio==0.7.2 cudatoolkit=10.1 -c pytorch
pip install -r requirements.txt
pip install bert-score

Download and copy your data set into the data folder or ensure that SAF is there.

Run the preprocessing script to preprocess the data set, optionally edit the file path and hyperparameters in the script

python preprocessing.py

Run the finetuning script to train the models. Adjust variable mode to select which experiment setting you want to use, you can also edit hyperparameters here

python finetuning.py

A model can be tested by running the testing script. Adjust model path, language and data paths in the script.

python testing.py

Testing saves the model predictions to a file, which can then be utilized to calculate and print out the BERT score. As file convention, the file should start with the experiment mode.

python bert_scoring.py

We also provide an example script for inference with a finetuned model (using our code) and a json file that contains a question, a reference answer, and a list of student answers.

python inference.py

You may also adjust the seeding in the litT5.py script.

If you found this code or dataset helpful in your research, please consider citing:

@inproceedings{filighera-etal-2022-answer,
    title = "Your Answer is Incorrect... Would you like to know why? Introducing a Bilingual Short Answer Feedback Dataset",
    author = "Filighera, Anna  and
      Parihar, Siddharth  and
      Steuer, Tim  and
      Meuser, Tobias  and
      Ochs, Sebastian",
    booktitle = "Proceedings of the 60th Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers)",
    month = may,
    year = "2022",
    address = "Dublin, Ireland",
    publisher = "Association for Computational Linguistics",
    url = "https://aclanthology.org/2022.acl-long.587",
    pages = "8577--8591",
   }

Name		Name	Last commit message	Last commit date
Latest commit History 45 Commits
data		data
source_code		source_code
LICENSE		LICENSE
README.md		README.md
SAF2_0.zip		SAF2_0.zip
finetuning.py		finetuning.py
inference.py		inference.py
preprocessing.py		preprocessing.py
requirements.txt		requirements.txt
testing.py		testing.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Short Answer Feedback Generation Baseline

About

Uh oh!

Releases

Packages

Uh oh!

Contributors 2

Uh oh!

Languages

License

SebOchs/SAF

Folders and files

Latest commit

History

Repository files navigation

Short Answer Feedback Generation Baseline

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors 2

Uh oh!

Languages

Packages