Diffu-Patch

Diffu-Patch: Denoising Diffusion Probabilistic Model for Learning Bug-Fixing Patches

Python library dependencies:

bert_score
blobfile
nltk
numpy
packaging
psutil
PyYAML
setuptools
spacy
torch==1.9.0+cu111
torchmetrics
tqdm
transformers==4.22.2
wandb
datasets

Dataset:

Prepare datasets and put them under the datasets folder.The two datasets we used are placed under the datasets folder, namely datasets/bugfix and datasets/bugfixlen. Corresponding vocabulary files are placed in their respective folders, named vocab.txt and vocablen.txt.

Diffu-Patch Training

cd scripts
bash train.sh

Arguments explanation:

--dataset: the name of datasets, just for notation
--data_dir: the path to the saved datasets folder, containing train.jsonl,test.jsonl,valid.jsonl
--seq_len: the max length of sequence $z$ ($x\oplus y$)
--resume_checkpoint: if not none, restore this checkpoint and continue training
--vocab: the tokenizer is initialized using bert or load your own preprocessed vocab dictionary (e.g. using BPE or our vocab)

Additional argument:

--learned_mean_embed: set whether to use the learned soft absorbing state.
--denoise: set whether to add discrete noise
--use_fp16: set whether to use mixed precision training
--denoise_rate: set the denoise rate, with 0.5 as the default

Diffu-Patch Detecting

cd scripts
bash run_decode.sh

Diffu-Patch Speed-up Detecting

cd scripts
bash run_decode_solver.sh

Name		Name	Last commit message	Last commit date
Latest commit History 2 Commits
datasets		datasets
diffuseq		diffuseq
scripts		scripts
Readme.md		Readme.md
basic_utils.py		basic_utils.py
dpm_solver_pytorch.py		dpm_solver_pytorch.py
index2.py		index2.py
requirements.txt		requirements.txt
sample_seq2seq.py		sample_seq2seq.py
sample_seq2seq_dpmSolver.py		sample_seq2seq_dpmSolver.py
scores.py		scores.py
train.py		train.py
train_util.py		train_util.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Diffu-Patch

Diffu-Patch: Denoising Diffusion Probabilistic Model for Learning Bug-Fixing Patches

Python library dependencies:

Dataset:

Diffu-Patch Training

Diffu-Patch Detecting

Diffu-Patch Speed-up Detecting

About

Uh oh!

Releases

Packages

Uh oh!

Languages

xxiutong/DiffuPatch

Folders and files

Latest commit

History

Repository files navigation

Diffu-Patch

Diffu-Patch: Denoising Diffusion Probabilistic Model for Learning Bug-Fixing Patches

Python library dependencies:

Dataset:

Diffu-Patch Training

Diffu-Patch Detecting

Diffu-Patch Speed-up Detecting

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Languages

Packages