PerMU

This repository contains the official implementation of the paper "Erasing Without Remembering: Implicit Knowledge Forgetting in Large Language Models".

Overview

We provide an unbiased assessment of existing methods in forgetting in-scope implicit unlearning samples. The evaluation spans three data domains: two widely-used machine unlearning datasets, TOFU and the Harry Potter books, as well as a popular model editing dataset, ZsRE. Evaluations are performed on 14 existing methods using two language models of different scales, Phi-1.3B and LLaMA2-7B.

Despite significant progress in LLM-based unlearning, we identify an embarrassingly simple yet critical dilemma: existing machine unlearning methods consistently exhibit a lack of generalisation.

To better illustrate, we introduce an unlearning scope, encompassing all the knowledge that unlearned models are expected to forget, such as paraphrased versions, reversed relations, one-hop questions, and those with substituted subjects.

Requirements

conda create -n unlearn python=3.8.19
conda activate unlearn
pip install -r requirements.txt

The code supports fourteen unlearning methods, including: ["grad_ascent," "grad_ascent+kl," "grad_ascent+gd," "dpo," "dpo+kl," "dpo+gd," "npo," "npo+kl," "npo+gd," "task_vector," "ULD," "WHP," "icl," and "PerMU"].

Specify the name of your testing method in the forget_xxx.sh file.

export Forget_Loss=("PerMU");

When tested on TOFU, we use the checkpoints of the pre-trained target model from the TOFU Leaderboard. For the Harry Potter and ZsRE datasets, you can download our pre-trained model from Hugging Face, as specified in the model_config.yaml file.

Running the Code

Unlearning on TOFU dataset:

bash scripts/forget_tofu.sh

Unlearning on Harry Potter dataset:

bash scripts/forget_harry.sh

Unlearning on ZSRE dataset:

bash scripts/forget_zsre.sh

Integrating a New Model

To unlearn a new model, add the model configuration to model_config.yaml, then fine-tune the model using "finetune.sh".

If our implementation and paper are helpful, please consider citing our work.

@article{wang2025erasing,
  title={Erasing Without Remembering: Safeguarding Knowledge Forgetting in Large Language Models},
  author={Wang, Huazheng and Jing, Yongcheng and Sun, Haifeng and Wang, Yingjie and Wang, Jingyu and Liao, Jianxin and Tao, Dacheng},
  journal={arXiv preprint arXiv:2502.19982},
  year={2025}
}

Name		Name	Last commit message	Last commit date
Latest commit History 18 Commits
config		config
data		data
evals		evals
figures		figures
scripts		scripts
.gitignore		.gitignore
README.md		README.md
__init__.py		__init__.py
aggregate_eval_stat.py		aggregate_eval_stat.py
data_module.py		data_module.py
dataloader.py		dataloader.py
evaluate_Harry.py		evaluate_Harry.py
evaluate_TOFU.py		evaluate_TOFU.py
evaluate_ZSRE.py		evaluate_ZSRE.py
finetune.py		finetune.py
forget.py		forget.py
modeling_llama.py		modeling_llama.py
modeling_phi.py		modeling_phi.py
requirements.txt		requirements.txt
utils.py		utils.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

PerMU

Overview

About

Uh oh!

Releases

Packages

Uh oh!

Languages

MaybeLizzy/PERMU

Folders and files

Latest commit

History

Repository files navigation

PerMU

Overview

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Languages

Packages