Skip to content

linhhtran/CoRe-Reranking

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

7 Commits
 
 
 
 
 
 
 
 
 
 

Repository files navigation

Attention-based Reranker with Contrastive Retrieval Heads

Repository for attention-based reranker with contrastive retrieval head detection. Our head detector identifies top retrieval heads by contrasting the attention score of positive and hard negative documents. The top detected retrieval heads (roughly 1% of all heads) significantly improves reranking task compared to noisy aggregation over all heads.

The attention-based reranker implementation is adapted from In-Context-Reranking.

Datasets

We use granite-embedding-30m-english to retrieve top-40 documents for BEIR benchmark, and granite-embedding-107m-multilingual for MLDR datasets.

We upload the retriever outputs here which can be downloaded and stored in the ./retriever_output folder.

The head detection data can be downloaded from here which should be stored in the ./head_data folder.

Experiment examples

We already include the CoRe head scores for each model in the ./head_data folder. The head scores (example for Mistral 7B) can be reproduced with the following command (run from ./experiments/):

python head_detection.py --llm mistral --detector core --temp 0.001

The following example command runs the reranking process with 8 retrieval heads on hotpotqa dataset:

python reranking.py --llm mistral --data hotpotqa --reranker core --num_head 8

About

No description, website, or topics provided.

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

 
 
 

Contributors

Languages