Similarity Aware Token Pruning: Your VLM but Faster

Official PyTorch implementation of Similarity Aware Token Pruning: Your VLM but Faster.

Authors: Ahmadreza Jeddi, Negin Baghbanzadeh, Elham Dolatabadi, Babak Taati

What does SAINT do?

SAINT allows researchers and practitioners to take existing ViTs and VLMs and prune their visual tokens in a training-free setup using a graphical modeling of tokens which enables aggressive token dropping based on similarity (redundancy) in the early layers of ViT/LLM, substantially improving inference efficiency while minimizing performance loss. SAINT is:

The first method that deeply analyzes token dynamics and establishes patterns of token evolution common to both vision encoders and language models; both follow an implicit 3-stage evolution, called aligner-explorer-aggregator (refer to the paper).
More robust by finding that token similarity provides a stronger signal for dropping compared to attention, and that dropping beats merging in early stages.
State-of-the-art (SOTA) in performance for both ViTs and VLMs.
Versatile in pruning, being the first VLM pruning method to analyze pruning before the LLM, during the LLM, and via a hybrid approach, showcasing the performance-efficiency trade-offs.

Usage

ViT: Navigate to the ViT folder to see the code and the README with instructions on how to work with Vision Transformers.
VLM: Navigate to the VLM folder to see the code and the README with instructions on how to work with Vision-Language Models.

Citation

If you use our work or this repository in your research, please cite our paper:

@misc{jeddi2025similarityawaretokenpruningvlm,
      title={Similarity-Aware Token Pruning: Your VLM but Faster}, 
      author={Ahmadreza Jeddi and Negin Baghbanzadeh and Elham Dolatabadi and Babak Taati},
      year={2025},
      eprint={2503.11549},
      archivePrefix={arXiv},
      primaryClass={cs.CV},
      url={https://arxiv.org/abs/2503.11549}, 
}

Acknowledgements

This repository was inspired by and builds upon the codebases from the ToMe and FastV repositories.

Name		Name	Last commit message	Last commit date
Latest commit History 3 Commits
VLM		VLM
ViT		ViT
imgs		imgs
.gitignore		.gitignore
README.md		README.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

Similarity Aware Token Pruning: Your VLM but Faster

What does SAINT do?

Usage

Citation

Acknowledgements

About

Uh oh!

Releases

Packages

Uh oh!

Contributors 2

Languages

armenjeddi/saint

Folders and files

Latest commit

History

Repository files navigation

Similarity Aware Token Pruning: Your VLM but Faster

What does SAINT do?

Usage

Citation

Acknowledgements

About

Topics

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors 2

Languages

Packages