PathVG: A New Benchmark and Dataset for Pathology Visual Grounding

Chunlin Zhong^1⋆, Shuang Hao^1⋆, Junhua Wu^2⋆, Xiaona Chang^2⋆,
Jiwei Jiang¹, Xiu Nie^2✉, He Tang^1✉, Xiang Bai¹

¹School of Software Engineering, Huazhong University of Science and Technology, Wuhan 430074, China
²Department of Pathology, Union Hospital, Tongji Medical College, Huazhong University of Science and Technology, Wuhan 430022, China

This repo is the official implementation of "PathVG: A New Benchmark and Dataset for Pathology Visual Grounding" (MICCAI 2025 Spotlight).

We introduce RefPath, a large-scale pathology visual grounding dataset containing 27,610 pathology images with 33,500 expert-verified language-grounded bounding boxes.

Contact: clzhong@hust.edu.cn; hetang@hust.edu.cn

📅 Updates (Timeline & To-Do)

Track key project milestones and resource releases below. Click the links to access corresponding resources once available:

2025-06-25: Repository initialization (basic structure & README released)
2025-09-30: PathVG Dataset public release → Download Dataset
2025-12: Full training/test code public release
2025-12: Pre-trained model weights public release → Weights Release

🔧 Environment Setup Prerequisites Our PathVG model is built on TransCP, Please first follow TransCP's official environment configuration guide.

📸 Overview

Install

git clone https://github.com/WayneTomas/TransCP.git
conda create -n pytorch1.7 python=3.6.13
conda activate pytorch1.7
pip install -r requirements.txt

📊 Dataset Preparation (RefPath)

The RefPath dataset is publicly available on Hugging Face:

Step 1: Download the dataset

The dataset consists of:

train.jsonl, testA.jsonl, testB.jsonl annotation files
Corresponding pathology images (provided as parquet files or individual images depending on the split)

Step 2: Organize the downloaded files

After downloading, we recommend organizing the data as follows:

RefPath/
├── train.jsonl
├── validation.jsonl
├── test.jsonl
└── images/                 # Extracted images from parquet files or directly downloaded
    ├── xxx.jpg
    ├── yyy.jpg
    └── ...

Step 3: Convert to training format

The training and evaluation code requires pre-processed data files (.pth format) for efficient loading. We have provided conversion scripts in the repository:

json2pt.py

python jsonl2pth.py \
  --input_jsonl ./data/RefPath/testB.jsonl \
  --output_pth ./PathVG/split/data/pathology2/testB.pth \
  --image_base ./data/RefPath/refpath_images

Train

The following is an example of model training on the RefPath dataset.

python -m torch.distributed.launch --nproc_per_node=2 --master_port=29516 train.py --config configs/TransCP_R50_pathology2.py

Eval

You can download our trained weights here for testing: Weights Release

python -m torch.distributed.launch --nproc_per_node=2 --master_port=29516 eval.py \
    --config configs/TransCP_R50_pathology2.py \
    --test_split testB \
    --resume /path/to/your/checkpoint.pth

Parameter Notes

--test_split: Specifies the dataset split for evaluation. testA corresponds to 40X magnification pathology images, testB corresponds to 20X magnification pathology images, and val includes all test images for comprehensive evaluation.
--resume: Path to the pre-trained model checkpoint (.pth file). This parameter is mandatory to load the trained model weights for inference.
--nproc_per_node: Number of GPUs used for distributed evaluation (set to 2 in the example, adjust based on available GPUs).
--master_port: Port for distributed communication (avoid port conflicts by changing to an unused port like 29517 if needed).

📝 Citation

If you use the PathVG dataset, code, or results in your research, please cite our MICCAI 2025 paper:
@InProceedings{ ZhoChu_PathVG_MICCAI2025,
author = { Zhong, Chunlin AND Hao, Shuang AND Wu, Junhua AND Chang, Xiaona AND Jiang, Jiwei AND Nie, Xiu AND Tang, He AND Bai, Xiang },
title = { { PathVG: A New Benchmark and Dataset for Pathology Visual Grounding } },
booktitle = {Medical Image Computing and Computer Assisted Intervention -- MICCAI 2025},
year = {2025},
publisher = {Springer Nature Switzerland},
volume = { LNCS 15972 },
month = {October},
pages = { 454 -- 463 },
}

For questions or issues, please open an issue or contact the corresponding author at: [hetang@hust.edu.cn]

Acknowledgement

Part of our code is based on the previous works DETR, TransVG, and TransCP, thanks for the authors.

Name		Name	Last commit message	Last commit date
Latest commit History 18 Commits
asset		asset
configs		configs
datasets		datasets
models		models
split		split
util		util
README.md		README.md
engine.py		engine.py
eval.py		eval.py
json2pt.py		json2pt.py
requirements.txt		requirements.txt
train.py		train.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

PathVG: A New Benchmark and Dataset for Pathology Visual Grounding

📅 Updates (Timeline & To-Do)

📸 Overview

Install

📊 Dataset Preparation (RefPath)

Step 1: Download the dataset

Step 2: Organize the downloaded files

Step 3: Convert to training format

Train

Eval

Parameter Notes

📝 Citation

Acknowledgement

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

PathVG: A New Benchmark and Dataset for Pathology Visual Grounding

📅 Updates (Timeline & To-Do)

📸 Overview

Install

📊 Dataset Preparation (RefPath)

Step 1: Download the dataset

Step 2: Organize the downloaded files

Step 3: Convert to training format

Train

Eval

Parameter Notes

📝 Citation

Acknowledgement

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages