Multimodal-Aware Fusion Network For Referring Remote Sensing Image Segmentation

Code for our GRSL 2025 paper"Multimodal-Aware Fusion Network for Referring Remote Sensing Image Segmentation"

Contributed by Leideng Shi, Juan Zhang*.

Getting Started

Installation

Install the dependencies.

The code was tested on Ubuntu 20.04.6, with Python 3.7 and PyTorch v1.12.1.

Clone this repository.

git clone https://github.com/Roaxy/MAFN.git

Create a new Conda environment with Python 3.7 then activate it:
```
conda create -n MAFN python==3.7
conda activate MAFN
```

Install pytorch v1.12.1 (CUDA 10.2 is used in this example).

conda install pytorch==1.12.1 torchvision==0.13.1 torchaudio==0.12.1 cudatoolkit=10.2 -c pytorch

Install the requirements.
```
pip install -r requirements.txt
```

Datasets

RRSIS-D

RRSIS-D dataset can be downloaded from Google Drive or Baidu Netdisk, and then follow RMSIN datasets usage. The files and make directories as follows.

$DATA_PATH
├── rrsisd
│   ├── refs(unc).p
│   ├── instances.json
└── images
    └── rrsisd
        ├── JPEGImages
        ├── ann_split

finishing downloading, unpack the tarball (hico_20160224_det.tar.gz) to the data directory.

Pre-trained model

Download the pre-trained classification weights of the Swin Transformer for training to initialize the model , and put the pth file in ./pretrained_weights.

Download the pre-trained bert-base-uncased weights of the BERT for training to initialize the model , and put the files in ./MAFN/bert-base-uncased

Training

After the preparation, you can start training with the following commands. We use DistributedDataParallel from PyTorch for training. To run on 2 GPUs (with IDs 0, 1) on a single node:

sh ./train.sh

Testing

# default setting
sh ./test.sh

You may modify codes in test.sh:11 to use val instead of test. By default, we set the split to test.

The validation on the RRSIS-D dataset

	P@0.5	P@0.6	P@0.7	P@0.8	P@0.9	oIoU	mIoU	Download
MAFN	76.32	69.31	58.33	44.54	24.71	78.33	66.03	model

Acknowledgements

Code in this repository is built on RMSIN. We'd like to thank the authors for open sourcing their project.

Name		Name	Last commit message	Last commit date
Latest commit History 47 Commits
arc		arc
bert		bert
data		data
lib		lib
loss		loss
refer		refer
MAFN.png		MAFN.png
README.md		README.md
args.py		args.py
requirements.txt		requirements.txt
test.py		test.py
test.sh		test.sh
train.py		train.py
train.sh		train.sh
transforms.py		transforms.py
utils.py		utils.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Multimodal-Aware Fusion Network For Referring Remote Sensing Image Segmentation

Getting Started

Installation

Datasets

RRSIS-D

Pre-trained model

Training

Testing

The validation on the RRSIS-D dataset

Acknowledgements

About

Uh oh!

Releases

Packages

Uh oh!

Languages

Roaxy/MAFN

Folders and files

Latest commit

History

Repository files navigation

Multimodal-Aware Fusion Network For Referring Remote Sensing Image Segmentation

Getting Started

Installation

Datasets

RRSIS-D

Pre-trained model

Training

Testing

The validation on the RRSIS-D dataset

Acknowledgements

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Languages

Packages