GitHub

To reproduce all our results as reported bellow, you can use our pretrained modeL and our source code.

Method	All	Thing	Stuff	Single	Plural
MCN	54.2	48.6	61.4	56.6	38.8
PNG	55.4	56.2	54.3	56.2	48.8
EPNG	49.7	45.6	55.5	50.2	45.1
PPMN	59.4	57.2	62.5	60.0	54.0
XPNG	63.3	61.1	66.2	64.0	56.4

Environments

You need the Pytorch >= 1.10.1, and follow the command that:

conda create -n xpng python=3.8
conda activate xpng
conda install pytorch==1.10.1 torchvision==0.11.2 cudatoolkit=11.1 -c pytorch -c conda-forge
pip install -r requirements.txt

After that, please follow the instruction of detectron2 to install detectron2 for the enviroment with:

Assuming the absolute path to the project's root directory is /xpng
cd /xpng
python -m pip install -e detectron2

Dataset

Download the 2017 MSCOCO Dataset from its official webpage. You will need the train and validation splits' images and panoptic segmentations annotations. Download the Panoptic Narrative Grounding Benchmark from the PNG's project webpage.

Organize the files as follows:

XPNG
|_ panoptic_narrative_grounding
   |_ images
   |  |_ train2017
   |  |_ val2017
   |_ annotations
   |  |_ png_coco_train2017.json
   |  |_ png_coco_val2017.json
   |  |_ panoptic_segmentation
   |  |  |_ train2017
   |  |  |_ val2017
   |  |_ panoptic_train2017.json
   |  |_ panoptic_val2017.json
|_ data

Pre-process the Panoptic narrative Grounding Ground-Truth Annotation for the dataloader using utils/pre_process.py. At the end of this step you should have two new files in your annotations folder.

panoptic_narrative_grounding
|_ annotations
   |_ png_coco_train2017.json
   |_ png_coco_val2017.json
   |_ png_coco_train2017_dataloader.json
   |_ png_coco_val2017_dataloader.json
   |_ panoptic_segmentation
   |  |_ train2017
   |  |_ val2017
   |_ panoptic_train2017.json
   |_ panoptic_val2017.json
|_ images
   |  |_ train2017
   |  |_ val2017

Pretrained Bert Model and PFPN

The pre-trained checkpoint can be downloaded from here, and the folder should be like:

pretrained_models
|_fpn
|  |_model_final_cafdb1.pkl
|_bert
|  |_bert-base-uncased
|  |  |_pytorch_model.bin
|  |  |_bert_config.json
|  |_bert-base-uncased.txt

Evaluate

Set args.test_only=True in main.py with --ckpt_path as the path to the model's pth parameter file.
cd/xpng
CUDA_VISIBLE_DEVICES=0 python -m torch.distributed.launch --nproc_per_node=1  --nnodes=1   --master_port 29614 main.py

Train

Set args.test_only=False in main.py  
cd/xpng
CUDA_VISIBLE_DEVICES=0,1 python -m torch.distributed.launch --nproc_per_node=2  --nnodes=1   --master_port 29615 main.py

Acknowledge

Some of the codes are built upon K-Net and PNG. Thanks them for their great works!

Name		Name	Last commit message	Last commit date
Latest commit History 37 Commits
XPNG		XPNG
configs		configs
data		data
detectron2		detectron2
models		models
utils		utils
1702470526152.png		1702470526152.png
LICENSE		LICENSE
README.md		README.md
__init__.py		__init__.py
main.py		main.py
requirement.txt		requirement.txt
train_net.py		train_net.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Environments

Dataset

Pretrained Bert Model and PFPN

Evaluate

Train

Acknowledge

About

Uh oh!

Releases

Packages

Uh oh!

Languages

License

TianyuGoGO/XPNG

Folders and files

Latest commit

History

Repository files navigation

Environments

Dataset

Pretrained Bert Model and PFPN

Evaluate

Train

Acknowledge

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Languages

Packages