GAPS

Official implementation for CVPRW2023 Paper: GAPS: Few-Shot Incremental Semantic Segmentation via Guided Copy-Paste Synthesis

Ri-Zhao Qiu, Peiyi Chen, Wangzhe Sun, Yu-Xiong Wang, and Kris Hauser

[Paper] [Poster]

Preparation

Setup dependencies

conda create --name GAPS python=3.10
conda install pytorch==1.12.1 torchvision==0.13.1 torchaudio==0.12.1 cudatoolkit=11.6 cudatoolkit-dev=11.6 -c pytorch -c conda-forge -c nvidia
pip install -r requirements.txt

Prepare ImageNet-Pretrained Models

Like many other few-shot/incremental/general segmentation works, GAPS is trained from ImageNet pretrained weights.

mkdir pretrained_model
cd pretrained_model
wget https://download.pytorch.org/models/resnet101-5d3b4d8f.pth

Prepare Dataset

By default, the dataset root is ./data. Alternatively, you can specify your own dataset root by either setting an environmental variable $DATASET_ROOT or linking the data folder. For more details you can refer to https://github.com/RogerQi/dl_codebase/blob/roger/submission/modules/utils/misc.py#L10.

Let's take ./data as an example. To start with, create the data folder.

mkdir data

Pascal-5ⁱ

Pascal segmentation datasets usually contain two sets of datasets - the original segmentation mask accompanying Pascal VOC 2012 semantic segmentation challenge, and a set of additional annotations supplemented by Berkeley SBD project.

Fortunately, torchvision has routines for conveniently downloading both of these two sets. The codebase contains code for automatically downloading these two datasets. You can run,

python3 main/train.py --cfg configs/fs_incremental/pascal5i_split0_5shot.yaml

and the Pascal-5i dataset will automatically be ready. You can expect to see the training process begin. Hit Ctrl+C to interrupt it.

COCO-20iⁱ

To use the COCO dataset, you need to manually obtain it.

cd data
mkdir COCO2017
# COCO2017 training images
wget http://images.cocodataset.org/zips/train2017.zip
unzip train2017.zip
# val images
wget http://images.cocodataset.org/zips/val2017.zip
unzip val2017.zip
# Stuff-Things semantic annotations map
wget http://calvin.inf.ed.ac.uk/wp-content/uploads/data/cocostuffdataset/stuffthingmaps_trainval2017.zip
unzip stuffthingmaps_trainval2017.zip

To test if the COCO dataset is working, you can run

python3 main/train.py --cfg configs/fs_incremental/coco20i_split0_5shot.yaml

Running GAPS

As described in our paper, learning in GAPS are divided into two stages: base learning stage and incremental learning stage.

Base learning stage

# run from project root
cd GAPS
# Base training on Pascal-5i (note that 5-shot and 1-shot share same base weights)
python3 main/train.py --cfg configs/fs_incremental/pascal5i_split0_5shot.yaml
python3 main/train.py --cfg configs/fs_incremental/pascal5i_split1_5shot.yaml
python3 main/train.py --cfg configs/fs_incremental/pascal5i_split2_5shot.yaml
python3 main/train.py --cfg configs/fs_incremental/pascal5i_split3_5shot.yaml
# Base training on COCO-20i
python3 main/train.py --cfg configs/fs_incremental/coco20i_split0_5shot.yaml
python3 main/train.py --cfg configs/fs_incremental/coco20i_split1_5shot.yaml
python3 main/train.py --cfg configs/fs_incremental/coco20i_split2_5shot.yaml
python3 main/train.py --cfg configs/fs_incremental/coco20i_split3_5shot.yaml

Empirically, the entire base learning stage takes approximately 5 days on a machine with a single RTX 3090 GPU.

If you want to skip base learning, you can find weights trained from the base stage in the table below.

Dataset	Base weights
Pascal-5-0	box
Pascal-5-1	box
Pascal-5-2	box
Pascal-5-3	box
COCO-20-0	box
COCO-20-1	box
COCO-20-2	box
COCO-20-3	box

Incremental learning stage

Take the split-3 of the Pascal-5i dataset as an example. After the base learning stage, the codebase will generate a weight named GIFS_pascal_voc_split3_final.pt at the project root. To perform incremental learning and testing, the command line to be invoked is

python3 main/test.py --cfg configs/fs_incremental/pascal5i_split3_5shot.yaml --load GIFS_pascal_voc_split3_final.pt

and you should see the results. Note that the diversity-guided exemplar selection requires computation of prototype of every image in the base training stage, which requires roughly 15 minutes on the first time one runs incremental learning on a split.

Following PIFS, the reported base and novel IoU are averaged across results after each single incremental learning task, masking unseen classes. To obtain the results reported in the paper, you need to follow the same procedure.

Name		Name	Last commit message	Last commit date
Latest commit History 13 Commits
configs		configs
main		main
modules		modules
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
requirements.txt		requirements.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

GAPS

Preparation

Setup dependencies

Prepare ImageNet-Pretrained Models

Prepare Dataset

Pascal-5ⁱ

COCO-20iⁱ

Running GAPS

Base learning stage

Incremental learning stage

About

Uh oh!

Releases

Packages

Uh oh!

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

GAPS

Preparation

Setup dependencies

Prepare ImageNet-Pretrained Models

Prepare Dataset

Pascal-5i

COCO-20ii

Running GAPS

Base learning stage

Incremental learning stage

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Uh oh!

Contributors

Uh oh!

Languages

Pascal-5ⁱ

COCO-20iⁱ

Packages