Structure Matters: Revisiting Boundary Refinement in Video Object Segmentation

Qin, Guanyi; Wang, Ziyue; Shen, Daiyun; Liu, Haofeng; Zhou, Hantao; Wu, Junde; Hu, Runze; Jin, Yueming

Computer Science > Computer Vision and Pattern Recognition

arXiv:2507.18944 (cs)

[Submitted on 25 Jul 2025]

Title:Structure Matters: Revisiting Boundary Refinement in Video Object Segmentation

Authors:Guanyi Qin, Ziyue Wang, Daiyun Shen, Haofeng Liu, Hantao Zhou, Junde Wu, Runze Hu, Yueming Jin

View PDF HTML (experimental)

Abstract:Given an object mask, Semi-supervised Video Object Segmentation (SVOS) technique aims to track and segment the object across video frames, serving as a fundamental task in computer vision. Although recent memory-based methods demonstrate potential, they often struggle with scenes involving occlusion, particularly in handling object interactions and high feature similarity. To address these issues and meet the real-time processing requirements of downstream applications, in this paper, we propose a novel bOundary Amendment video object Segmentation method with Inherent Structure refinement, hereby named OASIS. Specifically, a lightweight structure refinement module is proposed to enhance segmentation accuracy. With the fusion of rough edge priors captured by the Canny filter and stored object features, the module can generate an object-level structure map and refine the representations by highlighting boundary features. Evidential learning for uncertainty estimation is introduced to further address challenges in occluded regions. The proposed method, OASIS, maintains an efficient design, yet extensive experiments on challenging benchmarks demonstrate its superior performance and competitive inference speed compared to other state-of-the-art methods, i.e., achieving the F values of 91.6 (vs. 89.7 on DAVIS-17 validation set) and G values of 86.6 (vs. 86.2 on YouTubeVOS 2019 validation set) while maintaining a competitive speed of 48 FPS on DAVIS.

Subjects:	Computer Vision and Pattern Recognition (cs.CV); Image and Video Processing (eess.IV)
Cite as:	arXiv:2507.18944 [cs.CV]
	(or arXiv:2507.18944v1 [cs.CV] for this version)
	https://doi.org/10.48550/arXiv.2507.18944

Submission history

From: Guanyi Qin [view email]
[v1] Fri, 25 Jul 2025 04:30:23 UTC (3,491 KB)

Computer Science > Computer Vision and Pattern Recognition

Title:Structure Matters: Revisiting Boundary Refinement in Video Object Segmentation

Submission history

Access Paper:

Current browse context:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Computer Vision and Pattern Recognition

Title:Structure Matters: Revisiting Boundary Refinement in Video Object Segmentation

Submission history

Access Paper:

Current browse context:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators