3D Implicit Transporter for Temporally Consistent Keypoint Discovery

Zhong, Chengliang; Zheng, Yuhang; Zheng, Yupeng; Zhao, Hao; Yi, Li; Mu, Xiaodong; Wang, Ling; Li, Pengfei; Zhou, Guyue; Yang, Chao; Zhang, Xinliang; Zhao, Jian

Computer Science > Computer Vision and Pattern Recognition

arXiv:2309.05098 (cs)

[Submitted on 10 Sep 2023]

Title:3D Implicit Transporter for Temporally Consistent Keypoint Discovery

Authors:Chengliang Zhong, Yuhang Zheng, Yupeng Zheng, Hao Zhao, Li Yi, Xiaodong Mu, Ling Wang, Pengfei Li, Guyue Zhou, Chao Yang, Xinliang Zhang, Jian Zhao

View PDF

Abstract:Keypoint-based representation has proven advantageous in various visual and robotic tasks. However, the existing 2D and 3D methods for detecting keypoints mainly rely on geometric consistency to achieve spatial alignment, neglecting temporal consistency. To address this issue, the Transporter method was introduced for 2D data, which reconstructs the target frame from the source frame to incorporate both spatial and temporal information. However, the direct application of the Transporter to 3D point clouds is infeasible due to their structural differences from 2D images. Thus, we propose the first 3D version of the Transporter, which leverages hybrid 3D representation, cross attention, and implicit reconstruction. We apply this new learning system on 3D articulated objects and nonrigid animals (humans and rodents) and show that learned keypoints are spatio-temporally consistent. Additionally, we propose a closed-loop control strategy that utilizes the learned keypoints for 3D object manipulation and demonstrate its superior performance. Codes are available at this https URL.

Comments:	ICCV2023 oral paper
Subjects:	Computer Vision and Pattern Recognition (cs.CV)
Cite as:	arXiv:2309.05098 [cs.CV]
	(or arXiv:2309.05098v1 [cs.CV] for this version)
	https://doi.org/10.48550/arXiv.2309.05098

Submission history

From: Chengliang Zhong [view email]
[v1] Sun, 10 Sep 2023 17:59:48 UTC (20,486 KB)

Computer Science > Computer Vision and Pattern Recognition

Title:3D Implicit Transporter for Temporally Consistent Keypoint Discovery

Submission history

Access Paper:

Current browse context:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Computer Vision and Pattern Recognition

Title:3D Implicit Transporter for Temporally Consistent Keypoint Discovery

Submission history

Access Paper:

Current browse context:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators