|
|
Object Pose Transformer: Unifying Unseen Object Pose Estimation
Weihang Li,
Lorenzo Garattoni,
Fabien Despinoy,
Nassir Navab,
Benjamin Busam
Computer Vision and Pattern Recognition Conference(CVPR), 2026
arXiv
/
Project Page
/
Code
A unified feed-forward transformer for unseen object pose estimation that jointly predicts depth, point maps, camera parameters, and NOCS to recover both absolute and relative poses.
|
|
|
HouseCat-TRICKY: HouseCat6D Object Pose Estimation Challenge with Specular and Transparent Objects
Weihang Li et al.
International Conference on Computer Vision (ICCVW), 2025
Paper
/
Challenge Page
/
Code
Our paper HouseCat-TRICKY has been accepted to ICCVW 2025. The benchmark and evaluation focus on challenging transparent and reflective object categories.
|
|
|
Texture2LoD3: Enabling LoD3 Building Reconstruction With Panoramic Images
Wenzhao Tang*,
Weihang Li*,
Xiucheng Liang,
Olaf Wysocki,
Filip Biljecki,
Christoph Holst,
Boris Jutzi
Computer Vision and Pattern Recognition Conference Workshop on Urban Scene Modeling (CVPRW), 2025
arXiv
/
Project Page
/
Code
Texture2LoD3 proposes leveraging ubiquitous street-level images and low-level building models for accurate ortho-texturing (left):
Enabling accurate semantic segmentation (center) and facade-rich LoD3 reconstruction (right).
|
|
|
GCE-Pose: Global Context Enhancement for Category-level Object Pose Estimation
Weihang Li*,
Hongli Xu*,
Junwen Huang*,
HyunJun Jung*,
Peter KT Yu,
Nassir Navab,
Benjamin Busam
Computer Vision and Pattern Recognition Conference (CVPR), 2025
arXiv
/
Project Page
A semantic shape reconstruction module that recovers complete object geometry from partial observations with a
global context-enhanced feature fusion mechanism that leverages category-level semantic and shape priors for robust pose prediction
|
|
|
DynSUP: Dynamic Gaussian Splatting from An Unposed Image Pair
Weihang Li*,
Weirong Chen*,
Shenhan Qian,
Benjamin Busam,
Daniel Cremers,
Haoang Li
arXiv , 2024
arXiv
/
Project Page
/
Code
A novel method to achieve Gaussian splatting from an un-posed image pair in dynamic environments.
|
|
|
SCRREAM: SCan, Register, REnder And Map: A Framework for Annotating Accurate and Dense 3D Indoor Scenes with a Benchmark
HyunJun Jung,
Weihang Li,
Shun-Cheng Wu,
William Bittner,
Nikolas Brasch,
Jifei Song,
Eduardo Pérez-Pellitero,
Zhensong Zhang,
Arthur Moreau,
Nassir Navab,
Benjamin Busam
In Proceedings of the Neural Information Processing Systems (NeurIPS), 2024
arXiv
/
Project Page
/
Code
A framework to annotate accurate and dense 3d indoor scenes with a benchmark on novel view synthesis and SLAM
|
|
|
Knowledge-based Programming by Demonstration using semantic action models for industrial assembly
Junsheng Ding,
Haifan, Zhang,
Weihang Li,
Liangwei Zhou,
Alexander Perzylo
International Conference on Intelligent Robots and Systems (IROS), 2024
Paper
/
Project Page
/
Code
/
Video
A knowledge-based Programming by Demonstration (kb-PbD) paradigm to facilitate robot programming in small and medium-sized enterprises (SMEs).
|
- [06-2024] Our team received an Honorable Mention Award in the S23DR Challenge at CVPR 2024.
- [10-2025] Mentored student Haoliang to win 1st place in the WLCOP Challenge at ICCV 2025.
Teaching Assistant: Introduction to Machine Learning, TUM, 2024.
Teaching Assistant: Advanced Topics in 3D Computer Vision, TUM, 2025.
Teaching Assistant: Modern Computer Vision Methods, TUM, 2025.
|