Official implementation of our CVPR 2026 paper: HiSpatial: Taming Hierarchical 3D Spatial Understanding in Vision-Language Models
🔗 Project Page | 📄 Paper Link
- Release model and code (before April 10, 2026)
- Release training data (before May 1, 2026)
@inproceedings{liang2026hispatial,
title={HiSpatial: Taming Hierarchical 3D Spatial Understanding in Vision-Language Models},
author={Liang, Huizhi and Shen, Yichao and Deng, Yu and Xu, Sicheng and Feng, Zhiyuan and Zhang, Tong and Liang, Yaobo and Yang, Jiaolong},
booktitle={Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR)},
year={2026}
}