Image

Wei Li

w e i l i v i s i o n @ g m a i l . c o m
Google Scholar / GitHub

About me

I am a research fellow in MMLab@NTU with Prof. Chen Change Loy studying computer vision, machine learning, and AI. Previously, I completed my Ph.D. in Computer Science at QMUL, under the supervision of Prof. Shaogang Gong, where I also closely worked with Prof. Xiatian Zhu.


Quick links: Publications


News


Recent (All)

Image
Free4D: Tuning-free 4D Scene Generation with Spatial-Temporal Consistency
Tianqi Liu, Zihao Huang, Zhaoxi Chen, Guangcong Wang, Shoukang Hu, Liao Shen, Huiqiang Sun, Zhiguo Cao, Wei Li, Ziwei Liu
ICCV 2025.
[Paper][Website][Code]
Image
Harmonizing Visual Representations for Unified Multimodal Understanding and Generation
Size Wu, Wenwei Zhang, Lumin Xu, Sheng Jin, Zhonghua Wu, Qingyi Tao, Wentao Liu, Wei Li, Chen Change Loy
ICCV 2025.
[Paper][Website][Code]
Image
SRefiner: Soft-Braid Attention for Multi-Agent Trajectory Refinement
Liwen Xiao, Zhiyu Pan, Zhicheng Wang, Zhiguo Cao, Wei Li*
ICCV 2025 (Highlight).
[Paper][Website][Code]
Image
DeepInteraction++: Multi-Modality Interaction for Autonomous Driving
Zeyu Yang*, Nan Song*, Wei Li*, Xiatian Zhu, Li Zhang, Philip HS Torr
TPAMI 2025.
[Paper][Website][Code]
Image
DoF-Gaussian: Controllable Depth-of-Field for 3D Gaussian Splatting
Liao Shen, Tianqi Liu, Huiqiang Sun, Jiaqi Li, Zhiguo Cao, Wei Li, Chen Change Loy
CVPR 2025.
[Paper][Website][Code]
Image
WildAvatar: Learning In-the-wild 3D Avatars from the Web
Zihao Huang, ShouKang Hu, Guangcong Wang, Tianqi Liu, Yuhang Zang, Zhiguo Cao, Wei Li, Ziwei Liu
CVPR 2025.
[Paper][Website][Code]
Image
F-LMM: Grounding Frozen Large Multimodal Models
Size Wu, Wenwei Zhang, Lumin Xu, Sheng Jin, Wentao Liu, Wei Li, Chen Change Loy
CVPR 2025.
[Paper][Website][Code]
Image
Generalizable Implicit Motion Modeling for Video Frame Interpolation
Zujin Guo, Wei Li, Chen Change Loy
NeurIPS 2024.
[Paper][Website][Code]
Image
Fast Generalizable Gaussian Splatting Reconstruction from Multi-View Stereo
Tianqi Liu, Guangcong Wang, Shoukang Hu, Liao Shen, Xinyi Ye, Yuhang Zang, Zhiguo Cao, Wei Li, Ziwei Liu
ECCV 2024.
[Paper][Website][Code]
Image
MosaicFusion: Diffusion Models as Data Augmenters for Large Vocabulary Instance Segmentation
Jiahao Xie, Wei Li, Xiangtai Li, Ziwei Liu, Yew Soon Ong, Chen Change Loy
IJCV 2024.
[Paper][Website][Code]
Image
Contextual Object Detection with Multimodal Large Language Models
Yuhang Zang, Wei Li, Jun Han, Kaiyang Zhou, Chen Change Loy
IJCV 2024.
[Paper][Website][Code]