Zhen Li 「李 祯」

I am a final-year master's student at Beijing Institute of Technology, supervised by Prof. Yuwei Wu and Prof. Yunde Jia. I am currently interning at the Shanda AI Research, working closely with Dr. Kaipeng Zhang and Dr. Bo Zheng. Prior to that, I obtained my Bachelor's degree from the Xuteli School at Beijing Institute of Technology in 2023.

My research interests lie at (1) the intersection of vision-and-language and compositional generalization, (2) video generation incorporating 3D priors.

Email  /  Google Scholar  /  Github

profile photo

Selected Publications

* indicates equal contribution. 📧 indicates corresponding author.

Image Yume1.5: A Text-Controlled Interactive World Generation Model
Xiaofeng Mao, Zhen Li, Chuanhao Li, Xiaojie Xu, Kaining Ying, Tong He, Jiangmiao Pang, Yu Qiao, Kaipeng Zhang📧,
Arxiv, 2025.12
[Project Page] /  [Paper] /  [Code]
Image Composition-Incremental Learning for Compositional Generalization
Zhen Li, Yuwei Wu, Chenchen Jing, Che Sun📧, Chuanhao Li📧, Yunde Jia
AAAI, 2026
[Paper] /  [Code]
Image Sekai: A Video Dataset towards World Exploration
Zhen Li, Chuanhao Li📧, Xiaofeng Mao, Shaoheng Lin, Ming Li, Shitian Zhao, Zhaopan Xu, Xinyue Li, Yukang Feng, Jianwen Sun, Zizhen Li, Fanrui Zhang, Jiaxin Ai, Zhixiang Wang, Yuwei Wu📧, Tong He, Jiangmiao Pang, Yu Qiao, Yunde Jia, Kaipeng Zhang📧
NeurIPS, 2025
[Project Page] /  [Paper] /  [Code]
Image Consistency of Compositional Generalization Across Multiple Levels
Chuanhao Li*, Zhen Li*, Chenchen Jing📧, Xiaomeng Fan, Wenbo Ye, Yuwei Wu📧, Yunde Jia
AAAI, 2025
[Paper] /  [Code]
Image Multi-Sourced Compositional Generalization in Visual Question Answering
Chuanhao Li*, Wenbo Ye*, Zhen Li, Yuwei Wu📧, Yunde Jia
IJCAI, 2025
[Paper] /  [Code]
Image Compositional Substitutivity of Visual Reasoning for Visual Question Answering
Chuanhao Li*, Zhen Li*, Chenchen Jing📧, Yuwei Wu📧, Mingliang Zhai, Yunde Jia
ECCV, 2024
[Paper] /  [Code]
Image SearchLVLMs: A Plug-and-Play Framework for Augmenting Large Vision-Language Models by Searching Up-to-Date Internet Knowledge
Chuanhao Li, Zhen Li, Chenchen Jing, Shuo Liu, Wenqi Shao, Yuwei Wu📧, Ping Luo, Yu Qiao, Kaipeng Zhang📧
NeurIPS, 2024
[Project Page] /  [Paper] /  [Code]
Image In-Context Compositional Generalization for Large Vision-Language Models
Chuanhao Li, Chenchen Jing, Zhen Li, Mingliang Zhai, Yuwei Wu📧, Yunde Jia
EMNLP, 2024
[Paper]
Image Exploring the Effect of Primitives for Compositional Generalization in Vision-and-Language
Chuanhao Li, Zhen Li, Chenchen Jing📧, Yuwei Wu📧, Yunde Jia
CVPR, 2023
[Paper] /  [Code]

Experiences

shanghai_ai

Jan. 2026 - Present,

Research Intern, Shanda AI Research.

Mentor: Kaipeng Zhang

shanghai_ai

Feb. 2025 - Dec. 2025,

Research Intern, Shanghai Artificial Intellience Laboratory.

Mentor: Kaipeng Zhang

Copyright © Zhen Li 2020 | Last updated: Jan. 04, 2026 | Template from Jon Barron