Portrait of Yilun Chen

Yilun Chen

Research Scientist, Embodied AI Center at Shanghai AI Laboratory

I work on robotic foundation models, 3D vision, and embodied AI. My recent research aims to advance the frontiers of robotic foundation models.

Before joining Shanghai AI Laboratory, I completed my Ph.D. in the Department of Computer Science and Engineering at The Chinese University of Hong Kong, advised by Prof. Jiaya Jia.

  • Robotic Foundation Models
  • 3D Vision
  • Autonomous Driving

Open Positions

We are actively seeking motivated research fellows and interns with experience or strong interest in robotics and 3D vision. If you are interested in robotic foundation models, especially topics such as world models, 3D perception, embodied manipulation, and generalist robot learning, send me an email with your background, interests, and representative work.

Recent Highlights

Selected Publications

InternVLA-M1 preview

Technical Report, 2025

InternVLA-M1: A Spatially Grounded Foundation Framework for Generalist Robot Policy

InternVLA-M1 Team, Yilun Chen†

  • Dominated Hugging Face Robotics Trending with 6 of the top 8 models in September 2025.
Re3Sim preview

ICRA 2026

Re3Sim: Generating High-Fidelity Simulation Data via 3D-Photorealistic Real-to-Sim for Robotic Manipulation

Xiaoshen Han, Minghuan Liu, Yilun Chen†, Junqiu Yu, Xiaoyang Lyu, Yang Tian, Bolun Wang, Weinan Zhang, Jiangmiao Pang

RoboInter preview

ICLR 2026

RoboInter: A Holistic Intermediate Representation Suite Towards Robotic Manipulation

Hao Li*, Ziqin Wang*, Zi-han Ding, Shuai Yang, Yilun Chen†, Yang Tian, Xiaolin Hu, Tai Wang, Dahua Lin, Feng Zhao, Si Liu, Jiangmiao Pang

ST4VLA preview

ICLR 2026

ST4VLA: Spatially Guided Training for Vision-Language-Action Models

Jinhui Ye*, Fangjing Wang*, Ning Gao*, Junqiu Yu*, Yangkun Zhu, Bin Wang, Jinyu Zhang, Weiyang Jin, Yanwei Fu, Feng Zheng, Yilun Chen†, Jiangmiao Pang

InstructVLA preview

ICLR 2026

InstructVLA: Vision-Language-Action Instruction Tuning from Understanding to Manipulation

Shuai Yang*, Hao Li*, Bin Wang, Yilun Chen†, Yang Tian, Tai Wang, Hanqing Wang, Feng Zhao, Yiyi Liao, Jiangmiao Pang†

X-VLA preview

ICLR 2026

X-VLA: Soft-Prompted Transformer as Scalable Cross-Embodiment Vision-Language-Action Model

Jinliang Zheng*, Jianxiong Li*, Zhihao Wang, Dongxiu Liu, Xirui Kang, Yuchun Feng, Yinan Zheng, Jiayin Zou, Yilun Chen, Jia Zeng, Ya-Qin Zhang, Jiangmiao Pang, Jingjing Liu, Tai Wang, Xianyuan Zhan

CronusVLA preview

AAAI 2026 Oral

CronusVLA: Transferring Latent Motion Across Time for Multi-Frame Prediction in Manipulation

Hao Li*, Shuai Yang*, Yilun Chen†, Xinyi Chen, Xiaoda Yang, Yang Tian, Hanqing Wang, Tai Wang, Dahua Lin, Feng Zhao, Jiangmiao Pang†

GenManip preview

CVPR 2025

GenManip: LLM-driven Simulation for Generalizable Instruction-Following Manipulation

Ning Gao*, Yilun Chen*, Shuai Yang*, Xinyi Chen*, Yang Tian, Hao Li, Haifeng Huang, Hanqing Wang, Tai Wang, Jiangmiao Pang

Chat-Scene preview

NeurIPS 2024

Chat-Scene: Bridging 3D Scene and Large Language Models with Object Identifiers

Haifeng Huang*, Yilun Chen*, Zehan Wang*, Rongjie Huang, Runsen Xu, Tai Wang, Yang Zhao, Jiangmiao Pang, Zhou Zhao

PointLLM preview

ECCV 2024 Oral

PointLLM: Empowering Large Language Models to Understand Point Clouds

Runsen Xu, Xiaolong Wang, Tai Wang, Yilun Chen, Jiangmiao Pang, Dahua Lin

  • ECCV 2024 Best Paper Candidate.

Experience

  • Shanghai AI Laboratory Research Scientist, Mar. 2023 - Present
  • NVIDIA Research Research Intern, Jun. 2022 - Feb. 2023

    Mentors: Zhiding Yu, Jose M. Alvarez

  • SmartMore Inc. Research Intern, Mar. 2020 - Jun. 2022

    Mentor: Shu Liu

  • Tencent Youtu Lab Research Intern, Mar. 2018 - Jan. 2020

    Mentor: Shu Liu

  • Megvii Face++ Research Intern, Nov. 2016 - Nov. 2017

    Mentor: Gang Yu

Education

  • The Chinese University of Hong Kong Ph.D., Computer Science and Engineering, 2018 - 2022
  • Beihang University Bachelor, Computer Science and Engineering, 2013 - 2017

Service

  • Conference Reviewer

    CVPR, ECCV, ICCV, ICLR, NeurIPS, ICML, CoRL, IROS, ICRA

  • Journal Reviewer

    T-PAMI, IJCV, RA-L

  • Teaching

    CSCI3310, CSCI3180, CSCI1120, ENGG1100