|
Chen Gao (高晨)
I'm currently a Research Fellow at LV-Lab, National University of Singapore, under the supervision of Prof. Shuichen Yan.
Prior to this, I spent a wonderful year as a Research Fellow at Show Lab, National University of Singapore, working with Prof. Mike Z. Shou.
I received the PhD from Beihang University, China, under the supervision of Prof. Si Liu. Also, I was a visting scholar at Peking University working with Prof. He Wang, focusing on the Embodied AI research. In the spare time, I also enjoy playing basketball.
gaochen.ai@gmail.com  / 
Google Scholar  / 
GitHub  / 
I am assisting Prof. Yan in recruiting Research Assistants and bachelor‘s,master’s and phd students. The primary research areas include Embodied AI, VLA/World Model-based Foundation Model, Visuo-Tactile for Contact-Rich Manipulation, etc.
While candidates with relevant research experience will be prioritized, we also welcome applications from undergraduate interns. Interested individuals are encouraged to contact me via email for further information.
我在协助颜老师招RA以及本科生、硕士/博士生,在Embodied AI, VLA/World Model-based Foundation Model, Visuo-Tactile方向,有相关科研经历者优先,同时欢迎实习生/访问生,感兴趣的同学请邮件联系我。
|
|
Recent News
|
|
Research Internship
[Sep. 2019 - Jan. 2020]: Research Intern at YITU, led by Shuicheng Yan
[Aug. 2020 - Mar. 2021]: Research Intern at SenseTime, led by Chen Qian
[Aug. 2022 - Jun. 2024]: Research Intern at Meituan autonomous vehicles, led by Beipeng Mu
|
|
Selected Publications (* Equal, # Corresponding)
|
|
OmniVTA: Visuo-Tactile World Modeling for Contact-Rich Robotic Manipulation
Yuhang Zheng*,
Songen Gu*,
Weize Li,
Yupeng Zheng,
Yujie Zang,
Shuai Tian,
Xiang Li,
Ce Hao,
Chen Gao,
Si Liu,
Haoran Li,
Yilun Chen,
Shuicheng Yan#,
Wenchao Ding#,
[Paper]
[Code]
[Website]
[Dataset]
|
|
OctoNav: Towards Generalist Embodied Navigation
Chen Gao*,
Liankai Jin*,
Xingyu Peng*,
Jiazhao Zhang,
Yue Deng,
Annan Li,
He Wang,
Si Liu#
IEEE Conference on Computer Vision and Pattern Recognition. CVPR 2026.
[Paper]
[Code]
|
|
Towards Realistic Earth-Observation Constellation Scheduling: Benchmark and Methodology
Luting Wang,
Yinghao Xiang,
Hongliang Huang,
Dongjun Li,
Chen Gao#,
Si Liu#
Advances in Neural Information Processing Systems. NeurIPS 2025.
[Paper]
[Code]
|
|
PANDA: Towards Generalist Video Anomaly Detection via Agentic AI Engineer
Zhiwei Yang,
Chen Gao#,
Mike Zheng Shou#
Advances in Neural Information Processing Systems. NeurIPS 2025.
[Paper]
[Code]
|
|
RoboCerebra: A Large-scale Benchmark for Long-horizon Robotic Manipulation Evaluation
Songhao Han*,
Boxiang Qiu*,
Yue Liao*,
Siyuan Huang,
Chen Gao,
Shuicheng Yan,
Si Liu#
Advances in Neural Information Processing Systems. NeurIPS 2025.
[Paper]
[Code]
|
|
Reinforcement Learning for Multimodal Foundation Models: A Survey
Weijia Wu,
Chen Gao,
Joya Chen,
Kevin Qinghong Lin,
Qingwei Meng,
Yiming Zhang,
Yuke Qiu,
Hong Zhou,
Mike Zheng Shou#
[Paper]
[Code]
|
|
Diffusion Models in Robotics: A Survey
Xiaokang Liu,
Yuchen Ma,
Chen Gao,
Mike Zheng Shou#
[Paper]
[Code]
|
|
Global-Local Collaborative Inference with LLM for Lidar-Based Open-Vocabulary Detection
Xingyu Peng,
Yan Bai,
Chen Gao,
Lirong Yang,
Fei Xia,
Beipeng Mu,
Xiaofei Wang,
Si Liu#
European Conference on Computer Vision. ECCV 2024.
[Paper]
[Code]
|
|
Eliminating Cross-modal Conflicts in BEV Space for LiDAR-Camera 3D Object Detection
Jiahui Fu,
Chen Gao#,
Zitian Wang,
Lirong Yang,
Xiaofei Wang,
Beipeng Mu,
Si Liu
IEEE International Conference on Robotics and Automation. ICRA 2024.
[Paper]
[Code]
|
|
Room-Object Entity Prompting and Reasoning for Embodied Referring Expression
Chen Gao,
Si Liu#,
Jinyu Chen,
Luting Wang,
Qi Wu,
Bo Li,
Qi Tian
IEEE Transactions on Pattern Analysis and Machine Intelligence. TPAMI 2024.
[Paper]
|
|
Towards Vehicle-to-everything Autonomous Driving: A Survey on Collaborative Perception
Si Liu#,
Chen Gao,
Yuan Chen,
Xingyu Peng,
Xianghao Kong,
Kun Wang,
Runsheng Xu,
Wentao Jiang,
Hao Xiang,
Jiaqi Ma,
Miao Wang
[Paper]
[Code]
|
|
Adaptive Zone-aware Hierarchical Planner for Vision-Language Navigation
Chen Gao,
Xingyu Peng,
Mi Yan,
He Wang,
Lirong Yang,
Haibing Ren,
Hongsheng Li,
Si Liu#,
IEEE Conference on Computer Vision and Pattern Recognition. CVPR 2023.
[Paper]
[Code]
|
|
Target-Driven Structured Transformer Planner for Vision-Language Navigation
Yusheng Zhao*,
Jinyu Chen*,
Chen Gao,
Wenguan Wang,
Lirong Yang,
Haibin Ren,
Huaxia Xia,
Si Liu#,
ACM International Conference on Multimedia. ACM MM 2022.
(Oral Presentation)
[Paper]
[Code]
|
|
3D-SPS: Single-Stage 3D Visual Grounding via Referred Point Progressive Selection
Junyu Luo*,
Jiahui Fu*,
Xianghao Kong,
Chen Gao#,
Haibing Ren,
Hao Shen,
Huaxia Xia,
Si Liu,
IEEE Conference on Computer Vision and Pattern Recognition. CVPR 2022.
(Oral Presentation)
[Paper]
[Code]
|
|
Reinforced Structured State-Evolution for Vision-Language Navigation
Jinyu Chen,
Chen Gao,
Erli Meng,
Qiong Zhang,
Si Liu#
IEEE Conference on Computer Vision and Pattern Recognition. CVPR 2022.
[Paper]
[Code]
|
|
PSGAN++: Robust Detail-Preserving Makeup Transfer and Removal
Si Liu,
Wentao Jiang,
Chen Gao,
Ran He,
Jiashi Feng,
Bo Li,
Shuicheng Yan
IEEE Transactions on Pattern Analysis and Machine Intelligence. TPAMI 2021.
[Paper]
[Code & Dataset]
|
|
Room-and-Object Aware Knowledge Reasoning for Remote Embodied Referring Expression
Chen Gao,
Jinyu Chen,
Si Liu#,
Luting Wang,
Qiong Zhang,
Qi Wu
IEEE Conference on Computer Vision and Pattern Recognition. CVPR 2021.
(Oral Presentation)
[Paper]
[Code]
|
|
AdversarialNAS: Adversarial Neural Architecture Search for GANs
Chen Gao,
Yunpeng Chen,
Si Liu#,
Zhenxiong Tan,
Shuicheng Yan
IEEE Conference on Computer Vision and Pattern Recognition. CVPR 2020.
[Paper]
[Code]
|
|
PSGAN: Pose and Expression Robust Spatial-Aware GAN for Customizable Makeup Transfer
Wentao Jiang,
Si Liu#,
Chen Gao,
Jie Cao,
Ran He,
Jiashi Feng,
Shuicheng Yan
IEEE Conference on Computer Vision and Pattern Recognition. CVPR 2020.
(Oral Presentation)
[Paper]
[Code & Dataset]
|
|
InteractGAN: Learning to Generate Human-Object Interaction
Chen Gao,
Si Liu#,
Defa Zhu,
Quan Liu,
Jie Cao,
Haoqian He,
Ran He,
Shuicheng Yan
ACM International Conference on Multimedia. ACM MM 2020.
(Oral Presentation)
[Paper]
[Project]
|
|
Attentive Transfer and Layout Graph Reasoning for Free-wheeling Portrait Recapturing
Chen Gao,
Si Liu,
Ran He,
Shuicheng Yan
arXiv preprint arXiv:2006.01435.
[Paper]
|
|
Academic Services
Conference Reviewer: CVPR, ICCV, ECCV, NeurIPS, ICLR, AAAI, ACM MM, ICRA, etc.
Journal Reviewer: IEEE Transactions on Pattern Analysis and Machine Intelligence (TPAMI), IEEE Transactions on Image Processing (TIP), IEEE Transactions on Multimedia (TMM), IEEE Transactions on Circuits and Systems for Video Technology (TCSVT), IEEE Transaction on Cybernetics, IEEE Transactions on Neural Networks and Learning Systems (TNNLS), IEEE Transactions on Signal and Information Processing over Networks, Multimedia Tools and Applications, Neurocomputing, Transactions on Machine Learning Research (TMLR).
|
|