Biography
I am a Member of Technical Staff at xAI , working on the Omni Model Pretraining and Visual Generation. Before that, I was a Research Scientist at NVIDIA Research . I obtained Bachelor's degree at Zhejiang University in 2021, and Ph.D. degree at The Chinese University of Hong Kong (CUHK), Multi-Media Laboratory (MMLab) in 2025.
My research interest lies in Multi-Modal AI and Foundation Generative Models, with focus on GenAI pre/post-training, visual tokenizers, and their applications. I was the core contributor to NVIDIA Cosmos series
, an ensemble of open-source world foundation models including visual tokenizers, image / video foundation models, VLMs, and their post-trained variants.
News
[12/2025] Join xAI as Member of Technical Staff, working on Omni and Visual Generation models. Stay tuned for our release!
[11/2025] We release Cosmos 2.5 , an improved version of world foundation model. Check our Github and technical report .
[09/2025] One paper is accepted to NeurIPS 2025.
[07/2025] One paper is accepted to ICCV 2025 with oral presentation.
[06/2025] Pass my Ph.D. defense and become Dr. Liu officially!
[06/2025] We release Cosmos-Predict2 , a world foundation model with improved quality. Models open-sourced on Github and HF .
[03/2025] We release Cosmos-Transfer1 , a world model with multi-modal controllability. Models open-sourced on Github and HF .
[02/2025] Two papers are accepted to CVPR 2025.
[01/2025] Cosmos won the Best of CES, Best of AI, and Best Overall Awards in CNET 2025!
[01/2025] We release Cosmos , a world foundation model platform for Physical AI. Models open-sourced on Github and HF .
[01/2025] Four papers are accepted to ICLR 2025.
[12/2024] One paper is accepted to AAAI 2025.
[11/2024] We release Cosmos-Tokenizer , a suite of SOTA image/video tokenizers with models available on Github and HF .
[09/2024] Honored to receive ECCV 2024 Outstanding Reviewer Award . Great thanks for the recognition!
[07/2024] Two papers are accepted to ECCV 2024.
[06/2024] Join NVIDIA Research as full-time research scientist, building large-scale foundation models. Stay tuned for our release!
[05/2024] One paper is accepted to ICML 2024.
[03/2024] Start my internship at NVIDIA Research. See you in Santa Clara!
[03/2024] Two papers are accepted to CVPR 2024, with HumanGaussian accepted as Highlight (Top 2.8%). See you in Seattle!
[01/2024] One paper is accepted to ICLR 2024, with HyperHuman receiving review score of 6, 6, 8, 10 (Top 1.6%, Rank ).
[01/2024] I will intern at GenAI Team @ Meta AI Research in 2024 Fall. See you in Menlo Park!
[11/2023] I will intern at Deep Imagination Research @ NVIDIA Research in 2024 Spring with Ming-Yu Liu . See you in Santa Clara!
[11/2023] A high-quality 3D human generation framework HumanGaussian is released, with all the code and models available!
[10/2023] A hyper-realistic human generation foundation model HyperHuman collaborated with Snap Research is on arXiv!
[07/2023] One paper is accepted to ICCV 2023.
[05/2023] Start my internship at Snap Research. See you in Los Angeles!
[03/2023] Two papers are accepted to CVPR 2023.
[03/2023] One paper is accepted to TMLR 2023.
[09/2022] One paper is accepted to NeurIPS 2022, with ANGIE accepted as Spotlight (Top 5%)!
[07/2022] Three papers are accepted to ECCV 2022, with SSP-NeRF accepted as Oral (Top 2.7%)!
[03/2022] One paper is accepted to CVPR 2022.
[12/2021] One paper is accepted to AAAI 2022.
[Show more]
Industrial Research
Cosmos: World Foundation Model Platform for Physical AI
Contributions: Auto-Regressive Foundation Model Pre-Training & Post-Training. (CES'25 Best of AI, Best Overall)
Cosmos 2.5: Improved World Simulation with Video Foundation Models for Physical AI
Contributions: Data Processing Pipelines, Captioning, Long Video Generation, Evaluation, Transfer Post-training.
Cosmos Tokenizer: A Suite of Image and Video Neural Tokenizers
Contributions: Continuous/Discrete Image/Video Tokenizers.
Cosmos-Transfer: World Generation with Adaptive Multimodal Control
Contributions: Adaptive Multi-Modal Control, Data Processing Pipelines, Open-Source Repo.
Selected Publications [ Full List ] (* indicates equal contribution)
HumanGaussian: Text-Driven 3D Human Generation with Gaussian Splatting
IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR ), 2024. (Highlight, Top 2.8%)
HyperHuman: Hyper-Realistic Human Generation with Latent Structural Diffusion
International Conference on Learning Representations (
ICLR ), 2024.
(Review Score 6, 6, 8, 10, Top 1.6%, Rank )
Semantic-Aware Implicit Neural Audio-Driven Video Portrait Generation
European Conference on Computer Vision (ECCV ), 2022. (Oral, Top 2.7%)
Audio-Driven Co-Speech Gesture Video Generation
Advances in Neural Information Processing Systems (NeurIPS ), 2022. (Spotlight, Top 5%)
HMAR: Efficient Hierarchical Masked AutoRegressive Image Generation
IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR ), 2025.
TC4D: Trajectory-Conditioned Text-to-4D Generation
Sherwin Bahmani* ,
Xian Liu* ,
Yifan Wang* ,
Ivan Skorokhodov ,
Victor Rong ,
Ziwei Liu ,
Xihui Liu ,
Jeong Joon Park ,
Sergey Tulyakov ,
Gordon Wetzstein ,
Andrea Tagliasacchi ,
David B. Lindell .
European Conference on Computer Vision (ECCV ), 2024.
Working Experiences
Member of Technical Staff.
Dec. 2025 - Now
xAI, Omni Pretrain Team.
Research Scientist.
Jun. 2024 - Dec. 2025
NVIDIA Research, Cosmos Team.
Internship Experiences
Generative AI Research Intern.
Mar. 2024 - Jun. 2024
NVIDIA Research, Deep Imagination Group.
Research Visiting Student.
Dec. 2023 - Mar. 2024
UofT, Toronto Computational Imaging Group.
Research Intern.
Sept. 2023 - Dec. 2023
Tencent AI Laboratory.
Research Intern.
May. 2023 - Sept. 2023
Snap Research, Creative Vision Group.
Research Intern.
Jul. 2021 - Feb. 2022
Shanghai AI Lab, Digital Content Group.
Research Intern.
Aug. 2020 - Jun. 2021
SenseTime Research, Intelligent Video Group.
Invited Talks
Professional Services
Area Chair / Senior Program Committee: AAAI.
Conference Reviewer: CVPR, ECCV, ICCV, WACV, SIGGRAPH, SIGGRAPH Asia, NeurIPS, ICML, ICLR, AISTATS, AAAI, ACM MM.
Journal Reviewer: TPAMI, IJCV, TVCG, TIP, TMM, EG, CGF, PG.
Selected Honors & Awards
CNET 2025 Best of CES, Best of AI, and Best Overall. 2025
ECCV Outstanding Reviewer Award. 2024
CVPR Travel Award. 2024
ICLR Travel Award. 2024
National Scholarship. 2019, 2020
Hong Kong Ph.D. Fellowship Scheme (HKPFS). 2021- 2025
Outstanding Graduate of Zhejiang Province. 2021
Outstanding Bachelor Thesis Award of Zhejiang University, Top 1%. 2021
UCLA CSST Scholarship Program. 2020
SenseTime Scholarship. 2020
Tang Lixin Scholarship. 2019
First Class Scholarship for Academic Excellence. 2019, 2020
Teaching Experience
ENGG 1120, Linear Algebra for Engineers. Spring 2022.
ENGG 2440, Discrete Mathematics for Engineers. Fall 2021.