profile photo

Hang Gao

I am a member of technical staff at xAI, working on multimodal generative models.

I did my PhD at UC Berkeley advised by Angjoo Kanazawa. Before that, I studied at Jiao Tong University and Columbia.

Past projects selected → all
Image
Grok Imagine 0.9
Imagine team at xAI, worked on post-training, data and eval
Product Release, 2025

We made and rolled out Grok's first generation of video models in 3 months from scratch.

Image
Stable Virtual Camera: Generative View Synthesis with Diffusion Models
Jensen (Jinghao) Zhou*, Hang Gao*, Vikram Voleti, Aaryaman Vasishta, Chun-Han Yao, Mark Boss, Philip Torr, Christrian Rupprecht, Varun Jampani
ICCV, 2025
project page / arXiv / code

We build a multi-view diffusion model for view synthesis from as few as a single image.

Image
SOAR: Self-Occluded Avatar Recovery from a Single Video
Zhuoyang Pan*, Angjoo Kanazawa, Hang Gao*
arXiv, 2024
project page / arXiv / code

We recover complete human avatars from self-occluded videos in the wild.

Image
Shape of Motion: 4D Reconstruction from a Single Video
Qianqian Wang*, Vickie Ye*, Hang Gao*, Jake Austin, Zhengqi Li, Angjoo Kanazawa
ICCV, 2025   (Spotlight)
project page / arXiv / code

We propose a method for joint 4D reconstruction and 3D tracking on internet footages.

Image
NerfAcc: Efficient Sampling Accelerates NeRFs
Ruilong Li, Hang Gao, Matthew Tancik, Angjoo Kanazawa
ICCV, 2023
project page / arXiv / code

We build and release a toolbox for accelerating all kinds of NeRFs by efficient sampling.

Image
Monocular Dynamic View Synthesis: A Reality Check
Hang Gao, Ruilong Li, Shubham Tulsiani, Bryan Russell, Angjoo Kanazawa
NeurIPS, 2022
project page / arXiv / code

We systematically show why and how existing 4D reconstruction methods don't work well in the wild.

Image
Image
Long-term Human Motion Prediction with Scene Context
Zhe Cao, Hang Gao, Karttikeya Mangalam, Qi-Zhi Cai, Minh Vo, Jitendra Malik
ECCV, 2020   (Oral Presentation)
project page / arXiv / code

We predict long-term, diverse human motion in 3D by understanding scene context from an image.

Image
Image
Deformable Kernels: Adapting Effective Receptive Fields for Object Deformation
Hang Gao*, Xizhou Zhu*, Steve Lin, Jifeng Dai
ICLR, 2020
project page / arXiv / code

We propose a new convolutional operator that does attention in the kernel space.

Image
Spatio-Temporal Action Graph Networks
Roei Herzig*, Elad Levi*, Huijuan Xu*, Hang Gao, Eli Brosh, Xiaolong Wang, Amir Globerson, Trevor Darrell
ICCV Workshop, 2019
arXiv

We find relational graph useful for action recognition.

Image
Disentangling Propagation and Generation for Video Prediction
Hang Gao*, Huazhe Xu, Qi-Zhi Cai, Ruth Wang, Fisher Yu, Trevor Darrell
ICCV, 2019
arXiv

We make a system that propagates seen parts and generates unseen ones.

Image
Image
Low-shot Learning via Covariance-Preserving Adversarial Augmentation Networks
Hang Gao, Zheng Shou, Alireza Zareian, Hanwang Zhang, Shih-Fu Chang
NeurIPS, 2018
arXiv

We learn feature augmentation for low-shot classifiers.

Image
AutoLoc: Weakly-supervised Temporal Action Localization in Untrimmed Videos
Zheng Shou, Hang Gao, Lei Zhang, Kazuyuki Miyazawa, Shih-Fu Chang
ECCV, 2018
arXiv / code

We propose a weakly-supervised method for temporal action localization.

Image
ER: Early Recognition of Inattentive Driving Events Leveraging Audio Devices on Smartphones
Xiangyu Xu, Hang Gao, Jiadi Yu, Yingying Chen, Yanmin Zhu, Guangtao Xue, Minglu Li
INFOCOM, 2017
IEEE

We build an audio-based mobile app for inattentive driving detection.


Thanks Jon!
Last updated October 2025.