Image

 Ruizhi Shao (邵睿智)

  Senior Research Scientist

  Rhoda AI

  Palo Alto, CA, United States

  Email:jia1saurus@gmail.com

  CVGoogle ScholarGitHubTwitter


I am a Senior Research Scientist working at Rhoda AI. I received my Ph.D. from Tsinghua University in April 2025, where I was advised by Prof. Yebin Liu. During my doctoral studies, I was also fortunate to visit Stanford University as a visiting scholar, working with Prof. Gordon Wetzstein. My research spans computer vision, machine learning, and robotics, with a focus on video diffusion model and 3D generation. I am particularly interested in building Video World Models that can understand and interact with the physical world, with the long-term goal of bringing AI into real-world robotic systems.


News

Projects

threestudio

ThreeStudio: A Unified Framework For 3D Content Creation
Yuan-Chen Guo, Ying-Tian Liu, Ruizhi Shao, Christian Laforte, Vikram Voleti, Guan Luo, Chia-Hao Chen, Zi-Xin Zou, Chen Wang, Yan-Pei Cao and Song-Hai Zhang.
Project

Publications and Manuscripts

DevilSight

DevilSight: Augmenting Monocular Human Avatar Reconstruction through a Virtual Perspective
Yushuo Chen, Ruizhi Shao, Youxin Pang, Hongwen Zhang, Xinyi Wu, Rihui Wu, Yebin Liu.
International Conference on 3D Vision (3DV), 2026.
Paper |

DreamCraft3D++

DreamCraft3D++: Efficient Hierarchical 3D Generation with Multi-Plane Reconstruction Model
Jingxiang Sun, Cheng Peng, Ruizhi Shao, Yuan-Chen Guo, Xiaochen Zhao, Yangguang Li, Yanpei Cao, Bo Zhang, Yebin Liu.
IEEE Transactions on Pattern Analysis and Machine Intelligence (TPAMI), 2025.
Paper | Project |

Ins-HOI

Ins-HOI: Instance Aware Human-Object Interactions Recovery
Jiajun Zhang, Yuxiang Zhang, Hongwen Zhang, Xiao Zhou, Boyao Zhou, Ruizhi Shao, Zonghai Hu, Yebin Liu.
IEEE Transactions on Pattern Analysis and Machine Intelligence (TPAMI), 2025.
Paper | Project | Code |

SViMo

SViMo: Synchronized Diffusion for Video and Motion Generation in Hand-Object Interaction Scenarios
Lingwei Dang, Ruizhi Shao, Hongwen Zhang, Wei Min, Yebin Liu, Qingyao Wu.
Conference on Neural Information Processing Systems (NeurIPS), 2025. Spotlight.
Paper | Project | Code |

ISA4D

ISA4D: Interspatial Attention for Efficient 4D Human Video Generation
Ruizhi Shao, Yinghao Xu, Yujun Shen, Ceyuan Yang, Yang Zheng, Changan Chen, Yebin Liu, Gordon Wetzstein.
ACM Transactions on Graphics (Proc. SIGGRAPH), 2025.
Paper | arXiv | Project |

ManiVideo

ManiVideo: Generating Hand-Object Manipulation Video with Dexterous and Generalizable Grasping
Youxin Pang, Ruizhi Shao, Jiajun Zhang, Hanzhang Tu, Yun Liu, Boyao Zhou, Hongwen Zhang, Yebin Liu.
IEEE Conference on Computer Vision and Pattern Recognition (CVPR), 2025.
Paper | arXiv | Code |

Language of Motion

The Language of Motion: Unifying Verbal and Non-verbal Language of 3D Human Motion
Changan Chen*, Juze Zhang*, Shrinidhi Kowshika Lakshmikanth*, Yusu Fang, Ruizhi Shao, Gordon Wetzstein, Li Fei-Fei, Ehsan Adeli. (* equal contribution)
IEEE Conference on Computer Vision and Pattern Recognition (CVPR), 2025.
Paper | Project | Code |

GPS-Gaussian+

GPS-Gaussian+: Generalizable Pixel-wise 3D Gaussian Splatting for Real-Time Human-Scene Rendering from Sparse Views
Boyao Zhou, Shunyuan Zheng, Hanzhang Tu, Ruizhi Shao, Boning Liu, Shengping Zhang, Liqiang Nie, Yebin Liu.
IEEE Transactions on Pattern Analysis and Machine Intelligence (TPAMI), 2025.
Paper | Project |

Human4DiT

Human4DiT: 360-Degree Human Video Generation with 4D Diffusion Transformer
Ruizhi Shao*, Youxin Pang*, Zerong Zheng, Jingxiang Sun, Yebin Liu. (* equal contribution)
ACM Transactions on Graphics (Proc. SIGGRAPH Asia), 2024.
Paper | Project | Code |

HumanCoSer

HumanCoSer: Layered 3D Human Generation via Semantic-Aware Diffusion Model
Yi Wang, Jian Ma, Ruizhi Shao, Qiao Feng, Yu-Kun Lai, Yebin Liu, Kun Li.
IEEE International Symposium on Mixed and Augmented Reality (ISMAR), 2024.
Paper |

Tele-Aloha: A Low-budget and High-authenticity Telepresence System Using Sparse RGB Cameras
Hanzhang Tu, Ruizhi Shao, Xue Dong, Shunyuan Zheng, Hao Zhang, Lili Chen, Meili Wang, Wenyu Li, Siyan Ma, Shengping Zhang, Boyao Zhou, Yebin Liu
SIGGRAPH , 2024.
Paper | Project |

Control4D: Efficient 4D Portrait Editing with Text
Ruizhi Shao, Jingxiang Sun, Cheng Peng, Zerong Zheng, Boyao Zhou, Hongwen Zhang, Yebin Liu.
IEEE Conference on Computer Vision and Pattern Recognition (CVPR) , 2024.
Paper | Project |

humannorm

HumanNorm: Learning Normal Diffusion Model for High-quality and Realistic 3D Human Generation
Xin Huang*, Ruizhi Shao*, Qi Zhang, Hongwen Zhang, Ying Feng, Yebin Liu, Qing Wang. (* equal contribution)
IEEE Conference on Computer Vision and Pattern Recognition (CVPR) , 2024.
Paper | Project |

holihand

HHMR: Holistic Hand Mesh Recovery by Enhancing the Multimodal Controllability of Graph Diffusion Models
Mengcheng Li, Hongwen Zhang, Yuxiang Zhang, Ruizhi Shao, Tao Yu, Yebin Liu.
IEEE Conference on Computer Vision and Pattern Recognition (CVPR Highlight) , 2024.
Paper |

gpsgaussian

Gps-gaussian: Generalizable pixelwise 3d gaussian splatting for real-time human novel view synthesis
Shunyuan Zheng, Boyao Zhou, Ruizhi Shao, Boning Liu, Shengping Zhang, Liqiang Nie, Yebin Liu.
IEEE Conference on Computer Vision and Pattern Recognition (CVPR Highlight) , 2024.
Paper | Project | Code |

DreamCraft3D: Hierarchical 3D Generation with Bootstrapped Diffusion Prior
Jingxiang Sun, Bo Zhang, Ruizhi Shao, Lizhen Wang, Wen Liu, Zhenda Xie, Yebin Liu
International Conference on Learning Representations (ICLR), 2024.
Paper | Project |

tensor4d

Tensor4D : Efficient Neural 4D Decomposition for High-fidelity Dynamic Reconstruction and Rendering
Ruizhi Shao, Zerong Zheng, Hanzhang Tu, Boning Liu, Hongwen Zhang, Yebin Liu.
IEEE Conference on Computer Vision and Pattern Recognition (CVPR Highlight), 2023.
Paper | Project | Code | Data |

closet

CloSET: Modeling Clothed Humans on Continuous Surface with Explicit Template Decomposition
Hongwen Zhang, Siyou Lin, Ruizhi Shao, Yuxiang Zhang, Zerong Zheng, Han Huang, Yandong Guo, Yebin Liu.
IEEE Conference on Computer Vision and Pattern Recognition (CVPR), 2023.
Paper | Project |

HDHuman

HDHuman: High-quality Human Novel-view Rendering from Sparse Views
Tiansong Zhou, Jing Huang, Tao Yu, Ruizhi Shao, Kun Li.
IEEE Transactions on Visualization and Computer Graphics (TVCG), 2023.
Paper |

floren

FloRen: Real-time High-quality Human Performance Rendering via Appearance Flow Using Sparse RGB Cameras
Ruizhi Shao, Liliang Chen, Zerong Zheng, Hongwen Zhang, Yuxiang Zhang, Han Huang, Yebin Liu.
SIGGRAPH Asia, 2022.
Paper |

diffustereo

DiffuStereo: High Quality Human Reconstruction via Diffusion-based Stereo Using Sparse Cameras
Ruizhi Shao, Zerong Zheng, Hongwen Zhang, Jingxiang Sun, Yebin Liu.
European Conference on Computer Vision (ECCV ORAL), 2022.
Paper | Project | Code | Data |

FITE

Learning Implicit Templates for Point-Based Clothed Human Modeling
Siyou Lin, Hongwen Zhang, Zerong Zheng, Ruizhi Shao, Yebin Liu.
European Conference on Computer Vision (ECCV), 2022.
Paper | Project | Code |

DoubleField

DoubleField: Bridging the Neural Surface and Radiance Fields for High-fidelity Human Reconstruction and Rendering
Ruizhi Shao, Hongwen Zhang, He Zhang, Mingjia Chen, Yanpei Cao, Tao Yu, Yebin Liu.
IEEE Conference on Computer Vision and Pattern Recognition (CVPR), 2022.
Paper | Project | Code |

LocalTrans

LocalTrans: A Multiscale Local Transformer Network for Cross-Resolution Homography Estimation
Ruizhi Shao*, Gaochang Wu*, Yuemei Zhou, Ying Fu, Lu Fang, Yebin Liu (* equal contribution)
International Conference on Computer Vision (ICCV), 2021.
Paper | Project | Code |

DeepMultiCap

DeepMultiCap: Performance Capture of Multiple Characters Using Sparse Multiview Cameras
Yang Zheng*, Ruizhi Shao*, Yuxiang Zhang, Tao Yu, Zerong Zheng, Qionghai Dai, Yebin Liu (* equal contribution)
International Conference on Computer Vision (ICCV), 2021.
Paper | Project | Code | Dataset


Honors and Awards