Hey there! I am a PhD student at Stanford University working in robotics, advised by Prof. Shuran Song.
π Last summer, I interned at Boston Dynamics, working on dexterous manipulation for the electric Atlas!
Previously,
I got my MS in Robotics from CMU Robotics Institute, where I was fortunate to work with David Held and Martial Hebert on object-centric 3D representations for precise manipulation.
I develop robotic systems that learn high-precision, contact-rich manipulation from human demonstrations and generalize to novel visual and behavioral settings with minimal adaptation.
In-the-Wild Compliant Manipulation with UMI-FT
Hojung Choi*, Yifan Hou*, Chuer Pan, Seongheon Hong, Austin Patel, Xiaomeng Xu, Mark Cutkosky, Shuran Song
International Conference on Robotics and Automation (ICRA), 2026 paper /
website /
UMI-FT is a handheld data-collection platform that integrates compact, finger-level force/torque sensors with multimodal vision data to facilitate force-aware policy learning. This system enables the training of adaptive compliance policies that effectively regulate contact forces during complex manipulation tasks, such as wiping or skewering.
UMI-on-Air: Embodiment-Aware Guidance for Embodiment-Agnostic Visuomotor Policies
Harsh Gupta, Xiaofeng Guo, Huy Ha, Chuer Pan, Muqing Cao, Dongjae Lee, Sebastian Scherer, Shuran Song, Guanya Shi
International Conference on Robotics and Automation (ICRA), 2026 paper /
website /
UMI-on-Air, a framework for embodiment-aware deployment of embodiment-agnostic manipulation policies. We propose Embodiment-Aware Diffusion Policy, which integrates gradient feedback from an embodiment-specific low-level controller into an embodiment-agnostic high-level diffusion policyβs sampling process, steering trajectory generation toward dynamically feasible solutions for the deployment robot at test time.
One Demo is Worth a Thousand Trajectories: Action-View Augmentation for Visuomotor Policies
Chuer Pan, Litian Liang, Dominik Bauer, Eric Cousineau, Benjamin Burchfiel, Siyuan Feng, Shuran Song
Conference on Robot Learning (CoRL), 2025 paper /
website /
1001-Demos is an effective data augmentation framework for robot manipulation, that scales single fisheye, eye-in-hand, obstacle-free, human demonstrations into action-view consistent demos with visually realistic observation & physically feasible action trajectories. By leveraging a novel fisheye-adapted Gaussian Splatting formulation and trajectory optimization, it generates deployable smooth, collision-free demos from novel viewpoints with added obstacles β making robot manipulation policies more robust to unseen configurations and learn to plan around obstacles.
Universal Manipulation Interface: In-The-Wild Robot Teaching Without In-The-Wild Robot
Cheng Chi*, Zhenjia Xu*, Chuer Pan, Eric Cousineau, Ben Burchfiel, Siyuan Feng, Russ Tedrake, Shuran Song
Robotics: Science and Systems (RSS), 2024 β Best Systems Paper Award Finalist, RSS 2024 β paper /
website /
We propose UMI, a portable data collection and policy learning framework that allows direct skill transfer from in-the-wild human demonstrations to deployable robot policies, allowing zero-shot generalizable dynamic, bimanual, precise, and long-horizon robot policies.
Open X-Embodiment: Robotic Learning Datasets and RT-X Models
Open X-Embodiment Collaboration
International Conference on Robotics and Automation (ICRA), 2024 β Best Conference Paper Award, ICRA 2024 β paper /
website /
We introduce the largest robot learning dataset to date with 1M+ real robot trajectories, spanning 22 robot embodiments. We train large, transformer-based policies on the dataset and show that co-training with our diverse dataset substantially improves performance.
TAX-Pose: Task-Specific Cross-Pose Estimation for Robot Manipulation
Chuer Pan*, Brian Okorn*, Harry Zhang*, Ben Eisner*, David Held
Conference on Robot Learning (CoRL), 2022 paper /
website /
We propose a vision-based system that learns to estimate the task specific pose relationship (cross-pose) between pairs of interacting object using learned cross-object correspondences. We demonstrate that our method is able to learn from just 10 real point cloud demonstration with no pose annotations needed and generalize to novel instances within the trained object category.
Deep Projective Rotation Estimation through Relative Supervision
Brian Okorn*, Chuer Pan*, Martial Herbert, David Held
Conference on Robot Learning (CoRL), 2022 paper /
website /
We propose a new algorithm for self-supervised orientation estimation which utilizes Modified Rodrigues Parameters to stereographically project the closed manifold of SO(3) to the open manifold of 3D Euclidean space, which avoids the local optima common when naively applying relative self-supervision for object orientation estimation, allowing for faster convergence and lower rotational error on relative rotation estimation.
Examining the rhetorical capacities of neural language models
Zining Zhu, Chuer Pan, Mohamed Abdalla, Frank Rudzicz
Conference on Empirical Methods in Natural Language Processing (EMNLP) BlackboxNLP Workshop, 2020 paper /
video /
We propose a method that quantitatively evaluates the rhetorical capacities of neural language models (LMs) by evaluating their abilities to encode a set of linguistic features derived from Rhetorical Structure Theory (RST). Our experiments show that BERT-based LMs outperform other Transformer LMs such as GPT-2 and XLNet, revealing richer discourse knowledge in their intermediate layer representations.
An exploration of the reflow technique for the fabrication of an in vitro microvascular system to study occlusive clots
Yang Li, Chuer Pan, Yunfeng Li, Eugenia Kumacheva, Arun Ramachandran
Biomedical Microdevices, 19(4), 1-16, 2017 paper /
We introduce a reflow technique for fabrication of multi-level microchannel network with circular cross-section to systematically study the dissolution effects of thrombolytic drug on occlusive embolic clots in microvascular system.