Haiyang Xu

USTC
B.S.
2020 - 2024
UC San Diego
Ph.D. student
2024 - Present

I am a second-year Ph.D. student at UCSD, advised by Zhuowen Tu. My current research focus is on generative models, especially on topics such as controllable image, video, vector, and 3D generation.

Previously, I was a Research Intern at Adobe in 2025 Summer, working with Dr. Zhaowen Wang, Dr. Li-Yi Wei, Dr. Nanxuan Zhao and Dr. Cuong Nguyen; at NYU Courant in 2024 Fall, working with Prof. Saining Xie; and at Baidu in 2022 Summer, working with Dr. Dongliang He and Dr. Jingdong Wang. During my undergraduate, I was fortunate to work with Prof. Xiangnan He and Dr. Shuo Wang.

I am currently looking for internship positions in 2026 Spring. Feel free to contact me if you have any opportunities!

alternative

Experience

USTC
Research Intern
Jan 2022 - Jun 2023
Advised by: Prof. Xiangnan He and Dr. Shuo Wang
Baidu
Research Intern
Jul 2022 - Nov 2022
Advised by: Dr. Dongliang He and Dr. Jingdong Wang
UC San Diego
Research Intern
Jul 2023 - Nov 2023
Advised by: Prof. Zhuowen Tu
NYU Courant
Research Intern
Jan 2024 - Nov 2024
Advised by: Prof. Saining Xie
Adobe
Research Intern
Jun 2025 - Nov 2025
Advised by: Dr. Zhaowen Wang, Dr. Li-Yi Wei, Dr. Nanxuan Zhao and Dr. Cuong Nguyen
Adobe
Research Intern
2026 (Incoming)
Advised by: Dr. Mingze Xu and Dr. Yuanjun Xiong

Latest News

New! Feb 2026 My Adobe internship work is accepted to CVPR 2026. Thanks to all my mentors and collaborators!
Jan 2026 One paper accepted to ICLR 2026. Congrats to Enxin!

Publications (* equal contribution, † project leader)

Image

SemLayer: Semantic Generative Segmentation and Layer Reconstruction for Abstract Icons

CVPR 2026

Haiyang Xu, Ronghuan Wu, Li-Yi Wei, Nanxuan Zhao, Chenxi Liu, Cuong Nguyen, Zhuowen Tu, Zhaowen Wang

  Website   PDF   Code
Image

VideoNSA: Native Sparse Attention Scales Video Understanding

ICLR 2026

Enxin Song, Wenhao Chai, Shusheng Yang, Ethan Armand, Xiaojun Shan, Haiyang Xu, Jianwen Xie, Zhuowen Tu

  Website   PDF   Code
Image

CVP: Central-Peripheral Vision-Inspired Multimodal Model for Spatial Reasoning

WACV 2026

Zeyuan Chen, Xiang Zhang, Haiyang Xu, Jianwen Xie, Zhuowen Tu

  Website   PDF   Code

Reviewer Services