I am a PhD student in the Department of Computer Science at the University of Rochester, advised by Prof. Chenliang Xu. I received my B.Eng. from CS Department, Huazhong University of Science and Technology in 2022. In my undergrad, I worked with Prof. Kun He and Mr. Xiaosen Wang at HUST on adversarial machine learning. I also work closely with Prof. Yijie Peng at PKU on gradient estimation (Zeroth-Order optimization), Prof. Xiao-Yang Liu at RPI/Columbia on high-performance quantum and tensor computation, and Dr. Xiaodong Liu at Microsoft Research on efficient LLM. I am good at playing Erhu and familiar with Violin. Welcome to reach out to chat:)
Currently, I mainly work on efficient and reliable AI, ranging from classical deep learning models to LLM, for my Ph.D. degree.
DRIFT transfers reasoning from DeepSeek-R1 into QwenVL via gradient-space guidance, improving multimodal reasoning without destabilizing alignment or expensive RL.
To alleviate the problem of hallucinations, we propose the Differentiated Beam Decoding (DBD), along with a reliable new set of evaluation metrics: CLIP-Precision, CLIP-Recall, and CLIP-F1.
We propose a method of grouping and pruning similar experts to improve the model's parameter efficienc
Video Understanding with Large Language Models: A Survey
Yunlong Tang, Jing Bi, Siting Xu, Luchuan Song, Susan Liang, Teng Wang, Daoan Zhang, Jie An, Jingyang Lin, Rongyi Zhu, Ali Vosoughi, Chao Huang, Zeliang Zhang, Feng Zheng, Jianguo Zhang, Ping Luo, Jiebo Luo, Chenliang Xu.
Technical Report, 2023
This survey provides a detailed overview of the recent advancements in video understanding harnessing the power of LLMs.
We propose a novel input transformation based attack, called Structure Invariant Transformation (SIA), which applies a random image transformation onto each image block to craft a set of diverse images for gradient calculation.
We propose a novel Triangle Attack (TA) to optimize the perturbation by utilizing the geometric information that the longer side is always opposite the larger angle in any triangle.
We propose a new technique to compute the pathwise stochastic gradient estimate with respect to the standard deviation of the Gaussian noise added to each neuron of the ANN.
Education
University of Rochester , NY, USA
Ph.D. in Computer Science
Sep. 2022 - Present
Advisor: Chenliang Xu
Huazhong University of Science and Technology, Wuhan, China
B.Eng in Computer Science and Technology
Sept. 2018 - Jun. 2022
Experience
Microsoft Research , Redmond, US
Research intern, then part-time researcher
May 2025 - Dec 2025
Advisor: Xiaodong Liu and Hao Cheng
Work on efficient training and inference of reasoning language models.
Microsoft Research , Redmond, US
Research intern, then part-time researcher
May 2024 - Nov. 2024
Advisor: Xiaodong Liu and Hao Cheng
Work on efficient training and inference of language models.
Microsoft Research Asia , Beijing, China
Research intern
Oct. 2021 - Jun. 2022
Advisor: Xinran Wei
Work on high-performance computation of DFT, which is important/bottleneck in material design using AI.