I am a year-2 master student at Tsinghua University, under the supervision of Prof. Xiu Li.
I am fortunate to be collaborating closely with Dr. Lin Song on Vision Language Model.
Before that, I obtained my BSc in Mathematics and Applied Mathematics at Xidian University in 2023.
My research interest includes Multi-Modal Learning and Computer Vision.
News
[2025.09]    The paper MindOmni is accepted by NeurIPS 2025 (CCF-A)
[2025.09]    The Paper SOC++ is accepted by TPAMI 2025 (CCF-A)
[2025.06]    We are excited to release the project, MindOmni
[2025.05]    Two papers, LoRA-Gen and HaploVLM are accepted by ICML 2025 (CCF-A)
[2024.10]    Obtain National Scholarship, Tsinghua University
[2024.06]    Two papers, MambaTree (Spotlight) and COVE are accepted by NeurIPS 2025 (CCF-A)
[2024.03]    The paper UVCOM is accepted by CVPR 2024 (CCF-A)
[2023.09]    The paper SOC is accepted by NeurIPS 2023 (CCF-A)
[2023.09]    The first prize of The 5th Large-scale Video Object Segmentation Challenge Track3: Referring Video Object                                         Segmentation
[2023.03]    The paper SemanticAC is accepted by ICASSP 2023 (CCF-B)
[2021.12]    Obtain National Scholarship, Xidian University
Academic experience
2023-Present
Studying as a Master Student at Tsinghua University
2019-2023
Studying as an Undergraduate Student at Xidian University
Industrial experience
2024.06-Present
I am a multimodal algorithm research intern supervised by Dr. Lin Song at Tencent ARC Lab
2024.01-2024.06
I am a multimodal algorithm research intern supervised by Dr. Lin Song at Tencent AI Lab
2022.12-2023.3
I am a multimodal algorithm research intern at OPPO Research Institute
Publications
MindOmni: Unleashing Reasoning Generation in Vision Language Models with RGPO