I obtained the Ph.D degree at East China Normal University (ECNU) (from Sep. 2019 to Jun. 2024) in Shanghai supervised by Prof. Ming Gao, Prof. Xiang Li, and Prof. Yunshi Lan majoring Deep Learning and Natural Language Processing (NLP). I also visited at UCSD and supervised by Prof. Julian McAuley from Oct. 2023 to Feb. 2024. Now, I am a full-time staff engineering in Meituan
My research points consist of Large Language Models (LLMs), Post-training for LLMs (SFT and RLVR), Formal Reasoning, and Agentic Learning. If you want to join us, please send an email to me.
I have published some research papers (30+) at ICLR, ACL, AAAI, EMNLP, CIKM, etc., the topic includes Thinking Pattern Research for LLMs, Formal Theorem Proving, Long-horizon with RLVR, Few-shot Leanrning and Prompt Learning. More details can be found in Publications and Projects.
I had some internship experiences, including Ant Group (Digital Finance, Jun. 2021) and Alibaba Group (Platform of AI, AIR intern, Sep. 2022).
I like writing Chinese blogs at CSDN, sharing some papers and research knowledge of NLP, you can visit my Chinese blogs in here.
If you have any comments, please click here to let me know!
News
[2026-01-27] Our R-Horizon has been accepted to ICLR 2026. We release R-Horizon, which is a novel benchmark and training recipe for RLVR on long-horizon reasoning. 🔥🔥🔥
[2026-01-23] We release EvoCUA, which focus on GUI agent and achieves SOTA performance (56.7%) through all open-weights models on OS-World leaderboard. 🔥🔥🔥
[2026-01-15] We release LongCat-Flash-Thinking-2601, which aims at advancing our prior LongCat-Flash-Thinking on agentic tool use, searching and heavy thinking. 🔥🔥🔥
[2025-09-22] We release LongCat-Flash-Thinking, which is an efficient and powerful large reasoning model on solving general reasoning, formal proving, and agentic tasks.