My research focuses on scaling Multimodal Intelligence through mid/post-training and data scaling. I focus on building multimodal agents equipped with complex tool interaction and long-term memory, ultimately striving towards autonomous self-evovling systems.
Email: richard.peng.xia AT gmail DOT com; pxia AT cs DOT unc DOT edu
Jan.2026: Three papers were accepted by ICLR 2026.
Sept.2025: Tongyi DeepResearch was released , one paper was accepted by NeurIPS 2025 and selected as a spotlight presentation, and one paper was accepted by TMLR.
May.2025: One paper was accepted by ICML 2025.
Jan.2025: Three papers were accepted by ICLR 2025 and MMIE was selected as an oral presentation.
Dec.2024: Invited talk at Cohere For AI, one paper was accepted by COLING 2025, two papers were accepted by AAAI 2025.
Sept.2024: One paper was accepted by NeurIPS 2024 and one paper was accepted by EMNLP 2024.
Jul.2024: One paper was accepted by ECCV 2024.
Jun.2024: Two papers were accepted by MICCAI 2024 and one was early accepted.
Sept.2023: One paper was accepted by NeurIPS 2023.
Journal Reviewer: IEEE Transactions on Pattern Analysis and Machine Intelligence (TPAMI), International Journal of Computer Vision (IJCV), IEEE Transactions on Medical Imaging (TMI), Cell Patterns, Knowledge-Based Systems (KBS), Expert Systems with Applications (ESWA), Pattern Recognition (PR)