Xu Tan (谭旭) is a Research VP of Multimodality at Moonshot AI (a.k.a Kimi). He was previously a Principal Research Manager at Machine Learning Group, Microsoft Research Asia (MSRA). His work area covers LLMs, multimodality, and generative AI for video and audio.
His has published influential research papers with 15000+ citations, with two best papers and several top cited papers at AI conferences.
He has many technologies deployed in products: 1) Kimi-Video/Kimi-TTS in Kimi; 2) neural machine translation, pre-training models (MASS, MPNet), TTS (FastSpeech 1/2), ASR (FastCorrect 1/2), AI Music (https://github.com/microsoft/muzic), and AI avatar deployed in Microsoft (e.g., Bing Search/Ads, Microsoft Translator, Azure TTS, Azure ASR, Microsoft Xiaoice, etc).