I'm a second-year CS Ph.D. student at UNC Chapel Hill, advised by Prof. Mohit Bansal. I was a Master of Machine Learning and Computer Vision student at the Australian National University advised by Prof. Stephen Gould.
Before that, I got my bachelor degree in applied mathematics from the University of Science and Technology of China.
(2026-02)   New preprint AnchorWeave on world-consistent video generation with spatial memories.
(2025-11)   DreamRunner is accepted to AAAI 2026! 🎉
(2025-10)   New preprint SID for self-generating large-scale navigation demonstrations.
(2025-07)   One paper accepted to ICCV 2025.
(2025-05)   Summer Intern at 🛒 Amazon!
(2025-01)   Self-refining Data Flywheel for high-quality VLN data generation is accepted to ICLR 2025! 🤖 surpasses human on R2R-VLN for the first time!
(2024-11)   New preprint DreamRunner✨ for storytelling video generation! My first PhD project at UNC MURGE-Lab🥳!
(2024-11)   Our VLN survey paper is accepted to TMLR!
(2024-08)   Started my Ph.D. in the MURGe Lab at UNC Chapel Hill. Hello UNC😆!
(2024-07)   Two paper accepted to ECCV 2024! Congrats Gengze and InternVideo Team!
(2024-04)   One paper accepted to TPAMI! Congrats Dong!
(2024-02)   One paper accepted to CVPR 2024 as Highlight! Congrats Kunchang!
(2023-10)   Attending ICCV2023 @ Paris in person😆! Great pleasure to learn from so many researchers/scholars🥹!
(2023-07)   One paper accepted to ICCV 2023 as Oral presentation!
MVBench: A Comprehensive Multi-Modal Video Understanding Benchmark
Kunchang Li, Yali Wang, Yinan He, Yizhuo Li, Yi Wang, Yi Liu, Zun Wang, Jilan Xu, Guo Chen, Ping Luo, Limin Wang, Yu Qiao
CVPR, 2024, Highlight (3%) paper / code
InternVideo2: Scaling Video Foundation Models for Multimodal Video Understanding
Yi Wang*, Kunchang Li*, Xinhao Li*, Jiashuo Yu*, Yinan He*, Guo Chen, Baoqi Pei, Rongkun Zheng, Jilan Xu, Zun Wang, Yansong Shi, Tianxiang Jiang, Songze Li, Hongjie Zhang, Yifei Huang, Yu Qiao, Yali Wang, Limin Wang
ECCV, 2024 paper / code
NavGPT-2: Unleashing Navigational Reasoning Capability for Large Vision-Language Models
Gengze Zhou, Yicong Hong, Zun Wang, Xin Eric Wang, Qi Wu
ECCV, 2024 paper / code
ETPNav: Evolving Topological Planning for Vision-Language Navigation in Continuous Environments
Dong An, Hanqing Wang, Wenguan Wang, Zun Wang, Yan Huang, Keji He, Liang Wang
TPAMI, 2024 paper / code
InternVideo: General Video Foundation Models via Generative and Discriminative Learning
Yi Wang*, Kunchang Li*, Yizhuo Li*, Yinan He*, Bingkun Huang*, Zhiyu Zhao*, Hongjie Zhang*, Jilan Xu, Yi Liu, Zun Wang, Sen Xing, Guo Chen, Junting Pan, Yali Wang, Limin Wang, Yu Qiao
Technical Report , 2022
paper / code
Bridging the Gap Between Learning in Discrete and Continuous Environments for Vision-and-Language Navigation Yicong Hong*, Zun Wang*, Qi Wu, Stephen Gould CVPR, 2022
paper /
code
1st Place Solutions for RxR-Habitat Vision-and-Language Navigation Competition (CVPR 2022) Dong An*, Zun Wang*, Yangguang Li, Yi Wang, Yicong Hong, Yan Huang, Liang Wang, Jing Shao Technical Report , 2022
paper