Delong Chen

Delong Chen

Visiting Researcher at Meta FAIR Paris

Ph.D. Candidate at HKUST


Delong Chen (陈德龙) is a third-year Ph.D. student at The Hong Kong University of Science and Technology (HKUST), advised by Prof. Pascale Fung. He is now a visiting researcher at Meta FAIR Paris. He is working on vision-language and world modeling.

Awards
  • Best Paper at AAAI 2023 Inaugural Summer Symposium Series - AI x Metaverse
  • Best Dataset Paper at Long-Tailed Distribution Learning Workshop, IJCAI 2021
  • Best Demo at IEEE ICME 2021
  • 江苏省优秀本科毕业论文一等奖
  • 河海大学2021届本科“优秀毕业生”荣誉称号
  • 2020江苏省大学生网络文化节校园歌曲作品征集一等奖
  • “江苏省优秀共青团员”称号
  • “2019江苏省大学生年度人物”提名奖
  • 2020年河海大学“海韵风华大学生年度人物”称号
Reviewer / Program Committee
  • ICLR, NeurIPS, CVPR, ICCV, ICML, ACL Rolling Review (ARR), AAAI, ACMMM
  • IEEE TPAMI, ACM TIST, Artificial Intelligence Review
Volunteer
  • AAAI 2024 (Vancouver, Canada)
  • ACL 2024 (Bangkok, Thailand)
Teaching Assistant
  • ELEC 1200 A System View of Communications (2024 Spring, HKUST)

Selected first & co-first aurthor papers. See full publication list in Google Scholar


Action100M: A Large-scale Video Action Dataset
Delong Chen, Tejaswi Kasarla, Yejin Bang, Mustafa Shukor, Willy Chung, Jade Yu, Allen Bolourchi, Theo Moutakanni, Pascale Fung
World Modeling Workshop (Mila, 2026)
[ facebookresearch/Action100M (300+ stars)]
Image
VL-JEPA: Joint Embedding Predictive Architecture for Vision-language
Delong Chen*, Mustafa Shukor*, Theo Moutakanni*, Willy Chung*, Jade Yu, Tejaswi Kasarla, Allen Bolourchi, Yann LeCun, Pascale Fung
World Modeling Workshop (Mila, 2026)
Image
Planning with Reasoning using Vision Language World Model
Delong Chen*, Theo Moutakanni*, Willy Chung, Yejin Bang, Ziwei Ji, Allen Bolourchi, Pascale Fung
World Modeling Workshop (Mila, 2026)
Image
WorldPrediction: A Benchmark for High-level World Modeling and Long-horizon Procedural Planning
Delong Chen*, Willy Chung*, Yejin Bang, Ziwei Ji, Pascale Fung
ICML 2025 Workshop on Assessing World Models
[ facebookresearch/WorldPrediction]
Image
Subobject-level Image Tokenization
Delong Chen, Samuel Cahyawijaya, Jianfeng Liu, Baoyuan Wang, Pascale Fung
ICML 2025
[ ChenDelong1999/subobjects] [🤗 AK's Huggingface Daily Paper] [ Demo]
Image
What Makes for Good Image Captions?
Delong Chen, Samuel Cahyawijaya, Etsuko Ishii, Ho Shu Chan, Yejin Bang, Pascale Fung
EMNLP 2025 Findings & NeurIPS 2024 Workshop on Machine Learning and Compression
Image
High-Dimension Human Value Representation in Large Language Models
Samuel Cahyawijaya*, Delong Chen*, Yejin Bang*, Leila Khalatbari, Bryan Wilie*, Ziwei Ji, Etsuko Ishii, Pascale Fung*
NAACL 2025 Main
[ HLTCHKUST/UniVaR]
Image
RemoteCLIP: A Vision Language Foundation Model for Remote Sensing
Fan Liu*, Delong Chen*, Zhangqingyun Guan, Xiaocong Zhou, Jiale Zhu, Jun Zhou
IEEE Transactions on Geoscience and Remote Sensing, 2024
[ ChenDelong1999/RemoteCLIP (500+ stars)] [ Paperswithcode Leaderboard]
Image
ProtoCLIP: Prototypical Contrastive Language Image Pretraining
Delong Chen, Zhao Wu, Fan Liu, Zaiquan Yang, Huaxi Huang, Ying Tan, Erjin Zhou
IEEE Transactions on Neural Networks and Learning Systems, 2023
[ megvii-research/protoclip] [ ITRA codebase]
Image

* Equal Contribution
Corresponding Authors

🎻


Delong was awarded a violin performance diploma from the Central Conservatory of Music (中央音乐学院).
He served as the concert master of the Hohai University Symphony Orchestra during 2019-2020. He is also at bilibili with 20k+ followers.

Internationale ☭

Internationale ☭

Piano: Qiwen Zhang (张启文). Violin: Delong Chen & Haolin Ouyang

南京-武汉11高校云合奏《汉阳门花园》

南京-武汉11高校云合奏《汉阳门花园》

Cloud Symphony: Hanyang Gate Garden. Organized an 11-university symphony orchestra cloud performance – composition, audio mixing, and video editing. Media coverage Xinhua News Agency (新华社), People’s Daily (人民日报)