My research interests include Computer Vision (CV), Multimodal Large Language Models, and Generative Models.
I am deeply committed to contributing to open-source projects, as I firmly believe they are a cornerstone for the sustainable growth of the AI community.
arxiv Preprint, 2023
[paper] |
[Code] Keywords: Parameter-efficient transfer learning for vision Transformers in various vision tasks (classification, detection, segmentation)
Annual Meeting of the Association for Computational Linguistics (ACL-Industry), 2023
[paper] |
[Code] Keywords: Text-to-image latent diffusion models with rich entity knowledge.