Yuheng Li

Research [ Selected | All ]

Relational Visual Similarity
Thao Nguyen, Sicheng Mo, Krishna Kumar Singh, Yilin Wang, Jing Shi, Nicholas Kolkin, Eli Shechtman, Yong Jae Lee, Yuheng Li
arXiv (arXiv), 2025
[ProjectPage, Paper, Code]

Group Diffusion: Enhancing Image Generation by Unlocking Cross-Sample Collaboration
Sicheng Mo, Thao Nguyen, Richard Zhang, Nick Kolkin, Siddharth Srinivasan Iyer, Eli Shechtman, Krishna Kumar Singh, Yong Jae Lee, Bolei Zhou, Yuheng Li
arXiv (arXiv), 2025
[ProjectPage, Paper]

Learning an Image Editing Model without Image Editing Pairs
Nupur Kumari, Sheng-Yu Wang, Nanxuan Zhao, Yotam Nitzan, Yuheng Li, Krishna Kumar Singh, Richard Zhang, Eli Shechtman, Jun-Yan Zhu, Xun Huang
arXiv (arXiv), 2025
[ProjectPage, Paper]

Beyond Simple Edits: X-Planner for Complex Instruction-Based Image Editing
Chun-Hsiao Yeh, Yilin Wang, Nanxuan Zhao, Richard Zhang, Yuheng Li, Yi Ma, Krishna Kumar Singh
AAAI Conference on Artificial Intelligence (AAAI), 2025
[ProjectPage, Paper]

X-Fusion: Introducing New Modality to Frozen Large Language Models
Sicheng Mo, Thao Nguyen, Xun Huang, Siddharth Srinivasan Iyer, Yijun Li, Yuchen Liu, Abhishek Tandon, Eli Shechtman, Krishna Kumar Singh, Yong Jae Lee, Bolei Zhou, Yuheng Li
IEEE International Conference on Computer Vision, (ICCV), 2025
🏆 Best paper at CVPR 2025 Workshop: "Transformers for Vision (T4V)
[ProjectPage, Code, Paper]

Generating, Fast and Slow: Scalable Parallel Video Generation with Video Interface Networks
Bhishma Dedhia, David Bourgin, Krishna Kumar Singh, Yuheng Li, Yan Kang, Zhan Xu, Niraj K. Jha, Yuchen Liu
IEEE International Conference on Computer Vision, (ICCV), 2025
[ProjectPage, Paper]

Yo'Chameleon: Personalized Vision and Language Generation
Thao Nguyen, Krishna Kumar Singh, Jing Shi, Trung Bui, Yong Jae Lee, Yuheng Li
Conference on Computer Vision and Pattern Recognition (CVPR), 2025
[ProjectPage, Code, Paper]

Yo'LLaVA: Your Personalized Language and Vision Assistant
Thao Nguyen, Haotian Liu, Yuheng Li, Mu Cai, Utkarsh Ojha, Yong Jae Lee
Neural Information Processing Systems (NeurIPS), 2024
[ProjectPage, Code, Paper]

Removing Distributional Discrepancies in Captions Improves Image-Text Alignment
Yuheng Li, Haotian Liu, Mu Cai, Yijun Li, Eli Shechtman, Zhe Lin, Yong Jae Lee, Krishna Kumar Singh,
European Conference on Computer Vision, (ECCV), 2024
[ProjectPage, Paper]

Improved Baselines with Visual Instruction Tuning (LLaVA-1.5)
Haotian Liu, Chunyuan Li, Yuheng Li, Yong Jae Lee
IEEE Conference on Computer Vision and Pattern Recognition (CVPR), 2024
[ProjectPage, Code, Paper]

Edit One for All: Interactive Batch Image Editing
Thao Nguyen, Utkarsh Ojha, Yuheng Li, Haotian Liu, Yong Jae Lee
IEEE Conference on Computer Vision and Pattern Recognition (CVPR), 2024
[ProjectPage, Code, Paper]

Generate Anything Anywhere in Any Scene
Yuheng Li, Haotian Liu, Yangming Wen, Yong Jae Lee
arxiv, 2023

Leveraging Large Language Models for Scalable Vector Graphics-Driven Image Understanding
Mu Cai*, Zeyi Huang*, Yuheng Li, Haohan Wang, and Yong Jae Lee
(*equal contribution)
IEEE Winter Conference on Applications of Computer Vision (WACV), 2025, 2023

Visual Instruction Inversion: Image Editing via Visual Prompting
Thao Nguyen, Yuheng Li, Utkarsh Ojha, Yong Jae Lee
Neural Information Processing Systems (NeurIPS), 2023
[ProjectPage, Code, Paper]

What Knowledge Gets Distilled in Knowledge Distillation?
Utkarsh Ojha*, Yuheng Li*, Anirudh Sundara Rajan*, Yingyu Liang, Yong Jae Lee
(*equal contribution)
Neural Information Processing Systems (NeurIPS), 2023

GLIGEN: Open-Set Grounded Text-to-Image Generation
Yuheng Li, Haotian Liu, Qingyang Wu, Fangzhou Mu, Jianwei Yang, Jianfeng Gao, Chunyuan Li*, Yong Jae Lee*
(*equal advising)
IEEE Conference on Computer Vision and Pattern Recognition (CVPR), 2023
[arXiv] [code] [Project Page] [Demo] [Youtube]

Towards Universal Fake Image Detectors that Generalize Across Generative Models
Utkarsh Ojha*, Yuheng Li*, Yong Jae Lee
(*equal contribution)
IEEE Conference on Computer Vision and Pattern Recognition (CVPR), 2023

Delving Deeper into Anti-aliasing in ConvNets
Xueyan Zou, Fanyi Xiao, Zhiding Yu, Yuheng Li, and Yong Jae Lee
International Journal of Computer Vision (IJCV), 2022

Contrastive Learning for Diverse Disentangled Foreground Generation
Yuheng Li, Yijun Li, Jingwan Lu, Eli Shechtman, Yong Jae Lee, Krishna Kumar Singh
Proceedings of the European Conference on Computer Vision (ECCV), 2022

GIRAFFE HD: A High-Resolution 3D-aware Generative Model
Yang Xue, Yuheng Li, Krishna Kumar Singh, Yong Jae Lee
IEEE Conference on Computer Vision and Pattern Recognition (CVPR), 2022
[arXiv] [code]

Collaging Class-specific GANs for Semantic Image Synthesis
Yuheng Li, Yijun Li, Jingwan Lu, Eli Shechtman, Yong Jae Lee, Krishna Kumar Singh
IEEE International Conference on Computer Vision (ICCV), 2021
[arXiv] [project]

PartGAN: Unsupervised Part Decomposition for Image Generation and Segmentation
Yuheng Li, Krishna Kumar Singh, Yong Jae Lee
British Machine Vision Conference (BMVC), 2021

MixNMatch: Multifactor Disentanglement and Encoding for Conditional Image Generation
Yuheng Li, Krishna Kumar Singh, Utkarsh Ojha, Yong Jae Lee
IEEE Conference on Computer Vision and Pattern Recognition (CVPR), 2020
[arXiv] [code]