🤔
My current research interest lies in multi-modal representation learning.
-
SJTU
- Shanghai
Highlights
- Pro
Pinned Loading
-
Tencent/VersaViT
Tencent/VersaViT PublicVersaViT: Enhancing MLLM Vision Backbones via Task-Guided Optimization
Python 6
-
jinxiang-liu/UFE-AVS
jinxiang-liu/UFE-AVS PublicOfficial code for CVPR 2024 paper, "Audio-Visual Segmentation via Unlabeled Frame Exploitation""
Something went wrong, please refresh the page to try again.
If the problem persists, check the GitHub status page or contact support.
If the problem persists, check the GitHub status page or contact support.

