Skip to content
View KemingWu's full-sized avatar
🤗
Focusing
🤗
Focusing

Block or report KemingWu

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don’t include any personal information such as legal names or email addresses. Markdown is supported. This note will only be visible to you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
KemingWu/README.md

Hi there 👋

Image

  • 👋 Hi, I’m Keming Wu(Charles Wu), Ph.D. student in Tsinghua University (2025.8-current).
  • ✨ I’m interested in Computer Vision, Generative AI, Vision Language Models.
  • 🏚️ More about me, find out at my Google Scholar.
  • 📮 Contact me: wukeming0608@gmail.com

Pinned Loading

  1. EvolvingLMMs-Lab/lmms-engine EvolvingLMMs-Lab/lmms-engine Public

    A simple, unified multimodal models training engine. Lean, flexible, and built for hacking at scale.

    Python 758 35

  2. EvolvingLMMs-Lab/lmms-eval EvolvingLMMs-Lab/lmms-eval Public

    One-for-All Multimodal Evaluation Toolkit Across Text, Image, Video, and Audio Tasks

    Python 4k 557

  3. TIGER-AI-Lab/EditReward TIGER-AI-Lab/EditReward Public

    EditReward: A Human-Aligned Reward Model for Instruction-Guided Image Editing [ICLR 2026]

    Python 138 6

  4. EvolvingLMMs-Lab/OpenMMReasoner EvolvingLMMs-Lab/OpenMMReasoner Public

    [CVPR 2026] OpenMMReasoner: Pushing the Frontiers for Multimodal Reasoning with an Open and General Recipe

    Python 156 7

  5. EvolvingLMMs-Lab/LongVT EvolvingLMMs-Lab/LongVT Public

    [CVPR 2026] LongVT: Incentivizing "Thinking with Long Videos" via Native Tool Calling

    Python 218 13

  6. HybridLayout HybridLayout Public

    [ICCV 2025] Hybrid Layout Control for Diffusion Transformer: Fewer Annotations, Superior Aesthetics.

    Jupyter Notebook 19