Skip to content
View XingruiWang's full-sized avatar
:shipit:
:shipit:

Highlights

  • Pro

Block or report XingruiWang

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don’t include any personal information such as legal names or email addresses. Markdown is supported. This note will only be visible to you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
XingruiWang/README.md

Pinned Loading

  1. XModBench XModBench Public

    XModBench: Benchmarking Cross-Modal Capabilities and Consistency in Omni-Language Models

    Python 6

  2. Spatial457 Spatial457 Public

    [CVPR'25 Highlight] A VQA benchmark for 6D spatial reasoning.

    Python 20 3

  3. KeyVID KeyVID Public

    Offical code of paper KeyVID: Keyframe-Aware Video Diffusion for Audio-Synchronized Visual Animation.

    Python 6

  4. 3D-Aware-VQA 3D-Aware-VQA Public

    Official Code for the NeurIPS'23 paper "3D-Aware Visual Question Answering about Parts, Poses and Occlusions"

    Jupyter Notebook 21

  5. open-compass/VLMEvalKit open-compass/VLMEvalKit Public

    Open-source evaluation toolkit of large multi-modality models (LMMs), support 220+ LMMs, 80+ benchmarks

    Python 4.2k 725

  6. EvolvingLMMs-Lab/lmms-eval EvolvingLMMs-Lab/lmms-eval Public

    One-for-All Multimodal Evaluation Toolkit Across Text, Image, Video, and Audio Tasks

    Python 4.3k 607