Log inSign up
Zirui "Colin" Wang
189 posts
Image
user avatar
Zirui "Colin" Wang
@zwcolin
Research Intern @MetaAI; CS PhD Student @Berkeley_AI and @BerkeleySky; prev @Princeton_NLP, @HDSIUCSD, @VoioInc multimodal interaction
Berkeley, CA
ziruiw.net
Joined January 2015
663
Following
1,416
Followers
  • user avatar
    Zirui "Colin" Wang
    @zwcolin
    Mar 11, 2025
    Life update: I'll be joining Berkeley EECS as a PhD student starting in fall 2025, playing around with multimodal models and llms, being part of Sky Lab & BAIR, and enjoying the unreal™️ weather 🏖️ CA has to offer!
    Image
    Image
    28K
  • user avatar
    Zirui "Colin" Wang
    @zwcolin
    Feb 1, 2025
    While DeepSeek R1 has been flexing 💪🏻, how are VLMs progressing in 𝐫𝐞𝐚𝐬𝐨𝐧𝐢𝐧𝐠? ⚠️ Major Shift: the latest 𝐨𝐩𝐞𝐧-𝐰𝐞𝐢𝐠𝐡𝐭 Qwen2.5-VL has beaten the first GPT-4o and is now on par with the latest ChatGPT-4o! 😲 But what about o1-like models? Can they enhance
    Image
    33K
  • user avatar
    Zirui "Colin" Wang
    @zwcolin
    Jun 27, 2024
    🤨 Are Multimodal Large Language Models really as 𝐠𝐨𝐨𝐝 at 𝐜𝐡𝐚𝐫𝐭 𝐮𝐧𝐝𝐞𝐫𝐬𝐭𝐚𝐧𝐝𝐢𝐧𝐠 as existing benchmarks such as ChartQA suggest? 🚫 Our ℂ𝕙𝕒𝕣𝕏𝕚𝕧 benchmark suggests NO! 🥇Humans achieve ✨𝟖𝟎+% correctness. 🥈Sonnet 3.5 outperforms GPT-4o by 10+ points,
    Image
    00:00
    48K
  • user avatar
    Zirui "Colin" Wang
    @zwcolin
    Jul 26, 2024
    🤖 Welcome 𝐆𝐏𝐓-𝟒𝐨 𝐌𝐢𝐧𝐢 and 𝐈𝐧𝐭𝐞𝐫𝐧𝐕𝐋𝟐 𝐋𝐋𝐚𝐌𝐀-𝟑 𝟕𝟔𝐁 to the CharXiv (charxiv.github.io)  leaderboard for chart understanding! As concurrently released models, GPT-4o Mini is 𝐛𝐞𝐚𝐭𝐞𝐧 𝐛𝐲 𝐭𝐡𝐞 𝐨𝐩𝐞𝐧-𝐰𝐞𝐢𝐠𝐡𝐭 𝐨𝐧𝐞. 🎊 Congratulations to
    Image
    16K
  • user avatar
    Zirui "Colin" Wang
    @zwcolin
    Dec 7, 2023
    We present 🧩 TokenCompose, a text-to-image latent diffusion model trained with fine-grained grounding objectives for enhanced compositionality and photorealism. 🌐 Website: mlpc-ucsd.github.io/TokenCompose/ 📃 Paper: huggingface.co/papers/2312.03… 🖥️ Code: github.com/mlpc-ucsd/Toke… 🧵[1/n]
    Image
    00:00
    16K
  • user avatar
    Zirui "Colin" Wang
    @zwcolin
    Jul 15, 2024
    🎉Exciting news in Multimodal LLMs!  We're excited to see that 𝐈𝐧𝐭𝐞𝐫𝐧𝐕𝐋 𝐂𝐡𝐚𝐭 𝐕𝟐.𝟎 and 𝐂𝐚𝐦𝐛𝐫𝐢𝐚𝐧 now lead the 𝐂𝐡𝐚𝐫𝐗𝐢𝐯 leaderboard (charxiv.github.io) in chart understanding for open-weight models. 🤔What leads to their success? Here's some of
    Image
    17K
  • user avatar
    Zirui "Colin" Wang
    @zwcolin
    Dec 11, 2024
    🚨 I'll be presenting CharXiv this Friday morning at #neurips and Sunday at the MAR workshop. I'm 🤗 to connect with new friends and chat about developing/enhancing multimodal models (text-to-image, VLMs, etc) and their evaluations! Let's meet up at the conference :)
    Image
    3.7K
  • user avatar
    Zirui "Colin" Wang
    @zwcolin
    Aug 7, 2024
    Just finished response to authors' rebuttal for all papers that had a rebuttal in my batch. I hope these in-time responses give people more time/rounds for healthy and meaningful discussions on their papers! 👀 #NeurIPS
    Image
    2.9K
  • user avatar
    Zirui "Colin" Wang
    @zwcolin
    Apr 14, 2025
    i've been working on my masters' thesis and finally got something worth mentioning for the broader impact of the research work i did last year -- it's not another benchmark but an eval that people and devs care about and i'm ready to build more of them :p
    Image
    1.6K
  • user avatar
    Zirui "Colin" Wang
    @zwcolin
    Sep 26, 2024
    I'm honored to join the Siebel Scholars '25 cohort and committed to make AI systems empirically usable and useful in future years to come :).
    user avatar
    Princeton Computer Science
    @PrincetonCS
    Sep 26, 2024
    Congrats to @chochijimat, @danfriedman0, @sunniesuhyoung, @SadhikaMalladi and @zwcolin on being named 2025 @SiebelScholars! Now in its 24th year, the Siebel Scholars Program awards fellowships to students based on academic achievement and leadership. bit.ly/3ZLtrv4
    graphic of five Siebel Scholars
    3K
  • user avatar
    Zirui "Colin" Wang
    @zwcolin
    Mar 3, 2024
    Replying to @IEEESpectrum @_akhaliq and @arankomatsuzaki
    The original article claimed a *correlation*: "the correlation between influencer tweets and citation count," but your tweet statement made it sound like a *causation*. This is not rigorous at all. One can also say that influencers are forward-thinking in selecting papers.
    1.6K
  • user avatar
    Zirui "Colin" Wang
    @zwcolin
    Feb 26, 2024
    Today we are excited to share that 🧩 TokenCompose has been accepted to #CVPR2024. See you soon in Seattle!
    user avatar
    Zirui "Colin" Wang
    @zwcolin
    Dec 7, 2023
    We present 🧩 TokenCompose, a text-to-image latent diffusion model trained with fine-grained grounding objectives for enhanced compositionality and photorealism. 🌐 Website: mlpc-ucsd.github.io/TokenCompose/ 📃 Paper: huggingface.co/papers/2312.03… 🖥️ Code: github.com/mlpc-ucsd/Toke… 🧵[1/n]
    Image
    00:00
    1.8K
  • user avatar
    Zirui "Colin" Wang
    @zwcolin
    Jan 21, 2023
    Our paper (and my first RL paper ever): "On the Feasibility of Cross-Task Transfer with Model-Based Reinforcement Learning" got accepted @iclr_conf! 🧵(1/3)
    1.9K
  • user avatar
    Zirui "Colin" Wang
    @zwcolin
    Dec 14, 2024
    I'll present CharXiv at tmr's Multimodal Algorithmic Reasoning workshop for a spotlight talk at 11:45am followed by a poster session at 2:15pm in West Building Exhibit Hall A. If you are interested in or working on developing/evaluating multimodal models, let's connect there!
    Image
    2.1K

New to X?

Sign up now to get your own personalized timeline!

Create account

By signing up, you agree to the Terms of Service and Privacy Policy, including Cookie Use.

Terms·Privacy·Cookies·Accessibility·Ads Info·© 2026 X Corp.
Don't miss what's happening
People on X are the first to know.
Log inSign up
Advertisement
Advertisement