Log inSign up
Ting Chen
291 posts
user avatar
Ting Chen
@tingchenai
building multimodal AGI
Joined February 2010
437
Following
9,081
Followers
  • user avatar
    Ting Chen
    @tingchenai
    Feb 23, 2025
    Try it out!
    user avatar
    Elon Musk
    X
    @elonmusk
    Feb 23, 2025
    Grok voice early beta is now available on the @Grok app. This is early beta, so expect issues (that will be resolved fast), but it’s still awesome.
    2.2M
  • user avatar
    Ting Chen
    @tingchenai
    May 15, 2025
    You can try grok vision in voice mode by turning on the camera too!
    user avatar
    Elon Musk
    X
    @elonmusk
    May 15, 2025
    Try @grok vision. It’s awesome!
    2.1M
  • user avatar
    Ting Chen
    @tingchenai
    Dec 10, 2024
    Our goal is bit stream in and bit stream out
    user avatar
    Elon Musk
    X
    @elonmusk
    Dec 10, 2024
    You can also upload any image to Grok, including memes, and it will explain what they mean x.com/i/grok/share/4…
    1.5M
  • user avatar
    Ting Chen
    @tingchenai
    Feb 14, 2020
    Introducing SimCLR: a Simple framework for Contrastive Learning of Representations. SimCLR advances previous SOTA in self-supervised and semi-supervised learning on ImageNet by 7-10% (see next). arxiv.org/abs/2002.05709 Joint work with @skornblith @mo_norouzi @geoffreyhinton.
    Image
    Image
  • user avatar
    Ting Chen
    @tingchenai
    May 5, 2025
    A new update coming up soon.
    user avatar
    Grok
    xAI
    @grok
    May 4, 2025
    And Grok said, “Let there be voice,” and there was voice.
    Image
    112K
  • user avatar
    Ting Chen
    @tingchenai
    Sep 23, 2021
    Have you wondered why object detection, unlike classification, has so many sophisticated algorithms? With Pix2Seq (arxiv.org/abs/2109.10852), we simply cast object detection as a language modeling task conditioned on pixels! (with @srbhsxn, Lala Li, @fleet_dj, @geoffreyhinton)
    Image
  • user avatar
    Ting Chen
    @tingchenai
    Apr 20, 2025
    We have recently added multilingual and camera support to the voice mode. Give it a try!
    user avatar
    Elon Musk
    X
    @elonmusk
    Apr 20, 2025
    It’s awesome
    81K
  • user avatar
    Ting Chen
    @tingchenai
    Feb 18, 2025
    Yes, the native voice experience is coming to Grok soon! Let us know what specific features you want to see (or hear)!
    user avatar
    Shivon Zilis
    @shivon
    Feb 17, 2025
    Woah! That was one of the most unexpectedly rewarding hours of my life. Instead of passively listening to an audiobook about physics while doing errands as usual I had an hour long back-and-forth conversation with Ara from Grok 3 about a bunch of scientific topics. We started
    33K
  • user avatar
    Ting Chen
    @tingchenai
    Apr 13, 2024
    Just a beginning. Multimodal understanding and generation capabilities will be rapidly improving. DM open, come and join us!
    user avatar
    xAI
    xAI
    @xai
    Apr 13, 2024
    👀 x.ai/blog/grok-1.5v
    44K
  • user avatar
    Ting Chen
    @tingchenai
    Jun 19, 2020
    SimCLRv2: an improved self-supervised approach for semi-supervised learning. On ImageNet with 1% of the labels, it achieves 76.6% top-1, a 22% relative improvement over previous SOTA. arxiv.org/abs/2006.10029 Joint work with @skornblith, @kswersk, @mo_norouzi, @geoffreyhinton
    Image
  • user avatar
    Ting Chen
    @tingchenai
    Oct 13, 2022
    📢Introducing Pix2Seq-D, a generalist framework casting panoptic segmentation as a discrete data generation task conditioned on pixels. Works for both images and videos, with minimal task engineering. arxiv.org/abs/2210.06366 work w/ Lala Li, @srbhsxn @geoffreyhinton @fleet_dj
    Image
    00:00
  • user avatar
    Ting Chen
    @tingchenai
    Mar 13, 2020
    Happy to share that we've open-sourced both the code and pretrained models for SimCLR (a simple framework for contrastive learning of visual representations): github.com/google-researc… joint work with @skornblith, @mo_norouzi and @geoffreyhinton.
    Image
  • user avatar
    Ting Chen
    @tingchenai
    Oct 13, 2022
    Can we solve a (large) portion of vision tasks by simply formulating it as translating raw pixels into tokens/bits with higher level abstraction? A question that took a 2-year journey to get an answer: oh sure, if you know how to train a really good generative model🥳
  • user avatar
    Ting Chen
    @tingchenai
    Dec 10, 2024
    A few months ago, we made a decision to focus on autoregressive modeling for lots of good reasons like reusing the scalable LLM training and inference stacks here at @xai, but also met with many challenges. It is really amazing to see how far we are able to push it forward!
    user avatar
    Ethan Knight
    @__eknight__
    Dec 10, 2024
    Earlier today, we released a new model, code-named Aurora, that gives Grok the ability to generate extremely photorealistic images (and in the future, even edit them). It's free to use for all of 𝕏, try it out and send us what you're creating! This model was trained entirely
    Image
    Image
    Image
    Image
    29K

New to X?

Sign up now to get your own personalized timeline!

Create account

By signing up, you agree to the Terms of Service and Privacy Policy, including Cookie Use.

Terms·Privacy·Cookies·Accessibility·Ads Info·© 2026 X Corp.
Don't miss what's happening
People on X are the first to know.
Log inSign up
Advertisement
Advertisement