Ting Chen (@tingchenai) / X

Ting Chen

291 posts

Ting Chen

@tingchenai

building multimodal AGI

Joined February 2010

Ting Chen
@tingchenai
Feb 23, 2025
Try it out!
Elon Musk
@elonmusk
Feb 23, 2025
Grok voice early beta is now available on the @Grok app. This is early beta, so expect issues (that will be resolved fast), but it’s still awesome.
2.2M
Ting Chen
@tingchenai
May 15, 2025
You can try grok vision in voice mode by turning on the camera too!
Elon Musk
@elonmusk
May 15, 2025
Try @grok vision. It’s awesome!
2.1M
Ting Chen
@tingchenai
Dec 10, 2024
Our goal is bit stream in and bit stream out
Elon Musk
@elonmusk
Dec 10, 2024
You can also upload any image to Grok, including memes, and it will explain what they mean x.com/i/grok/share/4…
1.5M
Ting Chen
@tingchenai
Feb 14, 2020
Introducing SimCLR: a Simple framework for Contrastive Learning of Representations. SimCLR advances previous SOTA in self-supervised and semi-supervised learning on ImageNet by 7-10% (see next). arxiv.org/abs/2002.05709 Joint work with @skornblith @mo_norouzi @geoffreyhinton.
Ting Chen
@tingchenai
May 5, 2025
A new update coming up soon.
Grok
@grok
May 4, 2025
And Grok said, “Let there be voice,” and there was voice.
112K
Ting Chen
@tingchenai
Sep 23, 2021
Have you wondered why object detection, unlike classification, has so many sophisticated algorithms? With Pix2Seq (arxiv.org/abs/2109.10852), we simply cast object detection as a language modeling task conditioned on pixels! (with @srbhsxn, Lala Li, @fleet_dj, @geoffreyhinton)
Ting Chen
@tingchenai
Apr 20, 2025
We have recently added multilingual and camera support to the voice mode. Give it a try!
Elon Musk
@elonmusk
Apr 20, 2025
It’s awesome
81K
Ting Chen
@tingchenai
Feb 18, 2025
Yes, the native voice experience is coming to Grok soon! Let us know what specific features you want to see (or hear)!
Shivon Zilis
@shivon
Feb 17, 2025
Woah! That was one of the most unexpectedly rewarding hours of my life. Instead of passively listening to an audiobook about physics while doing errands as usual I had an hour long back-and-forth conversation with Ara from Grok 3 about a bunch of scientific topics. We started
33K
Ting Chen
@tingchenai
Apr 13, 2024
Just a beginning. Multimodal understanding and generation capabilities will be rapidly improving. DM open, come and join us!
xAI
@xai
Apr 13, 2024
👀 x.ai/blog/grok-1.5v
44K
Ting Chen
@tingchenai
Jun 19, 2020
SimCLRv2: an improved self-supervised approach for semi-supervised learning. On ImageNet with 1% of the labels, it achieves 76.6% top-1, a 22% relative improvement over previous SOTA. arxiv.org/abs/2006.10029 Joint work with @skornblith, @kswersk, @mo_norouzi, @geoffreyhinton
Ting Chen
@tingchenai
Oct 13, 2022
📢Introducing Pix2Seq-D, a generalist framework casting panoptic segmentation as a discrete data generation task conditioned on pixels. Works for both images and videos, with minimal task engineering. arxiv.org/abs/2210.06366 work w/ Lala Li, @srbhsxn @geoffreyhinton @fleet_dj
00:00
Ting Chen
@tingchenai
Mar 13, 2020
Happy to share that we've open-sourced both the code and pretrained models for SimCLR (a simple framework for contrastive learning of visual representations): github.com/google-researc… joint work with @skornblith, @mo_norouzi and @geoffreyhinton.
Ting Chen
@tingchenai
Oct 13, 2022
Can we solve a (large) portion of vision tasks by simply formulating it as translating raw pixels into tokens/bits with higher level abstraction? A question that took a 2-year journey to get an answer: oh sure, if you know how to train a really good generative model🥳
Ting Chen
@tingchenai
Dec 10, 2024
A few months ago, we made a decision to focus on autoregressive modeling for lots of good reasons like reusing the scalable LLM training and inference stacks here at @xai, but also met with many challenges. It is really amazing to see how far we are able to push it forward!
Ethan Knight
@__eknight__
Dec 10, 2024
Earlier today, we released a new model, code-named Aurora, that gives Grok the ability to generate extremely photorealistic images (and in the future, even edit them). It's free to use for all of 𝕏, try it out and send us what you're creating! This model was trained entirely
29K