Log inSign up
Yi Tay
4,001 posts
Image
user avatar
Yi Tay
@YiTayML
research scientist @googledeepmind ✨♊, model co-lead/captain of gemini deepthink imo gold medal 🥇, opinions are my own.
mixture-of-locations
yitay.net
Joined October 2016
87
Following
57K
Followers
  • Pinned
    user avatar
    Yi Tay
    @YiTayML
    Dec 4, 2025
    Happy to share that the @GoogleDeepMind Gemini team is starting a new research team in Singapore! This new team will be focused on advanced reasoning, LLM/RL and improving bleeding edge SOTA models such as Gemini, Gemini Deep Think and beyond. 🔥 This team will be led by yours
    Image
    322K
  • user avatar
    Yi Tay
    @YiTayML
    Mar 5, 2024
    Long overdue but here's a new blogpost on training LLMs in the wilderness from the ground up 😄🧐 In this blog post, I discuss: 1. Experiences in procuring compute & variance in different compute providers. Our biggest finding/surprise is that variance is super high and it's
    Image
    918K
  • user avatar
    Yi Tay
    @YiTayML
    Nov 18, 2025
    Gemini 3! This is our most intelligent model that brings any idea to life. 😻 This is the best model in the world, by a crazy wide margin! Aside from a huge increase across the absolutely everything, look at its coding capabilities and quality of aesthetics and fidelity.
    Image
    00:00
    221K
  • user avatar
    Yi Tay
    @YiTayML
    Mar 2, 2023
    New open source Flan-UL2 20B checkpoints :) - Truly open source 😎 No forms! 🤭 Apache license 🔥 - Best OS model on MMLU/Big-Bench hard 🤩 - Better than Flan-T5 XXL & competitive to Flan-PaLM 62B. - Size ceiling of Flan family just got higher! Blog:
    Image
    yitay.net
    A New Open Source Flan 20B with UL2 — Yi Tay
    Releasing the new open source Flan-UL2 20B model.
    452K
  • user avatar
    Yi Tay
    @YiTayML
    Jul 21, 2025
    Our IMO gold model is not just an "experimental reasoning" model. It is way more general purpose than anyone would have expected. This general deep think model is going to be shipped so stay tuned! 🔥
    user avatar
    Melvin Johnson
    @melvinjohnsonp
    Jul 21, 2025
    So happy to see this incredible achievement. Huge congrats to @lmthang, @quocleix, @YiTayML and the IMO team on the result. This was a great collaboration across teams to build a general Gemini DeepThink model that can also get gold at IMO.
    354K
  • user avatar
    Yi Tay
    @YiTayML
    Nov 25, 2024
    Personal / life update: I have returned to @GoogleDeepMind to work on AI & LLM research. It was an exciting 1.5 years at @RekaAILabs and I truly learned a lot from this pretty novel experience. I wrote a short note about my experiences and transition on my personal blog here
    Image
    yitay.net
    Returning to Google DeepMind — Yi Tay
    Returning to Google and recounting my experiences as a startup co-founder.
    325K
  • user avatar
    Yi Tay
    @YiTayML
    Oct 3, 2023
    It’s been a short 6 months since I left Google Brain and it has been a uniquely challenging yet interesting experience to build everything from the ground up in an entirely new environment (e.g., the wilderness) Today, we’re excited to announce the first version of the
    user avatar
    Reka
    @RekaAILabs
    Oct 3, 2023
    We are excited to announce the 1st version of our multimodal assistant, Yasa-1, a language assistant with visual and auditory sensors that can take actions via code execution 🪄. Yasa-1 can understand text, images, videos, sounds & more! 🚀 Check out more details below👇
    Image
    00:00
    479K
  • user avatar
    Yi Tay
    @YiTayML
    Jun 12, 2023
    Hot take 🔥: Lots of buzz these days about new foundation open-source models but what if I told you there have been no real advance since 2019's T5 models 😀 Take a look at this table from this new InstructEval paper: arxiv.org/abs/2306.04757. Some thoughts/observations: 1.
    Image
    503K
  • user avatar
    Yi Tay
    @YiTayML
    Jul 25, 2022
    "Scaling laws vs Model Architectures" from @GoogleAI. Lessons: - Not all arch scale the same way. - Vanilla Transformer does pretty well 😀 - Touching the attention too much is "dangerous". 😔 - Perf at base may not translate to large+ scale. pdf: arxiv.org/abs/2207.10551
    Image
  • user avatar
    Yi Tay
    @YiTayML
    Mar 30, 2023
    Over the past 3.3 years at Google, I have been blessed with so many wonderful friendships and experiences. I have grown so much. However, it’s time to move on to a new adventure! I wrote a blogpost about my wonderful experience here:
    Image
    yitay.net
    Leaving Google Brain — Yi Tay
    Documenting my 3.3 years at Google Research and Brain.
    393K
  • user avatar
    Yi Tay
    @YiTayML
    Apr 15, 2024
    It's been a wild ride. Just 20 of us, burning through thousands of H100s over the past months, we're glad to finally share this with the world! 💪 One of the goals we’ve had when starting Reka was to build cool innovative models at the frontier. Reaching GPT-4/Opus level was a
    user avatar
    Reka
    @RekaAILabs
    Apr 15, 2024
    Meet Reka Core, our best and most capable multimodal language model yet. 🔮 It’s been a busy few months training this model and we are glad to finally ship it! 💪 Core has a lot of capabilities, and one of them is understanding video --- let’s see what Core thinks of the 3 body
    Image
    00:00
    217K
  • user avatar
    Yi Tay
    @YiTayML
    Jun 27, 2023
    We’re coming out of stealth with $58M in funding to build generative models and advance AI research at @RekaAILabs 🔥🚀 Language models and their multimodal counterparts are already ubiquitous and massively impactful everywhere. That said, we are still at the beginning of this
    Reka funding announcement to build generative models
    261K
  • user avatar
    Yi Tay
    @YiTayML
    Sep 16, 2020
    Inspired by the dizzying number of efficient Transformers ("x-formers") models that are coming out lately, we wrote a survey paper to organize all this information. Check it out at arxiv.org/abs/2009.06732. Joint work with @m__dehghani @dara_bahri and @metzlerd. @GoogleAI 😀😃
    Image
    Image
  • user avatar
    Yi Tay
    @YiTayML
    Feb 18, 2022
    Excited to share our latest work at @GoogleAI on "Transformer Memory as a Differentiable Search Index"! TL;DR? We parameterize a search system with only a single Transformer model 😎. Everything in the corpus is encoded in the model! 🙌 Paper: arxiv.org/abs/2202.06991
    Image

New to X?

Sign up now to get your own personalized timeline!

Create account

By signing up, you agree to the Terms of Service and Privacy Policy, including Cookie Use.

Terms·Privacy·Cookies·Accessibility·Ads Info·© 2026 X Corp.
Don't miss what's happening
People on X are the first to know.
Log inSign up
Advertisement
Advertisement