Log inSign up
Gradium
137 posts
Image
user avatar
Gradium
@GradiumAI
The voice layer for modern apps and agents. Real-time, scalable voice APIs: TTS, STT, turn-taking & voice cloning. Devs: build → gradium.ai/#models
gradium.ai
Joined September 2025
1
Following
3,628
Followers
  • Pinned
    user avatar
    Gradium
    @GradiumAI
    Dec 2, 2025
    Gradium is out of stealth to solve voice. We raised $70M and after only 3 months we’re releasing our transcription and synthesis products to power the next generation of voice AI.
    Image
    00:00
    473K
  • user avatar
    Gradium
    @GradiumAI
    2h
    Gradium STT and TTS now powers @adoptbuddy , the emotional companion robot from @BlueFrogRobotic, deployed in schools, hospitals, and senior care.
    Image
    538
  • user avatar
    Gradium
    @GradiumAI
    Jun 15
    Our on-device TTS model Phonon out there crushing it on TTFA and WER.
    user avatar
    Pratim🥑
    @BhosalePratim
    Jun 13
    Long flights always give me more ideas to think about what's missing around us. Few prompts later, here's Scribble Story. On-device fully local pipeline to convert scribblings into a short story you can listen to. Using @GradiumAI Phonon and @Alibaba_Qwen
    Image
    00:00
    781
  • user avatar
    Gradium
    @GradiumAI
    Jun 15
    60+ new voices live in the Gradium catalogue. English, Spanish, French, German, and Portuguese, with eight regional accents across them. gradium.ai
    Image
    GIF
    1.8K
  • user avatar
    Gradium
    @GradiumAI
    Jun 11
    ¡Hola Barcelona☀️
    user avatar
    Nicolas Grenié
    @picsoung
    Jun 11
    Next speaker are @ConstanceGriso and Timothé from @GradiumAI talking about Phonon, their on device model Really cool demo
    Image
    895
  • user avatar
    Gradium
    @GradiumAI
    Jun 10
    We upgraded Gradium TTS for the cases voice agents can't get wrong: phone numbers, codes, email addresses read back right the first time. Couple of examples: English: 97% on emails, top of the field. French: leads every competitor we benchmarked. Samples + methodology →
    Image
    00:00
    8.4K
  • user avatar
    Gradium
    @GradiumAI
    Jun 10
    In this joint work with @kyutai_labs, we design a reward model for conversational dynamics to teach full-duplex models how a human behaves in conversation, using cues to know when to interrupt, backchannel or stay silent.
    user avatar
    kyutai
    @kyutai_labs
    Jun 10
    New paper: Multi-Faceted Interactivity Alignment in Full-Duplex Speech Models We use RL to post-train speech models (Moshi and PersonaPlex) to talk more like a human: to know when to respond, when to wait, and when to nod along with “yeah”s and “okay”s when listening.
    Image
    00:00
    3.2K
  • user avatar
    Gradium
    @GradiumAI
    Jun 9
    We'll be at @VivaTech next week showcasing our models. Come find us at Booth 7.2 | 2F13 with @awscloud all week, and on the @LaFrenchTech booth on Wednesday. @neilzegh is giving two talks: Wed 17th, 5:20pm, @nvidia Stage 1 and on Fri, 10am, Théâtre AWS
    Image
    458
  • user avatar
    Gradium
    @GradiumAI
    Jun 9
    Learn how to build an audiobook voice agent using Gradium and @pipecat_ai Gradium's TTS handles the narration and Pipecat's built-in WebRTC transport delivers the audio to the browser.
    Image
    00:00
    11K
  • user avatar
    Gradium
    @GradiumAI
    Jun 5
    Reasoning LLMs typically take 2-3 seconds to start emitting tokens. In a voice agent, that's 2-3 seconds of silence after the user finishes speaking. The @MiniMax_AI team just shipped a community contribution to Gradbot with two models running in parallel. MiniMax-M2-her
    Image
    GitHub - gradium-ai/gradbot: Open source framework to vibecode and prototype voice agents with...
    From github.com
    5.6K
  • user avatar
    Gradium
    @GradiumAI
    Jun 4
    A full house at the @joinhexa office in Paris yesterday. Our CTO @olivierteboul joined the discussion by sharing why low latency matters for voice agents and how Gradium models support enterprise use cases for voice AI.
    Image
    Image
    795
  • user avatar
    Gradium
    @GradiumAI
    Jun 2
    "I'd like to cancel my flight from Boston to..." You pause to check a date. The agent cuts in: "Got it, where to?" Now you're talking over it to finish your own sentence. That's acoustic turn detection. Semantic VAD waits because it knows you're not done: gradium.ai/blog/semantic-…
    Image
    3.2K
  • Gradium reposted
    user avatar
    Neil Zeghidour
    @neilzegh
    May 28
    Yesterday we hosted our first Voice AI Dinner in Berlin. Where should be the next one?
    Image
    2.2K

New to X?

Sign up now to get your own personalized timeline!

Create account

By signing up, you agree to the Terms of Service and Privacy Policy, including Cookie Use.

Terms·Privacy·Cookies·Accessibility·Ads Info·© 2026 X Corp.
Don't miss what's happening
People on X are the first to know.
Log inSign up
Advertisement
Advertisement