Gradium (@GradiumAI) / X

Gradium

137 posts

Gradium

@GradiumAI

The voice layer for modern apps and agents. Real-time, scalable voice APIs: TTS, STT, turn-taking & voice cloning. Devs: build → gradium.ai/#models

gradium.ai

Joined September 2025

Following

3,628

Followers

Pinned
Gradium
@GradiumAI
Dec 2, 2025
Gradium is out of stealth to solve voice. We raised $70M and after only 3 months we’re releasing our transcription and synthesis products to power the next generation of voice AI.
00:00
473K
Gradium
@GradiumAI
2h
Gradium STT and TTS now powers @adoptbuddy , the emotional companion robot from @BlueFrogRobotic, deployed in schools, hospitals, and senior care.
538
Gradium
@GradiumAI
Jun 15
Our on-device TTS model Phonon out there crushing it on TTFA and WER.
Pratim🥑
@BhosalePratim
Jun 13
Long flights always give me more ideas to think about what's missing around us. Few prompts later, here's Scribble Story. On-device fully local pipeline to convert scribblings into a short story you can listen to. Using @GradiumAI Phonon and @Alibaba_Qwen
00:00
781
Gradium
@GradiumAI
Jun 15
60+ new voices live in the Gradium catalogue. English, Spanish, French, German, and Portuguese, with eight regional accents across them. gradium.ai
GIF
1.8K
Gradium
@GradiumAI
Jun 11
¡Hola Barcelona☀️
Nicolas Grenié
@picsoung
Jun 11
Next speaker are @ConstanceGriso and Timothé from @GradiumAI talking about Phonon, their on device model Really cool demo
895
Gradium
@GradiumAI
Jun 10
We upgraded Gradium TTS for the cases voice agents can't get wrong: phone numbers, codes, email addresses read back right the first time. Couple of examples: English: 97% on emails, top of the field. French: leads every competitor we benchmarked. Samples + methodology →
00:00
8.4K
Gradium
@GradiumAI
Jun 10
In this joint work with @kyutai_labs, we design a reward model for conversational dynamics to teach full-duplex models how a human behaves in conversation, using cues to know when to interrupt, backchannel or stay silent.
kyutai
@kyutai_labs
Jun 10
New paper: Multi-Faceted Interactivity Alignment in Full-Duplex Speech Models We use RL to post-train speech models (Moshi and PersonaPlex) to talk more like a human: to know when to respond, when to wait, and when to nod along with “yeah”s and “okay”s when listening.
00:00
3.2K
Gradium
@GradiumAI
Jun 9
We'll be at @VivaTech next week showcasing our models. Come find us at Booth 7.2 | 2F13 with @awscloud all week, and on the @LaFrenchTech booth on Wednesday. @neilzegh is giving two talks: Wed 17th, 5:20pm, @nvidia Stage 1 and on Fri, 10am, Théâtre AWS
458
Gradium
@GradiumAI
Jun 9
Learn how to build an audiobook voice agent using Gradium and @pipecat_ai Gradium's TTS handles the narration and Pipecat's built-in WebRTC transport delivers the audio to the browser.
00:00
11K
Gradium
@GradiumAI
Jun 5
Reasoning LLMs typically take 2-3 seconds to start emitting tokens. In a voice agent, that's 2-3 seconds of silence after the user finishes speaking. The @MiniMax_AI team just shipped a community contribution to Gradbot with two models running in parallel. MiniMax-M2-her
GitHub - gradium-ai/gradbot: Open source framework to vibecode and prototype voice agents with...
From github.com
5.6K
Gradium
@GradiumAI
Jun 4
A full house at the @joinhexa office in Paris yesterday. Our CTO @olivierteboul joined the discussion by sharing why low latency matters for voice agents and how Gradium models support enterprise use cases for voice AI.
795
Gradium
@GradiumAI
Jun 2
"I'd like to cancel my flight from Boston to..." You pause to check a date. The agent cuts in: "Got it, where to?" Now you're talking over it to finish your own sentence. That's acoustic turn detection. Semantic VAD waits because it knows you're not done: gradium.ai/blog/semantic-…
3.2K
Gradium reposted
Neil Zeghidour
@neilzegh
May 28
Yesterday we hosted our first Voice AI Dinner in Berlin. Where should be the next one?
2.2K