Baseten (@baseten) / X

Baseten

2,470 posts

Baseten

@baseten

Inference is everything.

San Francisco and New York

Joined March 2021

Pinned
Baseten
@baseten
May 13
Intelligence should be defined by the people closest to the work. Intelligence should be owned by all of us. Let’s build a many model future!
Tuhin Srivastava
@tuhinone
May 13
Article
A many model future
Obsessives have always moved the world forward. They are responsible for our most beloved products, proudest scientific achievements, most moving art, the greatest leaps in what we're capable of....
33K
Baseten
@baseten
8h
"That's when they come to open-source models, that's when they come to Baseten, that's when they come to post-train models on Baseten, to be able to do it better, faster, and cheaper. That's when you get both intelligence everywhere and unit economics that make sense for your
Tuhin Srivastava
@tuhinone
8h
Thanks to @EdLudlow for having us on Bloomberg Tech yesterday to talk about our latest fundraise and the growing number of companies owning their open and specialized models.
00:00
3.1K
Baseten
@baseten
11h
Excited to be a day 0 launch partner for BioNeMo, NVIDIA's new, fully-open agent toolkit for scientific workflows! All 10 BioNeMo NIMs are available in our model library. Learn more in our announcement: baseten.co/blog/nvidia-bi…
00:35
NVIDIA Healthcare
@NVIDIAHealth
15h
Science is entering a new era - one where AI agents can do scientific work. 🧬 Today NVIDIA is launching the BioNeMo Agent Toolkit - an open, agent-ready toolkit that gives any AI agent callable tools for protein structure prediction, molecular docking, generative chemistry,
3.3K
Baseten reposted
Philip Kiely
@philipkiely
Jun 23
Article
How we built the world’s fastest API for GLM-5.2
GLM-5.2 is the biggest news in open models since DeepSeek-R1. It’s easy to see why. GLM-5.2 delivers comparable performance to GPT 5.5 and Opus 4.8 at a fraction of the cost, generally 70-80% less...
389K
Baseten reposted
Alex Ker 🔭
@thealexker
Jun 22
Tutorial on how to use GLM-5.2 in Claude Code (bookmark this) ~4.5x faster & ~5x cheaper compared to Opus 4.8! 1. Install the latest Claude Code npm install -g @Anthropic-ai/claude-code 2. Create an account at baseten.co. 3. Grab an API Key from
37K
Baseten reposted
Amir Haghighat
@amiruci
Jun 22
We have the fastest GLM-5.2 deployment on the market: >280 tok/s and <0.8s ttft, according to Artificial Analysis. This same performance carries across all post-trained variants. These aren’t vanity metrics. Optimizations like these save our customers tens of millions of dollars
Amir Haghighat
@amiruci
Jun 22
We closed our Series F today at a $13B valuation. Our inference business grew 20x in the last year. I want to explain why: The growth comes from a shift I think is permanent: companies want to own their intelligence layer. Instead of relying exclusively on closed models, teams
88K
Baseten reposted
Tuhin Srivastava
@tuhinone
Jun 22
The GLM moment is going to be bigger than the DeepSeek moment. Baseten has the fastest inference on the best open-weight model. >280 tps and <0.8 ttft.
Tuhin Srivastava
@tuhinone
Jun 22
Article
Announcing our Series F
Today, we are thrilled to announce Baseten’s $1.5B Series F, led by Altimeter Capital, Conviction Partners, and Spark Capital, co-led by Sands Capital and Wellington Management, with participation...
22K
Baseten
@baseten
Jun 22
The best open model with the best performance: GLM-5.2 runs at >280 TPS and <0.8s TTFT on Baseten. Try it here: baseten.co/library/glm-52/
00:00
Tuhin Srivastava
@tuhinone
Jun 22
Article
Announcing our Series F
Today, we are thrilled to announce Baseten’s $1.5B Series F, led by Altimeter Capital, Conviction Partners, and Spark Capital, co-led by Sands Capital and Wellington Management, with participation...
29K
Baseten reposted
Gamma
@GammaApp
Jun 22
Our co-founder and CPO @thatsjonsense is speaking on June 25th on "How to Choose a Model" alongside @sarahmsachs @oneill_c @baseten @NotionHQ. RSVP in thread!
7.3K
Baseten
@baseten
Jun 22
We’re excited to announce our $1.5B Series F. Baseten exists to help companies own their intelligence and run AI products in production with speed, reliability, and control. As we enter this next chapter, three things are clear: 1. Customers like Abridge, Clay, Cursor, Decagon,
Tuhin Srivastava
@tuhinone
Jun 22
Article
Announcing our Series F
Today, we are thrilled to announce Baseten’s $1.5B Series F, led by Altimeter Capital, Conviction Partners, and Spark Capital, co-led by Sands Capital and Wellington Management, with participation...
57K
Baseten reposted
Droid
@droid
Jun 19
Who's also using Droid with @baseten inference?
Dhruv Singal
@alphatozeta8148
Jun 19
@baseten model performance team is absolutely cracked. @Zai_org GLM 5.2 is now 4x faster running at full 1M context! Already available to use in your favorite coding harnesses, here it is COOKING in @FactoryAI Droid and @opencode Docs for how to get it in comments
00:00
11K
Baseten reposted
Dhruv Singal
@alphatozeta8148
Jun 19
@baseten model performance team is absolutely cracked. @Zai_org GLM 5.2 is now 4x faster running at full 1M context! Already available to use in your favorite coding harnesses, here it is COOKING in @FactoryAI Droid and @opencode Docs for how to get it in comments
00:00
37K
Baseten
@baseten
Jun 18
Most supervised fine-tuning (SFT) studies run on generic data. Ours run on production tasks, paired with evals our team spends weeks building. New post-training research from our team here.
Charlie O'Neill
@oneill_c
Jun 18
1/ We fine-tune a lot of customer models, so we decided to systematically try and figure out some best practices for finetuning. SFT isn't sexy, but it's still important. We vary one SFT lever at a time across 2 model families, dense + MoE to 235B, on 4 real-world customer
3.4K