Log inSign up
Baseten
2,470 posts
Image
user avatar
Baseten
@baseten
Inference is everything.
San Francisco and New York
baseten.co
Joined March 2021
349
Following
12.7K
Followers
  • Pinned
    user avatar
    Baseten
    @baseten
    May 13
    Intelligence should be defined by the people closest to the work. Intelligence should be owned by all of us. Let’s build a many model future!
    user avatar
    Tuhin Srivastava
    @tuhinone
    May 13
    Article cover image
    Article
    A many model future
    Obsessives have always moved the world forward. They are responsible for our most beloved products, proudest scientific achievements, most moving art, the greatest leaps in what we're capable of....
    33K
  • user avatar
    Baseten
    @baseten
    8h
    "That's when they come to open-source models, that's when they come to Baseten, that's when they come to post-train models on Baseten, to be able to do it better, faster, and cheaper. That's when you get both intelligence everywhere and unit economics that make sense for your
    user avatar
    Tuhin Srivastava
    @tuhinone
    8h
    Thanks to @EdLudlow for having us on Bloomberg Tech yesterday to talk about our latest fundraise and the growing number of companies owning their open and specialized models.
    Image
    00:00
    3.1K
  • user avatar
    Baseten
    @baseten
    11h
    Excited to be a day 0 launch partner for BioNeMo, NVIDIA's new, fully-open agent toolkit for scientific workflows! All 10 BioNeMo NIMs are available in our model library. Learn more in our announcement: baseten.co/blog/nvidia-bi…
    Image
    Image
    00:35
    user avatar
    NVIDIA Healthcare
    NVIDIA
    @NVIDIAHealth
    15h
    Science is entering a new era - one where AI agents can do scientific work. 🧬 Today NVIDIA is launching the BioNeMo Agent Toolkit - an open, agent-ready toolkit that gives any AI agent callable tools for protein structure prediction, molecular docking, generative chemistry,
    3.3K
  • Baseten reposted
    user avatar
    Philip Kiely
    Baseten
    @philipkiely
    Jun 23
    Article cover image
    Article
    How we built the world’s fastest API for GLM-5.2
    GLM-5.2 is the biggest news in open models since DeepSeek-R1. It’s easy to see why. GLM-5.2 delivers comparable performance to GPT 5.5 and Opus 4.8 at a fraction of the cost, generally 70-80% less...
    389K
  • Baseten reposted
    user avatar
    Alex Ker 🔭
    Baseten
    @thealexker
    Jun 22
    Tutorial on how to use GLM-5.2 in Claude Code (bookmark this) ~4.5x faster & ~5x cheaper compared to Opus 4.8! 1. Install the latest Claude Code npm install -g @Anthropic-ai/claude-code 2. Create an account at baseten.co. 3. Grab an API Key from
    Image
    37K
  • Baseten reposted
    user avatar
    Amir Haghighat
    Baseten
    @amiruci
    Jun 22
    We have the fastest GLM-5.2 deployment on the market: >280 tok/s and <0.8s ttft, according to Artificial Analysis. This same performance carries across all post-trained variants. These aren’t vanity metrics. Optimizations like these save our customers tens of millions of dollars
    Image
    user avatar
    Amir Haghighat
    Baseten
    @amiruci
    Jun 22
    We closed our Series F today at a $13B valuation. Our inference business grew 20x in the last year. I want to explain why: The growth comes from a shift I think is permanent: companies want to own their intelligence layer. Instead of relying exclusively on closed models, teams
    88K
  • Baseten reposted
    user avatar
    Tuhin Srivastava
    @tuhinone
    Jun 22
    The GLM moment is going to be bigger than the DeepSeek moment. Baseten has the fastest inference on the best open-weight model. >280 tps and <0.8 ttft.
    Image
    user avatar
    Tuhin Srivastava
    @tuhinone
    Jun 22
    Article
    Announcing our Series F
    Today, we are thrilled to announce Baseten’s $1.5B Series F, led by Altimeter Capital, Conviction Partners, and Spark Capital, co-led by Sands Capital and Wellington Management, with participation...
    22K
  • user avatar
    Baseten
    @baseten
    Jun 22
    The best open model with the best performance: GLM-5.2 runs at >280 TPS and <0.8s TTFT on Baseten. Try it here: baseten.co/library/glm-52/
    Image
    00:00
    user avatar
    Tuhin Srivastava
    @tuhinone
    Jun 22
    Article
    Announcing our Series F
    Today, we are thrilled to announce Baseten’s $1.5B Series F, led by Altimeter Capital, Conviction Partners, and Spark Capital, co-led by Sands Capital and Wellington Management, with participation...
    29K
  • Baseten reposted
    user avatar
    Gamma
    @GammaApp
    Jun 22
    Our co-founder and CPO @thatsjonsense is speaking on June 25th on "How to Choose a Model" alongside @sarahmsachs @oneill_c @baseten @NotionHQ. RSVP in thread!
    Image
    7.3K
  • user avatar
    Baseten
    @baseten
    Jun 22
    We’re excited to announce our $1.5B Series F. Baseten exists to help companies own their intelligence and run AI products in production with speed, reliability, and control. As we enter this next chapter, three things are clear: 1. Customers like Abridge, Clay, Cursor, Decagon,
    Image
    user avatar
    Tuhin Srivastava
    @tuhinone
    Jun 22
    Article
    Announcing our Series F
    Today, we are thrilled to announce Baseten’s $1.5B Series F, led by Altimeter Capital, Conviction Partners, and Spark Capital, co-led by Sands Capital and Wellington Management, with participation...
    57K
  • Baseten reposted
    user avatar
    Droid
    Factory
    @droid
    Jun 19
    Who's also using Droid with @baseten inference?
    user avatar
    Dhruv Singal
    Baseten
    @alphatozeta8148
    Jun 19
    @baseten model performance team is absolutely cracked. @Zai_org GLM 5.2 is now 4x faster running at full 1M context! Already available to use in your favorite coding harnesses, here it is COOKING in @FactoryAI Droid and @opencode Docs for how to get it in comments
    Image
    00:00
    11K
  • Baseten reposted
    user avatar
    Dhruv Singal
    Baseten
    @alphatozeta8148
    Jun 19
    @baseten model performance team is absolutely cracked. @Zai_org GLM 5.2 is now 4x faster running at full 1M context! Already available to use in your favorite coding harnesses, here it is COOKING in @FactoryAI Droid and @opencode Docs for how to get it in comments
    Image
    00:00
    37K
  • user avatar
    Baseten
    @baseten
    Jun 18
    Most supervised fine-tuning (SFT) studies run on generic data. Ours run on production tasks, paired with evals our team spends weeks building. New post-training research from our team here.
    user avatar
    Charlie O'Neill
    Baseten
    @oneill_c
    Jun 18
    1/ We fine-tune a lot of customer models, so we decided to systematically try and figure out some best practices for finetuning. SFT isn't sexy, but it's still important. We vary one SFT lever at a time across 2 model families, dense + MoE to 235B, on 4 real-world customer
    Image
    3.4K

New to X?

Sign up now to get your own personalized timeline!

Create account

By signing up, you agree to the Terms of Service and Privacy Policy, including Cookie Use.

Terms·Privacy·Cookies·Accessibility·Ads Info·© 2026 X Corp.
Don't miss what's happening
People on X are the first to know.
Log inSign up
Advertisement
Advertisement