Log inSign up
Fireworks AI
1,189 posts
Image
user avatar
Fireworks AI
@FireworksAI_HQ
From Inference to Intelligence
fireworks.ai
Joined September 2022
240
Following
22.8K
Followers
  • Fireworks AI reposted
    user avatar
    Richy Chen
    Fireworks AI
    @richychn
    Jun 19
    also hearing from other customers that GLM 5.2 approaches GPT 5.5 on their evals + use cases
    user avatar
    Fireworks AI
    @FireworksAI_HQ
    Jun 19
    "...at least as good as Opus 4.8 and GPT 5.5."
    Image
    00:00
    6.7K
  • user avatar
    Fireworks AI
    @FireworksAI_HQ
    Jun 19
    "...at least as good as Opus 4.8 and GPT 5.5."
    Image
    00:00
    37K
    user avatar
    Fireworks AI
    @FireworksAI_HQ
    Jun 19
    Shout out to @jeremyphoward for the analysis.
    2.1K
  • user avatar
    Fireworks AI
    @FireworksAI_HQ
    Jun 19
    There’s only one way to find out:
    user avatar
    Jeremy Howard
    @jeremyphoward
    Jun 18
    Wow. @Zai_org GLM 5.2 is a marvel! It is *at least* as good as Opus 4.8 and GPT 5.5. It's super fast, inexpensive, and not too verbose. It responds with nuance and judgement, & handles long context VERY well. I've never experienced an open weights model like this before.
    Image
    GLM 5.2 API & Playground | Fireworks AI
    From fireworks.ai
    12K
  • user avatar
    Fireworks AI
    @FireworksAI_HQ
    Jun 18
    Recently, @BoardroomClub1 and @Yossigarmazi hosted @the_bunny_chen to talk open source, RFT, and why the infrastructure layer matters as much as the model itself. Listen now:
    Image
    open.spotify.com
    Open Source Models Could Outperform Frontier Models | Fireworks AI Co-Founder | Apex Series
    Boardroom Club · Episode
    5.4K
  • Fireworks AI reposted
    user avatar
    LangChain
    @LangChain
    Jun 17
    Replying to @LangChain and @Alibaba_Qwen
    We teamed up with @FireworksAI_HQ to answer the following question… How can we cost-effectively mine important signals from every single trace, while maintaining frontier performance? Read our LangChain Labs study ⤵️
    Image
    Building a 100x Cheaper Trace Judge with Fireworks
    From langchain.com
    5.9K
  • user avatar
    Fireworks AI
    @FireworksAI_HQ
    Jun 17
    We're just getting photos back from New York @Techweek_ and we can't get enough of them. Thanks to everyone who came up to the roof to party with us. We'll see you next year!
    Image
    Image
    Image
    Image
    2.6K
  • user avatar
    Fireworks AI
    @FireworksAI_HQ
    Jun 17
    Kimi 2.7 is now fully trainable on Fireworks. Feed your data into Kimi and build a moat that beats frontier models at lower costs. SFT. DPO. RL. Managed clicks or raw API with huge context, giant LoRA ranks & all the pro knobs. Get started: app.fireworks.ai/dashboard/fine…
    Image
    14K
  • user avatar
    Fireworks AI
    @FireworksAI_HQ
    Jun 16
    GLM 5.2 is live on Fireworks, day zero. 1M-token context, coding‑first frontier model, independently validated on SWE‑bench, Terminal‑Bench, GPQA and AIME. Served on our own infrastructure the moment @Zai_org opened the weights. What’s under the hood ↓
    Image
    17K
    user avatar
    Fireworks AI
    @FireworksAI_HQ
    Jun 16
    You may have seen other platforms announce day-zero support within minutes of weights dropping, but there is a key difference between inference providers and routers. Routers forward your GLM 5.2 calls to someone else’s endpoint (typically the model lab’s own API). Fireworks
    Image
    2.2K
    user avatar
    Fireworks AI
    @FireworksAI_HQ
    Jun 16
    Benchmarks are just the starting point. The only eval that matters is the one on your codebase, your prompts, your latency SLOs. Ready to build? Drop  accounts/fireworks/models/glm-5p2  into your existing OpenAI/Anthropic‑compatible client and try it on real work. Learn
    Image
    1.7K
  • user avatar
    Fireworks AI
    @FireworksAI_HQ
    Jun 15
    This is stepping stone for enabling customers to generate training data from traces and the lean into continuous post training and own their AI with their own data moat. We tip our hat to the @LangChain team for the incredible work.
    user avatar
    LangChain
    @LangChain
    Jun 15
    New LangChain Labs x @FireworksAI_HQ study
    11K
  • user avatar
    Fireworks AI
    @FireworksAI_HQ
    Jun 15
    You can always choose to make your own frontier.
    user avatar
    Lin Qiao
    Fireworks AI
    @lqiao
    Jun 15
    Article
    Owning vs. Renting Intelligence
    Mythos got shut down this week. Whether you agreed with the decision or not is almost beside the point. A company built on top of intelligence it didn't control suddenly found itself exposed to...
    7.7K
  • Fireworks AI reposted
    user avatar
    Richy Chen
    Fireworks AI
    @richychn
    Jun 13
    the same tech that powered Kimi K2.6 Fast on Fire Pass now powers Kimi K2.7 Fast
    user avatar
    Fireworks AI
    @FireworksAI_HQ
    Jun 13
    Replying to @FireworksAI_HQ
    Available now on Fireworks serverless. → Standard tier (pay per token) → Priority tier for critical workloads → Fast path coming soon Pricing: $0.95 / 1M input, $4 / 1M output, $0.19 / 1M cache hits. 256K context. Full details here: fireworks.ai/blog/kimi-k2p7…
    Image
    9.7K
  • Fireworks AI reposted
    user avatar
    Eric Glyman
    Ramp
    @eglyman
    Jun 12
    public benchmarks are saturated. every frontier model has trained against them, and the leaderboard tells you near nothing. we built ours from inside ramp — code no model has seen, graded against the bar our engineers ship to. every company running on AI needs its own.
    user avatar
    Ramp Labs
    Ramp
    @RampLabs
    Jun 12
    Today we’re releasing Ramp SWE-Bench: a private, production-grounded coding benchmark created from real engineering problems we've faced at Ramp.
    Image
    00:00
    47K
  • user avatar
    Fireworks AI
    @FireworksAI_HQ
    Jun 13
    Moonshot released K2.7 Code, the latest in their K2 line of coding models, and it's live on Fireworks Day 0, on serverless and the API. It produces roughly 30% fewer reasoning tokens than K2.6 while scoring higher on Moonshot’s coding benchmarks. For agentic coding work, that
    Image
    164K
    user avatar
    Fireworks AI
    @FireworksAI_HQ
    Jun 13
    In long agent loops, reasoning tokens get reused as context on every following turn. Shorter reasoning means smaller contexts downstream, faster generations, and fewer retries. K2.7 Code reduces that overhead without giving up quality, which lowers the real cost per completed
    2.3K
    user avatar
    Fireworks AI
    @FireworksAI_HQ
    Jun 13
    Available now on Fireworks serverless. → Standard tier (pay per token) → Priority tier for critical workloads → Fast path coming soon Pricing: $0.95 / 1M input, $4 / 1M output, $0.19 / 1M cache hits. 256K context. Full details here: fireworks.ai/blog/kimi-k2p7…
    Image
    12K

New to X?

Sign up now to get your own personalized timeline!

Create account

By signing up, you agree to the Terms of Service and Privacy Policy, including Cookie Use.

Terms·Privacy·Cookies·Accessibility·Ads Info·© 2026 X Corp.
Don't miss what's happening
People on X are the first to know.
Log inSign up
Advertisement
Advertisement