Fireworks AI (@FireworksAI

Fireworks AI

1,189 posts

Fireworks AI

@FireworksAI_HQ

From Inference to Intelligence

fireworks.ai

Joined September 2022

Fireworks AI reposted
Richy Chen
@richychn
Jun 19
also hearing from other customers that GLM 5.2 approaches GPT 5.5 on their evals + use cases
Fireworks AI
@FireworksAI_HQ
Jun 19
"...at least as good as Opus 4.8 and GPT 5.5."
00:00
6.7K
Fireworks AI
@FireworksAI_HQ
Jun 19
"...at least as good as Opus 4.8 and GPT 5.5."
00:00
37K
Fireworks AI
@FireworksAI_HQ
Jun 19
Shout out to @jeremyphoward for the analysis.
2.1K
Fireworks AI
@FireworksAI_HQ
Jun 19
There’s only one way to find out:
Jeremy Howard
@jeremyphoward
Jun 18
Wow. @Zai_org GLM 5.2 is a marvel! It is *at least* as good as Opus 4.8 and GPT 5.5. It's super fast, inexpensive, and not too verbose. It responds with nuance and judgement, & handles long context VERY well. I've never experienced an open weights model like this before.
GLM 5.2 API & Playground | Fireworks AI
From fireworks.ai
12K
Fireworks AI
@FireworksAI_HQ
Jun 18
Recently, @BoardroomClub1 and @Yossigarmazi hosted @the_bunny_chen to talk open source, RFT, and why the infrastructure layer matters as much as the model itself. Listen now:
open.spotify.com
Open Source Models Could Outperform Frontier Models | Fireworks AI Co-Founder | Apex Series
Boardroom Club · Episode
5.4K
Fireworks AI reposted
LangChain
@LangChain
Jun 17
Replying to @LangChain and @Alibaba_Qwen
We teamed up with @FireworksAI_HQ to answer the following question… How can we cost-effectively mine important signals from every single trace, while maintaining frontier performance? Read our LangChain Labs study ⤵️
Building a 100x Cheaper Trace Judge with Fireworks
From langchain.com
5.9K
Fireworks AI
@FireworksAI_HQ
Jun 17
We're just getting photos back from New York @Techweek_ and we can't get enough of them. Thanks to everyone who came up to the roof to party with us. We'll see you next year!
2.6K
Fireworks AI
@FireworksAI_HQ
Jun 17
Kimi 2.7 is now fully trainable on Fireworks. Feed your data into Kimi and build a moat that beats frontier models at lower costs. SFT. DPO. RL. Managed clicks or raw API with huge context, giant LoRA ranks & all the pro knobs. Get started: app.fireworks.ai/dashboard/fine…
14K
Fireworks AI
@FireworksAI_HQ
Jun 16
GLM 5.2 is live on Fireworks, day zero. 1M-token context, coding‑first frontier model, independently validated on SWE‑bench, Terminal‑Bench, GPQA and AIME. Served on our own infrastructure the moment @Zai_org opened the weights. What’s under the hood ↓
17K
Fireworks AI
@FireworksAI_HQ
Jun 16
You may have seen other platforms announce day-zero support within minutes of weights dropping, but there is a key difference between inference providers and routers. Routers forward your GLM 5.2 calls to someone else’s endpoint (typically the model lab’s own API). Fireworks
2.2K
Fireworks AI
@FireworksAI_HQ
Jun 16
Benchmarks are just the starting point. The only eval that matters is the one on your codebase, your prompts, your latency SLOs. Ready to build? Drop accounts/fireworks/models/glm-5p2 into your existing OpenAI/Anthropic‑compatible client and try it on real work. Learn
1.7K
Fireworks AI
@FireworksAI_HQ
Jun 15
This is stepping stone for enabling customers to generate training data from traces and the lean into continuous post training and own their AI with their own data moat. We tip our hat to the @LangChain team for the incredible work.
LangChain
@LangChain
Jun 15
New LangChain Labs x @FireworksAI_HQ study
11K
Fireworks AI
@FireworksAI_HQ
Jun 15
You can always choose to make your own frontier.
Lin Qiao
@lqiao
Jun 15
Article
Owning vs. Renting Intelligence
Mythos got shut down this week. Whether you agreed with the decision or not is almost beside the point. A company built on top of intelligence it didn't control suddenly found itself exposed to...
7.7K
Fireworks AI reposted
Richy Chen
@richychn
Jun 13
the same tech that powered Kimi K2.6 Fast on Fire Pass now powers Kimi K2.7 Fast
Fireworks AI
@FireworksAI_HQ
Jun 13
Replying to @FireworksAI_HQ
Available now on Fireworks serverless. → Standard tier (pay per token) → Priority tier for critical workloads → Fast path coming soon Pricing: $0.95 / 1M input, $4 / 1M output, $0.19 / 1M cache hits. 256K context. Full details here: fireworks.ai/blog/kimi-k2p7…
9.7K
Fireworks AI reposted
Eric Glyman
@eglyman
Jun 12
public benchmarks are saturated. every frontier model has trained against them, and the leaderboard tells you near nothing. we built ours from inside ramp — code no model has seen, graded against the bar our engineers ship to. every company running on AI needs its own.
Ramp Labs
@RampLabs
Jun 12
Today we’re releasing Ramp SWE-Bench: a private, production-grounded coding benchmark created from real engineering problems we've faced at Ramp.
00:00
47K
Fireworks AI
@FireworksAI_HQ
Jun 13
Moonshot released K2.7 Code, the latest in their K2 line of coding models, and it's live on Fireworks Day 0, on serverless and the API. It produces roughly 30% fewer reasoning tokens than K2.6 while scoring higher on Moonshot’s coding benchmarks. For agentic coding work, that
164K
Fireworks AI
@FireworksAI_HQ
Jun 13
In long agent loops, reasoning tokens get reused as context on every following turn. Shorter reasoning means smaller contexts downstream, faster generations, and fewer retries. K2.7 Code reduces that overhead without giving up quality, which lowers the real cost per completed
2.3K
Fireworks AI
@FireworksAI_HQ
Jun 13
Available now on Fireworks serverless. → Standard tier (pay per token) → Priority tier for critical workloads → Fast path coming soon Pricing: $0.95 / 1M input, $4 / 1M output, $0.19 / 1M cache hits. 256K context. Full details here: fireworks.ai/blog/kimi-k2p7…
12K