Applied Compute (@appliedcompute) / X

Applied Compute

182 posts

Applied Compute

@appliedcompute

The Best AI is Built Not Bought

San Francisco

Joined July 2012

Pinned
Applied Compute
@appliedcompute
Apr 8
Article
Applied Compute Raises $80M to Help Enterprises Advance from Generalized to Specific Intelligence
Models keep getting smarter, but there's a massive gap between raw intelligence and actual productivity on specific tasks inside companies. Delivering real value requires knowing how to perform those...
211K
Applied Compute
@appliedcompute
19m
We partnered with @harvey to post-train the state-of-the-art legal agent on their LAB benchmark. It surpasses Opus 4.8 Max and GPT-5.5 xhigh.
00:00
408
Applied Compute
@appliedcompute
19m
Replying to @appliedcompute
We rebuilt the agent harness to operate well in challenging long context environments. Legal source documents are huge, with the 90th-percentile LAB task carrying nearly 100k tokens and some exceeding 200k. We added compaction so the model summarizes its own transcript and
72
Applied Compute
@appliedcompute
19m
Read the full report:
appliedcompute.com
Training a State-of-the-Art Legal Agent with Harvey
How Applied Compute post-trained GLM-5.1 into the strongest available model on Harvey's Legal Agent Benchmark through full-stack optimization.
59
Applied Compute reposted
Gabe Pereyra
@gabepereyra
Jun 17
Model strategy for @harvey: We are working on the first model in our legal foundation model series, inspired by @cursor_ai's Composer. Two goals: 1. Allow us to serve frontier intelligence across our product surface areas at an affordable price and a strong security posture.
208K
Applied Compute
@appliedcompute
Jun 16
Preserving entropy is critical for continued training; in modern post-training recipes, entropy is often a fixed resource that gets exhausted over the course of a training run, making it difficult for the model to improve and learn on new tasks. Adaptive entropy control methods
17K
Applied Compute
@appliedcompute
Jun 16
Replying to @appliedcompute
The collapse also shows up in the answers themselves. Under various metrics of intra-prompt diversity, a policy trained with GRPO leads to less diverse responses than a trained with adaptive entropy control. Moreover, we observe that entropy allows response diversity to be tuned,
782
Applied Compute
@appliedcompute
Jun 16
Read the full research report:
Continued Training with Entropy Preserving RL
From appliedcompute.com
588
Applied Compute
@appliedcompute
Jun 15
The workflows that make you different shouldn't run on the same general models everyone else rents. Our co-founder @rhythmrg on when to train your own.
Rhythm Garg
@rhythmrg
Jun 15
Article
Should you post-train your own model?
General frontier models, both open and closed, are improving quickly. In many cases, they are the right starting point. If you are building a 0-to-1 prototype, trying to understand a workflow, or...
4.2K
Applied Compute reposted
Yash Patil
@ypatil125
Jun 14
When we started Applied Compute this was our thesis in a nutshell. "Companies need to turn their workflows, domain knowledge, and accumulated judgment into AI systems that improve with each use. Private evals should capture whether a model is actually improving against outcomes
Satya Nadella
@satyanadella
Jun 14
Article
A frontier without an ecosystem is not stable
I’ve been thinking a lot about the future of the firm in an AI-driven economy. This transition is different than any previous platform shift. In the past, we used digital systems to enhance human...
79K
Applied Compute
@appliedcompute
Jun 12
"A great eval needs to understand every correct answer, and every way one can go catastrophically wrong." @BrendanFoody from @mercor_ai shared with our CEO @ypatil125 how evals are deceptively the hardest part of post-training. Our team at Applied Compute solves this by
00:00
3.6K
Applied Compute
@appliedcompute
Jun 11
“RL is remarkably data efficient. You can specialize a model on exactly what your business needs, with surprisingly little data.” @BrendanFoody sat with our CEO @ypatil125 to discuss how RL flipped the equation from quantity to quality, so the proprietary data only you have can
00:00
5.2K
Applied Compute reposted
Sahar Zadeh
@sahar__zadeh
Jun 10
Article
Moats Need Models
For most of the last two years, the model was treated as a commodity input. You picked a frontier API, wrapped it in a clever harness, and built your product in the layer above. The model was a...
11K
Applied Compute
@appliedcompute
Jun 10
After working with both frontier labs and enterprises across industries, @mercor_ai CEO @BrendanFoody joined our CEO @ypatil125 to discuss why proprietary data and custom models are what keep a company competitive at the frontier.
00:00
18K
Applied Compute
@appliedcompute
Jun 4
@nvidia’s Nemotron 3 Ultra handles software-engineering tasks at a fraction of the per-task cost of frontier models. So we trained a router to send each coding task to the cheapest model that can successfully solve it, cutting inference cost while holding frontier-level quality.
40K
Applied Compute
@appliedcompute
Jun 4
Replying to @appliedcompute
The models are complementary. The trained router sends 73% of tasks to @NVIDIAAI's efficient Nemotron 3 Ultra and routes the long tail to GPT 5.5 and Opus 4.7 on tasks where frontier performance at a premium is worth the tradeoff. Since the router is agentic, it can call tools
1.3K
Applied Compute
@appliedcompute
Jun 4
Read the full research report:
Training an Agentic Router for Optimal Cost-Performance on SWE Tasks
From appliedcompute.com
652