Braintrust (@braintrust) / X

Braintrust

753 posts

Braintrust

@braintrust

The observability layer for production AI.

Joined August 2023

Pinned
Braintrust
@braintrust
Jun 1
Topics is now GA on all plans. Continuously find the patterns worth investigating across your production traffic.
00:00
2.7K
Braintrust
@braintrust
12h
The Agent Open panel lineup is live. On Jun 30th in San Francisco Braintrust and friends host a conversation with leaders building the infrastructure for shipping quality agents. Then, it's time for pickleball. First come, first serve. Literally.
278
Braintrust
@braintrust
12h
Join us → luma.com/the-agent-open
156
Braintrust
@braintrust
15h
There have been six generations of AI agents: - A simple prompt that asks a model a question. - A fixed pipeline that retrieves context and puts it into the prompt to get a result. - A react loop, in which the model decides what tools to call and in what order. - A
00:00
359
Braintrust
@braintrust
Jun 19
When you're building AI systems, you need to know what prompt your LLM received, what it returned, and how many tokens it used. And you need to log tool calls, retrieval, reasoning, and handoffs between subagents. OpenTelemetry is an OSS framework for capturing that data using
00:00
567
Braintrust
@braintrust
Jun 18
How do you make AI traces readable for non-engineers? Custom trace views in Braintrust transform a raw trace into a format that a subject matter expert can understand. For example, you can turn a customer support trace into a ticket card with the entire conversation, the
00:00
235
Braintrust
@braintrust
Jun 18
Braintrust now integrates with Azure AI Foundry, giving you access to OpenAI models and the full Azure model catalog, including Grok, Claude, and DeepSeek. Configure it through API key, Entra ID, or workload identity federation for secure cloud provider integrations.
215
Braintrust
@braintrust
Jun 18
Read the setup guide → braintrustdata.link/azure-foundry-…
137
Braintrust
@braintrust
Jun 17
The success or failure of an agent is measured differently depending on the needs of the business. Braintrust lets you define custom facets and track every production trace against what really matters. Braintrust is presenting a workshop on how to: - Define the dimensions that
298
Braintrust
@braintrust
Jun 17
Join us → braintrustdata.link/observability-…
227
Braintrust
@braintrust
Jun 17
Your AI tools should understand your log schemas, help debug failed evals, and answer questions about your model performance. Braintrust’s MCP server makes this possible with Cursor, Claude, VS Code, and more.
237
Braintrust
@braintrust
Jun 17
Read more →
AI that knows your data - Blog - Braintrust
From braintrust.dev
169
Braintrust
@braintrust
Jun 16
AI governance is entering a new phase. The EU AI Act enforces legal accountability for any company with EU customers, and ISO 42001 compliance is becoming a requirement in enterprise procurement. The teams best prepared for these governance frameworks are the teams that already
00:00
326
Braintrust
@braintrust
Jun 16
Every AI team is building with different frameworks, different model providers, and different languages. Braintrust is designed for this reality. Instrument once via SDKs or OpenTelemetry, and get consistent traces, evals, and debugging across your entire stack.
201
Braintrust
@braintrust
Jun 16
Read more →
How to use Braintrust with any framework or provider - Blog - Braintrust
From braintrust.dev
151
Braintrust reposted
claire vo 🖤
@clairevo
Jun 15
"What you do is you prioritize the top few benchmarks and then you probably bullshit the rest." We talk a lot about how AI is making coding easier for non-technical folks, but don't hear much about how the most elite engineers are delegating their most technically complex work.
00:00
18K
Braintrust
@braintrust
Jun 15
Because LLMs are non-deterministic, a sudden change in your eval score could just be due to variance. A binomial test tells you the probability of getting a certain result by chance, so you can be confident that an eval score isn't random.
00:00
209
Braintrust
@braintrust
Jun 12
How does your team rank when it comes to shipping quality AI products? Braintrust's AI quality assessment maps your current practices to the next useful step, whether you're still manually checking outputs or already running online scores in production.
00:00
361
Braintrust
@braintrust
Jun 12
Take the assessment →
AI quality assessment - Braintrust
From braintrust.dev
183