Log inSign up
Braintrust
753 posts
Image
user avatar
Braintrust
@braintrust
The observability layer for production AI.
braintrust.dev
Joined August 2023
55
Following
6,799
Followers
  • Pinned
    user avatar
    Braintrust
    @braintrust
    Jun 1
    Topics is now GA on all plans. Continuously find the patterns worth investigating across your production traffic.
    Image
    00:00
    2.7K
  • user avatar
    Braintrust
    @braintrust
    12h
    The Agent Open panel lineup is live. On Jun 30th in San Francisco Braintrust and friends host a conversation with leaders building the infrastructure for shipping quality agents. Then, it's time for pickleball. First come, first serve. Literally.
    278
    user avatar
    Braintrust
    @braintrust
    12h
    Join us → luma.com/the-agent-open
    Image
    156
  • user avatar
    Braintrust
    @braintrust
    15h
    There have been six generations of AI agents: - A simple prompt that asks a model a question. - A fixed pipeline that retrieves context and puts it into the prompt to get a result. - A react loop, in which the model decides what tools to call and in what order. - A
    Image
    00:00
    359
  • user avatar
    Braintrust
    @braintrust
    Jun 19
    When you're building AI systems, you need to know what prompt your LLM received, what it returned, and how many tokens it used. And you need to log tool calls, retrieval, reasoning, and handoffs between subagents. OpenTelemetry is an OSS framework for capturing that data using
    Image
    00:00
    567
  • user avatar
    Braintrust
    @braintrust
    Jun 18
    How do you make AI traces readable for non-engineers? Custom trace views in Braintrust transform a raw trace into a format that a subject matter expert can understand. For example, you can turn a customer support trace into a ticket card with the entire conversation, the
    Image
    00:00
    235
  • user avatar
    Braintrust
    @braintrust
    Jun 18
    Braintrust now integrates with Azure AI Foundry, giving you access to OpenAI models and the full Azure model catalog, including Grok, Claude, and DeepSeek. Configure it through API key, Entra ID, or workload identity federation for secure cloud provider integrations.
    215
    user avatar
    Braintrust
    @braintrust
    Jun 18
    Read the setup guide → braintrustdata.link/azure-foundry-…
    Image
    137
  • user avatar
    Braintrust
    @braintrust
    Jun 17
    The success or failure of an agent is measured differently depending on the needs of the business. Braintrust lets you define custom facets and track every production trace against what really matters. Braintrust is presenting a workshop on how to: ​- Define the dimensions that
    298
    user avatar
    Braintrust
    @braintrust
    Jun 17
    Join us → braintrustdata.link/observability-…
    Image
    227
  • user avatar
    Braintrust
    @braintrust
    Jun 17
    Your AI tools should understand your log schemas, help debug failed evals, and answer questions about your model performance. Braintrust’s MCP server makes this possible with Cursor, Claude, VS Code, and more.
    237
    user avatar
    Braintrust
    @braintrust
    Jun 17
    Read more →
    Image
    AI that knows your data - Blog - Braintrust
    From braintrust.dev
    169
  • user avatar
    Braintrust
    @braintrust
    Jun 16
    AI governance is entering a new phase. The EU AI Act enforces legal accountability for any company with EU customers, and ISO 42001 compliance is becoming a requirement in enterprise procurement. The teams best prepared for these governance frameworks are the teams that already
    Image
    00:00
    326
  • user avatar
    Braintrust
    @braintrust
    Jun 16
    Every AI team is building with different frameworks, different model providers, and different languages. Braintrust is designed for this reality. Instrument once via SDKs or OpenTelemetry, and get consistent traces, evals, and debugging across your entire stack.
    Image
    201
    user avatar
    Braintrust
    @braintrust
    Jun 16
    Read more →
    Image
    How to use Braintrust with any framework or provider - Blog - Braintrust
    From braintrust.dev
    151
  • Braintrust reposted
    user avatar
    claire vo 🖤
    @clairevo
    Jun 15
    "What you do is you prioritize the top few benchmarks and then you probably bullshit the rest." We talk a lot about how AI is making coding easier for non-technical folks, but don't hear much about how the most elite engineers are delegating their most technically complex work.
    Image
    00:00
    18K
  • user avatar
    Braintrust
    @braintrust
    Jun 15
    Because LLMs are non-deterministic, a sudden change in your eval score could just be due to variance. A binomial test tells you the probability of getting a certain result by chance, so you can be confident that an eval score isn't random.
    Image
    00:00
    209
  • user avatar
    Braintrust
    @braintrust
    Jun 12
    How does your team rank when it comes to shipping quality AI products? Braintrust's AI quality assessment maps your current practices to the next useful step, whether you're still manually checking outputs or already running online scores in production.
    Image
    00:00
    361
    user avatar
    Braintrust
    @braintrust
    Jun 12
    Take the assessment →
    Image
    AI quality assessment - Braintrust
    From braintrust.dev
    183

New to X?

Sign up now to get your own personalized timeline!

Create account

By signing up, you agree to the Terms of Service and Privacy Policy, including Cookie Use.

Terms·Privacy·Cookies·Accessibility·Ads Info·© 2026 X Corp.
Don't miss what's happening
People on X are the first to know.
Log inSign up
Advertisement
Advertisement