Fun story from our internal testing on Claude 3 Opus. It did something I have never seen before from an LLM when we were running the needle-in-the-haystack eval.
For background, this tests a model’s recall ability by inserting a target sentence (the "needle") into a corpus of
Alex Albert
2,267 posts
- We built a Claude desktop app! Now available on Mac and Windows.
- Introducing Claude 3.7 Sonnet. Our most intelligent model to date and the first generally available hybrid reasoning model in the world.
- Introducing the Model Context Protocol (MCP) An open standard we've been working on at Anthropic that solves a core challenge with LLM apps - connecting them to your data. No more building custom integrations for every data source. MCP provides one protocol to connect them all:
- It's a big day for Claude's PDF capabilities. We're rolling out visual PDF support across claude dot ai and the Anthropic API. Let me explain:
- Claude is starting to get really good at coding and autonomously fixing pull requests. It's becoming clear that in a year's time, a large percentage of code will be written by LLMs. Let me show you what I mean:
- Claude just replicated my profile pic in an excel file We're entering the vibe excel era
- We wrote up what we've learned about using Claude Code internally at Anthropic. Here are the most effective patterns we've found (many apply to coding with LLMs generally):
- PSA for people who sleep with their door closed I was talking to Claude about waking up drowsy with a stuffy nose at my childhood home Claude suspected CO2 buildup and suggested buying a monitor for my room Turns out Claude was right - the levels spike when I sleep🙃
- We just rolled out prompt caching in the Anthropic API. It cuts API input costs by up to 90% and reduces latency by up to 80%. Here's how it works:
- We're opening limited access to a research preview of a new agentic coding tool we're building: Claude Code. You'll get Claude-powered code assistance, file operations, and task execution directly from your terminal. Here’s what it can do:
- Well, that was fast… I just helped create the first jailbreak for ChatGPT-4 that gets around the content filters every time credit to @vaibhavk97 for the idea, I just generalized it to make it work on ChatGPT here's GPT-4 writing instructions on how to hack someone's computer
- One of the things we've been most impressed by internally at Anthropic is Claude 3.7 Sonnet's one-shot code generation ability. Here are a few of my favorite examples I've seen on here over the past day:
- Introducing Claude Opus 4 and Claude Sonnet 4. Our best models yet. The Claude 4 family is here.














