Bug0 reposted this
Every model since GPT-4 could write hundreds of unit tests effectively. That's not what's new with Claude Fable. What's new is that nobody's reading the code anymore. Fable ships the whole feature: implementation, tests, the lot. Earlier models wrote code you reviewed line by line. Now the diff is 4,000 lines, it arrived in twenty minutes, and reviewing it properly would take longer than writing it yourself. So teams skim it, see green tests, and merge. And before someone says "use a second model to write the tests": that fixes the wrong problem. Unit tests, whoever writes them, check that the code does what the code says. The bugs I've watched cost teams real money this year lived somewhere else entirely. Real auth meeting a real browser. A third-party API timing out at 2am while the checkout flow just... waited. None of that was in the diff. None of it ever is. The human checkpoint didn't get automated out of the pipeline. It got skipped. Someone still has to answer whether code nobody on your team read is safe to put in front of customers. We're seeing this play out in our pipeline at Bug0. The fastest-growing demand isn't for another testing tool, it's for our forward-deployed engineers: teams shipping AI-written code who want an actual human, outside the codebase, accountable for "safe to ship." The faster a team has gone with AI, the sooner they call. Full essay on where the QA work went: https://lnkd.in/grq6BTfn