Moss (YC F25) reposted this
I've been telling everyone Moss does sub-10ms retrieval. Time to prove it. So we did. We published full benchmark results on our GitHub repo this week. Open, reproducible, run them yourself. Here's why this mattered to me personally: 1/ I got tired of seeing infrastructure companies throw around performance numbers with no way to verify. I didn't want Moss to be one of those companies. If we're going to make the claim, the proof should be public. 2/ These aren't cherry-picked. We tested across real workloads, different dataset sizes, different query patterns. The results hold across the board. 3/ The number matters because of what it unlocks. Sub-10ms is the difference between a voice AI agent that feels instant and one that pauses. Between a search experience that's seamless and one where the user notices the wait. This was a bet we made early on. That speed would be the moat. That if we obsessed over single-digit millisecond retrieval, everything else would follow. The repo is open. The numbers speak for themselves. Link in comments.