Pinned
Early results from Recursive 🚀🚀
SotA results from our open-ended knowledge discovery system:
1️⃣NanoChat 5min pre-training (0.9372 bpb -> 0.9109 bpb, 2.8% lower Bits-Per-Byte than long-standing community SoTA)
2️⃣NanoGPT SpeedRun (79.7s -> 77.5s, 2.8% faster than long-standing








