Season 1.5 of Alpha Arena has officially ended !
- Mystery Model (a.k.a GROK 4.20) is the winner, up 12% on avg.
- Not only did it win, it made money in all four competitions
- GPT5.1 🥈 came in 2nd, and Gemini 3 🥉 3rd
- All trades & model outputs are 100% verifiable 👇
nof1
24 posts
The first AI research lab focused on financial markets
- T-minus 2.5 hoursToday is the final day of Alpha Arena S1.5 🤯 There's 32 instances of LLMs managing a total of $320,000 in real capital They're ingesting news, indices, microstructure, etc. every 6 minutes Mystery Model is the only profitable model so far
- An update from our research team!Modern LLMs are good at writing code, but not necessarily at optimizing policy. But you can wrap the LLM in an evolutionary framework, and search for policies. In a new @the_nof1 paper, we show that we can automatically generate strong trading policies.
- Alpha Arena Season 1.5Season 1.5 of Alpha Arena is now LIVE with $320K deployed It features: - Multiple competitions - Tons of new data - 2 new models - US equities Most AI benchmarks test knowledge, our goal is to test judgement Watch live below 👇
00:00 - nof1 reposted
00:00 - nof1 repostedThe next season of our benchmark will have lots of improvements. Also, we have plenty of other things going on at @the_nof1 which we haven't made public yet. Markets are fun to play, and make AI players for.Qwen's portfolio is up +60% Gemini's is down -60% Of course, too early to tell how much is skill vs. noise Next season we'll run many instances of the models in parallel for statistical rigor The goal of Season 1 was to look for biases. What are the major differences between
- nof1 repostedQwen's portfolio is up +60% Gemini's is down -60% Of course, too early to tell how much is skill vs. noise Next season we'll run many instances of the models in parallel for statistical rigor The goal of Season 1 was to look for biases. What are the major differences between
- nof1 repostedSo much of finance happens behind closed doors. As you know, it's an insanely secretive industry We're excited to put more experiments out in the open, both in trading and AI research The next season of Alpha Arena will include a human trader, as well as our homegrown models
- Alpha ArenaAlpha Arena is LIVE 6 AI models trading $10K each, fully autonomously Real money. Real markets. Real benchmark. Who's your money on? Link below
- A new era of AI evaluation begins next weekThe only thing I accept being harder than @NetHack_LE are benchmarks on live real-world data. Well done @jay_azhang & @togelius from @the_nof1! Excited to see how AI will perform over longer investment timelines.
- nof1 repostedOur new benchmark has the top 6 AI models trading real capital Grok4 is winning so far. It was short and then flipped to long, timing the bottom perfectly It's up >500% in 1 day












