Parallel LLM reasoning just got a lot cheaper. MARS slashes 25–47% of tokens (and up to 29% beyond top baselines) by probing in-flight traces and stopping only when the leader’s margin is provably safe—no need to wait for every chain to finish.
It does this with a lightweight
00:00

