Jon Richens (@jonathanrichens) / X

Jon Richens

144 posts

Jon Richens

@jonathanrichens

Research scientist in AI safety @GoogleDeepMind

Joined August 2020

Pinned
Jon Richens
@jonathanrichens
Jun 4, 2025
Are world models necessary to achieve human-level agents, or is there a model-free short-cut? Our new #ICML2025 paper tackles this question from first principles, and finds a surprising answer, agents _are_ world models… 🧵
186K
Jon Richens
@jonathanrichens
Feb 19, 2024
Excited to share our new paper arxiv.org/abs/2402.10877 (Oral, ICLR 2024, w/ @tom4everitt, @GoogleDeepMind). In it we answer the question, do agents need to learn causal world models? arxiv.org/abs/2402.10877. 🧵
73K
Jon Richens
@jonathanrichens
Jun 4, 2025
Replying to @jonathanrichens
Turns out there’s a neat answer to this question. We prove that any agent capable of generalizing to a broad range of simple goal-directed tasks must have learned a predictive model capable of simulating its environment. And this model can always be recovered from the agent.
8.2K
Jon Richens
@jonathanrichens
May 7, 2024
Our paper, 'robust agents learn causal world models' got an honourable mention in the outstanding paper awards at #ICLR2024. Check out our talk in 20 mins in hall A3, or come chat with @tom4everitt and I at the poster session after!
ICLR
@iclr_conf
May 7, 2024
Announcing the #ICLR2024 Outstanding Paper Awards: blog.iclr.cc/2024/05/06/icl… Shoutout to the awards committee: @eunsolc, @katjahofmann, @liu_mingyu, @nanjiang_cs, @guennemann, @optiML, @tkipf, @CevherLIONS
4.1K
Jon Richens
@jonathanrichens
Jun 4, 2025
Replying to @jonathanrichens
Specifically, we show it’s possible to recover a bounded error approximation of the environment transition function from any goal-conditional policy that satisfies a regret bound across a wide enough set of simple goals, like steering the environment into a desired state.
5.3K
Jon Richens
@jonathanrichens
Jun 4, 2025
Replying to @jonathanrichens
… and many more! Check out our paper arxiv.org/pdf/2506.01622, or come chat to me at #ICML2025. Joint work @GoogleDeepMind with @dabelcs, @alexis_bellot_, @tom4everitt
3.6K
Jon Richens
@jonathanrichens
Jun 4, 2025
Replying to @jonathanrichens
No model-free path. If you want to train an agent capable of a wide range of goal-directed tasks, you can’t avoid the challenge of learning a world model. And to improve performance or generality, agents need to learn increasingly accurate and detailed world models.
5.1K
Jon Richens
@jonathanrichens
Jun 4, 2025
Replying to @jonathanrichens
And to achieve lower regret, or more complex goals, agents must learn increasingly accurate world models. Goal-conditioned policies are informationally equivalent to world models! But only for goals over mutli-step horizons, myopic agents do not need to learn world models.
4.7K
Jon Richens
@jonathanrichens
Jun 4, 2025
Replying to @jonathanrichens
World models are foundational to goal-directedness in humans, but are hard to learn in messy open worlds. We're now seeing generalist, model-free agents (Gato, PaLM-E, Pi-0…). Do these agents learn implicit world models, or have they found another way to generalize to new tasks?
7.3K
Jon Richens
@jonathanrichens
Nov 25, 2022
Nice to see our work 'counterfactual harm' is a highlighted @DeepMind paper at #NeurIPS2022 this year. Interesting omen that 3 of the 9 highlighted papers use causality (all of them in the responsible AI category).
Google DeepMind
@GoogleDeepMind
Nov 25, 2022
Going to @NeurIPSConf? We’ll be presenting our latest research including: 🔵 Language models Chinchilla and Flamingo 🔵 New papers on algorithmic advances and optimising #RL 🔵 How we’re developing ethical and fair AI systems And much more: dpmd.ai/neurips-tw #NeurIPS2022
Jon Richens
@jonathanrichens
Jun 4, 2025
Replying to @jonathanrichens
Fundamental limitations on agency. In environments where the dynamics are provably hard to learn, or where long-horizon prediction is infeasible, the capabilities of agents are fundamentally bounded.
4.3K
Jon Richens
@jonathanrichens
Jun 4, 2025
Replying to @jonathanrichens
Causality. In previous work we showed a causal world model is needed for robustness. It turns out you don’t need as much causal knowledge of the environment for task generalization. There is a causal hierarchy, but for agency and agent capabilities, rather than inference!
4K
Jon Richens
@jonathanrichens
Jun 4, 2025
Replying to @jonathanrichens
Extracting world knowledge from agents. We derive algorithms that recover a world model given the agent’s policy and goal (policy + goal -> world model). These algorithms complete the triptych of planning (world model + goal -> policy) and IRL (world model + policy -> goal).
4K
Jon Richens
@jonathanrichens
Jun 4, 2025
Replying to @jonathanrichens
Emergent capabilities. To minimize training loss across many goals, agents must learn a world model, which can solve tasks the agent was not explicitly trained on. Simple goal-directedness gives rise to many capabilities (social cognition, reasoning about uncertainty, intent…).
3.5K