Log inSign up
Jon Richens
144 posts
user avatar
Jon Richens
@jonathanrichens
Research scientist in AI safety @GoogleDeepMind
Joined August 2020
359
Following
1,129
Followers
  • Pinned
    user avatar
    Jon Richens
    @jonathanrichens
    Jun 4, 2025
    Are world models necessary to achieve human-level agents, or is there a model-free short-cut? Our new #ICML2025 paper tackles this question from first principles, and finds a surprising answer, agents _are_ world models… 🧵
    Image
    186K
  • user avatar
    Jon Richens
    @jonathanrichens
    Feb 19, 2024
    Excited to share our new paper arxiv.org/abs/2402.10877 (Oral, ICLR 2024, w/ @tom4everitt, @GoogleDeepMind). In it we answer the question, do agents need to learn causal world models? arxiv.org/abs/2402.10877. 🧵
    Image
    73K
  • user avatar
    Jon Richens
    @jonathanrichens
    Jun 4, 2025
    Replying to @jonathanrichens
    Turns out there’s a neat answer to this question. We prove that any agent capable of generalizing to a broad range of simple goal-directed tasks must have learned a predictive model capable of simulating its environment. And this model can always be recovered from the agent.
    Image
    8.2K
  • user avatar
    Jon Richens
    @jonathanrichens
    May 7, 2024
    Our paper, 'robust agents learn causal world models' got an honourable mention in the outstanding paper awards at #ICLR2024. Check out our talk in 20 mins in hall A3, or come chat with @tom4everitt and I at the poster session after!
    user avatar
    ICLR
    @iclr_conf
    May 7, 2024
    Announcing the #ICLR2024 Outstanding Paper Awards: blog.iclr.cc/2024/05/06/icl… Shoutout to the awards committee: @eunsolc, @katjahofmann, @liu_mingyu, @nanjiang_cs, @guennemann, @optiML, @tkipf, @CevherLIONS
    4.1K
  • user avatar
    Jon Richens
    @jonathanrichens
    Jun 4, 2025
    Replying to @jonathanrichens
    Specifically, we show it’s possible to recover a bounded error approximation of the environment transition function from any goal-conditional policy that satisfies a regret bound across a wide enough set of simple goals, like steering the environment into a desired state.
    Image
    5.3K
  • user avatar
    Jon Richens
    @jonathanrichens
    Jun 4, 2025
    Replying to @jonathanrichens
    … and many more! Check out our paper arxiv.org/pdf/2506.01622, or come chat to me at #ICML2025. Joint work @GoogleDeepMind with @dabelcs, @alexis_bellot_, @tom4everitt
    3.6K
  • user avatar
    Jon Richens
    @jonathanrichens
    Jun 4, 2025
    Replying to @jonathanrichens
    No model-free path. If you want to train an agent capable of a wide range of goal-directed tasks, you can’t avoid the challenge of learning a world model. And to improve performance or generality, agents need to learn increasingly accurate and detailed world models.
    5.1K
  • user avatar
    Jon Richens
    @jonathanrichens
    Jun 4, 2025
    Replying to @jonathanrichens
    And to achieve lower regret, or more complex goals, agents must learn increasingly accurate world models. Goal-conditioned policies are informationally equivalent to world models! But only for goals over mutli-step horizons, myopic agents do not need to learn world models.
    4.7K
  • user avatar
    Jon Richens
    @jonathanrichens
    Jun 4, 2025
    Replying to @jonathanrichens
    World models are foundational to goal-directedness in humans, but are hard to learn in messy open worlds. We're now seeing generalist, model-free agents (Gato, PaLM-E, Pi-0…). Do these agents learn implicit world models, or have they found another way to generalize to new tasks?
    7.3K
  • user avatar
    Jon Richens
    @jonathanrichens
    Nov 25, 2022
    Nice to see our work 'counterfactual harm' is a highlighted @DeepMind paper at #NeurIPS2022 this year. Interesting omen that 3 of the 9 highlighted papers use causality (all of them in the responsible AI category).
    user avatar
    Google DeepMind
    @GoogleDeepMind
    Nov 25, 2022
    Going to @NeurIPSConf? We’ll be presenting our latest research including: 🔵 Language models Chinchilla and Flamingo 🔵 New papers on algorithmic advances and optimising #RL 🔵 How we’re developing ethical and fair AI systems And much more: dpmd.ai/neurips-tw #NeurIPS2022
  • user avatar
    Jon Richens
    @jonathanrichens
    Jun 4, 2025
    Replying to @jonathanrichens
    Fundamental limitations on agency. In environments where the dynamics are provably hard to learn, or where long-horizon prediction is infeasible, the capabilities of agents are fundamentally bounded.
    4.3K
  • user avatar
    Jon Richens
    @jonathanrichens
    Jun 4, 2025
    Replying to @jonathanrichens
    Causality. In previous work we showed a causal world model is needed for robustness. It turns out you don’t need as much causal knowledge of the environment for task generalization. There is a causal hierarchy, but for agency and agent capabilities, rather than inference!
    Image
    4K
  • user avatar
    Jon Richens
    @jonathanrichens
    Jun 4, 2025
    Replying to @jonathanrichens
    Extracting world knowledge from agents. We derive algorithms that recover a world model given the agent’s policy and goal (policy + goal -> world model). These algorithms complete the triptych of planning (world model + goal -> policy) and IRL (world model + policy -> goal).
    Image
    4K
  • user avatar
    Jon Richens
    @jonathanrichens
    Jun 4, 2025
    Replying to @jonathanrichens
    Emergent capabilities. To minimize training loss across many goals, agents must learn a world model, which can solve tasks the agent was not explicitly trained on. Simple goal-directedness gives rise to many capabilities (social cognition, reasoning about uncertainty, intent…).
    Image
    3.5K

New to X?

Sign up now to get your own personalized timeline!

Create account

By signing up, you agree to the Terms of Service and Privacy Policy, including Cookie Use.

Terms·Privacy·Cookies·Accessibility·Ads Info·© 2026 X Corp.
Don't miss what's happening
People on X are the first to know.
Log inSign up
Advertisement
Advertisement