Thomas Fel (@thomas_fel

Thomas Fel

1,626 posts

Thomas Fel

@thomas_fel_

Interpretability, Visual Intelligence @GoodfireAI. Prev: @Harvard, @Google, @BrownUniversity (@tserre lab). Crêpe lover.

San Francisco, CA

Joined February 2017

Pinned
Thomas Fel
@thomas_fel_
May 7
Happy to share my first post since joining Goodfire. Neural geometry has been my obsession for years, and our team here is building a really serious research agenda around it. I can't wait to share the series of papers coming over the next few weeks... Brace for shapes 🍩
Goodfire
@GoodfireAI
May 7
Neural networks might speak English, but they think in shapes. Understanding their rich *neural geometry* is key to understanding how they work – and to debugging and controlling them with precision. Starting today, we’re releasing a series of posts on this research agenda. 🧵
00:00
78K
Thomas Fel
@thomas_fel_
Oct 14, 2025
🕳️🐇Into the Rabbit Hull – Part I (Part II tomorrow) An interpretability deep dive into DINOv2, one of vision’s most important foundation models. And today is Part I, buckle up, we're exploring some of its most charming features.
00:00
64K
Thomas Fel
@thomas_fel_
Sep 28, 2025
Phenomenology → principle → method. From observed phenomena in representations (conditional orthogonality) we derive a natural instantiation. And it turns out to be an old friend: Matching Pursuit! 📄 arxiv.org/abs/2506.03093 See you in San Diego, @NeurIPSConf 🎉
28K
Thomas Fel
@thomas_fel_
Oct 29, 2024
🎭Recent work shows that models’ inductive biases for 'simpler' features may lead to shortcut learning. What do 'simple' vs 'complex' features look like? What roles do they play in generalization? Our new paper explores these questions. arxiv.org/pdf/2407.06076 #Neurips2024
44K
Thomas Fel
@thomas_fel_
Jul 20, 2025
Great excuse to share something I really love: 1-Lipschitz nets. They give clean theory, certs for robustness, the right loss for W-GANs, even nicer grads for explainability!! Yet are still niche. Here’s a speed-run through some of my favorite papers on the field. 🧵👇
cider
@jeffreycider
Jul 20, 2025
optimization theorem: "assume a lipschitz constant L..." the lipschitz constant:
60K
Thomas Fel
@thomas_fel_
Oct 15, 2025
🕳️🐇Into the Rabbit Hull – Part II Continuing our interpretation of DINOv2, the second part of our study concerns the geometry of concepts and the synthesis of our findings toward a new representational phenomenology: the Minkowski Representation Hypothesis
57K
Thomas Fel
@thomas_fel_
Mar 13, 2025
Train your vision SAE on Monday, then again on Tuesday, and you'll find only about 30% of the learned concepts match. ⚓ We propose Archetypal SAE which anchors concepts in the real data’s convex hull, delivering stable and consistent dictionaries. arxiv.org/pdf/2502.12892…
43K
Thomas Fel
@thomas_fel_
Sep 4, 2023
👋 Explain big vision model with 𝐂𝐑𝐀𝐅𝐓 🪄🐰 A method that 𝙖𝙪𝙩𝙤𝙢𝙖𝙩𝙞𝙘𝙖𝙡𝙡𝙮 extracts the most important concepts for your favorite pre-trained vision model. e.g., we automatically discover the most important concepts on a ResNet50 for rabbits: eyes, ears, fur. 🧶
56K
Thomas Fel
@thomas_fel_
Feb 22, 2024
👋👨‍🍳🍵 After a year of cooking up a secret project, I'm thrilled to officially reveal: The 𝐋𝐄𝐍𝐒 𝐏𝐫𝐨𝐣𝐞𝐜𝐭. By combining modern tools of Explainable AI, how much can we explain a ResNet50? 🧶
00:00
42K
Thomas Fel
@thomas_fel_
Sep 17, 2025
Check out our COLM 2025 (oral) 🎤 SAEs reveal that VLM embedding spaces aren’t just "image vs. text" cones. They contain stable conceptual directions, some forming surprising bridges across modalities. 1/2
10K
Thomas Fel
@thomas_fel_
Mar 25, 2025
An underrated paper. “Just because the information is here doesn’t mean it’s easily extractable.” Think about AES-256 a file —data is there, but the route back is complex. This is what we used to understand which features are easy vs. complex for NN arxiv.org/abs/2407.06076
Jack Morris
@jxmnop
Mar 24, 2025
# A new type of information theory this paper is not super well-known but has changed my opinion of how deep learning works more than almost anything else it says that we should measure the amount of information available in some representation based on how *extractable* it is,
7.4K
Thomas Fel
@thomas_fel_
Feb 5, 2023
𝐇𝐨𝐥𝐢𝐬𝐭𝐢𝐜𝐚𝐥𝐥𝐲 𝐄𝐱𝐩𝐥𝐚𝐢𝐧𝐚𝐛𝐥𝐞 𝐕𝐢𝐬𝐢𝐨𝐧 𝐓𝐫𝐚𝐧𝐬𝐟𝐨𝐫𝐦𝐞𝐫 arxiv.org/abs/2301.08669 Another important work for explainability. The authors reformulate the B-cos network for transformers by showing that each usual Vision Transformer blocks...
13K
Thomas Fel
@thomas_fel_
Apr 3, 2024
Personal update: I'm truly honored and thrilled to announce that I'll be joining Harvard this September as one of the @KempnerInst Research Fellows! 🥳
Kempner Institute at Harvard University
@KempnerInst
Apr 2, 2024
Thrilled to announce the 2024 recipients of #KempnerInstitute Research Fellowships: Thomas Fel, Mikail Khona, Bingbin Liu, Isabel Papadimitriou, Noor Sajid, & Aaron Walsman! bit.ly/4aBQ6MS @Napoolar @KhonaMikail @BingbinL @isabelpapad @nsajidt @aaronwalsman
12K
Thomas Fel
@thomas_fel_
Dec 10, 2023
Super excited to be at @NeurIPSConf this year, presenting 4 articles focused on explainability! 😉 If you're attending the conference and would like to chat or take a coffee feel free to reach out! 👋☕
12K