Log inSign up
Eric J. Michaud
315 posts
Image
user avatar
Eric J. Michaud
@ericjmichaud_
Trying to make deep neural networks among the best understood objects in the universe. šŸ’»šŸ¤–šŸ§ šŸ‘½šŸ”­šŸš€
Berkeley
ericjmichaud.com
Joined February 2015
1,266
Following
4,143
Followers
  • Pinned
    user avatar
    Eric J. Michaud
    @ericjmichaud_
    Jan 13
    How does scaling up neural networks change what they learn? Despite its importance, our understanding of this question remains nascent. I've written a long post reflecting on my model of neural scaling and its relationship to interpretability, etc.: ericjmichaud.com/quanta
    Image
    00:00
    361K
  • user avatar
    Eric J. Michaud
    @ericjmichaud_
    Nov 21, 2023
    Replying to @dwarkesh_sp
    tl;dr: Maybe learning simple things (basic knowledge, heuristics, etc) actually lowers the loss more than learning sophisticated things (algorithms associated with higher cognition that we really care about), and the sophisticated things will eventually be learned as scaling
    user avatar
    xuan (ɕɄɛn / sh-yen)
    @xuanalogue
    Jun 8, 2023
    I respect Jacob a lot but I find it really difficult to engage with predictions of LLM capabilities that presume some version of the scaling hypothesis will continue to hold - it just seems highly implausible given everything we already know about the limits of transformers!
    149K
  • user avatar
    Eric J. Michaud
    @ericjmichaud_
    Feb 10, 2024
    Our group has a new preprint out, in which we make some very tentative steps towards translating trained neural networks into code: arxiv.org/abs/2402.05110 Quick summary/thoughts 🧵:
    Image
    106K
  • user avatar
    Eric J. Michaud
    @ericjmichaud_
    May 22, 2025
    Today, the most competent AI systems in almost *any* domain (math, coding, etc.) are broadly knowledgeable across almost *every* domain. Does it have to be this way, or can we create truly narrow AI systems? In a new preprint, we explore some questions relevant to this goal...
    Image
    GIF
    61K
  • user avatar
    Eric J. Michaud
    @ericjmichaud_
    Oct 31, 2024
    If you cluster language model features (from SAEs) into two groups based on whether they tend to fire together in the same document, you find two "lobes" of features that also turn out to be geometrically distinct! Math vs. prose features separate in this t-SNE plot...
    Image
    00:00
    Image
    user avatar
    David D. Baek
    @dbaek__
    Oct 29, 2024
    1/6 New paper! ā€œThe Geometry of Concepts: Sparse Autoencoder Feature Structure.ā€ We find that the concept universe of SAE features has interesting structure at three levels: 1) ā€œatomicā€ small-scale, 2) ā€œbrainā€ intermediate-scale, and 3) ā€œgalaxyā€ large-scale!
    157K
  • user avatar
    Eric J. Michaud
    @ericjmichaud_
    Jun 2, 2025
    I've moved to SF and am working at @GoodfireAI this summer! Excited to be here and to spend time with many friends, old and new.
    Image
    28K
  • user avatar
    Eric J. Michaud
    @ericjmichaud_
    Oct 12, 2024
    Earlier this year I tried writing a doc to encourage the MIT Department of Physics to create a new PhD specialization in ML. The doc ended up being a little vague and so I didn't push it. But in light of the Physics Nobel this week, I thought I'd share it here:
    Image
    Image
    31K
  • user avatar
    Eric J. Michaud
    @ericjmichaud_
    Mar 24, 2023
    Understanding the origin of neural scaling laws and the emergence of new capabilities with scale is key to understanding what deep neural networks are learning. In our new paper, @tegmark, @ZimingLiu11, @uzpg_ and I develop a theory of neural scaling. 🧵:
    arXiv logo
    arxiv.org
    The Quantization Model of Neural Scaling
    We propose the Quantization Model of neural scaling laws, explaining both the observed power law dropoff of loss with model and data size, and also the sudden emergence of new capabilities with...
    81K
  • user avatar
    Eric J. Michaud
    @ericjmichaud_
    Nov 29, 2023
    *The Space of LLM Learning Curves* The mean loss improves smoothly over LLM training. But this averages over very many loss curves on individual tokens. I've made some interactive visualizations for exploring the per-token curves: ericjmichaud.com/llm-curve-visu… Demo & observations:
    Image
    00:00
    12K
  • user avatar
    Eric J. Michaud
    @ericjmichaud_
    Apr 5, 2025
    These discussions about AI timelines & risk often seem to hinge on a basic difference in intuition for what intelligence is and how powerful it is...
    user avatar
    Dwarkesh Patel
    @dwarkesh_sp
    Apr 3, 2025
    The @slatestarcodex & @DKokotajlo episode. Scott and Daniel break down every month from now until the 2027 intelligence explosion. Misaligned hive minds, Xi and Trump waking up, automated Ilyas accelerating AI progress. I went in quite skeptical. But I learned a tremendous
    Image
    00:00
    23K
  • user avatar
    Eric J. Michaud
    @ericjmichaud_
    Oct 26, 2022
    New preprint out with @ZimingLiu11 and @tegmark on "Precision Machine Learning". In this paper, we consider what becomes involved when you care about the difference between approximating a function with error 0.001 vs 0.00000000000001 error. arxiv.org/abs/2210.13447
  • user avatar
    Eric J. Michaud
    @ericjmichaud_
    May 24, 2024
    Our group has a new preprint out with some observations about the structure of language model features. I'll share some additional reflections on what motivated this work, how this relates to some phenomena in SAEs, some uncertainties, and its implications for interpretability:
    user avatar
    Josh Engels
    @JoshAEngels
    May 24, 2024
    1/10 New paper! "Not All Language Model Features Are Linear." Prior work says language model features are linear, but we find some that are multi-dimensional! How can we auto-find these multi-d features? Do models really use them? What even is a multi-d feature? Answers below🧵
    Image
    7.8K
  • user avatar
    Eric J. Michaud
    @ericjmichaud_
    Jun 18, 2022
    Last month, we put out a preprint led by @ZimingLiu11 on the phenomena of "grokking" in deep learning. Here's a blog post with some videos and additional discussion to accompany the paper: ericjmichaud.com/grokking-squar…
    Image
    GIF
  • user avatar
    Eric J. Michaud
    @ericjmichaud_
    Nov 19, 2024
    Since the internal structure of neural networks, through training, comes to reflect the structure of the external world, advances in interpretability and our understanding of neural computation more generally could have a huge impact across science over the coming years...
    user avatar
    Elana Simon
    @ElanaPearl
    Nov 18, 2024
    🧬What are protein language models (PLMs) actually learning about biology? Our paper introduces InterPLM - a framework that reveals interpretable features in PLMs using sparse autoencoders, giving us a window into how these models represent protein structure and function. 🧵(1/9)
    Image
    3.6K

New to X?

Sign up now to get your own personalized timeline!

Create account

By signing up, you agree to the Terms of Service and Privacy Policy, including Cookie Use.

TermsĀ·PrivacyĀ·CookiesĀ·AccessibilityĀ·Ads InfoĀ·Ā© 2026 X Corp.
Don't miss what's happening
People on X are the first to know.
Log inSign up
Advertisement
Advertisement