Log inSign up
Andy Keller
340 posts
Image
user avatar
Andy Keller
@t_andy_keller
Postdoctoral Fellow at The Kempner Institute at Harvard University -- Somewhere between Brains & Bits. PhD at UvA, Intern @ Apple MLR, Prev @ Intel AI & Nervana
AKAndyKeller.github.io
Joined March 2014
1,513
Following
4,403
Followers
  • Pinned
    user avatar
    Andy Keller
    @t_andy_keller
    Jul 22, 2025
    Why do video models handle motion so poorly? It might be lack of motion equivariance. Very excited to introduce: Flow Equivariant RNNs (FERNNs), the first sequence models to respect symmetries over time. Paper: arxiv.org/abs/2507.14793 Blog: kempnerinstitute.harvard.edu/research/deepe… 1/🧵
    Image
    GIF
    58K
  • user avatar
    Andy Keller
    @t_andy_keller
    Mar 10, 2025
    In the physical world, almost all information is transmitted through traveling waves -- why should it be any different in your neural network? Super excited to share recent work with the brilliant @mozesjacobs: "Traveling Waves Integrate Spatial Information Through Time" 1/14
    Image
    GIF
    760K
  • user avatar
    Andy Keller
    @t_andy_keller
    Jul 26, 2023
    Traveling waves are known to exist throughout the brain in a variety of forms — there are many hypotheses, but their exact computational role is debated. Together with @wellingmax we built an RNN which exhibits traveling waves to see what it could do. Here’s what we think: 1/7
    Image
    GIF
    122K
  • user avatar
    Andy Keller
    @t_andy_keller
    May 10, 2024
    Traveling waves are indicative of conserved quantities. In the brain, there is undeniable evidence for traveling waves of neural activity -- but what is the brain trying to conserve? In our ICLR paper with @wellingmax, @_mullerlab, & @sejnowski, we ask: could it be memory? 🌊/9
    Image
    GIF
    76K
  • user avatar
    Andy Keller
    @t_andy_keller
    Sep 7, 2021
    Together with @wellingmax, we think deep learning needs more organization and structure... topographic organization and equivariant structure 😁 Introducing our new paper: Topographic VAEs learn Equivariant Capsules 📃arxiv.org/abs/2109.01394 🧬github.com/AKAndykeller/T… 1/6
    Overview of the Topographic Variational Autoencoder with shifting temporal coherence. The combined color/rotation transformation in input space τ becomes encoded as a Roll within the capsule dimension. The model is thus able decode unseen sequence elements by encoding a partial sequence and Rolling activations within the capsules. We see this resembles a commutative diagram.
  • user avatar
    Andy Keller
    @t_andy_keller
    Nov 17, 2020
    Excited to share my first paper! arxiv.org/abs/2011.07248 Self Normalizing Flows -- An efficient training method for unconstrained normalizing flows. Joint work w/ the ever supportive @jornpeters, @priyankjaini, @emiel_hoogeboom, Patrick Forré & @wellingmax 1/5
    Image
  • user avatar
    Andy Keller
    @t_andy_keller
    Mar 10, 2025
    Replying to @t_andy_keller
    Just as ripples in water carry information across a pond, traveling waves of activity in the brain have long been hypothesized to carry information from one region of cortex to another*; but how can a neural network actually leverage this information?*cell.com/neuron/fulltex… 2/14
    Image
    Traveling Waves in Visual Cortex
    From cell.com
    27K
  • user avatar
    Andy Keller
    @t_andy_keller
    Mar 10, 2025
    Replying to @t_andy_keller
    For all the technical details and more ablations, please see our paper recently accepted in workshop-form at ICLR Re-Align, and full pre-print on ArXiv. Code: github.com/KempnerInstitu… Paper: arxiv.org/abs/2502.06034 Hope to see you in Singapore! 🇸🇬 Fin/
    Image
    GitHub - KempnerInstitute/traveling-waves-integrate: Repository to create traveling waves integrate...
    From github.com
    13K
  • user avatar
    Andy Keller
    @t_andy_keller
    Mar 10, 2025
    Replying to @t_andy_keller
    Inspired by Mark Kac’s famous question, "Can one hear the shape of a drum?" we thought: Maybe a neural network can use wave dynamics to integrate spatial information and effectively "hear" visual shapes... To test this, we tried feeding images of squares to a wave-based RNN: 3/14
    Image
    GIF
    27K
  • user avatar
    Andy Keller
    @t_andy_keller
    Mar 10, 2025
    Replying to @t_andy_keller
    We found that, in-line with theory, we could reliably predict the area of the drum analytically by looking at the fundamental frequency of oscillations of each neuron in our hidden state. But is this too simple? How much further can we take it if we add learnable parameters? 4/14
    Image
    GIF
    22K
  • user avatar
    Andy Keller
    @t_andy_keller
    Mar 10, 2025
    Replying to @t_andy_keller
    If you want more visualizations, a bit more depth, and even some audio of what different images 'sound' like to our models, please check out our @KempnerInst blog-post! kempnerinstitute.harvard.edu/research/deepe… 13/14
    Image
    Traveling Waves Integrate Spatial Information Through Time - Kempner Institute
    From kempnerinstitute.harvard.edu
    12K
  • user avatar
    Andy Keller
    @t_andy_keller
    Mar 10, 2025
    Replying to @t_andy_keller
    We made wave dynamics flexible by adding learned damping and natural frequency encoders, allowing hidden state dynamics to adapt based on the input stimulus. On simple polygon images, we found the model learned to use these parameters to produce shape-specific wave dynamics: 6/14
    Image
    GIF
    22K
  • user avatar
    Andy Keller
    @t_andy_keller
    Mar 10, 2025
    Replying to @t_andy_keller
    Looking at the Fourier transform of the resulting neural oscillations at each point in the hidden state, we then saw that the model learned to produce different frequency spectra for each shape, meaning each neuron really was able to 'hear' which shape it was a part of! 7/14
     Plot of five representative frequency bins from the FFT of the dynamics of our wave-RNN on the shape task. We see different shapes pop out in different bins, indicating that they 'sound' different, and allowing the model to uniquely classify each shape. On the right we plot the average FFT for each pixel, separated by each shape, over the whole dataset, showing that different shapes do have measurably different frequency spectra, even in this average case.
    17K
  • user avatar
    Andy Keller
    @t_andy_keller
    Mar 10, 2025
    Replying to @t_andy_keller
    Overall, we believe this is the first step of many towards creating neural networks with alternative methods of information integration, beyond those that we have currently such as network depth, bottlenecks, or all-to-all connectivity, like in Transformer self-attention. 12/14
    13K

New to X?

Sign up now to get your own personalized timeline!

Create account

By signing up, you agree to the Terms of Service and Privacy Policy, including Cookie Use.

Terms·Privacy·Cookies·Accessibility·Ads Info·© 2026 X Corp.
Don't miss what's happening
People on X are the first to know.
Log inSign up
Advertisement
Advertisement