Log inSign up
vincent!
328 posts
Image
user avatar
vincent!
@vvhuang_
understanding llms @TransluceAI, writing mindslice.substack.com previously: hotel manager @MIT
sf
vvhuang.com
Joined November 2020
503
Following
1,678
Followers
  • Pinned
    user avatar
    vincent!
    @vvhuang_
    Dec 18, 2025
    We trained a decoder to read the internal activations of an LLM and answer questions about what the model will think about or do next. We find that this decoder can understand LLM behaviors, even when the model itself is confused! (for instance, if the model has been jailbroken)
    Image
    Image
    user avatar
    Transluce
    @TransluceAI
    Dec 18, 2025
    Transluce is developing end-to-end interpretability approaches that directly train models to make predictions about AI behavior. Today we introduce Predictive Concept Decoders (PCD), a new architecture that embodies this approach.
    21K
  • user avatar
    vincent!
    @vvhuang_
    Oct 15, 2023
    many of the smartest people in my life got really good at arguing / winning fights, and this has actually made their beliefs *less* correct over time, because it's really hard for others to correct them when they're wrong
    12K
  • user avatar
    vincent!
    @vvhuang_
    Mar 20, 2023
    super excited to be joining @genintelligent! i wrote a bit about my job search / how i decided what to do after school here: mindslice.substack.com/p/choosing
    Image
    9.3K
  • user avatar
    vincent!
    @vvhuang_
    Sep 2, 2024
    i’ve decided to take a break from working on codegen at @Imbue and am looking to explore a bit + figure out what to focus on 🙂 please send cool papers to read, problems that are not being thought about enough, people you think i should meet!!
    16K
  • user avatar
    vincent!
    @vvhuang_
    Dec 13, 2022
    college learning hack - make a list of interesting classes with final projects and show up to the last lectures of the semester to watch all the project presentations, regardless of whether you’re enrolled or not i tried this for the first time today and really enjoyed it 🙂
  • user avatar
    vincent!
    @vvhuang_
    Nov 5, 2023
    there aren’t many good resources for learning about the effects of AI / automation on labor / markets so I put together my own reading list, with notes for the papers I’ve already finished:
    Image
    docs.google.com
    AI x Labor Lit Review
    This document summarizes vincent’s explorations of the literature surrounding AI/automation and labor markets. Primary focus is on understanding economic trends and evidence, not policy proposals. If...
    11K
  • user avatar
    vincent!
    @vvhuang_
    Feb 28, 2023
    every few months i’m tempted to tweet a lot and become a twitter influencer. but then i remember that time two years ago when a friend i respect highly looked at someone’s profile and said “there’s no way anyone who tweets this often can actually be productive”
    5.7K
  • user avatar
    vincent!
    @vvhuang_
    Oct 15, 2024
    learned more physics working on this for 1 week than the entire preceding year 🫡 check out our experiments on functional ultrasound and the acoustoelectric effect!!!
    user avatar
    marley 📐
    @_marleyx
    Oct 15, 2024
    Can we invent new brain-computer interface modalities? @raffi_hotter and I got 9 friends together and built a lab at home to test two totally new imaging methods: acoustoelectric imaging & functional ultrasound through the skull 🧵 story that involves nV measurements, pretty
    Image
    Image
    GIF
    Image
    Image
    00:00
    4.7K
  • user avatar
    vincent!
    @vvhuang_
    Oct 23, 2024
    i think we have the most compelling explanation so far for why LLMs make mistakes like 9.11>9.9 🙂 1) we labeled every neuron in Llama3 2) when Llama says 9.11>9.9 we see influential groups of neurons about dates and bible verses 3) zeroing those allows Llama to answer correctly
    user avatar
    Kevin Meng
    @mengk20
    Oct 23, 2024
    why do language models think 9.11 > 9.9? at @TransluceAI we stumbled upon a surprisingly simple explanation - and a bugfix that doesn't use any re-training or prompting. turns out, it's about months, dates, September 11th, and... the Bible?
    Image
    00:00
    6K
  • user avatar
    vincent!
    @vvhuang_
    May 14, 2022
    brief writeup of a zero-knowledge crypto project I worked on this semester 🙂
    user avatar
    0xPARC
    @0xPARC
    May 13, 2022
    [New Post] zkPairing: zkSNARKs for Elliptic Curve Pairings @jonathanpwang, @vvhuang_, and @theyisun present elliptic curve pairings in circom--unlocking BLS signatures, recursive verification, polynomial commitment verification, and more in groth16 (1/n) 0xparc.org/blog/zk-pairin…
  • user avatar
    vincent!
    @vvhuang_
    Jul 23, 2024
    the way zuck added this apple complaint in the middle of the llama3.1 announcement 😆
    Image
    2.8K
  • user avatar
    vincent!
    @vvhuang_
    Aug 24, 2022
    i got better at writing complex code this summer and i think almost all the improvement came from deciding to write more things down. it’s a very simple workflow change which results in a lot more working memory and foresight
  • user avatar
    vincent!
    @vvhuang_
    Feb 11, 2024
    working on tensor parallelism has really made me appreciate how beautiful torch.autograd is like, you can put distributed operations inside your model and all the gradients still get computed correctly during backprop! with very little code required! (code from megatron)
    Image
    3.6K
  • user avatar
    vincent!
    @vvhuang_
    Jan 19, 2023
    wrote a bit about my experiences with applied crypto and how most young people aren't able to properly explore their interests thanks @amirbolous for sharing :)
    user avatar
    amir
    @amirbolous
    Jan 18, 2023
    @vvhuang_ with an absolute banger of a post, part below stuck out in particular. if u are a young person interested in "weird" things, reminder that there are always people, places, programs, and things out there for you to show u there is a real alternative
    Image
    3.1K

New to X?

Sign up now to get your own personalized timeline!

Create account

By signing up, you agree to the Terms of Service and Privacy Policy, including Cookie Use.

Terms·Privacy·Cookies·Accessibility·Ads Info·© 2026 X Corp.
Don't miss what's happening
People on X are the first to know.
Log inSign up
Advertisement
Advertisement