Log inSign up
cider
795 posts
user avatar
cider
@jeffreycider
my purpose in life is to forget linear algebra 2x a year
San Francisco, CA
Joined September 2019
656
Following
2,536
Followers
  • Pinned
    user avatar
    cider
    @jeffreycider
    May 31, 2022
    linear transformations stretch euclidean space ReLU folds euclidean space neural networks are just repeated origami on high-dimensional laffy taffy
  • user avatar
    cider
    @jeffreycider
    Jul 16, 2025
    Replying to @tenobrus
    "Scheiße"
    118K
  • user avatar
    cider
    @jeffreycider
    Apr 18, 2023
    "neural networks need to be adversarially robust like the human visual cortex. like you shouldn't be able to change a few pixels and completely change the semantic meaning of an image" the human visual cortex:
    Image
    776K
  • user avatar
    cider
    @jeffreycider
    Apr 21, 2023
    we have a material that's 1. chemically inert (ie safe for everything including food) 2. made of the crust's most abundant element 3. easily created with stone age tech 4. harder than steel 5. transparent also if it gets really hot it becomes a semiconductor praise gaia
    96K
  • user avatar
    cider
    @jeffreycider
    Jun 16, 2023
    watching a new ML grad student say that their research direction is using neuroscience as inspiration to make new architectures (can't interfere, it's a canon event)
    83K
  • user avatar
    cider
    @jeffreycider
    Oct 11, 2024
    Replying to @ScarletAstrorum
    there is only one way through, and it is to eat enough seitan every day to annihilate the celiac population of a small european country
    37K
  • user avatar
    cider
    @jeffreycider
    Jun 3, 2025
    first chess tournament in 17 years i was so scared of getting demolished by a 2nd grader but joke's on me my elo was literally not high enough to get paired with any children
    15K
  • user avatar
    cider
    @jeffreycider
    Sep 25, 2023
    "linear algebra has no surprises, if it seems true it probably is" you underestimate my dumb bitch energy. also if you weren't surprised by the wigner semicircle law kindly go fuck yourself and then give me your geometric intuition 🙏
    Image
    83K
  • user avatar
    cider
    @jeffreycider
    Jul 20, 2025
    optimization theorem: "assume a lipschitz constant L..." the lipschitz constant:
    Image
    Image
    user avatar
    Laker Newhouse
    @LakerNewhouse
    Jul 19, 2025
    [1/9] We created a performant Lipschitz transformer by spectrally regulating the weights—without using activation stability tricks: no layer norm, QK norm, or logit softcapping. We think this may address a “root cause” of unstable training.
    106K
  • user avatar
    cider
    @jeffreycider
    Jan 2, 2024
    nn layers align their singular vectors each matrix syncs to its neighbor, its rotation neatly clicking into the basis directions of the next rotation. like two gears precision-machined to be partners LLMs are swiss watches, ticking in a billion-dimensional pocket universe
    Image
    Image
    33K
  • user avatar
    cider
    @jeffreycider
    May 5, 2022
    overheard in roonchat: imma outsource flirting on dating apps to GPT-3 bro GPT-3 is curve-fitted over all reddit posts with >3 upvotes. you're handing your sex life to the perfect robo-redditor; i wouldn't even hand it a grocery list
  • user avatar
    cider
    @jeffreycider
    Jan 13, 2023
    Replying to @ChristophMolnar and @QVagabond
    yeah writers can't actually write, they're all just librarians with really good index lookup in the library of babel
    13K
  • user avatar
    cider
    @jeffreycider
    Jan 1, 2024
    DPO's method of removing RL from RLHF is so based > previously "forced" to sample trajectories with RL bc direct optimization would require an intractable partition fn Z(x) > observe that the bradley-terry model has a few extra degrees of freedom > simply set Z(x)=1
    Image
    Image
    93K
  • user avatar
    cider
    @jeffreycider
    Apr 23, 2023
    zoomers were born into a post-SVM world. "kernel trick" means nothing to someone who has never known ML without self-supervised representations. the kids only respect scale therefore i propose to rename "neural tangent kernel" to "infinite-width neural net (Taylor's version)"
    Image
    24K

New to X?

Sign up now to get your own personalized timeline!

Create account

By signing up, you agree to the Terms of Service and Privacy Policy, including Cookie Use.

Terms·Privacy·Cookies·Accessibility·Ads Info·© 2026 X Corp.
Don't miss what's happening
People on X are the first to know.
Log inSign up
Advertisement
Advertisement