cider (@jeffreycider) / X

cider

795 posts

cider

@jeffreycider

my purpose in life is to forget linear algebra 2x a year

San Francisco, CA

Joined September 2019

Pinned
cider
@jeffreycider
May 31, 2022
linear transformations stretch euclidean space ReLU folds euclidean space neural networks are just repeated origami on high-dimensional laffy taffy
cider
@jeffreycider
Jul 16, 2025
Replying to @tenobrus
"Scheiße"
118K
cider
@jeffreycider
Apr 18, 2023
"neural networks need to be adversarially robust like the human visual cortex. like you shouldn't be able to change a few pixels and completely change the semantic meaning of an image" the human visual cortex:
776K
cider
@jeffreycider
Apr 21, 2023
we have a material that's 1. chemically inert (ie safe for everything including food) 2. made of the crust's most abundant element 3. easily created with stone age tech 4. harder than steel 5. transparent also if it gets really hot it becomes a semiconductor praise gaia
96K
cider
@jeffreycider
Jun 16, 2023
watching a new ML grad student say that their research direction is using neuroscience as inspiration to make new architectures (can't interfere, it's a canon event)
83K
cider
@jeffreycider
Oct 11, 2024
Replying to @ScarletAstrorum
there is only one way through, and it is to eat enough seitan every day to annihilate the celiac population of a small european country
37K
cider
@jeffreycider
Jun 3, 2025
first chess tournament in 17 years i was so scared of getting demolished by a 2nd grader but joke's on me my elo was literally not high enough to get paired with any children
15K
cider
@jeffreycider
Sep 25, 2023
"linear algebra has no surprises, if it seems true it probably is" you underestimate my dumb bitch energy. also if you weren't surprised by the wigner semicircle law kindly go fuck yourself and then give me your geometric intuition 🙏
83K
cider
@jeffreycider
Jul 20, 2025
optimization theorem: "assume a lipschitz constant L..." the lipschitz constant:
Laker Newhouse
@LakerNewhouse
Jul 19, 2025
[1/9] We created a performant Lipschitz transformer by spectrally regulating the weights—without using activation stability tricks: no layer norm, QK norm, or logit softcapping. We think this may address a “root cause” of unstable training.
106K
cider
@jeffreycider
Jan 2, 2024
nn layers align their singular vectors each matrix syncs to its neighbor, its rotation neatly clicking into the basis directions of the next rotation. like two gears precision-machined to be partners LLMs are swiss watches, ticking in a billion-dimensional pocket universe
33K
cider
@jeffreycider
May 5, 2022
overheard in roonchat: imma outsource flirting on dating apps to GPT-3 bro GPT-3 is curve-fitted over all reddit posts with >3 upvotes. you're handing your sex life to the perfect robo-redditor; i wouldn't even hand it a grocery list
cider
@jeffreycider
Jan 13, 2023
Replying to @ChristophMolnar and @QVagabond
yeah writers can't actually write, they're all just librarians with really good index lookup in the library of babel
13K
cider
@jeffreycider
Jan 1, 2024
DPO's method of removing RL from RLHF is so based > previously "forced" to sample trajectories with RL bc direct optimization would require an intractable partition fn Z(x) > observe that the bradley-terry model has a few extra degrees of freedom > simply set Z(x)=1
93K
cider
@jeffreycider
Apr 23, 2023
zoomers were born into a post-SVM world. "kernel trick" means nothing to someone who has never known ML without self-supervised representations. the kids only respect scale therefore i propose to rename "neural tangent kernel" to "infinite-width neural net (Taylor's version)"
24K