Shreyas Kapur (@shreyaskapur) / X

Shreyas Kapur

53 posts

Shreyas Kapur

@shreyaskapur

PhD student @berkeley_ai. Prev. undergrad @MIT, intern @Waymo @GoogleDeepMind

Berkeley, CA

Joined June 2012

Pinned
Shreyas Kapur
@shreyaskapur
Jun 3, 2024
My first PhD paper!🎉We learn *diffusion* models for code generation that learn to directly *edit* syntax trees of programs. The result is a system that can incrementally write code, see the execution output, and debug it. 🧵1/n
00:00
742K
Shreyas Kapur
@shreyaskapur
Mar 21, 2025
I've been waiting 10 years to make this.
00:00
785K
Shreyas Kapur
@shreyaskapur
Mar 21, 2025
Replying to @shreyaskapur
Built with Google Gemini Flash 2.0 Image generation :D
27K
Shreyas Kapur
@shreyaskapur
Jun 3, 2024
Replying to @shreyaskapur
We develop an analogous version of “noise” for syntax trees inspired by the computer security literature on fuzzing🎲. And we teach our model to reverse this noise⏪. 2/n
25K
Shreyas Kapur
@shreyaskapur
Jun 3, 2024
Replying to @shreyaskapur
We managed to get part of our project running in the browser, Website🌎: tree-diffusion.github.io Paper📄: arxiv.org/abs/2405.20519 Code🖥️: github.com/revalo/tree-di… Thanks for my wonderful collaborator @jenner_erik, and advisor Stuart Russell! n/n 🧵
16K
Shreyas Kapur
@shreyaskapur
Jun 4, 2024
I had a lot of fun working on this. I didn't believe that a chess playing neural net could learn to do look-ahead just in its weights, so I was definitely the non-believer in this project.
Erik Jenner
@jenner_erik
Jun 4, 2024
♟️Do chess-playing neural nets rely purely on simple heuristics? Or do they implement algorithms involving *look-ahead* in a single forward pass? We find clear evidence of 2-turn look-ahead in a chess-playing network, using techniques from mechanistic interpretability! 🧵
00:00
23K
Shreyas Kapur
@shreyaskapur
Jun 3, 2024
Replying to @shreyaskapur
Our implementation works on a given context-free grammar. Here is an example of our model diffusing a smaller “SVG”-like language. 4/n
00:00
19K
Shreyas Kapur
@shreyaskapur
Jun 3, 2024
Replying to @shreyaskapur
A model that *edits* code makes it really easy to combine it with a search algorithm🔎. 3/n
00:00
19K
Shreyas Kapur
@shreyaskapur
Jun 3, 2024
Replying to @shreyaskapur
This is what that language looks like, 5/n
16K
Shreyas Kapur
@shreyaskapur
Jun 3, 2024
Replying to @shreyaskapur
Of course, our architecture is also a Vision-Language Transformer that is trained to edit code via tree diffusion. 7/n
15K
Shreyas Kapur
@shreyaskapur
Jun 3, 2024
Replying to @shreyaskapur
We show how our approach outperforms previous methods, including rejection sampling a Vision-Language Transformer that is specifically trained on these tasks (CSGNet in this figure). 6/n
15K
Shreyas Kapur
@shreyaskapur
Jun 3, 2024
Replying to @shreyaskapur
These languages are small, and we only show this approach on a fairly narrow inverse-graphics task. In the future, we hope to show that this approach may potentially work more generally with languages with loops and variables. 8/n
16K
Shreyas Kapur
@shreyaskapur
Jun 3, 2024
Replying to @sdtoyer
😂I'm glad you asked Sam! We've been working on a modern, functional, and performant library for graphics and diagrams in Python called iceberg,
GitHub - revalo/iceberg: A compositional diagramming and animation library as an eDSL in Python
From github.com
3.9K
Shreyas Kapur
@shreyaskapur
Dec 15, 2024
Come check out my tree diffusion poster at the system 2 reasoning at scale workshop at NeurIPS!
Shalev
@Shalev_lif
Dec 15, 2024
Best poster moment at #NeurIPS2024
2.8K