Gabriel Synnaeve (@syhw) / X

Gabriel Synnaeve

9,110 posts

Gabriel Synnaeve

@syhw

Nerd & Dad. RL & CodeGen research since before it was cool.

Paris

Joined October 2009

Gabriel Synnaeve
@syhw
Sep 24, 2025
(🧵) Today, we release Meta Code World Model (CWM), a 32-billion-parameter dense LLM that enables novel research on improving code generation through agentic reasoning and planning with world models. ai.meta.com/research/publi…
921K
Gabriel Synnaeve
@syhw
Jun 9, 2023
We've just released MusicGen, and there is a @huggingface demo now, here is a thread about me playing with it just right now. huggingface.co/spaces/faceboo… A 🧵👇
MusicGen - a Hugging Face Space by facebook
From huggingface.co
698K
Gabriel Synnaeve
@syhw
Oct 9, 2025
This is an excellent history of LLMs, doesn't miss seminal papers I know. Reminds you we're standing on the shoulders of giants, and giants are still being born today. gregorygundersen.com/blog/2025/10/0…
128K
Gabriel Synnaeve
@syhw
Oct 4, 2024
Reinforcement learning with execution feedback (RLEF). Lots of sweat went into this one, but what works in principle works in practice: for code generation we can turn compute into training data: arxiv.org/abs/2410.02089 This works for LLMs, but will lead to world models.
arxiv.org
RLEF: Grounding Code LLMs in Execution Feedback with Reinforcement Learning
Large language models (LLMs) deployed as agents solve user-specified tasks over multiple steps while keeping the required manual engagement to a minimum. Crucially, such LLMs need to ground their...
63K
Gabriel Synnaeve
@syhw
Sep 30, 2025
Everything I know in RL in one tweet: exploration>exploitation, easy to leverage off-policy positive rewards, hard to leverage off-policy negative rewards, update the policy often, focus on throughput, self-play or find asymmetric grounding, clip everything but check statistics.
34K
Gabriel Synnaeve
@syhw
Jun 18, 2024
Multi-token prediction models are here
facebook/multi-token-prediction · Hugging Face
From huggingface.co
92K
Gabriel Synnaeve
@syhw
Oct 21, 2024
Want to do research in code generation with LLMs and wonky deep learning from the 90s? We're recruiting one Master student (M2) intern for 2025 at FAIR Paris in my team metacareers.com/jobs/106871446…
58K
Gabriel Synnaeve
@syhw
Dec 15, 2020
The wav2letter Santa has brought 50k hours of read speech in 8 languages in CC-BY 4.0: - dataset: openslr.org/94/ - paper: arxiv.org/abs/2012.03411 - pretrained models: github.com/facebookresear…
Gabriel Synnaeve
@syhw
Apr 17, 2024
To all the defeatists who think there is nothing else but scale: * 5 years between Self-Attention Is All You Need and FlashAttention * Transformers still require warmup. Researchers: get back to work! The future is bright :)
77K
Gabriel Synnaeve
@syhw
Apr 5, 2023
Do you need to quantize models? Try diffq, `pip install diffq` and
GitHub - facebookresearch/diffq: DiffQ performs differentiable quantization using pseudo quantiza...
From github.com
33K
Gabriel Synnaeve
@syhw
Sep 24, 2025
Replying to @syhw
4/ Here is an example of the Code World Model tracing the execution of the piece of code counting the "r"s in "strawberry". Think of it like a neural `pdb` that you can set to any initial frame state, and that reasoning can query as a tool in token space.
121K
Gabriel Synnaeve
@syhw
Aug 24, 2023
Happy to be releasing Code Llama! We've built it on Llama 2 and improved it for code use cases. In particular it supports infilling out of the box, and was trained with sequences up to 16k tokens. Looking forward to what the community will build with it! 1/7
34K
Gabriel Synnaeve
@syhw
Sep 24, 2025
Replying to @syhw
2/ When humans plan, we imagine the possible outcomes of different actions. When we reason about code we simulate part of its execution in our head. The current generation of LLMs struggles to do this. What kind of research will an explicitly trained code world model enable?
34K
Gabriel Synnaeve
@syhw
Apr 20, 2021
Flashlight's v0.3 release: a lightweight, modern C++ deep learning autograd-based library with SOTA models in speech recognition, language modeling, and vision: github.com/flashlight/fla… dataloading/model/training/docs to follow [1/5]
Release v0.3 · flashlight/flashlight
From github.com