Log inSign up
Gabriel Synnaeve
9,110 posts
Image
user avatar
Gabriel Synnaeve
@syhw
Nerd & Dad. RL & CodeGen research since before it was cool.
Paris
syhw.github.io
Joined October 2009
1,472
Following
16.8K
Followers
  • user avatar
    Gabriel Synnaeve
    @syhw
    Sep 24, 2025
    (🧵) Today, we release Meta Code World Model (CWM), a 32-billion-parameter dense LLM that enables novel research on improving code generation through agentic reasoning and planning with world models. ai.meta.com/research/publi…
    921K
  • user avatar
    Gabriel Synnaeve
    @syhw
    Jun 9, 2023
    We've just released MusicGen, and there is a @huggingface demo now, here is a thread about me playing with it just right now. huggingface.co/spaces/faceboo… A 🧵👇
    Image
    MusicGen - a Hugging Face Space by facebook
    From huggingface.co
    698K
  • user avatar
    Gabriel Synnaeve
    @syhw
    Oct 9, 2025
    This is an excellent history of LLMs, doesn't miss seminal papers I know. Reminds you we're standing on the shoulders of giants, and giants are still being born today. gregorygundersen.com/blog/2025/10/0…
    128K
  • user avatar
    Gabriel Synnaeve
    @syhw
    Oct 4, 2024
    Reinforcement learning with execution feedback (RLEF). Lots of sweat went into this one, but what works in principle works in practice: for code generation we can turn compute into training data: arxiv.org/abs/2410.02089 This works for LLMs, but will lead to world models.
    arXiv logo
    arxiv.org
    RLEF: Grounding Code LLMs in Execution Feedback with Reinforcement Learning
    Large language models (LLMs) deployed as agents solve user-specified tasks over multiple steps while keeping the required manual engagement to a minimum. Crucially, such LLMs need to ground their...
    63K
  • user avatar
    Gabriel Synnaeve
    @syhw
    Sep 30, 2025
    Everything I know in RL in one tweet: exploration>exploitation, easy to leverage off-policy positive rewards, hard to leverage off-policy negative rewards, update the policy often, focus on throughput, self-play or find asymmetric grounding, clip everything but check statistics.
    34K
  • user avatar
    Gabriel Synnaeve
    @syhw
    Jun 18, 2024
    Multi-token prediction models are here
    Image
    facebook/multi-token-prediction · Hugging Face
    From huggingface.co
    92K
  • user avatar
    Gabriel Synnaeve
    @syhw
    Oct 21, 2024
    Want to do research in code generation with LLMs and wonky deep learning from the 90s? We're recruiting one Master student (M2) intern for 2025 at FAIR Paris in my team metacareers.com/jobs/106871446…
    58K
  • user avatar
    Gabriel Synnaeve
    @syhw
    Dec 15, 2020
    The wav2letter Santa has brought 50k hours of read speech in 8 languages in CC-BY 4.0: - dataset: openslr.org/94/ - paper: arxiv.org/abs/2012.03411 - pretrained models: github.com/facebookresear…
  • user avatar
    Gabriel Synnaeve
    @syhw
    Apr 17, 2024
    To all the defeatists who think there is nothing else but scale: * 5 years between Self-Attention Is All You Need and FlashAttention * Transformers still require warmup. Researchers: get back to work! The future is bright :)
    77K
  • user avatar
    Gabriel Synnaeve
    @syhw
    Apr 5, 2023
    Do you need to quantize models? Try diffq, `pip install diffq` and
    Image
    GitHub - facebookresearch/diffq: DiffQ performs differentiable quantization using pseudo quantiza...
    From github.com
    33K
  • user avatar
    Gabriel Synnaeve
    @syhw
    Sep 24, 2025
    Replying to @syhw
    4/ Here is an example of the Code World Model tracing the execution of the piece of code counting the "r"s in "strawberry". Think of it like a neural `pdb` that you can set to any initial frame state, and that reasoning can query as a tool in token space.
    Image
    121K
  • user avatar
    Gabriel Synnaeve
    @syhw
    Aug 24, 2023
    Happy to be releasing Code Llama! We've built it on Llama 2 and improved it for code use cases. In particular it supports infilling out of the box, and was trained with sequences up to 16k tokens. Looking forward to what the community will build with it! 1/7
    34K
  • user avatar
    Gabriel Synnaeve
    @syhw
    Sep 24, 2025
    Replying to @syhw
    2/ When humans plan, we imagine the possible outcomes of different actions. When we reason about code we simulate part of its execution in our head. The current generation of LLMs struggles to do this. What kind of research will an explicitly trained code world model enable?
    Image
    34K
  • user avatar
    Gabriel Synnaeve
    @syhw
    Apr 20, 2021
    Flashlight's v0.3 release: a lightweight, modern C++ deep learning autograd-based library with SOTA models in speech recognition, language modeling, and vision: github.com/flashlight/fla… dataloading/model/training/docs to follow [1/5]
    Image
    Release v0.3 · flashlight/flashlight
    From github.com

New to X?

Sign up now to get your own personalized timeline!

Create account

By signing up, you agree to the Terms of Service and Privacy Policy, including Cookie Use.

Terms·Privacy·Cookies·Accessibility·Ads Info·© 2026 X Corp.
Don't miss what's happening
People on X are the first to know.
Log inSign up
Advertisement
Advertisement