Log inSign up
Alexandre Défossez
714 posts
user avatar
Alexandre Défossez
@honualx
Leading ambitious research @kyutai_labs. Chief Science Officer @gradiumai.
Paris, France
ai.honu.io
Joined March 2019
527
Following
4,990
Followers
  • user avatar
    Alexandre Défossez
    @honualx
    Sep 18, 2024
    Meet Moshiko and Moshika, the open source Moshi models 📖🟢. Moshi is a 7B text-audio model, capable of doing full-duplex conversations: it can listen and speak at any time. Plus, its inner text monologue improves the generation 💬 All on device🧑‍💻 🔎kyutai.org/Moshi.pdf
    Image
    00:00
    user avatar
    kyutai
    @kyutai_labs
    Sep 18, 2024
    Today, we release several Moshi artifacts: a long technical report with all the details behind our model, weights for Moshi and its Mimi codec, along with streaming inference code in Pytorch, Rust and MLX. More details below 🧵 ⬇️ Paper: kyutai.org/Moshi.pdf Repo:
    131K
  • user avatar
    Alexandre Défossez
    @honualx
    Nov 10, 2021
    I'm happy to release the v3 of Demucs for Music Source Separation, with hybrid domain prediction, compressed residual branches and much more. Checkout the code: github.com/facebookresear… Here is a demo for you @jaimealtozano, I'm sure you'll enjoy the improvements!
    Image
    00:00
  • user avatar
    Alexandre Défossez
    @honualx
    Sep 30, 2020
    I recently discovered Perlin noise, a stochastic texture generation algorithm used to make realistic fire, smoke, clouds etc. It was developed by Ken Perlin for the CGI of Disney movie Tron in 1982 🤖 (1/N)
    Image
  • user avatar
    Alexandre Défossez
    @honualx
    Nov 28, 2019
    We have released our platform for source separation in music. We adapt Conv-Tasnet and introduce the Demucs architecture, leading to two state-of-the-art models surpassing all previously known methods such as Wave-U-Net, Open-Unmix or Spleeter.
    Image
    GitHub - facebookresearch/demucs: Code for the paper Hybrid Spectrogram and Waveform Source...
    From github.com
  • user avatar
    Alexandre Défossez
    @honualx
    Dec 15, 2023
    AI is nothing without open source, #keepaiopen 🤗
    758K
  • user avatar
    Alexandre Défossez
    @honualx
    Nov 8, 2023
    We release stereo models for all MusicGen variants (+ a new large melody both mono and stereo): 6 new models available on HuggingFace (thanks @reach_vb). We show how a simple fine tuning procedure with codebook interleaving takes us from boring mono to immersive stereo🎧👇
    Image
    00:00
    116K
  • user avatar
    Alexandre Défossez
    @honualx
    Nov 25, 2020
    I have extended Julius with some extra features: FFT convolutions, FIR filters and decomposition over frequency bands in the waveform domain. All in @PyTorch, differentiable and with CUDA and TorchScript support.
    Image
  • user avatar
    Alexandre Défossez
    @honualx
    Oct 25, 2022
    With @jadecopet, @syhw and @adiyossLC , we are releasing EnCodec, a state-of-the-art neural audio codec supporting both 24 kHz mono audio and 48 kHz stereo, with bandwidth ranging from 1.5 kbps to 24 kbps 🗜️🎤🤖 arxiv.org/pdf/2210.13438…
    Image
    00:00
  • user avatar
    Alexandre Défossez
    @honualx
    Nov 17, 2023
    Really excited to be part of the founding team of @kyutai_labs: at the heart of our mission is doing open source and open science in AI🔬📖. Thanks so much to our founding donators for making this happen 🇪🇺 I’m thrilled to get to work with such a talented team and grow the lab 😊
    Image
    Image
    user avatar
    kyutai
    @kyutai_labs
    Nov 17, 2023
    Announcing Kyutai: a non-profit AI lab dedicated to open science. Thanks to Xavier Niel (@GroupeIliad), Rodolphe Saadé (@cmacgm) and Eric Schmidt (@SchmidtFutures ), we are starting with almost 300M€ of philanthropic support. Meet the team ⬇️
    18K
  • user avatar
    Alexandre Défossez
    @honualx
    Jun 9, 2023
    Today we release MusicGen, a text-to-music auto-regressive model built on EnCodec. It also supports optional melody conditioning based on chroma-gram extraction! It requires only 50 autoregressive steps per second of audio. Really fun to remix known tune in all genre 👇 + 🧵
    user avatar
    Felix Kreuk
    @FelixKreuk
    Jun 9, 2023
    We present MusicGen: A simple and controllable music generation model. MusicGen can be prompted by both text and melody. We release code (MIT) and models (CC-BY NC) for open research, reproducibility, and for the music community: github.com/facebookresear…
    Image
    00:00
    65K
  • user avatar
    Alexandre Défossez
    @honualx
    Dec 15, 2023
    As a PhD student and RS, FAIR was a magical place to be in: - incredible mentoring in all fields of AI🧑‍🏫 - access to resources and having my own research agenda 🧭 - free and encouraged to publish and open source 📖 For a lot of us there it was a transformative experience 🧑🏻‍🚀
    user avatar
    JoshXT
    @JoshXT
    Dec 15, 2023
    Replying to @ylecun
    Meta has definitely been the best thing to happen to AI.
    74K
  • user avatar
    Alexandre Défossez
    @honualx
    Sep 3, 2020
    We are releasing the code for our Interspeech paper "Real Time Enhancement in the Waveform Domain" with @syhw and @adiyossLC . Watch our live demo youtu.be/77cm_MVtLfk. Want to try it? Checkout our repo github.com/facebookresear… (1/2)
  • user avatar
    Alexandre Défossez
    @honualx
    Jun 13, 2023
    Official MusicGen now also supports extended generation (different implem, same idea). Go to our colab to test it. And keep an eye on @camenduru for more cool stuff! Of course, I tested it with an Interstellar deep remix as lo-fi with organic samples :) colab.research.google.com/drive/1fxGqfg9…
    Image
    00:00
    Image
    01:30
    user avatar
    camenduru
    @camenduru
    Jun 13, 2023
    Good news 🥳 Now we can generate more than 30s, Thanks to rkfg ❤ and Oncorporation ❤ github.com/rkfg/audiocraf… github.com/Oncorporation/… Please try it 🐣 github.com/camenduru/Musi… 🦆 🖼 stable diffusion model Freedom Redmond by @artificialguybr
    85K
  • user avatar
    Alexandre Défossez
    @honualx
    Dec 12, 2023
    We do not have a demo booth at #NeurIPS2023 but the MusicGen demo is always online 💻 and all code is open source 📖, with @jadecopet and @FelixKreuk 🎶🥁 huggingface.co/spaces/faceboo…
    38K

New to X?

Sign up now to get your own personalized timeline!

Create account

By signing up, you agree to the Terms of Service and Privacy Policy, including Cookie Use.

Terms·Privacy·Cookies·Accessibility·Ads Info·© 2026 X Corp.
Don't miss what's happening
People on X are the first to know.
Log inSign up
Advertisement
Advertisement