Log inSign up
Ben Clavié
Mixedbread
3,609 posts
Image
user avatar
Ben Clavié
Mixedbread
@bclavie
regressing linearly on a daily basis. wife guy who does retrieval. research @mixedbreadai, prev answerdotai
Mitaka-shi, Tokyo
ben.clavie.eu
Joined April 2016
1,366
Following
6,726
Followers
  • Pinned
    user avatar
    Ben Clavié
    Mixedbread
    @bclavie
    Mar 12
    I'm so excited to introduce this! We've worked on a million different moving parts to produce this. I'm fairly confident it's the best multimodal model that exists, period -- and it's not too shabby at pushing back the LIMITs of retrieval either...
    user avatar
    Mixedbread
    @mixedbreadai
    Mar 12
    Introducing Mixedbread Wholembed v3, our new SOTA retrieval model across all modalities and 100+ languages. Wholembed v3 brings best-in-class search to text, audio, images, PDFs, videos... You can now get the best retrieval performance on your data, no matter its format.
    Image
    145K
  • user avatar
    Ben Clavié
    Mixedbread
    @bclavie
    Sep 5, 2024
    RAG is increasingly going multi-modal, but document retrieval is tough, and layout gets in your way. But it shouldn't! Introducing 🪤RAGatouille's Vision-equipped, ColPali-powered sibling: 🐭Byaldi With just a few lines of code, search through documents, with no pre-processing.
    Image
    150K
  • user avatar
    Ben Clavié
    Mixedbread
    @bclavie
    Jan 4, 2024
    The RAG wave is here to stay, but in practice, it's hard to retrieve the right docs w/ embdings, & better IR models are hard to use! Let's fix that: Introducing 🪤RAGatouille, a lib to train&use SotA retrieval model, ColBERT, in just a few lines of code! github.com/bclavie/RAGato…
    Image
    203K
  • user avatar
    Ben Clavié
    Mixedbread
    @bclavie
    Feb 10, 2025
    What if a [MASK] was all you needed? ModernBERT is great, but we couldn't stop wondering if it could be greater than previous encoders in different ways. Maybe we don't need task-specific heads? Maybe it can do all sort of tasks with only its generative head? Spoilers: Yes
    Image
    165K
  • user avatar
    Ben Clavié
    Mixedbread
    @bclavie
    Aug 13, 2024
    🎉Happy to finally release answerai-colbert-small-v1: the small but mighty @answerdotai ColBERT. It might not be able to count the number of "r"s in words, but it can definitely find the instructions on how to do that. With just 33M params, it beats even `bge-base` on BEIR!
    Image
    146K
  • user avatar
    Ben Clavié
    Mixedbread
    @bclavie
    Feb 8, 2025
    Image
    36K
  • user avatar
    Ben Clavié
    Mixedbread
    @bclavie
    Dec 14, 2024
    Turn on early stopping, and all I see is a successful training loop
    Image
    40K
  • user avatar
    Ben Clavié
    Mixedbread
    @bclavie
    Jun 9, 2025
    Multimodal RAG: Just use ColPali/DSE then pass your screenshots to the LLM This is the dream, but how well do LLMs read text contained in images? We wanted to know, so we tried a simple thing: do results change on evals when using screenshots rather than text as input? Yes.
    Image
    70K
  • user avatar
    Ben Clavié
    Mixedbread
    @bclavie
    Mar 6, 2024
    "Just use a reranker for better retrieval" ... Yes, but which one? Someone asked me recently what reranker they should use (with no data to fine-tune it), and I realised just how loaded that question actually was, so I made this (mostly English) "cheatsheet"
    A decision tree mapping our decisions to choose a reranker model. Full accessible version is coming soon.
    40K
  • user avatar
    Ben Clavié
    Mixedbread
    @bclavie
    Mar 14, 2024
    Document reranking is powerful, but daunting to get started with. Moreover, trying a new approach requires modifying your pipeline, even though it does the same thing! Introducing 🔧rerankers: a lightweight library to provide a unified way to use various reranking methods🧵1/?
    Image
    92K
  • user avatar
    Ben Clavié
    Mixedbread
    @bclavie
    Dec 19, 2024
    It's finally out! We at @answerdotai, @LightOnIO and friends are releasing ModernBERT 🎉 It does exactly what it says on the tin: It's BERT, but not 2018 BERT, no, it's 2024 BERT, with all the 2024 bells and whistles. They're slot-in replacements for BERT, at both model sizes.
    Image
    88K
  • user avatar
    Ben Clavié
    Mixedbread
    @bclavie
    Jun 27, 2024
    🥁🥁 New blog post out (link in thread), w/ two aims: 🤓 Providing a clear, hopefully easy-to-read intro to ColBERT, without assuming you've ever used it. 🏊Introducing ColBERT Token Pooling ✨: You can reduce the size of ColBERT indexes by 66% with barely any performance hit!
    Graph showing results curve.
It is titled "Relative Performance (Quantised Vectors, 2-bit)". The x-axis is "Pooling Factor" and the y-axis "Relative Performance (%)".

It presents the results of using token pooling with ColBERT, at Pooling Factors 1 (no pooling), 2, 3, 4, 5 and 6.

It has two Average bars: Average and Average w/o outliers. It shows that the average performance reduction of Pooling by 2 is less than 1%, and pooling by 3 is just 2%. Without outliers, this gets down to ~2% and 4%, respectively.
    80K
  • user avatar
    Ben Clavié
    Mixedbread
    @bclavie
    Sep 16, 2024
    Time for a new ✨Information Retrieval Blogpost✨ It's about our rerankers library, and the why&how of it. It features this updated "what model should I start with" cheatsheet, as well as an intro to what reranking is and why you should embrace it (and a lot more cool stuff!)
    Image
    27K
  • user avatar
    Ben Clavié
    Mixedbread
    @bclavie
    Sep 4, 2024
    Full slides for this talk are here: docs.google.com/presentation/d… Expect a lot of ColBERT and ColPali, with a tiny SLADE and BM25 cameo to give some context. Thanks @jxnlco and @dan_s_becker for having me!
    user avatar
    Ben Clavié
    Mixedbread
    @bclavie
    Sep 4, 2024
    Replying to @mervenoyann
    Couldn't agree more, I literally just ended my talk at @jxnlco's RAG course with this slide an hour ago 😄 Normalise accepting ColPali+VLM is more amazing than it has any right to be and accepting that we don't need overly complex pipelines to do their job.
    Image
    59K

New to X?

Sign up now to get your own personalized timeline!

Create account

By signing up, you agree to the Terms of Service and Privacy Policy, including Cookie Use.

Terms·Privacy·Cookies·Accessibility·Ads Info·© 2026 X Corp.
Don't miss what's happening
People on X are the first to know.
Log inSign up
Advertisement
Advertisement