Log inSign up
Niels Rogge
3,391 posts
Image
user avatar
Niels Rogge
@NielsRogge
ML Engineer @huggingface. Building paperswithco.de. @KU_Leuven grad. General interest in machine & deep learning. Making AI more accessible for everyone!
Belgium
nielsrogge.github.io
Joined April 2010
727
Following
20.9K
Followers
  • Pinned
    user avatar
    Niels Rogge
    @NielsRogge
    Sep 29, 2022
    Today my Transformers-Tutorials repo hit 2,000 stars on @github! 🤩 Very greatful :) the repo contains many tutorial notebooks on inference + fine-tuning with custom data for Transformers on all kinds of data; text, images, scanned PDFs, videos ⭐
    Image
    GitHub - NielsRogge/Transformers-Tutorials: This repository contains demos I made with the Transf...
    From github.com
  • user avatar
    Niels Rogge
    @NielsRogge
    Jan 21, 2025
    “So there’s this Chinese company called DeepSeek which basically does what OpenAI initially intended to do. They open-sourced a model trained with large-scale reinforcement learning, beating everyone else, and even releasing a paper detailing their process“
    Image
    212K
  • user avatar
    Niels Rogge
    @NielsRogge
    Sep 18, 2025
    The most legendary LLM release is still Mistral-7B for me No context given, just a magnet link First time we got a decent model running locally Feels like yesterday
    Image
    121K
  • user avatar
    Niels Rogge
    @NielsRogge
    Jan 7, 2025
    Wait how big of an AI lab is @deepseek_ai ?
    Image
    252K
  • user avatar
    Niels Rogge
    @NielsRogge
    Oct 24, 2024
    Microsoft silently dropped a new model on the hub 👀 "OmniParser is a general screen parsing tool, which interprets/converts UI screenshot to structured format, to improve existing LLM based UI agents"
    Image
    microsoft/OmniParser · Hugging Face
    From huggingface.co
    171K
  • user avatar
    Niels Rogge
    @NielsRogge
    Oct 18, 2025
    Karpathy: "RL is terrible" Every RL researcher on the Karpathy interview: "I agree with everything he says"
    180K
  • user avatar
    Niels Rogge
    @NielsRogge
    Jun 19, 2025
    "Hugging Face is basically the equivalent of Github in the era of software 2.0" - Karpathy, 2025, colorized
    Image
    85K
  • user avatar
    Niels Rogge
    @NielsRogge
    Jul 10, 2025
    I feel like people who know about distributed training, CUDA kernels, JAX/Flax, GPU clusters, profiling etc have a massive edge currently Maybe I should learn those too
    130K
  • user avatar
    Niels Rogge
    @NielsRogge
    Jan 29, 2025
    Absolutely disgusting post! Literally the opposite of what @huggingface stands for
    user avatar
    Dario Amodei
    Anthropic
    @DarioAmodei
    Jan 29, 2025
    My thoughts on China, export controls and two possible futures darioamodei.com/on-deepseek-an…
    143K
  • user avatar
    Niels Rogge
    @NielsRogge
    Mar 16, 2025
    I learned how to migrate to uv: 1) uv init 2) uv venv 3) source .venv/bin/activate 4) uv add -r requirements.txt Congrats, you can now delete your requirements.txt file
    user avatar
    Han
    @HanchungLee
    Mar 15, 2025
    uv still has a bunch of issues. - it’s not pip - lack of ide and coding agent support - venv created by uv cant be used without uv - poor documentation. there’s no playbook on how to migrate to it. information scattered around like a changelog.
    187K
  • user avatar
    Niels Rogge
    @NielsRogge
    Aug 5, 2022
    OWL-ViT by @GoogleAI is now available @huggingface Transformers. The model is a minimal extension of CLIP for zero-shot object detection given text queries. 🤯 🥳 It has impressive generalization capabilities and is a great first step for open-vocabulary object detection! (1/2)
    Image
    GIF
  • user avatar
    Niels Rogge
    @NielsRogge
    Mar 18, 2025
    Amazing video on multi-head attention, multi-query attention, grouped-query attention and MLA (multi-latent attention), the main innovation of DeepSeek's V3 model Made with @3blue1brown's amazing Manim library
    Image
    59K
  • user avatar
    Niels Rogge
    @NielsRogge
    May 29, 2025
    When people ask me: "Which framework should I use for developing AI agents? Should I use @OpenAI Agent SDK? Google's Agent Development Kit? LangGraph or @pydantic Agent framework?" Here's my honest answer:
    Image
    132K
  • user avatar
    Niels Rogge
    @NielsRogge
    Dec 6, 2024
    Feeling the same tbh
    Image
    82K

New to X?

Sign up now to get your own personalized timeline!

Create account

By signing up, you agree to the Terms of Service and Privacy Policy, including Cookie Use.

Terms·Privacy·Cookies·Accessibility·Ads Info·© 2026 X Corp.
Don't miss what's happening
People on X are the first to know.
Log inSign up
Advertisement
Advertisement