Tao Hu

Ommer Lab, MCML Junior Member.

prof_pic.jpg

PHD

news

Aug 01, 2025 TREAD and ArtFlow are accepted by ICCV 2025, Congrats to the team~
Mar 11, 2025 MaskFlow on arxiv:sparkles:
Mar 07, 2025 Will give a talk at NxtAim Winter School about “Efficient Architecture for Representation”.
Mar 01, 2025 Continuous, Subject-Specific Attribute Control in T2I Models by Identifying Semantic Directions accepted by CVPR 2025.
Jan 23, 2025 ToddlerDiffusion: Interactive Structured Image Generation with Cascaded Schrödinger Bridge accepted by ICLR 2025.

selected publications

  1. TREAD: Token Routing for Efficient Architecture-agnostic Diffusion Training
    Felix Krause , Timy Phan , Ming Gui , Stefan Andreas Baumann , Vincent Tao Hu , and Björn Ommer
    In ICCV , 2025
  2. Stochastic Interpolants for Revealing Stylistic Flows across the History of Art
    Pingchuan Ma , Ming Gui , Johannes Schusterbauer , Xiaopei Yang , Olga Grebenkova , Vincent Tao Hu , and Björn Ommer
    In ICCV , 2025
  3. Continuous, Subject-Specific Attribute Control in T2I Models by Identifying Semantic Directions
    Stefan Andreas Baumann , Felix Krause , Michael Neumayr , Nick Stracke , Vincent Tao Hu , and Björn Ommer
    In CVPR , 2025
    Prompt Editing in T2I models
  4. MaskFlow: Discrete Flows for Flexible and Efficient Long Video Generation
    Michael Fuest , Vincent Tao Hu , and Björn Ommer
    In Arxiv , 2025
  5. ./mask.jpg
    [MASK] is All You Need
    Vincent Tao Hu , and Björn Ommer
    In Arxiv , 2024
  6. ToddlerDiffusion: Flash Interpretable Controllable Diffusion Model
    Eslam Mohamed BAKR , Liangbing Zhao , Vincent Tao Hu , Matthieu Cord , Patrick Perez , and Mohamed Elhoseiny
    In ICLR , 2025
  7. DepthFM: Fast Monocular Depth Estimation with Flow Matching
    Ming Gui , Johannes S. Fischer , Ulrich Prestel , Pingchuan Ma , Dmytro Kotovenko , Olga Grebenkova , Stefan A. Baumann , Vincent Tao Hu , and Björn Ommer
    In AAAI , 2025
    An exploration of flow matching for blazing fast and zero-shot depth estimation
  8. Does VLM Classification Benefit from LLM Description Semantics?
    Pingchuan Ma , Lennart Rietdorf , Dmytro Kotovenko , Vincent Tao Hu , and Björn Ommer
    In AAAI , 2025
  9. Distillation of Diffusion Features for Semantic Correspondence
    Frank Fundel , Johannes Schusterbauer , Vincent Tao Hu , and Björn Ommer
    In WACV , 2025
  10. Scaling Image Tokenizers with Grouped Spherical Quantization
    Jiangtao Wang , Zhen Qin , Yifan Zhang , Vincent Tao Hu , Björn Ommer , Rania Briq , and Stefan Kesselheim
    2024
  11. Diffusion Models and Representation Learning: A Survey
    Michael Fuest , Pingchuan Ma , Ming Gui , Johannes Fischer , Vincent Tao Hu , and Bjorn Ommer
    In Arxiv , 2024
    The interplay between diffusion models and representation learning
  12. ZigMa: A DiT-style Zigzag Mamba Diffusion Model
    Vincent Tao Hu , Stefan Andreas Baumann , Ming Gui , Olga Grebenkova , Pingchuan Ma , Johannes Fischer , and Bjorn Ommer
    In ECCV , 2024
    a DiT-style Mamba-based diffusion models
  13. Boosting Latent Diffusion with Flow Matching
    Johannes S. Fischer , Ming Gui , Pingchuan Ma , Nick Stracke , Stefan A. Baumann , Vincent Tao Hu , and Bjorn Ommer
    In ECCV , 2024
    flow matching for super-resolution
  14. Guided Flow Vision Transformer from Self-Supervised Diffusion Features
    Vincent Tao Hu , Yunlu Chen , Mathilde Caron , Yuki M. Asano , Cees G.M. Snoek , and Björn Ommer
    In Arxiv , 2024
  15. Motion Flow Matching for Human Motion Synthesis and Editing
    Vincent Tao Hu , Wenzhe Yin , Pingchuan Ma , Yunlu Chen , Basura Fernando , Yuki M. Asano , Efstratios Gavves , Pascal Mettes , Björn Ommer , and Cees G.M. Snoek
    In Arxiv , 2024
  16. Training Class-Imbalanced Diffusion Model Via Overlap Optimization
    Divin Yan , Lu Qi , Vincent Tao Hu , Ming-Hsuan Yang , and Meng Tang
    In arxiv , 2024
  17. ./fm.png
    Latent Space Editing in Transformer-based Flow Matching
    Vincent Tao Hu , David W Zhang , Pascal Mettes , Meng Tang , Deli Zhao , and Cees G.M. Snoek
    In AAAI 2024. Also appear in ICML 2023 Workshop, New Frontiers in Learning, Control, and Dynamical Systems , 2024
  18. ./scribbleseg.png
    Generative Data Augmentation Improves Scribble-supervised Semantic Segmentation
    Jacob Schnell , Jieke Wang , Lu Qi , Vincent Tao Hu , and Meng Tang
    In SyntaGen CVPR workshop , 2024
    Explore diffusion model for data augmention in segmentation task.
  19. ./fm-s2s.png
    Flow Matching for Conditional Text Generation in a Few Sampling Steps
    Vincent Tao Hu , Di Wu , Yuki M. Asano , Pascal Mettes , Basura Fernando , Björn Ommer , and Cees G.M. Snoek
    In EACL , 2024
    Flow Matching for text generation
  20. ./sgdm-why.png
    Self-Guided Diffusion Models
    Tao Hu* , David W Zhang* , Yuki M. Asano , Gertjan J. Burghouts , and Cees G.M. Snoek
    In CVPR , 2023
    A bridge between the community of self-supervised learning and diffusion models. Short version to appear in NeurIPS 2022 Workshop on Score-Based Methods and NeurIPS 2022 Workshop Self-Supervised Learning Theory and Practice.
Image