Pinned
Introducing *Transfusion* - a unified approach for training models that can generate both text and images. arxiv.org/pdf/2408.11039
Transfusion combines language modeling (next token prediction) with diffusion to train a single transformer over mixed-modality sequences. This
















