Pinned
Speedrunning ImageNet Diffusion - 360x faster training
There have been many new techniques demonstrating convergence speedups compared to DiT in the past few years, however all of these have been studied in isolation, against increasingly outdated baselines.
I present SR-DiT













