This paper is wild.
Create a camera with cheap terrible camera lens, but train a diffusion model to recreate a much better image.
arxiv.org/abs/2408.07541
I mostly see VCs bothered. They want to believe OpenAI, etc have a secret sauce that will ensure they have high margins forever.
The space is fast moving and competitive, and they are not as clued in as researchers are, so were caught off guard.
This is the most exciting paper I've read in a while.
Alternate title could have been: "One weird trick to increase your depth map inference 200x."
arxiv: arxiv.org/abs/2409.11355
github: github.com/VisualComputin…
Let's go through the details 🧵 1/9
🚨 Major FLUX training update!
A week ago @fal dropped a bombshell with sub-five minute LoRA training.
Today we doubled the speed.
Same step count, same great quality, but now you can train a LoRA in 2-3 minutes.
$2 for ~ 2 minutes. Best deal around.