Building Real-Time Editing on FLUX2: Inference Acceleration and Distillation with Reinforcement Learning (Preview)
A systems-and-training walkthrough of how a FLUX2-based editor was pushed toward real-time interaction using cache-aware two-step inference, causal attention distillation, and reward-guided DMDR.