Flux’s Architecture diagram :) Don’t think there’s a paper so had a quick look through their code. Might be useful for understanding current Diffusion architectures : r/LocalLLaMA arXiv Dive: How Flux and Rectified Flow Transformers Work | Oxen.ai