Image Generation¶
Architectures¶
- MMDiT - Multi-Modal Diffusion Transformer architecture
- flow matching - Flow matching for diffusion models
- block causal linear attention - Block causal linear attention mechanism
- DC AE - Deep Compression Autoencoder
- SANA - SANA architecture
- sana denoiser architecture - SANA denoiser design
- transformers v5 - Transformers v5 for diffusion
FLUX Models¶
- flux klein 9b inference - FLUX Klein 9B inference guide and best practices
- flux kontext - FLUX Kontext model
Training & Fine-tuning¶
- diffusion lora training - LoRA training for diffusion models
- lora fine tuning for editing models - LoRA fine-tuning for editing models
- Text to LoRA - Text-to-LoRA generation
- paired training for restoration - Paired training for image restoration
Inference & Optimization¶
- diffusion inference acceleration - Inference acceleration techniques
- tiled inference - Tiled inference for high-resolution generation
- temporal tiling - Tiles as temporal sequence
- low vram inference strategies - Low-VRAM inference strategies
- textual latent interpolation - Textual latent interpolation
Editing & Restoration¶
- Step1X Edit - Step1X-Edit model
- [[ACE++]] - ACE++ editing
- LaMa - Large Mask Inpainting
- image restoration survey - Image restoration survey
- RealRestorer - RealRestorer model
- color checker and white balance - Color checker and white balance correction
- grayscale overlay nn architectures - Neural networks for grayscale overlay prediction
Specialized Models¶
- Calligrapher - Calligrapher model
- PixelSmile - PixelSmile model
- X Dub - X-Dub model
- FLAIR - FLAIR model
- MACRO - MACRO model
- MARBLE - MARBLE model
- ATI - Any Trajectory Instruction
Segmentation¶
- in context segmentation - In-context segmentation