P
6B Parameters
Powerful 6 billion parameter model delivering exceptional image quality and detail that rivals models 10x larger, while maintaining efficiency and speed.
S
Single-Stream DiT
Revolutionary Diffusion Transformer architecture with single-stream processing that achieves unprecedented parameter efficiency and coherent generation.
L
Bilingual Text Rendering
Industry-leading bilingual text rendering with native support for English and Chinese, producing accurate typography where other models fail.
S
Sub-Second Generation
Blazing-fast 8-NFE turbo mode achieves sub-second inference on H800 GPUs and rapid generation on consumer hardware, 3-6x faster than competitors.
E
Natural Language Editing
Z-Image-Edit variant enables precise image-to-image transformations using natural language instructions for creative editing and style transfer.
O
Fully Open Source
Complete open-source release including all three variants (Turbo, Base, Edit), weights, code, and groundbreaking training methodology for community innovation.
M
Consumer-Friendly VRAM
Runs comfortably on consumer devices with 16GB VRAM, and can be optimized for GPUs with as little as 4GB VRAM, making it accessible to all.
P
Photorealistic Quality
Excels at generating highly detailed, photorealistic images with superior detail preservation in distant elements and backgrounds compared to similar-sized models.
C
ControlNet Integration
Full ControlNet support including Union 2.0 for precise control over pose, depth, edges, and more. Perfect for professional workflows requiring accuracy.
F
Fine-Tuning Ready
Z-Image-Base enables full fine-tuning, LoRA training, and distillation. Tools like DiffSynth-Studio provide comprehensive training support with low-VRAM optimization.
U
Universal Upscaler
Use Z-Image as a second-pass upscaler/enhancer for any model (FLUX, SDXL, SD 1.5). Results rival commercial services with fast, integrated workflow processing.
C
Rich Ecosystem
Extensive community support with ComfyUI nodes, Diffusers pipelines, Replicate API, AI Runner integration, and comprehensive documentation across platforms.