r/createimg

This Open Image Model Works Without a Separate VAE
▲ 21 r/createimg+1 crossposts

This Open Image Model Works Without a Separate VAE

Came across a newly released AI image model called HiDream-O1-Image, and its design looks pretty different from most current generators.

The model works directly with pixel-based generation instead of depending on a separate VAE pipeline. It also combines text understanding, image generation, editing, and personalization inside one unified system.

Main highlights:

  • Generates high-resolution images up to 2048×2048
  • Supports text-to-image and image editing
  • Can handle long text rendering and subject-focused generations
  • Includes an internal prompt reasoning system for layouts and details
  • Built with 8B parameters while targeting performance close to larger models

There are currently two versions available:

  • HiDream-O1-Image → recommended around 50 steps
  • HiDream-O1-Image-Dev → recommended around 28 steps

Model links:

u/Substantial-Fee-3910 — 4 days ago