u/Chance-Jaguar-3708

Image 1 —
Image 2 —
Image 3 —

HF REPO : https://huggingface.co/kpsss34/Walkyrie-1.3B-v1.0

Walkyrie is a Text-to-Image diffusion model derived from Wan2.1-T2V-1.3B.
The text encoder (UMT5) was pruned to ~1B parameters and the model was re-trained for image generation, converting the original Text-to-Video architecture into a high-quality Text-to-Image pipeline.

* The node for use with ComfyUI is currently under development. I hope to receive feedback from everyone to encourage this project.

Thanks all,

u/Chance-Jaguar-3708 — 10 days ago

HF REPO : https://huggingface.co/kpsss34/Walkyrie-1.3B-v1.0

Walkyrie-1.3B is a Text-to-Image diffusion model derived from Wan2.1-T2V-1.3B.
The text encoder (UMT5) was pruned to ~1B parameters and the model was re-trained for image generation, converting the original Text-to-Video architecture into a high-quality Text-to-Image pipeline.

⚠️ Early Release — Work in Progress This model has only been trained to approximately 20% of the planned training budget. It is released for testing and community feedback purposes. Quality and stability are expected to improve significantly with further training.

My biggest remaining problem is anatomy, which is a common issue with small-scale models.
### I hope everyone will encourage me to succeed. ###

u/Chance-Jaguar-3708 — 11 days ago