u/AsstronautHistorian

So I just tried training a Z-Image Turbo LORA with over 1,0000 images of subjects with male genitalia, from different angles and zoom lengths.

For context, I've trained many other LORAs successfully, so I have a pretty good grasp on how to make these things work.

I was surprised at how bad the results were with representing the male genitalia. You would think that 1K images from different angles should be enough...and yeah it kinda got the shapes correct but still lots of deformities.

My question is... why? Why is it so hard for the model to replicate something it has 1K images of? Is genitalia the last frontier of anatomy that AI has yet to get a grasp on, like its previous struggle with hands/fingers? Is the "poisoned well" theory a thing (the suspicion that Z-Image Turbo was purposely given bad training data related to genitalia to purposefully censor/make it hard to generate)?

I've seen other people have been able to make OK Loras around this subject, so why am I struggling so badly?

Last thing I'll add is that I've tried messing with different Lora rank sizes (32, 64, 128), Learning Rates, etc. Just seems I'm hitting a wall and not even sure why.

reddit.com
u/AsstronautHistorian — 11 days ago