u/Educational-Bag-8767 — reddlx

Hey everyone,

I’ve been obsessed lately with those high-quality documentary channels that use 3D mannequin-like figures and cinematic dioramas to narrate stories (think historical events or technical breakdowns). I’ve got the scriptwriting and VO side of things down, but I am hitting a massive brick wall when it comes to the visual pipeline.

I’m trying to avoid the "low-effort" AI look and actually get consistent characters and motion. My current struggle is:

Prompt Consistency: Every time I try to generate a "mannequin in a 1920s office," I get something different. One looks like a plastic doll, the other looks like a wax figure. How do you keep the "material" of the mannequins consistent across a whole 10-minute video?
Image-to-Video Motion: When I run these through tools like Runway or Luma, the motion is usually just a weird "gliding" effect or the mannequin’s face starts melting. I’m looking for that specific, intentional camera movement—dolly zooms or slow pans—that makes it feel like a professional documentary.

Are people building these entirely in Blender/Unreal Engine and just using AI for textures, or is there a specific ComfyUI or prompting workflow that handles this? I have a decent rig (RTX 40-series), so I can handle some heavy lifting, but I just can't find a tutorial that isn't just "how to use Midjourney."

Would love to hear from anyone who has cracked the code on the visual pipeline for this specific style.

Hey everyone,

I’m trying to avoid the "low-effort" AI look and actually get consistent characters and motion. My current struggle is:

Prompt Consistency: Every time I try to generate a "mannequin in a 1920s office," I get something different. One looks like a plastic doll, the other looks like a wax figure. How do you keep the "material" of the mannequins consistent across a whole 10-minute video?
Image-to-Video Motion: When I run these through tools like Runway or Luma, the motion is usually just a weird "gliding" effect or the mannequin’s face starts melting. I’m looking for that specific, intentional camera movement—dolly zooms or slow pans—that makes it feel like a professional documentary.

Would love to hear from anyone who has cracked the code on the visual pipeline for this specific style.