u/Educational-Bag-8767

Hey everyone,

I’ve been obsessed lately with those high-quality documentary channels that use 3D mannequin-like figures and cinematic dioramas to narrate stories (think historical events or technical breakdowns). I’ve got the scriptwriting and VO side of things down, but I am hitting a massive brick wall when it comes to the visual pipeline.

I’m trying to avoid the "low-effort" AI look and actually get consistent characters and motion. My current struggle is:

  1. Prompt Consistency: Every time I try to generate a "mannequin in a 1920s office," I get something different. One looks like a plastic doll, the other looks like a wax figure. How do you keep the "material" of the mannequins consistent across a whole 10-minute video?
  2. Image-to-Video Motion: When I run these through tools like Runway or Luma, the motion is usually just a weird "gliding" effect or the mannequin’s face starts melting. I’m looking for that specific, intentional camera movement—dolly zooms or slow pans—that makes it feel like a professional documentary.

Are people building these entirely in Blender/Unreal Engine and just using AI for textures, or is there a specific ComfyUI or prompting workflow that handles this? I have a decent rig (RTX 40-series), so I can handle some heavy lifting, but I just can't find a tutorial that isn't just "how to use Midjourney."

Would love to hear from anyone who has cracked the code on the visual pipeline for this specific style.

reddit.com
u/Educational-Bag-8767 — 17 days ago

Hey everyone,

I’ve been obsessed lately with those high-quality documentary channels that use 3D mannequin-like figures and cinematic dioramas to narrate stories (think historical events or technical breakdowns). I’ve got the scriptwriting and VO side of things down, but I am hitting a massive brick wall when it comes to the visual pipeline.

I’m trying to avoid the "low-effort" AI look and actually get consistent characters and motion. My current struggle is:

  1. Prompt Consistency: Every time I try to generate a "mannequin in a 1920s office," I get something different. One looks like a plastic doll, the other looks like a wax figure. How do you keep the "material" of the mannequins consistent across a whole 10-minute video?
  2. Image-to-Video Motion: When I run these through tools like Runway or Luma, the motion is usually just a weird "gliding" effect or the mannequin’s face starts melting. I’m looking for that specific, intentional camera movement—dolly zooms or slow pans—that makes it feel like a professional documentary.

Are people building these entirely in Blender/Unreal Engine and just using AI for textures, or is there a specific ComfyUI or prompting workflow that handles this? I have a decent rig (RTX 40-series), so I can handle some heavy lifting, but I just can't find a tutorial that isn't just "how to use Midjourney."

Would love to hear from anyone who has cracked the code on the visual pipeline for this specific style.

reddit.com
u/Educational-Bag-8767 — 17 days ago