u/uxl

New open source multimodal model does it all...with only 3b parameters
▲ 682 r/accelerate+2 crossposts

New open source multimodal model does it all...with only 3b parameters

Lance is a lightweight native unified multimodal model that supports image and video understanding, generation, and editing within a single framework.

  • Efficient at 3B scale. With only 3B active parameters, Lance delivers strong performance across image generation, image editing, and video generation benchmarks.
huggingface.co
u/uxl — 19 hours ago