r/LocalLLaMA top dayJun 9, 2026, 6:43 PM/u/pmttyji

SCAIL-2: Open-Source End-to-End Character Animation Without Intermediate Pose Representations

Original: zai-org/SCAIL-2 · Hugging Face

SCAIL-2 is an open-source model that animates characters from driving videos end-to-end, eliminating skeleton or mask intermediates.

SCAIL-2 by zai-org removes the reliance on skeleton maps and inpainting masks common in prior character animation pipelines, driving characters directly from video in an end-to-end manner. Trained on 60K synthesized motion pairs using SCAIL-Preview, Wan-Animate, and MoCha via a Unified Motion Transfer Interface with RoPE design, the model develops emergent abilities beyond its teacher models. Capabilities include cross-identity character replacement, animal-driving scenarios, and zero-shot support for SAM3D-Body mesh rendering.

SCAIL-2 是由 zai-org 開源發布於 Hugging Face 的角色動畫模型,核心突破在於實現了「端對端」(End-to-end)的角色驅動動畫,徹底擺脫過去方案對中間姿勢表示的依賴。

Full summary

Free shows the 3-line summary; Pro unlocks the full deep summary (~300 words) so you never have to click through.

See Pro plans →

Want the original English / full article?

Read on r/LocalLLaMA top day →

Summaries are AI-generated; the original article is authoritative.