Latest in AI

Showing:video-generationDesignersClear ×

Topic

Release New Tool Tutorial Business Paper Benchmark Opinion Regulation

For

General Developers Designers Product Founders Marketing Researchers Students

SCAIL-2: Open-Source End-to-End Character Animation Without Intermediate Pose Representations
r/LocalLLaMA top day48 days agoRelease
SCAIL-2 by zai-org removes the reliance on skeleton maps and inpainting masks common in prior character animation pipelines, driving characters directly from video in an end-to-end manner. Trained on 60K synthesized motion pairs using SCAIL-Preview, Wan-Animate, and MoCha via a Unified Motion Transfer Interface with RoPE design, the model develops emergent abilities beyond its teacher models. Capabilities include cross-identity character replacement, animal-driving scenarios, and zero-shot support for SAM3D-Body mesh rendering.
Grok Imagine Video 1.5 on AI Gateway
Vercel Changelog55 days agoRelease
Vercel’s changelog points to Grok Imagine Video 1.5 becoming available through AI Gateway. The public model page lists the preview model as xai/grok-imagine-video-1.5-preview and marks it primarily for image-to-video generation. Because the source text is unavailable, concrete claims about quality, speed, audio, editing, or text-to-video improvements should not be inferred.
How to Prompt Grok Imagine Video 1.5
Replicate Blog68 days agoTutorial
xAI has released Grok Imagine Video 1.5, a model that animates a still image into a short video clip. It generates synchronized audio during the same pass, combining visual animation and sound creation in one workflow. The Replicate Blog post focuses on prompting techniques intended to help users get more from the model.
推出 Waypoint-1：來自 Overworld 的即時互動式影片擴散模型★ 75
Hugging Face Blog189 days agoRelease
Overworld has officially launched a new model called "Waypoint-1" on the Hugging Face platform. It is a world model focused on "Real-time Interactive Video…
Fuel your creativity with new generative media models and tools★ 78
Google DeepMind Blog434 days agoNew Tool
Google announced new generative media models and tools at I/O 2025, led by Veo 3 for video, Imagen 4 for images, and Flow for AI filmmaking. Veo 3 adds audio generation, while Imagen 4 improves detail, typography, aspect ratios, and up to 2K output. Google also expanded Lyria 2 and Lyria RealTime access, while continuing SynthID watermarking and launching SynthID Detector.