Latest in AI

Showing:video-generationClear ×

Topic

Release New Tool Tutorial Business Paper Benchmark Opinion Regulation

For

General Developers Designers Product Founders Marketing Researchers Students

Tiangong AI Redefines World Models with Matrix-Game 3.5 at BAAI Conference 2026
量子位 QbitAI43 days agoRelease
At the 2026 BAAI Zhiyuan Conference, Tiangong AI announced Matrix-Game 3.5, framing it as a redefinition of world models in AI rather than a routine update. The company disclosed what it characterizes as the latest technical breakthroughs in the Matrix-Game product line, which targets interactive virtual-world simulation. The announcement positions Tiangong AI as a serious competitor in the fast-moving world-model space alongside international and domestic Chinese AI labs.
Avataar AI Launches Low-Cost Varya Video Model for India
TechCrunch AI46 days agoRelease
Avataar AI has launched Varya, a video generation model built from Alibaba’s open Wan 2.2 model and distilled for faster, cheaper output. The company says Varya can generate 5-second 720p clips on an NVIDIA H200 in 45 seconds, versus 1,230 seconds for Wan 2.2. Avataar plans to release the model and training data through India’s AI Kosh portal while offering hosted access at about $0.005 per second.
SCAIL-2: Open-Source End-to-End Character Animation Without Intermediate Pose Representations
r/LocalLLaMA top day48 days agoRelease
SCAIL-2 by zai-org removes the reliance on skeleton maps and inpainting masks common in prior character animation pipelines, driving characters directly from video in an end-to-end manner. Trained on 60K synthesized motion pairs using SCAIL-Preview, Wan-Animate, and MoCha via a Unified Motion Transfer Interface with RoPE design, the model develops emergent abilities beyond its teacher models. Capabilities include cross-identity character replacement, animal-driving scenarios, and zero-shot support for SAM3D-Body mesh rendering.
Grok Imagine Video 1.5 on AI Gateway
Vercel Changelog55 days agoRelease
Vercel’s changelog points to Grok Imagine Video 1.5 becoming available through AI Gateway. The public model page lists the preview model as xai/grok-imagine-video-1.5-preview and marks it primarily for image-to-video generation. Because the source text is unavailable, concrete claims about quality, speed, audio, editing, or text-to-video improvements should not be inferred.
Why Video Agent Models Are Next — Ethan He, xAI Grok Imagine
Latent Space56 days agoOpinion
Latent Space interviews Ethan He, who led Grok Imagine at xAI, about building the product in three months. The episode contrasts video generation with world models and explores why video agent models may become an important next step. It also argues that Grok Imagine remains underrated, while the supplied description does not include architecture details or benchmark results.
How to Prompt Grok Imagine Video 1.5
Replicate Blog68 days agoTutorial
xAI has released Grok Imagine Video 1.5, a model that animates a still image into a short video clip. It generates synchronized audio during the same pass, combining visual animation and sound creation in one workflow. The Replicate Blog post focuses on prompting techniques intended to help users get more from the model.
推出 Waypoint-1：來自 Overworld 的即時互動式影片擴散模型★ 75
Hugging Face Blog189 days agoRelease
Overworld has officially launched a new model called "Waypoint-1" on the Hugging Face platform. It is a world model focused on "Real-time Interactive Video…
Fuel your creativity with new generative media models and tools★ 78
Google DeepMind Blog434 days agoNew Tool
Google announced new generative media models and tools at I/O 2025, led by Veo 3 for video, Imagen 4 for images, and Flow for AI filmmaking. Veo 3 adds audio generation, while Imagen 4 improves detail, typography, aspect ratios, and up to 2K output. Google also expanded Lyria 2 and Lyria RealTime access, while continuing SynthID watermarking and launching SynthID Detector.
Hugging Face 釋出 vid_ds_scripts：一站式構建影片生成高品質資料集★ 75
Hugging Face Blog531 days agoNew Tool
With the rise of open-source video generation models such as LTX-Video, HunyuanVideo, and CogVideoX, building high-quality training datasets has become the…