Noiz AI, HKUST & Tsinghua Open-Source Audio Generation Model: 4 Steps, 0.24s on One GPU
量子位 QbitAI·4 hours ago·Paper
Noiz AI has partnered with Hong Kong University of Science and Technology (HKUST) and Tsinghua University to open-source a large audio generation model. The model's standout claims are efficiency: just four sampling steps to produce audio, with inference completing in 0.24 seconds on a single GPU. The open-source release brings research-grade, low-latency audio synthesis within reach of developers and researchers globally.