Latest in AI

Showing:audio-generationClear ×

Topic

Release New Tool Tutorial Business Paper Benchmark Opinion Regulation

For

General Developers Designers Product Founders Marketing Researchers Students

Noiz AI, HKUST & Tsinghua Open-Source Audio Generation Model: 4 Steps, 0.24s on One GPU
量子位 QbitAI43 days agoPaper
Noiz AI has partnered with Hong Kong University of Science and Technology (HKUST) and Tsinghua University to open-source a large audio generation model. The model's standout claims are efficiency: just four sampling steps to produce audio, with inference completing in 0.24 seconds on a single GPU. The open-source release brings research-grade, low-latency audio synthesis within reach of developers and researchers globally.
Eleven Music is here★ 74
ElevenLabs Blog50 days agoNew Tool
ElevenLabs announced Eleven Music, a product for generating studio-grade music from natural language prompts. It supports control over genre, style, structure, vocals or instrumentals, multilingual output, and edits to sections or entire songs. The company says it was built with labels, publishers, and artists, is cleared for most commercial uses, and is available on the website with Music API documentation.
Eleven v3 is Now Generally Available
ElevenLabs Blog50 days agoRelease
ElevenLabs published a blog post announcing that Eleven v3 is now generally available. Since the article body was not provided, the only confirmed detail is the availability milestone, not specific feature, pricing, API, language, or performance changes. Developers and creators using voice AI should review the official post before making adoption decisions.
ElevenLabs raises $500M Series D at $11B valuation★ 74
ElevenLabs Blog50 days agoBusiness
ElevenLabs announced a $500 million Series D at an $11 billion valuation, more than triple its valuation from a year earlier. The round was led by Sequoia Capital, with A16Z, ICONIQ, Lightspeed, BOND, and others participating. The company says it will invest in ElevenAgents, ElevenCreative, ElevenAPI, voice agents, conversational models, dubbing, audio research, and international expansion.
How to Prompt Grok Imagine Video 1.5
Replicate Blog68 days agoTutorial
xAI has released Grok Imagine Video 1.5, a model that animates a still image into a short video clip. It generates synchronized audio during the same pass, combining visual animation and sound creation in one workflow. The Replicate Blog post focuses on prompting techniques intended to help users get more from the model.
使用 MusicGen-Chord 結合和弦進行與文字提示詞生成音樂
Replicate Blog993 days agoRelease
The Replicate platform has officially launched MusicGen-Chord, a significant feature upgrade to Meta's open-source music generation model MusicGen. Traditional…
AudioLDM 2 速度優化指南：如何讓文字轉音訊與音樂生成快上加快 ⚡️★ 75
Hugging Face Blog1,063 days agoTutorial
AudioLDM 2 is an advanced open-source text-to-audio and text-to-music generation model. However, under its default settings, the model's inference speed is…
使用 Inference Endpoints 快速部署 MusicGen 音樂生成 API
Hugging Face Blog1,089 days agoTutorial
This article provides a detailed walkthrough of how to quickly deploy Meta's open-source MusicGen music generation model using Hugging Face Inference…