Latest in AI

Showing:apiGeneralClear ×

Topic

Release New Tool Tutorial Business Paper Benchmark Opinion Regulation

For

General Developers Designers Product Founders Marketing Researchers Students

Eleven Music is here★ 74
ElevenLabs Blog50 days agoNew Tool
ElevenLabs announced Eleven Music, a product for generating studio-grade music from natural language prompts. It supports control over genre, style, structure, vocals or instrumentals, multilingual output, and edits to sections or entire songs. The company says it was built with labels, publishers, and artists, is cleared for most commercial uses, and is available on the website with Music API documentation.
Introducing Music v2★ 72
ElevenLabs Blog50 days agoRelease
ElevenLabs says Music v2 improves vocals, instrumentation, arrangement, and multilingual generation across genres. The model adds finer control through inpainting, section-by-section song building, reference matching, and more complex vocal or stylistic handling. It powers ElevenMusic and ElevenCreative now, with ElevenAPI access coming soon, and is positioned for musicians, developers, brands, and content teams.
Google 於 I/O 大會發布 Gemini 3.5 Flash：全面整合至旗下產品，但 API 價格顯著調漲★ 85
Simon Willison's Weblog69 days agoRelease
Google officially unveiled Gemini 3.5 Flash at its 2026 I/O conference. Unlike previous launches, this new model skipped the `-preview` stage and went directly…
OpenAI 推出 GPT-Realtime-2、GPT-Translate 與 GPT-Whisper：全新 SOTA 即時語音 API★ 85
Latent Space81 days agoRelease
OpenAI has continued to expand the reach of its GPT-5 technology, officially launching three new voice and audio APIs: GPT-Realtime-2, GPT-Translate, and…
Vercel AI SDK 新增 WhatsApp 轉接器支援，輕鬆構建 WhatsApp AI 機器人★ 72
Vercel Changelog139 days agoRelease
Vercel announced in its Changelog that the AI SDK has officially added WhatsApp Adapter support. This update is a significant convenience for developers…
Google DeepMind 推出全新改進版 Gemini 音訊模型，打造更強大的語音互動體驗★ 85
Google DeepMind Blog227 days agoRelease
Google DeepMind has announced a major upgrade to its Gemini audio models, aimed at delivering a more natural, fluid, and low-latency voice interaction…
Google 推出 Gemini 2.5 Computer Use 模型：基於 Gemini 2.5 Pro，支援 API 預覽★ 85
Google DeepMind Blog277 days agoRelease
Google DeepMind has officially launched the new dedicated "Gemini 2.5 Computer Use" model, which is now available in preview via API. This model is built on…
xAI 正式開放 Grok 3 與 Grok 3-mini API！主打推理與超低定價★ 85
TLDR AI (Buttondown)465 days agoRelease
Grok 3, the flagship AI model from xAI founded by Elon Musk, has finally officially opened its API access months after launch, and simultaneously surprised…
SOTA 影片生成新里程：Google Veo 2 與快手 Kling 2 正式對開發者全面開放 (GA)★ 85
TLDR AI (Buttondown)468 days agoRelease
AI video generation has reached a major milestone: Google's Veo 2 and Kuaishou's Kling 2, currently ranked at the top of the Artificial Analysis Video Arena…
【AINews】平靜中的暗潮：OpenAI 記憶更新與 GPT-4.1 洩露、xAI 正式推出 Grok 3 API★ 75
TLDR AI (Buttondown)473 days agoRelease
Although AINews characterized these two days as "a calm day," in reality, tech giants and the open-source community remained full of undercurrents. First, on…
Stable Diffusion 3.5 正式登陸 Replicate：支援 API 快速部署與調用★ 75
Replicate Blog644 days agoRelease
Stability AI has announced its latest text-to-image model, Stable Diffusion 3.5. At the same time, the well-known AI model hosting platform Replicate has…
Replicate 完成由 a16z 領投的 4,000 萬美元 B 輪融資，加速企業基於開源 AI 進行開發★ 75
Replicate Blog966 days agoBusiness
AI infrastructure startup Replicate announced the successful completion of a $40 million Series B funding round. This round was led by prominent Silicon Valley…
Llama 2 發布首日 24 小時動態大盤點：開源社群與 Replicate 的狂歡★ 80
Replicate Blog1,105 days agoRelease
Meta officially launched the highly anticipated open-source large language model Llama 2 on July 18, 2023, immediately triggering a tsunami of cascading…