Latest in AI

Showing:real-timeResearchersClear ×

Topic

Release New Tool Tutorial Business Paper Benchmark Opinion Regulation

For

General Developers Designers Product Founders Marketing Researchers Students

Google DeepMind 發表 Gemini Omni：全新原生全模態模型，實現超低延遲即時影音與語音互動★ 95
Google DeepMind Blog71 days agoRelease
Google DeepMind has officially unveiled its latest flagship AI model, "Gemini Omni." This model represents a major breakthrough by Google in the field of…
OpenAI 推出 GPT-Realtime-2、GPT-Translate 與 GPT-Whisper：全新 SOTA 即時語音 API★ 85
Latent Space81 days agoRelease
OpenAI has continued to expand the reach of its GPT-5 technology, officially launching three new voice and audio APIs: GPT-Realtime-2, GPT-Translate, and…
Google DeepMind 推出 Gemini 3.1 Flash Live：讓語音 AI 更自然、更可靠★ 85
Google DeepMind Blog124 days agoRelease
Google DeepMind has officially unveiled its latest voice model, "Gemini 3.1 Flash Live." This model is positioned to deliver lower-latency, higher-precision…
Google DeepMind 推出全新改進版 Gemini 音訊模型，打造更強大的語音互動體驗★ 85
Google DeepMind Blog227 days agoRelease
Google DeepMind has announced a major upgrade to its Gemini audio models, aimed at delivering a more natural, fluid, and low-latency voice interaction…
在 Arm 架構上實現即時 AI 聲音生成：賦予創意自由的個人工具
Hugging Face Blog420 days agoRelease
As generative AI technology becomes more widespread, AI Sound Generation has become an indispensable part of modern multimedia creation, game development, and…
Hugging Face 推出 FastRTC：專為 Python 開發者打造的即時語音與視訊 AI 應用庫★ 85
Hugging Face Blog518 days agoRelease
Hugging Face has officially launched FastRTC, an open-source real-time communication (RTC) library designed specifically for Python developers, aimed at…