Latest in AI

Showing:voice-agentDevelopersClear ×

Topic

Release New Tool Tutorial Business Paper Benchmark Opinion Regulation

For

General Developers Designers Product Founders Marketing Researchers Students

Google DeepMind 發表 Gemini Omni：全新原生全模態模型，實現超低延遲即時影音與語音互動★ 95
Google DeepMind Blog71 days agoRelease
Google DeepMind has officially unveiled its latest flagship AI model, "Gemini Omni." This model represents a major breakthrough by Google in the field of…
Google DeepMind 推出全新改進版 Gemini 音訊模型，打造更強大的語音互動體驗★ 85
Google DeepMind Blog227 days agoRelease
Google DeepMind has announced a major upgrade to its Gemini audio models, aimed at delivering a more natural, fluid, and low-latency voice interaction…
Gemini 2.5 迎來全新突破：先進的語音對話與音訊生成功能★ 85
Google DeepMind Blog419 days agoRelease
Google DeepMind has announced that its latest-generation model, Gemini 2.5, has achieved new breakthroughs in AI-driven audio dialog and audio generation. This…
Hugging Face 推出 FastRTC：專為 Python 開發者打造的即時語音與視訊 AI 應用庫★ 85
Hugging Face Blog518 days agoRelease
Hugging Face has officially launched FastRTC, an open-source real-time communication (RTC) library designed specifically for Python developers, aimed at…