Google DeepMind BlogDec 12, 2025, 5:50 PMimportant 85

Google DeepMind 推出全新改進版 Gemini 音訊模型，打造更強大的語音互動體驗

Original: Improved Gemini audio models for powerful voice experiences

Google DeepMind has announced a major upgrade to its Gemini audio models, aimed at delivering a more natural, fluid, and low-latency voice…

Google DeepMind 宣布推出改進後的 Gemini 音訊模型，專為即時語音體驗進行優化。新模型採用原生多模態架構，無需經過「語音轉文字」的中介步驟，即可直接理解並生成語音，大幅降低延遲。這項升級將賦予開發者構建更具情感表現力、反應更迅速的語音助理與互動應用的能力，並已陸續在 Google AI Studio 與 Vertex AI 上線。

Google DeepMind has announced a major upgrade to its Gemini audio models, aimed at delivering a more natural, fluid, and low-latency voice interaction experience for users and developers alike.

Full summary

Free shows the 3-line summary; Pro unlocks the full deep summary (~300 words) so you never have to click through.

See Pro plans →

Want the original English / full article?

Read on Google DeepMind Blog →

gemini google-ai-studio vertex-ai #audio #voice-agent #multimodal #real-time #api

Summaries are AI-generated; the original article is authoritative.