Google announces Gemini 3.5 Live Translate for instant voice-to-voice translation
Google's Gemini 3.5 Live Translate delivers real-time voice translation that preserves speaker tone, pacing, and pitch, with SynthID watermarks for provenance.
Google has announced Gemini 3.5 Live Translate, a real-time voice-to-voice translation system that preserves the original speaker's tone, pacing, and pitch rather than producing flat synthetic output. The system embeds Google's SynthID watermarks into translated audio, enabling AI content provenance detection without affecting audio quality. This extends Google's Gemini Live multimodal API capabilities into cross-language communication scenarios such as meetings, live streams, and customer service.
Google 於 2026 年 6 月正式宣布推出 Gemini 3.5 Live Translate,這是一項建立在 Gemini 3.5 模型之上的即時語音對語音翻譯功能,目標是解決現有機器翻譯系統長期以來的一大痛點:翻譯後的聲音聽起來像機器人,失去了原說話者的個人特質。
Free shows the 3-line summary; Pro unlocks the full deep summary (~300 words) so you never have to click through.
See Pro plans →Want the original English / full article?
Read on Ars Technica AI →Summaries are AI-generated; the original article is authoritative.