Ars Technica AIJun 9, 2026, 6:57 PMRyan Whitwam

Google announces Gemini 3.5 Live Translate for instant voice-to-voice translation

Google's Gemini 3.5 Live Translate delivers real-time voice translation that preserves speaker tone, pacing, and pitch, with SynthID watermarks for provenance.

Google has announced Gemini 3.5 Live Translate, a real-time voice-to-voice translation system that preserves the original speaker's tone, pacing, and pitch rather than producing flat synthetic output. The system embeds Google's SynthID watermarks into translated audio, enabling AI content provenance detection without affecting audio quality. This extends Google's Gemini Live multimodal API capabilities into cross-language communication scenarios such as meetings, live streams, and customer service.

In June 2026, Google officially announced the launch of Gemini 3.5 Live Translate, a real-time speech-to-speech translation feature built on the Gemini 3.5 model. Its goal is to address a long-standing pain point of existing machine translation systems: the translated voice sounds robotic, losing the original speaker's personal characteristics.

Full summary

Free shows the 3-line summary; Pro unlocks the full deep summary (~300 words) so you never have to click through.

See Pro plans →

Want the original English / full article?

Read on Ars Technica AI →

gemini gemini-live synthid #voice-translation #real-time-ai #synthid #watermarking #multimodal

Summaries are AI-generated; the original article is authoritative.