Back to news
ai Priority 4/5 6/12/2026, 11:05:16 AM

Google DeepMind Introduces Gemini 3.5 Live Translate for Real-Time Voice Translation

Google DeepMind Introduces Gemini 3.5 Live Translate for Real-Time Voice Translation

Google DeepMind has unveiled Gemini 3.5 Live Translate, an advanced model designed to facilitate highly fluid, bidirectional voice translation. By processing spoken input and generating translated speech with minimal latency, this update aims to make cross-lingual conversations feel as natural as face-to-face interactions.

Related tools

Recommended tools for this topic

These picks prioritize high-intent tools relevant to this topic. Some links may include partner or affiliate tracking.

#deepmind#ai#research#official

Comparison

AspectBefore / AlternativeAfter / This
Translation PipelineCascaded systems requiring separate Automatic Speech Recognition (ASR), Machine Translation (MT), and Text-to-Speech (TTS) stepsEnd-to-end multimodal processing within a single unified model architecture
Latency and SpeedHigh cumulative latency due to sequential processing of speech-to-text-to-speech translationNear real-time streaming translation with significantly reduced pause intervals between speaker turns
Tone and EmotionLoss of original vocal nuances, pitch, and emotional context during intermediate text translation phasesPreservation of the speaker's original emotional tone, inflection, and natural voice characteristics

Source: DeepMind Blog

This page summarizes the original source. Check the source for full details.

Related