Mistral AI NewsJun 18, 2026, 9:12 AM

Mistral AI Launches Voxtral: Audio Speech and Understanding Model

Original: Research Voxtral July 15, 2025 Mistral AI

Mistral AI releases Voxtral, its first audio-native language model for transcription and spoken-language understanding.

Mistral AI has announced Voxtral, its debut audio-native language model family targeting speech recognition, multilingual transcription, and audio comprehension. Available in two sizes via Mistral's La Plateforme API, it extends the company's portfolio decisively into multimodal AI. The release positions Mistral as a full-stack AI provider capable of handling voice and audio alongside its established text and code capabilities.

Mistral AI has unveiled Voxtral, its first dedicated audio language model, marking a significant expansion of the company's multimodal AI capabilities. Announced in mid-2025, Voxtral enters the competitive speech AI space with an architecture designed to handle voice and audio input natively — treating audio as a first-class modality rather than an afterthought appended to a text-centric model.

Full summary

Free shows the 3-line summary; Pro unlocks the full deep summary (~300 words) so you never have to click through.

See Pro plans →

Want the original English / full article?

Read on Mistral AI News →

mistral #audio-ai #speech-recognition #multimodal #voice #asr

Summaries are AI-generated; the original article is authoritative.