Tech Updates

Startups

Artificial Intelligence

Americas

AssemblyAI Launches Multilingual Real-Time Speech Recognition Across Six Languages

AssemblyAI's Universal-Streaming model now supports six languages in a unified architecture while undercutting competitors by up to 9.6x on pricing.

NewDecoded

Published Nov 13, 2025

Nov 13, 2025

3 min read

Image by AssemblyAI

AssemblyAI announced multilingual support for its Universal-Streaming speech-to-text model on November 12, 2025, expanding real-time transcription to six languages: English, Spanish, French, German, Italian, and Portuguese. The release eliminates a critical barrier for companies deploying AI voice agents globally by offering all languages through a single unified model at $0.15 per hour, significantly undercutting competitors like AWS Transcribe ($1.44/hour) and Google Cloud Speech-to-Text ($0.96/hour).

Unified Architecture, Not Language Detection

Unlike competing solutions that route requests through language detection gateways, Universal-Streaming processes all six languages through shared architecture trained simultaneously. This design enables native code-switching within single utterances without additional latency or complexity. The model handles mixed-language phrases like "Je voudrais un coffee, s'il vous plaît" without requiring special handling or preprocessing steps.

Performance Benchmarks on Real-World Audio

AssemblyAI tested the model on diverse audio including call center recordings with background noise, medical consultations with domain terminology, and multi-speaker business meetings. The company reports an average Word Error Rate of 11.77% compared to Deepgram Nova-3's 12.76%, with median latency of 303 milliseconds versus Deepgram's 449 milliseconds. These metrics directly impact user experience in production voice agent deployments where conversation flow depends on sub-400ms response times.

Production-Ready Integration

The model ships with punctuation, capitalization, and intelligent endpointing built in, eliminating post-processing pipelines for downstream LLM integration. Developers can integrate through native support in LiveKit, Vapi, Pipecat, and Daily, or directly via WebSocket API by setting a single parameter. The unified pricing model applies identical rates across all languages, removing variable costs that complicate international expansion planning.

Healthcare and Regulated Use Cases

The consistent accuracy across languages addresses cost concerns in regulated industries where transcription errors compound operational expenses. A 3% higher error rate translates to 5-10% increases in human quality assurance costs at scale, particularly problematic in healthcare documentation and financial services where precision requirements are non-negotiable. AssemblyAI positions the multilingual release as eliminating the traditional trade-off between market reach and product quality.

Decoded Take

This launch represents a strategic pricing and architecture play in the increasingly competitive real-time speech AI market. By offering unified pricing at $0.15/hour across all languages while AWS charges $1.44/hour and Google charges $0.96/hour, AssemblyAI is forcing incumbent cloud providers to defend margins or cede market share in the growing voice agent sector.

The unified model architecture creates a sustainable advantage where future improvements automatically benefit all languages rather than requiring per-language optimization cycles. This timing aligns with enterprise adoption of AI voice agents in customer support, healthcare documentation, and meeting assistants, where multilingual capability has shifted from nice-to-have to deployment requirement.

The real differentiation isn't just language support but the elimination of detection latency and code-switching handling, which matters critically in conversational AI where 100-200 millisecond delays break natural interaction patterns.

Want to advertise your Data, Analytics, or AI here? Reach out!

NewDecoded

Want to advertise your Data, Analytics, or AI here? Reach out!

NewDecoded

Want to advertise your Data, Analytics, or AI here? Reach out!

NewDecoded

Share this article

News

Apr 22, 2026

NEURA Robotics and AWS Unite to Scale Physical AI Across Global Logistics Networks

News

Apr 22, 2026

NEURA Robotics and AWS Unite to Scale Physical AI Across Global Logistics Networks

News

Apr 22, 2026

NEURA Robotics and AWS Unite to Scale Physical AI Across Global Logistics Networks

News

Apr 22, 2026

VAST Data Hits $30 Billion Valuation as AI Infrastructure Becomes a Global Priority

News

Apr 22, 2026

VAST Data Hits $30 Billion Valuation as AI Infrastructure Becomes a Global Priority

News

Apr 22, 2026

VAST Data Hits $30 Billion Valuation as AI Infrastructure Becomes a Global Priority

News

Apr 23, 2026

Google Unveils Workspace Intelligence to Automate Multi-Step Workflows via Agentic AI

News

Apr 23, 2026

Google Unveils Workspace Intelligence to Automate Multi-Step Workflows via Agentic AI

News

Apr 23, 2026

Google Unveils Workspace Intelligence to Automate Multi-Step Workflows via Agentic AI

News

Apr 23, 2026

Salesforce and Google Cloud Unite AI Agents for Cross-Platform Enterprise Workflows

News

Apr 23, 2026

Salesforce and Google Cloud Unite AI Agents for Cross-Platform Enterprise Workflows

News

Apr 23, 2026

Salesforce and Google Cloud Unite AI Agents for Cross-Platform Enterprise Workflows

News

Apr 23, 2026

Google Cloud Next 2026 Debuts Specialized AI Chips and Gemini Agent Platform

News

Apr 23, 2026

Google Cloud Next 2026 Debuts Specialized AI Chips and Gemini Agent Platform

News

Apr 23, 2026

Google Cloud Next 2026 Debuts Specialized AI Chips and Gemini Agent Platform

News

Apr 23, 2026

PwC Singapore Launches S$4 Million AI-Powered Hub to Navigate Global Trade Complexity

News

Apr 23, 2026

PwC Singapore Launches S$4 Million AI-Powered Hub to Navigate Global Trade Complexity

News

Apr 23, 2026

PwC Singapore Launches S$4 Million AI-Powered Hub to Navigate Global Trade Complexity