What is Scribe V2?
Scribe V2 is ElevenLabs’ most accurate speech-to-text model for batch transcription, subtitling, and captioning, with realtime variant for low-latency live use in 90+ languages.
When was Scribe V2 released?
Scribe V2 was introduced on January 21, 2026, with availability through ElevenLabs API and Studio.
How accurate is Scribe V2?
It claims the lowest word error rate on industry benchmarks, outperforming competitors in diverse audio conditions, accents, and long-form content.
Does Scribe V2 support realtime transcription?
Yes, Scribe V2 Realtime variant delivers ultra-low 150ms latency for live agents, meetings, and conversational AI across 90+ languages.
How much does Scribe V2 cost?
Usage-based pricing starts around $0.40 per hour of audio (lower at scale/enterprise); no unlimited free tier, though limited web testing may be available.
What languages does Scribe V2 support?
Over 90 languages with automatic multi-language detection and transcription in mixed audio files.
What are the key enterprise features of Scribe V2?
Includes SOC 2, HIPAA, GDPR compliance, zero retention mode, data residency, and entity detection for PII/redaction.
How does Scribe V2 handle entities and keyterms?
Native detection for 56 categories with timestamps; supports up to 100 keyterm prompts for context-aware accuracy on specific terms/names.




