WhatsApp built-in
On-device transcription inside the chat bubble. Limited languages, no export.
Export the WhatsApp voice note from any chat, drop the .opus file in as-is. Text back in seconds — no MP3 conversion, no 'unsupported format' errors, 99 languages including code-switching.
MP3 · WAV · M4A · MP4 · MOV · MKV · OGG · OPUS · FLAC · WEBM — up to 100 MB anonymously
YouTube · TikTok · Vimeo · Twitter · SoundCloud · Spotify · 50+ more
↓ Watch what comes out
WhatsApp encodes voice notes as 16 kHz mono OPUS at roughly 24 kbps — compact, but not what generic transcription tools expect. We decode OPUS natively and skip the conversion dance.
Hey, quick one — I'm running late for dinner, traffic on the bridge is insane.
Can you tell Priya to start without me? I'll be there by 8:30 at the latest.
Y dile que pida lo de siempre, the pad thai with extra peanuts.
Okay, gotta go, calling you when I park.
↓ This is the dashboard
Same layout as the real dashboard — Summary, full Transcript, Speakers tab, Exports. Key points and action items extracted automatically. Auto-tags on every job.
Sample preview from a founder interview about post-call workflow. Real transcripts look exactly like this — same tabs, same summary block, same key-points / action-items split, same auto-tag chips.
Three real options · honest comparison
WhatsApp now transcribes voice notes inside the app on recent iOS and Android — but only in a handful of languages and only on-device. Generic transcribers refuse the .opus file. We take the export and run it.
On-device transcription inside the chat bubble. Limited languages, no export.
Drop the .opus or .ogg as exported. Multi-language, exportable, batch-friendly.
Won't accept .opus. Convert to MP3 first, lose quality, lose minutes.
WhatsApp built-in transcription availability and language list accurate as of 2026. Competitor OPUS support checked on free tiers.
Specific to WhatsApp
Most tools were built for meeting MP4s, not phone voice notes. The codec, the languages, and the burst length all trip them up.
Drop a .opus and these flip on by default. Override per-job from the form.
Accuracy · real-world numbers
WhatsApp's 16 kHz mono OPUS caps what's recoverable — no stereo, no high frequencies. Numbers below are from real customer voice notes (de-identified), not synthetic test sets. The recording environment matters more than the codec here.
Quiet room, phone held normally. Native speaker, no code-switching. This is the realistic ceiling for OPUS at 24 kbps.
Kitchen, café, kids in the background. Filler words and overlaps from background voices may sneak in but rarely change meaning.
Footsteps, traffic hum, occasional wind. Numbers and proper nouns are the first to slip. Expect a 1-minute proofread per 5-minute note.
Wind buffeting, road noise above the voice. Words usable in chunks, full sentences less reliable. Worst case in our WhatsApp dataset.
Common questions
30 free minutes every month. No card. OPUS and OGG accepted as-is, 99 languages, all exports included.
Start free