Voicemail to text.100+ languages, any carrier format.

Drop a voicemail recording from Google Voice, Twilio, RingCentral, or a mobile carrier. Get a timestamped transcript with phone numbers formatted, language auto-detected — MP3, WAV, OGG, or AMR.

Drop a file, or pick one

MP3 · WAV · M4A · MP4 · MOV · MKV · OGG · OPUS · FLAC · WEBM — up to 100 MB anonymously

Paste a link, we’ll fetch the audio

YouTube · TikTok · Vimeo · Twitter · SoundCloud · Spotify · 50+ more

Record straight from your browser

Sign up takes 30 seconds — recording opens right after, in the dashboard.

No card required~90s per 60-min fileSRT · VTT · DOCX · TXTFiles auto-deleted in 24h

↓ Watch what comes out

Carrier audio in. Searchable text out.

Voicemail is single-speaker narrow-band audio — usually 8 kHz, often with traffic or wind behind it. We tune the recognizer for short, telephony-band recordings so callbacks and numbers actually land.

voicemail-0427-1142.mp3REC 1 speaker · 0:38
auto-detected en-US8 kHz mono · μ-law
~90s
Transcript · streaming89% accuracy
S1

Hi, this is Janet calling from Westfield Property Management about the lease renewal on the Larkin Street unit.

S1

We sent the paperwork over Tuesday — wanted to confirm you received it before the 30th.

S1

Best number to reach me is 415-555-0188, extension 204.

S1

Thanks, give me a call back when you get a chance.

89% on 8 kHz monoTXT · DOCX · JSON · SRT

↓ This is the dashboard

This is what loads when the job finishes.

Same layout as the real dashboard — Summary, full Transcript, Speakers tab, Exports. Key points and action items extracted automatically. Auto-tags on every job.

Try it on your own file — it's free

Three real options · honest comparison

Google Voice built-in. YouMail. Or us.

Google Voice ships free transcripts that are fine for a one-line gist. YouMail is a consumer visual-voicemail app. We process the file you export — any carrier, any format, with formatting and exports built for paste-into-CRM workflows.

Option 01

Google Voice / Gmail built-in

Free auto-transcript on every Google Voice message. English-only and a one-shot text dump.

RequiresGoogle Voice number
LanguagesEnglish only
Phone number formattingInline, often broken
Bulk uploadNo — per-message only
ExportEmail body text
CostFree
Best forSolo users on Google Voice who only need a rough English gist in their inbox.
Option 02

Transcription.Solutions

Drop the WAV, MP3, OGG, or AMR. Get formatted text back — any carrier, any language.

RequiresJust the audio file
Languages100+, auto-detected
Phone number formattingNormalized E.164 + local
Bulk uploadDrop a folder, runs parallel
ExportTXT · DOCX · JSON · SRT
Cost · per min$0.03
Best forAnyone batching voicemails out of a PBX, supporting non-English callers, or pushing text into a CRM.
Option 03

YouMail

Consumer visual-voicemail app. Replaces your carrier's voicemail entirely — not a file-based tool.

RequiresCarrier conditional forwarding
LanguagesEnglish-leaning
Phone number formattingCaller ID only
Bulk uploadNo — live forwarding only
ExportIn-app + email
Cost$5–18/mo per number
Best forMobile users who want to replace their carrier voicemail with a unified inbox.

Pricing accurate as of May 2026. Google Voice transcript availability varies by region and account type.

Specific to voicemail

Three things that bite people on generic transcription tools.

Voicemail isn't a meeting. The defaults that work for podcasts will mangle a 30-second callback.

What goes wrong

  1. 1Phone numbers spoken fast get transcribed as words ("four one five five five five oh one eight eight") instead of formatted digits — useless for CRM paste.
  2. 2Caller names are mumbled once at the start. Generic models miss the spelling and the rest of the message has no anchor.
  3. 3AMR / OGG files from IP-PBX systems get rejected outright by tools built around MP4 podcast audio.

What to flip here

  1. 1Turn on Phone number formatting in the job form. We normalize digits into E.164 (+14155550188) and a readable local format in the same line.
  2. 2Paste likely caller names and your company terms into Custom vocabulary. Even a 10-name list dramatically lifts proper-noun recall on short audio.
  3. 3Drop the file as-is. We accept WAV, MP3, OGG, AMR, M4A, FLAC, μ-law, A-law — no transcoding step needed.

Recommended job settings for voicemail

Upload a voicemail file and these flip on by default. Override per-job from the form.

Speaker model
Single speaker · monologue
Audio profile
Telephony 8 kHz narrow-band
Language
Auto-detect · 100+ languages
Phone numbers
Format as E.164 + local
Filler words
Kept (tone matters)
Export
TXT �� DOCX · JSON (CRM-ready)

Accuracy · real-world numbers

92% on clean VoIP. Holds up on PSTN landline too.

Voicemail is the hardest audio we see — 8 kHz narrow-band, single mic, often with road or café noise. These numbers are from real customer voicemail batches in production, not curated samples.

92%
Google Voice / Teams Phone MP3

Wideband 16 kHz capture, MP3 at 64 kbps+. Quiet indoor caller. Numbers and proper nouns land cleanly.

89%
Twilio / RingCentral WAV

Standard 8 kHz μ-law VoIP recording. Most business voicemail lands here. Phone numbers normalize correctly.

83%
Mobile carrier OGG / AMR

AMR-NB at 4.75–12.2 kbps from IP-PBX or carrier visual voicemail. Compression artifacts on sibilants and digits.

76%
PSTN landline, background noise

Older copper line, caller in a car or on speakerphone. Words usable, occasional misses on numbers and names.

Common questions

8 things people ask about voicemail transcription.

01Can you pull voicemails directly from Google Voice or Gmail?+
Not via API — Google doesn't expose voicemail audio that way. Download the MP3 attachment from the notification email, or use Google Voice's per-message download. Drop the file in our dashboard or batch-upload a folder.
02Do you support AMR files from old IP-PBX systems?+
Yes. AMR-NB and AMR-WB both work, along with WAV (μ-law, A-law, PCM), MP3, OGG, M4A, and FLAC. We handle the codec internally — no need to transcode to WAV first.
03Will phone numbers in the message be formatted correctly?+
Yes, when Phone number formatting is on. We detect spoken digits and output both E.164 (+14155550188) and a readable local format on the same line. Works in 40+ country dialing conventions.
04How do you handle very short messages — under 10 seconds?+
Fine. There's no minimum length. Sub-10-second messages bill at our 6-second floor ($0.003 per file). Accuracy holds because the model isn't waiting for context — voicemail is a monologue, not a conversation.
05What about Spanish or multilingual voicemails?+
Auto-detect runs across 100+ languages and picks the dominant one. For voicemails that switch mid-message (English greeting, Spanish body) toggle multilingual mode — we transcribe both segments in their own language without forcing one.
06Can I bulk upload 200 voicemails at once?+
Yes. Drag a folder into the dashboard or POST to our batch endpoint. Jobs run in parallel, you get a CSV index back with filename, language, duration, and a link to each transcript. No per-batch surcharge.
07Can transcripts be pushed straight into HubSpot or Salesforce?+
We don't ship a direct CRM connector yet. The JSON export includes caller ID (if you pass it in metadata), formatted phone numbers, and the full transcript — most teams pipe it through Zapier or a 20-line script to their CRM contact record.
08Voicemails often contain PII or medical info — how do you handle it?+
Source audio is permanently deleted within 24 hours. Transcripts live in your dashboard until you delete them. We're not a HIPAA Business Associate today — if you need a signed BAA, talk to us before uploading PHI.

Drop a voicemail file. See what comes out.

30 free minutes every month. No card. Phone-number formatting, 100+ languages, all exports included.

Start free