Voice Memos (iOS 18+)
On-device transcription baked into the Voice Memos app. Free, but very limited.
Drop the M4A recording straight from Voice Memos, QuickTime, or any Apple app. Speaker labels, timestamps, 99 languages — no convert-to-MP3 dance, no iCloud middleman.
MP3 · WAV · M4A · MP4 · MOV · MKV · OGG · OPUS · FLAC · WEBM — up to 100 MB anonymously
YouTube · TikTok · Vimeo · Twitter · SoundCloud · Spotify · 50+ more
↓ Watch what comes out
M4A is AAC inside an MPEG-4 container — iPhone Voice Memos and Mac QuickTime both default to it. We read the container directly, pull the AAC stream, and skip any re-encoding step that would degrade the audio.
Before we get into the funding round — can I record this for my notes?
Yeah, that's fine. Off the record on the board stuff though.
Understood. So walk me back to when you first met the lead investor.
That was March, at a dinner in Palo Alto. Completely cold intro.
↓ This is the dashboard
Same layout as the real dashboard — Summary, full Transcript, Speakers tab, Exports. Key points and action items extracted automatically. Auto-tags on every job.
Sample preview from a founder interview about post-call workflow. Real transcripts look exactly like this — same tabs, same summary block, same key-points / action-items split, same auto-tag chips.
Three real options · honest comparison
iOS 18 added on-device transcription to Voice Memos. Otter wants you to import every M4A into its app library. We take the file and give you the transcript — no library, no app install.
On-device transcription baked into the Voice Memos app. Free, but very limited.
Upload the M4A as-is. Speaker labels, timestamps, every export format.
Polished web app. Wants the file in its library, English-first, file caps on free tier.
Pricing and feature flags accurate as of May 2026. Voice Memos transcription availability depends on iOS version and device chip.
Specific to M4A
Most issues are about how the M4A was captured, not the format itself.
Drop an M4A and these flip on by default. Override per-job from the form.
Accuracy · real-world numbers
M4A's AAC codec is kind to speech — the ceiling is set by where the phone was, not the file format. Numbers below are from actual customer Voice Memo and QuickTime files, not synthetic benchmarks.
Classic 1-on-1 interview, phone 30 cm from the talker. Voice Memos at default 64 kbps AAC is enough — error is text-only.
Settings → Voice Memos → Audio Quality: Lossless. ALAC inside the M4A container at ~1 Mbps. Marginal gain over the default for speech.
Roundtable interview, phone in the middle. Acoustic diarization holds for distinct voices; nearby chairs and laptop fans bleed in.
Espresso machine, traffic, second conversation behind you. Words usable for quoting; expect a re-listen pass on numbers and names.
Common questions
30 free minutes every month. No card. Speaker labels, 99 languages, all exports included.
Start free