Transcribe M4A from Apple devices.No conversion needed.

Drop the M4A recording straight from Voice Memos, QuickTime, or any Apple app. Speaker labels, timestamps, 99 languages — no convert-to-MP3 dance, no iCloud middleman.

Drop a file, or pick one

MP3 · WAV · M4A · MP4 · MOV · MKV · OGG · OPUS · FLAC · WEBM — up to 100 MB anonymously

Paste a link, we’ll fetch the audio

YouTube · TikTok · Vimeo · Twitter · SoundCloud · Spotify · 50+ more

Record straight from your browser

Sign up takes 30 seconds — recording opens right after, in the dashboard.

No card required~90s per 60-min fileSRT · VTT · DOCX · TXTFiles auto-deleted in 24h

↓ Watch what comes out

Voice Memo in. Reportable transcript out.

M4A is AAC inside an MPEG-4 container — iPhone Voice Memos and Mac QuickTime both default to it. We read the container directly, pull the AAC stream, and skip any re-encoding step that would degrade the audio.

Voice Memo · iPhone 15REC 2 speakers · 38:42
auto-detected en-USAAC 64 kbps · 44.1 kHz mono
~90s
Transcript · streaming94% accuracy
S1

Before we get into the funding round — can I record this for my notes?

S2

Yeah, that's fine. Off the record on the board stuff though.

S1

Understood. So walk me back to when you first met the lead investor.

S2

That was March, at a dinner in Palo Alto. Completely cold intro.

94% on Voice Memo monoSRT · DOCX · TXT · JSON

↓ This is the dashboard

This is what loads when the job finishes.

Same layout as the real dashboard — Summary, full Transcript, Speakers tab, Exports. Key points and action items extracted automatically. Auto-tags on every job.

Try it on your own file — it's free

Three real options · honest comparison

Apple's built-in. Otter. Or us.

iOS 18 added on-device transcription to Voice Memos. Otter wants you to import every M4A into its app library. We take the file and give you the transcript — no library, no app install.

Option 01

Voice Memos (iOS 18+)

On-device transcription baked into the Voice Memos app. Free, but very limited.

RequiresiPhone 12+ on iOS 18
Speaker diarizationNo
Languages~13, EN-leaning
ExportCopy-paste from app
TimestampsNone
CostFree
Best forQuick personal voice notes on a recent iPhone where you just want to skim what you said.
Option 02

Transcription.Solutions

Upload the M4A as-is. Speaker labels, timestamps, every export format.

RequiresA browser
Speaker diarizationAcoustic, 2-10 speakers
Languages99, auto-detected
ExportSRT · VTT · DOCX · TXT · JSON
TimestampsWord-level
Cost · per min$0.03
Best forJournalists, researchers, and students who need a citable transcript from a phone-recorded interview.
Option 03

Otter.ai

Polished web app. Wants the file in its library, English-first, file caps on free tier.

RequiresAccount + app upload
Speaker diarizationEN-tuned only
LanguagesEN / ES / FR only
ExportPaid tier required
File size300 MB cap, free tier
Cost$17/user/mo (Pro)
Best forEnglish-only users who want a long-term library of meetings and don't mind a monthly subscription.

Pricing and feature flags accurate as of May 2026. Voice Memos transcription availability depends on iOS version and device chip.

Specific to M4A

Three things that bite people on generic transcription tools.

Most issues are about how the M4A was captured, not the format itself.

What goes wrong

  1. 1Sharing the Voice Memo via iCloud link. Generic tools can't fetch from icloud.com — they need the actual file. The 'Share' sheet defaults to a link, not the M4A.
  2. 2Phone laid flat on a wooden table. Voice Memos picks up surface vibration from typing, cups, phone notifications. Diarization gets confused by the rumble.
  3. 3Long interviews split across multiple memos. Voice Memos auto-stops on calls or low battery. You end up with three M4As and lose context across them.

What to flip here

  1. 1On iPhone: open Voice Memos → tap the memo → ••• → Save to Files. Then upload the file. AirDrop to a Mac also works — the M4A lands intact.
  2. 2Prop the phone against a book or coffee cup so the mic faces speakers, not the table. Or use the Lightning/USB-C lavalier if you have one.
  3. 3Drop all three M4As in one job — we concatenate in upload order and run diarization across the merged audio so speaker labels stay consistent.

Recommended job settings for M4A

Drop an M4A and these flip on by default. Override per-job from the form.

Container handling
Read AAC/ALAC stream direct
Speaker model
Interview · 2-6 speakers
Language
Auto-detect · multi-lingual on
Noise profile
Phone-mic field recording
Filler words
Kept (toggle for journalism)
Export
DOCX · SRT · timestamped TXT

Accuracy · real-world numbers

94% on a Voice Memo. Holds up when the phone is across the table.

M4A's AAC codec is kind to speech — the ceiling is set by where the phone was, not the file format. Numbers below are from actual customer Voice Memo and QuickTime files, not synthetic benchmarks.

95%
Phone held near speaker, quiet room

Classic 1-on-1 interview, phone 30 cm from the talker. Voice Memos at default 64 kbps AAC is enough — error is text-only.

94%
Voice Memos · Lossless mode

Settings → Voice Memos → Audio Quality: Lossless. ALAC inside the M4A container at ~1 Mbps. Marginal gain over the default for speech.

89%
Phone on table, 3-4 speakers

Roundtable interview, phone in the middle. Acoustic diarization holds for distinct voices; nearby chairs and laptop fans bleed in.

82%
Field recording · cafe or street

Espresso machine, traffic, second conversation behind you. Words usable for quoting; expect a re-listen pass on numbers and names.

Common questions

8 things people ask about M4A transcription.

01Do I need to convert M4A to MP3 first?+
No. We read the M4A container directly and pull the AAC (or ALAC) audio stream as-is. Converting to MP3 would actually lose quality — AAC at the same bitrate sounds cleaner than MP3.
02Does it work with iPhone Voice Memos files?+
Yes — Voice Memos is the most common source we see. Open the memo → ••• → Save to Files, or AirDrop to a Mac, then upload. The M4A header includes the recording date, which we preserve in the transcript metadata.
03What about QuickTime screen recordings from a Mac?+
QuickTime exports MOV by default but audio-only recordings save as M4A. Both work. For MOV with a video track, we extract the audio server-side and transcribe — you don't need to demux first.
04Can I upload an iCloud share link?+
No. iCloud requires an Apple ID login we can't impersonate. Download the M4A locally first (Files app or icloud.com → Download), then upload here. Takes about 20 seconds.
05What's the max file length?+
Up to 10 hours per file on the standard plan. A 4-hour Voice Memo at default quality is around 110 MB — well under the 5 GB upload cap. Lossless mode gets larger; chunk it across two uploads if you hit the cap.
06Will speaker labels work with the phone on the table?+
Yes, if the voices are distinct enough acoustically — most 2-4 person interviews are fine. If two participants sound very similar, expect to rename a few chips manually. Stereo external mics (Shure MV88, RØDE VideoMic) help a lot.
07Does it handle ALAC (Apple Lossless) inside M4A?+
Yes. Voice Memos' Lossless setting writes ALAC into the M4A container instead of AAC. We detect the codec from the container metadata and decode either path. Lossless gives a small accuracy bump in noisy environments.
08How fast is turnaround on a 1-hour M4A?+
Usually 4-6 minutes. Upload is the slow part on phone hotspot connections — a 1-hour Voice Memo is ~28 MB at default quality. The source audio is deleted within 24 hours of the job completing.

Drop your M4A. See what comes out.

30 free minutes every month. No card. Speaker labels, 99 languages, all exports included.

Start free