Zoom built-in
Auto-transcript inside the Zoom app. Locked to paid Zoom plans.
Drop a Zoom call recording. Get a speaker-labeled transcript with timestamps in 99 languages — no Zoom paid plan, no dashboard lock-in.
MP3 · WAV · M4A · MP4 · MOV · MKV · OGG · OPUS · FLAC · WEBM — up to 100 MB anonymously
YouTube · TikTok · Vimeo · Twitter · SoundCloud · Spotify · 50+ more
↓ Watch what comes out
Zoom records each participant on a separate channel when that setting is on — we use it to split speakers without guessing. Mono cloud recording? Acoustic diarization handles the fallback.
Quick check — Marcus, did the vendor SOW come back signed?
It did, came in Tuesday. I'll forward after this call.
Perfect. And the Q3 forecast review — still Thursday?
Thursday at 2. Deck went out this morning.
↓ This is the dashboard
Same layout as the real dashboard — Summary, full Transcript, Speakers tab, Exports. Key points and action items extracted automatically. Auto-tags on every job.
Sample preview from a founder interview about post-call workflow. Real transcripts look exactly like this — same tabs, same summary block, same key-points / action-items split, same auto-tag chips.
Three real options · honest comparison
Zoom ships its own transcript on paid tiers. Otter and Fireflies live in your calendar as a bot. We work with the file you already have, or send a bot when you want it live.
Auto-transcript inside the Zoom app. Locked to paid Zoom plans.
Drop the recording. Or send a bot. Works with any Zoom plan — including free.
A bot sits in your calendar. Pretty UI, English-first, hard cap on file size.
Pricing and feature flags accurate as of May 2026. Zoom AI Companion availability depends on regional rollout.
Specific to Zoom
Flip the right settings before you record and the transcript comes back cleaner.
Drop a Zoom file and these flip on by default. Override per-job from the form.
Accuracy · real-world numbers
The ceiling is set by what Zoom captured. Stereo per-channel cloud recording is the best case; phone dial-in participants degrade fastest. Numbers below are from actual customer Zoom files in production, not synthetic benchmarks.
Zoom's 'separate audio file per participant' setting on. Each speaker isolated, diarization skipped — text-only error.
Default cloud recording, 128 kbps. Stereo channel split distinguishes voices reliably. Most Zoom calls land here.
Acoustic diarization, similar voices may merge. Plan a 2-min rename pass on the speaker chips.
8 kHz narrow-band audio. Words usable, occasional misses on numbers and proper nouns. Worst case in our data.
Common questions
30 free minutes every month. No card. Speaker labels, 99 languages, all exports included.
Start free