Zoom transcription.Speaker-labeled, any language.

Drop a Zoom call recording. Get a speaker-labeled transcript with timestamps in 99 languages — no Zoom paid plan, no dashboard lock-in.

Drop a file, or pick one

MP3 · WAV · M4A · MP4 · MOV · MKV · OGG · OPUS · FLAC · WEBM — up to 100 MB anonymously

Paste a link, we’ll fetch the audio

YouTube · TikTok · Vimeo · Twitter · SoundCloud · Spotify · 50+ more

Record straight from your browser

Sign up takes 30 seconds — recording opens right after, in the dashboard.

No card required~90s per 60-min fileSRT · VTT · DOCX · TXTFiles auto-deleted in 24h

↓ Wo ọ̀jọ̀ tó yóò jáde

Zoom recording ọ̀wọ̀, transcript irọ́lẹ̀ yoo jáde.

Zoom records each participant on a separate channel when that setting is on — we use it to split speakers without guessing. Mono cloud recording? Acoustic diarization handles the fallback.

Zoom cloud recordingREC 3 speakers · 47:08
auto-detected en-US16 kHz stereo · 128 kbps
~90s
Transcript · streaming96% accuracy
S1

Quick check — Marcus, did the vendor SOW come back signed?

S2

It did, came in Tuesday. I'll forward after this call.

S1

Perfect. And the Q3 forecast review — still Thursday?

S2

Thursday at 2. Deck went out this morning.

96% on stereo cloudSRT · DOCX · TXT · JSON

↓ This is the dashboard

This is what loads when the job finishes.

Same layout as the real dashboard — Summary, full Transcript, Speakers tab, Exports. Key points and action items extracted automatically. Auto-tags on every job.

Try it on your own file — it's free

Àwọ́n àyọkà mẹ́ta gidi · ìwé àgbiyanju onígbímọ̀

Zoom built-in. Otter or Fireflies. Or us.

Zoom ships its own transcript on paid tiers. Otter and Fireflies live in your calendar as a bot. We work with the file you already have, or send a bot when you want it live.

Option 01

Zoom built-in

Auto-transcript inside the Zoom app. Locked to paid Zoom plans.

RequiresZoom Pro+ ($15/host/mo)
Speaker diarizationNo (on mono)
Languages1 per call, EN-leaning
ExportVTT only, in dashboard
AI summaryZoom AI Companion (paid)
Cost$15+/host/mo
Best forTeams already on Zoom Pro who only need a rough text dump of one-language meetings.
Option 02

Transcription.Solutions

Drop the recording. Or send a bot. Works with any Zoom plan — including free.

RequiresNothing on Zoom side
Speaker diarizationStereo channel split
Languages99, auto-detected
ExportSRT · VTT · DOCX · TXT · JSON
AI summaryFree on every plan
Cost · per min$0.03
Best forAnyone who records on free Zoom, runs multilingual meetings, or wants the transcript outside Zoom's dashboard.
Option 03

Otter / Fireflies

A bot sits in your calendar. Pretty UI, English-first, hard cap on file size.

RequiresCalendar OAuth + paid
Speaker diarizationAcoustic, EN-tuned
LanguagesNon-EN drops accuracy
Export2 GB cap (Fireflies)
AI summaryBehind paid tier
Cost$17–19/user/mo
Best forEnglish-only sales teams that want a calendar-native bot and never record outside Zoom.

Pricing and feature flags accurate as of May 2026. Zoom AI Companion availability depends on regional rollout.

Zoom pàtó

Àwọ́n ohun ìbátan mẹ́ta tó máa ń ṣe àyàtẹ lori àwọ́n alatẹ̀ gbenàgbenà.

Flip the right settings before you record and the transcript comes back cleaner.

Kí ni ó máa ń ṣe àyàtẹ

  1. 1Mono cloud recording. Zoom mixes everyone into one channel by default. Acoustic diarization then merges similar voices into one speaker.
  2. 2Two-letter company names (PSI, DKMS, Klue) get spelled phonetically. Generic AI doesn't know they're proper nouns.
  3. 3Chat messages with links, IDs, action items live separately from the audio and get lost.

Kí ni ó yẹ kí o ṣe níhìn

  1. 1Turn on Record a separate audio file for each participant in Zoom before the meeting. We detect per-channel files and skip diarization entirely.
  2. 2Paste team vocabulary into Custom vocabulary on the job form. We pass it to the recognizer as a hint, not a hard match.
  3. 3Send the bot to the live meeting (not the recording). Chat messages merge into the transcript in chronological order.

Àwọ́n setting tuntun fun Zoom

Drop a Zoom file and these flip on by default. Override per-job from the form.

Diarization
Per-channel if available
Speaker model
Conversational · 2-8 speakers
Language
Auto-detect · multi-lingual on
Filler words
Removed by default
Summary
Action items + decisions
Export
DOCX · SRT · timestamped TXT

Accuracy · real-world numbers

96%+ on cloud recordings. Holds up on phone dial-in too.

The ceiling is set by what Zoom captured. Stereo per-channel cloud recording is the best case; phone dial-in participants degrade fastest. Numbers below are from actual customer Zoom files in production, not synthetic benchmarks.

96%+
Per-channel cloud recording

Zoom's 'separate audio file per participant' setting on. Each speaker isolated, diarization skipped — text-only error.

94%
Stereo cloud, ≤3 speakers

Default cloud recording, 128 kbps. Stereo channel split distinguishes voices reliably. Most Zoom calls land here.

90%
Mono cloud, 4-6 speakers

Acoustic diarization, similar voices may merge. Plan a 2-min rename pass on the speaker chips.

87%
Phone dial-in participant

8 kHz narrow-band audio. Words usable, occasional misses on numbers and proper nouns. Worst case in our data.

Àwọ́n ibeere tòyè

8 things people ask about Zoom transcription.

01Can you pull from a Zoom cloud recording URL directly?+
No. Zoom cloud URLs need a Zoom account login — we can't impersonate you. Download the MP4 first (zoom.us → Recordings → Download), then drop the file here. Takes 30 seconds.
02Does the bot need calendar OAuth like Otter?+
No. You drop the bot URL into the Zoom invite manually, or send the bot to a meeting URL from our dashboard. We don't read your calendar.
03What about Zoom AI Companion — does it transcribe?+
Zoom AI Companion provides a summary, not a verbatim transcript with speaker labels and exports. It's also locked to paid Zoom plans and disabled in many corporate environments by default.
04Can you handle international Zoom calls with multiple languages?+
Yes — auto-detect picks the dominant language and the recognizer handles code-switching mid-sentence on most language pairs. Spanish ↔ English, Mandarin ↔ English, and most Indo-European pairs work cleanly.
05Is the recording deleted after transcription?+
Yes — 24 hours after job completion. The transcript and exports stay in your account for as long as you want them. Deletion is unrecoverable and logged in your audit trail.
06Will the bot show as a participant in the meeting?+
Yes — a labelled bot joins like a normal participant. You can rename it in your settings (default: 'Transcription.Solutions') and configure auto-disclosure if you're in a two-party-consent region.
07What if Zoom only saved a mono recording?+
Acoustic diarization kicks in — we identify speakers by voiceprint clustering. It's less accurate than per-channel (90% vs 96%) but readable. Rename Speaker 1/2 once and it propagates.
08Can I get just the SRT for re-uploading captions?+
Yes. Every job produces SRT and VTT by default. Re-upload via Zoom's caption upload feature, or to YouTube if you're publishing the recording.

Drop your Zoom recording. See what comes out.

30 free minutes every month. No card. Speaker labels, 99 languages, all exports included.

Start free