100+ languages
From Mandarin to Maltese. Auto-detected — you don't pick a language up front.
Upload any audio or video. Get accurate .srt subtitles with speaker labels in 100+ languages. Free up to 5 minutes anonymously, 30 minutes per month with a free account.
MP3 · WAV · M4A · OGG · OPUS · FLAC · MP4 · MOV · MKV · WEBM
Anonymous: up to 5 min/ 100 MB. Sign up free for 30 min / month + bigger files.
What you get
Indexed cues. Hours:minutes:seconds,milliseconds timestamps. UTF-8 text. No proprietary wrappers. Open it in Notepad if you want — it’s just text.
product-meeting.mp3 ───────────────────────── duration 12:34 size 11.4 MB language auto-detect → en-US sample 48 kHz · 16-bit · mono codec MP3 / 192 kbps status uploaded → queued
1 00:00:00,420 → 00:00:03,180 Sarah: Alright, let’s lock the Q3 launch date. 2 00:00:03,310 → 00:00:06,840 Marcus: September 14 works — engineering signed off Friday. 3 00:00:07,020 → 00:00:09,560 Sarah: Great. I’ll get marketing on the launch brief today. 4 00:00:09,710 → 00:00:11,430 Marcus: One concern — pricing page copy is still in review.
How it works
No setup. No installs. No format conversion. Drop a file, get an SRT back — same workflow whether you’re subtitling a podcast clip, a Zoom call, or a one-hour lecture.
Drag and drop — or click choose. MP3, WAV, M4A, MP4, MOV and 6 more formats accepted. Up to 5 minutes anonymously, 30 minutes per month on the free plan, 600 on Pro.
Auto-detects language from the first seconds, transcribes every word, and aligns each line to a millisecond-precise timestamp. A 60-minute file finishes in roughly 90 seconds.
Get a standards-compliant .srt ready for YouTube, Premiere, DaVinci Resolve, VLC — anything that reads subtitles. VTT, DOCX, plain text, and PDF exports come with it.
What’s in the box
Each export is checked against the real format spec — players accept the file, no fix-up step needed.
From Mandarin to Maltese. Auto-detected — you don't pick a language up front.
Millisecond-precision cues. Drop the SRT straight into Premiere or DaVinci Resolve — every line is already aligned.
Native diarization tags each line with the speaker. Rename them in the editor and exports update everywhere.
SRT, VTT, plain TXT, DOCX with speakers, JSON with word timings, and a branded PDF — all from one transcription.
Real ratio on production. A one-hour podcast is done before you finish making coffee. No batch queues on paid plans.
Files auto-deleted in 24h. No training on your content. TLS in transit, encryption at rest, EU/US infrastructure.
Common questions
600 minutes a month, files up to 5 GB, native speaker diarization, AI summaries and action items, meeting bot for Zoom / Google Meet / Microsoft Teams.
See Pro planAlready a customer? Open the dashboard