YouTube auto-captions
Free with upload. No speaker labels, no Scripture formatting, English-leaning.
Drop a Sunday service recording — pulpit mic, live-stream rip, or YouTube URL. Get a speaker-labeled transcript with Scripture references intact, in 100+ languages.
MP3 · WAV · M4A · MP4 · MOV · MKV · OGG · OPUS · FLAC · WEBM — up to 100 MB anonymously
YouTube · TikTok · Vimeo · Twitter · SoundCloud · Spotify · 50+ more
↓ Watch what comes out
Most churches record a single pulpit mic plus an ambient room feed. We treat the pulpit channel as the primary speaker, suppress congregational noise, and keep amens and responsive readings on a separate label.
Turn with me to Romans chapter 8, verse 28 — and we'll read through 30 together.
Amen.
Paul says, 'And we know that for those who love God all things work together for good.'
Now hear me — all things. Not some things. All things.
↓ This is the dashboard
Same layout as the real dashboard — Summary, full Transcript, Speakers tab, Exports. Key points and action items extracted automatically. Auto-tags on every job.
Sample preview from a founder interview about post-call workflow. Real transcripts look exactly like this — same tabs, same summary block, same key-points / action-items split, same auto-tag chips.
Three real options · honest comparison
If you upload the service to YouTube, you get auto-captions for free — they're rough. Rev sends it to a human and bills $1.50/min. We sit in the middle: AI accuracy near-human on pulpit audio, priced like the free tools.
Free with upload. No speaker labels, no Scripture formatting, English-leaning.
Drop the file or paste the YouTube URL. Scripture references kept, congregation separated.
A person types it. Highest accuracy, slowest turnaround, expensive at sermon length.
Pricing approximate as of 2026. YouTube caption language list and Rev pricing tiers change without notice.
Specific to sermons
Sermons are not meetings. The vocabulary, the cadence, and the room are all different — flip these and the transcript stops looking like nonsense.
Drop a sermon file and these flip on by default. Override per-job from the form.
Accuracy · real-world numbers
The ceiling is set by what your sound booth captured. A direct feed from the pulpit lavalier is the best case; a phone on the back pew is the worst. Numbers below come from real customer sermon files, not synthetic benchmarks.
Pulpit lav or headset mic, mixed clean and pulled from the board. The preacher's voice isolated, no room reflections.
Compressed but mixed by the booth. Slight loss on consonants from AAC encoding. Most sermons land here.
Q&A or congregation prayer where the mic passes around. Some words clip when the mic moves off-axis.
Room recording from a single device. Echo, AC hum, distance from the preacher. Usable for study, expect a cleanup pass.
Common questions
30 free minutes every month. No card. Speaker labels, 100+ languages, all exports included.
Start free