Rev (human-typed)
A person types your audio. Highest accuracy on hard audio — pay for it in dollars and hours.
Drop a source interview — phone, lavalier, field recorder, or press conference. Get a speaker-labeled transcript with citation timestamps, then a DOCX your fact-checker can mark up.
MP3 · WAV · M4A · MP4 · MOV · MKV · OGG · OPUS · FLAC · WEBM — up to 100 MB anonymously
YouTube · TikTok · Vimeo · Twitter · SoundCloud · Spotify · 50+ more
↓ Watch what comes out
Every speaker turn gets a timestamp so you can jump back to the audio when your editor or fact-checker challenges a quote. Filler words stay in by default — quote integrity matters.
Walk me through when you first noticed the cost overrun on the housing project.
It was the March 14 finance committee. The number jumped from 22 to 31 million with no memo.
And nobody on the council asked where the extra nine million went?
One person did. It's on the recording. After that, the item moved to closed session.
↓ This is the dashboard
Same layout as the real dashboard — Summary, full Transcript, Speakers tab, Exports. Key points and action items extracted automatically. Auto-tags on every job.
Sample preview from a founder interview about post-call workflow. Real transcripts look exactly like this — same tabs, same summary block, same key-points / action-items split, same auto-tag chips.
Three real options · honest comparison
Rev's human-typed tier is the legacy newsroom default — accurate, expensive, slow. Trint built a newsroom-tuned AI editor. We do the AI transcript, kill the file in 24 hours, and stay out of the way.
A person types your audio. Highest accuracy on hard audio — pay for it in dollars and hours.
AI transcript in minutes. Source audio deleted in 24 hours. Custom vocabulary for source names and place names.
Newsroom-tuned AI editor with collaborative workflows. Strong product, subscription pricing.
Pricing approximate as of 2026. Rev's automated tier is separate from the human-typed tier compared here.
Specific to journalism
Before you upload the file, flip the right settings — the transcript comes back closer to publishable.
Drop an interview and these flip on by default. Override per-job from the form.
Accuracy · real-world numbers
Reporters record in conditions transcribers don't always plan for. The microphone and the room set the ceiling — not the model. Numbers below are from actual journalist files in production, not synthetic benchmarks.
Podcast-grade setup, one source across the table on a cardioid or lav. Proper nouns are the only failure mode.
Cooperative source on broadband. Some loss on numbers, bill IDs, and unfamiliar last names. Custom vocabulary closes most of it.
Tascam or phone in a coat pocket. HVAC, traffic, dishes. Words are usable for quote pulls — expect a verify pass on the audio.
Six reporters shouting questions, podium PA echo, no individual mics. Diarization will fuse some questioners. Worst case in our data.
Common questions
30 free minutes every month. No card. Speaker labels, citation timestamps, 24-hour audio deletion on every plan.
Start free