Rev (human transcription)
Humans type it. Slow, expensive, but the gold standard for publishable verbatim.
Drop a research interview or focus group recording. Get speaker-labelled, timestamped text ready for NVivo, Atlas.ti, or MaxQDA — with audio deleted within 24 hours.
MP3 · WAV · M4A · MP4 · MOV · MKV · OGG · OPUS · FLAC · WEBM — up to 100 MB anonymously
YouTube · TikTok · Vimeo · Twitter · SoundCloud · Spotify · 50+ more
↓ Watch what comes out
We mark each participant turn with a timestamp at the start, keep filler words if you ask for verbatim, and export DOCX with speaker styles your CAQDAS tool already recognises.
Can you walk me through the first time you noticed the change in the neighbourhood?
Um, it was probably 2019 — the bakery on the corner shut, and, yeah, that's when it hit me.
And what did that feel like, watching that happen over those months?
Honestly? Like the place I'd known for thirty years was vanishing, piece by piece.
↓ This is the dashboard
Same layout as the real dashboard — Summary, full Transcript, Speakers tab, Exports. Key points and action items extracted automatically. Auto-tags on every job.
Sample preview from a founder interview about post-call workflow. Real transcripts look exactly like this — same tabs, same summary block, same key-points / action-items split, same auto-tag chips.
Three real options · honest comparison
Rev's human service is the historical default for dissertation-grade quotes. NVivo bundles AI transcription inside the CAQDAS tool itself. We sit between — faster than Rev, more accurate and IRB-friendlier than NVivo's built-in.
Humans type it. Slow, expensive, but the gold standard for publishable verbatim.
AI transcript in minutes, audio deleted in 24h, DOCX styled for NVivo and Atlas.ti import.
AI transcription bundled inside your CAQDAS tool or note-taker. Convenient, EN-leaning, less control.
Pricing and feature flags accurate as of 2026. Rev's AI/human split and NVivo Transcription credit pricing vary by region and academic licensing.
Specific to qualitative research
Flip the right settings before you upload and the transcript imports straight into your CAQDAS project.
Drop a field recording and these flip on by default. Override per-job from the form.
Accuracy · real-world numbers
Field audio is the hard case in transcription — open rooms, accented English, overlapping speech in focus groups. Lavalier-mic dyadic interviews hit the ceiling; ambient field recordings and large focus groups degrade fastest. Numbers below come from actual researcher uploads, not synthetic benchmarks.
Quiet room, single L2 or native speaker, recorder on the table. Best case for semi-structured interviews — most dyadic studies land here.
Zoom H4n or phone recorder mid-table. Speaker chairs identified by direction. Plan a 5-min relabel pass.
Café, market, walking interview. Background chatter and traffic affect short responses; main turns remain codable.
Overlapping speech and shared mic. Diarization will merge some quieter voices — expect to disambiguate at coding time.
Common questions
30 free minutes every month. No card. Verbatim mode, 100+ languages, CAQDAS-ready DOCX, audio deleted in 24h.
Start free