Rev human verbatim
A human types it. High accuracy, but 24-hour turnaround and the price scales linearly with hours.
Drop a focus group recording with 6, 8, even 10 voices. Get a verbatim transcript with each participant labelled, cross-talk tagged, and a DOCX that loads straight into NVivo.
MP3 · WAV · M4A · MP4 · MOV · MKV · OGG · OPUS · FLAC · WEBM — up to 100 MB anonymously
YouTube · TikTok · Vimeo · Twitter · SoundCloud · Spotify · 50+ more
↓ Watch what comes out
Focus groups are the hardest diarization case in our queue — similar demographics, similar voices, frequent cross-talk overlap. We tag the overlap inline instead of dropping it, then you rename Speaker 3 → 'Participant_F2' once and it propagates.
So when you first opened the packaging — walk me through what you noticed.
Honestly? The first thing was the smell. Like a hospital, kind of clinical —
Yeah, same. I thought it was supposed to be the lavender one.
Right, and the label says lavender but it really doesn't —
↓ This is the dashboard
Same layout as the real dashboard — Summary, full Transcript, Speakers tab, Exports. Key points and action items extracted automatically. Auto-tags on every job.
Sample preview from a founder interview about post-call workflow. Real transcripts look exactly like this — same tabs, same summary block, same key-points / action-items split, same auto-tag chips.
Three real options · honest comparison
Researchers usually pick between paying a human transcriber (slow, accurate, expensive) or running the file through a generic AI tool that wasn't built for 8-voice rooms. We sit in between — AI speed, diarization tuned for research recordings, and a DOCX that drops into NVivo without surgery.
A human types it. High accuracy, but 24-hour turnaround and the price scales linearly with hours.
Diarization tuned for 6-10 voices, cross-talk tagged inline, DOCX export sized for NVivo, ATLAS.ti, and Dedoose.
Generic AI built for meetings. Decent on 2-3 speakers, falls apart past 5 — and exports don't anticipate QDA software.
Pricing accurate as of May 2026. Accuracy ranges come from our internal sample of customer focus group files, not synthetic benchmarks.
Specific to focus groups
Flip the right settings up front and the transcript drops into NVivo without a cleanup weekend.
Drop a focus group file with the 'research' template and these flip on by default. Override per-job from the form.
Accuracy · real-world numbers
Focus group accuracy is bottlenecked by microphone topology, not the model. A lavalier on every participant gives us clean per-speaker channels — diarization becomes trivial. One boundary mic on a conference table with 8 voices is the hard case. Numbers below come from real research recordings in our pipeline.
Each participant on their own track, mixed to multitrack WAV. Diarization skipped — text-only error. Best case for dissertation-grade work.
Boundary mic centred on the table, moderate room treatment. Voices distinguishable, occasional confusion between same-gender participants of similar age.
Cross-talk frequent, similar voices merge under acoustic diarization. Expect a 10-minute rename and merge pass on the speaker chips before analysis.
Compressed mono mix, no per-channel split available. Words still usable for thematic coding, but disfluency-level verbatim claims weaken here.
Common questions
30 free minutes every month. No card. Speaker labels, cross-talk tagging, QDA-ready DOCX export included on every plan.
Start free