Start free

Blog · · 7 min read

Legal transcription deposition: what AI can and can't do

AI transcription speeds up deposition prep and review — but it isn't the certified record. Where it fits, where it doesn't, and how to use it safely.

Where AI transcription fits in a deposition workflow — and where it doesn't

AI transcription is useful for legal transcription deposition prep and review — generating a fast searchable draft with speaker labels and timestamps — but it is not a certified transcript and cannot replace a stenographer or licensed court reporter. The official record still comes from a human authorised in your jurisdiction. Use AI to locate key passages, build cite-checked outlines, and turn 6-hour recordings into 9-minute searchable drafts. Then order the certified version for the courtroom.

Why AI transcription is not the certified record

A certified deposition transcript carries a court reporter's seal, oath, and signature attesting that the transcript is a true and accurate record of the proceeding. That certification is what makes the transcript admissible and citable in motions, trial, and appeal. Software does not swear an oath.

In most US jurisdictions, the rules (e.g. FRCP 30(b)(5), state equivalents) require an officer authorised to administer oaths to record the testimony. The deponent is sworn in by that person. Even when the deposition is recorded by audio or video, a certified transcriber or court reporter produces the final transcript.

AI transcription sits outside this chain. A draft from an automated tool has no legal weight on its own. What it has is speed: a 4-hour deposition becomes a searchable text file in under 45 minutes, with each turn labelled and timestamped.

What AI transcription actually does well for depositions

Three jobs, all happening before or after the certified transcript exists:

  1. Same-day review draft. You record the deposition (audio or video, with all parties' consent and on the record). Upload the file. You have a labelled draft before you leave the building — useful for the de-brief with your client or co-counsel that evening.

  2. Searchable archive of older depositions. If you have a hard drive of MP3s from past cases, audio-to-text transcription turns them into grep-able text. Find every time a witness mentioned a specific date, product, or person across years of depositions in seconds.

  3. Cite-checking against the certified transcript. When the certified copy arrives weeks later, you can diff your working notes against it. Paragraph-level timestamps make it trivial to jump back to the audio and verify a quote.

What it does not replace: the court reporter's real-time stenography, the certified transcript's admissibility, the rough draft a reporter can deliver same-day on request, or the formal exhibit-marking workflow.

A realistic deposition prep workflow

Here is the workflow most litigation teams settle into once they stop trying to make AI replace the reporter:

Before the deposition. Pull every prior recorded statement from the deponent — earlier depositions, recorded interviews, public talks, podcast appearances. Run them through interview-style transcription with diarization so each speaker is labelled. Search for inconsistencies, prior positions, and verbatim phrases you want to confront the witness with.

During the deposition. The court reporter handles the official record. Your team's recording (if permitted and on the record) is a backup and the source for your working draft.

Same day. Upload your recording. Speaker diarization automatically separates Counsel, Witness, and Opposing Counsel into distinct labels (speaker_0, speaker_1, speaker_2) that you rename in the dashboard. Export to DOCX for your annotation pass.

Definition for clarity:

Speaker diarization is the process of segmenting an audio recording by speaker turn — identifying who spoke when — so a transcript can label each utterance with a distinct speaker, even before any names are assigned.

The week after. When the certified transcript arrives, use your AI draft as a navigation index. Page numbers in the certified version, timestamps in yours — both point at the same moments.

Try it on your audio

Start free →

30 minutes a month, no card.

Chain of custody and data handling

This is where most legal teams get stuck, reasonably. A deposition recording is privileged work product and may contain confidential client information. You cannot upload it to a service that trains on user data or retains files indefinitely.

The relevant facts for Transcription.Solutions:

  • Source audio is permanently deleted from our infrastructure within 24 hours of job completion. Not "after 30 days", not "when you ask us to" — automatically, within 24 hours.
  • Transcripts remain in your account until you delete them. You control retention.
  • We do not train models on your data. Your recordings and transcripts are not used to improve the ASR system or any downstream model.
  • Exports cover the standard legal review chain: DOCX (for redlining), TXT (for archival), JSON (for e-discovery ingestion), MD (for note-taking systems), PDF (for client distribution), plus SRT and VTT if the deposition was video-recorded. Same seven formats on every plan.
  • REST API with signed webhooks if your firm's case management system needs programmatic ingestion — see the API reference.

For matters under protective order or with heightened confidentiality requirements, document the workflow in your engagement letter and confirm the third-party tool clause covers automated transcription services. This is general guidance, not legal advice — verify with your jurisdiction's ethics opinions on cloud-based legal tech before adopting in any client matter.

Accuracy — what to expect on real deposition audio

Depositions are a near-best-case scenario for AI transcription: quiet conference rooms, lapel or table mics, professional speakers who don't talk over each other (much). On clean conference-room recordings at 128 kbps or higher, expect ~92% word accuracy, often higher. That is the plateau for AI ASR on real-world audio.

Where accuracy drops:

  • Phone or video-call depositions at 8 kHz (telephony bandwidth): ~82%. Order a higher-bitrate recording from the platform if possible.
  • Heavy technical vocabulary (patent disputes, medical malpractice): proper nouns and domain terms will need a manual pass. The transcript is a draft, not a final.
  • Heavy cross-talk during objections: any system, human or AI, struggles when three lawyers speak simultaneously. Diarization may collapse overlapping turns.

A 92% transcript is plenty for navigation and review. It is not plenty for citation in a brief — that's what the certified version is for.

Choosing between AI draft and certified transcript

NeedUse
Same-day review of what was saidAI draft
Searchable archive of past depositionsAI draft
Cite-checking witness inconsistenciesAI draft + certified
Quote in a motion or briefCertified only
Read into the record at trialCertified only
Exhibit attachment for summary judgmentCertified only
Internal client debrief notesAI draft
Witness prep with prior testimonyAI draft

The AI draft and the certified transcript are not competitors. They serve different points in the same workflow.

FAQ

Is an AI-generated transcript admissible in court?

Generally no, not on its own. Admissibility of a deposition transcript depends on certification by an authorised officer (typically a court reporter or notary). An automated transcript has no oath behind it. It can support work product, witness prep, and internal review, but the document filed with the court or read into the record needs to be the certified version. Check your jurisdiction's specific rules. General guidance, not legal advice.

Can I record a deposition myself and skip the court reporter?

In almost all US jurisdictions, no — at least not for the official record. FRCP 30(b)(5) and state equivalents require the deposition to be conducted before an officer authorised to administer oaths. You can typically make a backup audio or video recording (with notice and consent on the record), and that backup is what you'd run through AI transcription for your own working draft. State-specific consent and notice rules vary (California, Illinois, and Florida have additional requirements) — confirm against your local rules before recording.

How fast is AI transcription for a multi-hour deposition?

Approximately 6× faster than realtime — a 60-minute file completes in 9–11 minutes. A 4-hour deposition is typically ready in 35–45 minutes. The system uses parallel chunking, so doubling the file length does not double the wait. On the Pro plan, the maximum single-file duration is 10 hours, which covers all but the longest day-long depositions.

What about privilege and confidentiality?

Source audio is deleted from our infrastructure within 24 hours of completion. Transcripts stay in your account until you delete them. We do not train models on user data. For matters under protective order, document the use of automated transcription in your workflow and confirm it satisfies your firm's outside vendor policies. The REST API supports automated deletion via your case management system.

Can it tell the difference between Counsel and Witness automatically?

Yes — diarization labels each speaker turn as speaker_0, speaker_1, speaker_2, and so on. The labels are anonymous by default; you rename them ("Counsel Q.", "Witness", "Opposing Counsel") with one click in the dashboard. On stereo recordings where counsel and witness are on separate channels, speaker assignment is by channel split — effectively 100% accurate.

What file formats can I upload?

MP3, WAV, M4A, AAC, OGG, OPUS, FLAC, and WEBM for audio; MP4, MOV, MKV, AVI, and WMV for video. Most deposition rooms record to MP4 (video) or WAV/MP3 (audio backup) — all work directly without conversion. Up to 2 GB per file on Pro, 5 GB on Business.

What if the deposition is in a language other than English?

99 languages are auto-detected from the first 30 seconds of audio, with one price across all of them. Diarization works the same way. For a multilingual deposition with translation by an interpreter, the transcript will reflect what was actually spoken on the record — including both the original language and the interpreted English. Review carefully.

How does pricing compare to ordering rough drafts from the court reporter?

Court reporter rough drafts typically run $2.00–$3.50 per page as a surcharge on top of the final certified transcript (rates vary by jurisdiction and reporting firm; same-day or next-day expedites add another 50–100%). For a 4-hour deposition producing ~250 pages, that's roughly $500–$875 in rough-draft fees on top of the certified rate. Our Pro plan is $19/month for 600 minutes (10 hours) of transcription. A typical 4-hour deposition uses 240 minutes of your monthly quota. Overage on Pro is $0.04/minute on Pro and $0.02/minute on Business. The AI draft is a complement to the rough or certified transcript, not a replacement for either.

Related reading