Descript
Audio editor with built-in transcription. Great for editing-by-text workflows, heavier than you need if you just want a transcript.
Drop your podcast episode master — MP3, WAV, or a YouTube link. Get a speaker-labeled transcript, AI show notes with key points and tags, plus an SRT for the video cut.
MP3 · WAV · M4A · MP4 · MOV · MKV · OGG · OPUS · FLAC · WEBM — up to 100 MB anonymously
YouTube · TikTok · Vimeo · Twitter · SoundCloud · Spotify · 50+ more
↓ One file in, four artifacts out
Most podcasts come in as a post-production stereo MP3 with host and guest already mixed together. We split them acoustically, detect the music intro, and start the transcript at the first spoken word.
Welcome back to the show. Today I'm talking with Priya Anand about her new book on supply chains.
Thanks for having me, Jordan. It's been a wild three years since we last spoke.
So the book opens with the Suez blockage — why start there?
Because it was the moment everyone non-logistics suddenly cared about containers.
↓ This is the dashboard
Same layout as the real dashboard — Summary, full Transcript, Speakers tab, Exports. Key points and action items extracted automatically. Auto-tags on every job.
Sample preview from a founder interview about post-call workflow. Real transcripts look exactly like this — same tabs, same summary block, same key-points / action-items split, same auto-tag chips.
Three real options · honest comparison
Descript is an editor first, transcription second. Castmagic is show notes first, transcript second. We focus on the file → transcript → show notes pipeline and stay out of your editor.
Audio editor with built-in transcription. Great for editing-by-text workflows, heavier than you need if you just want a transcript.
Drop the episode master. Transcript, show notes, tags, SRT — all four in one pass. No editor, no lock-in.
Show-notes-as-a-service. Drag in the file, get a slick content pack. Transcript is more of a byproduct.
Pricing approximate as of 2026 and changes per vendor. Free tiers and add-on AI features rotate frequently.
Specific to podcasting
Tell us a few things about the episode on upload and the output stops needing a cleanup pass.
Drop an episode and these defaults flip on. Override per-job from the form.
Accuracy · real-world numbers
Podcast accuracy depends mostly on how the guest was recorded, not the host. A studio host paired with a Zoom-only guest behaves like the worst leg. Numbers below come from real customer episodes, not lab audio.
Each speaker on a separate WAV. We treat each track independently and skip diarization. Cleanest possible case.
Host left, guest right, after mastering. The most common podcast shape. Diarization is essentially free from the stereo split.
Roundtable shows or panel format mixed to mono. Similar voices may merge once or twice per hour — a 2-min cleanup pass fixes it.
Guest on AirPods through a hotel wifi call. Numbers and proper nouns suffer most. Custom vocabulary recovers most of it.
Common questions
30 free minutes every month. No card. Speaker labels, show notes, chapters, and every export included.
Start free