Podcast transcription for podcasters — podcast transcript, show notes, SRT in one pass

Transcription for podcasters.Show notes and SRT in one pass.

Drop your podcast episode master — MP3, WAV, or a YouTube link. Get a speaker-labeled transcript, AI show notes with key points and tags, plus an SRT for the video cut.

Drop your audio or video

MP3 · WAV · M4A · MP4 · MOV · MKV · OGG · OPUS · FLAC · WEBM — up to 100 MB anonymously

Paste a link, we’ll fetch the audio

YouTube · TikTok · Vimeo · Twitter · SoundCloud · Spotify · 50+ more

Record straight from your browser

No card required~90s per 60-min fileSRT · VTT · DOCX · TXTFiles auto-delete in 24h

Episode master in. Transcript, show notes, SRT, tags out.

Most podcasts come in as a post-production stereo MP3 with host and guest already mixed together. We split them acoustically, detect the music intro, and start the transcript at the first spoken word.

Episode 142 masterREC 2 speakers · 48:21 · MP3 192 kbps

auto-detected en-US44.1 kHz stereo · post-mix

~90s

Transcript · streaming95% accuracy

Welcome back to the show. Today I'm talking with Priya Anand about her new book on supply chains.

Thanks for having me, Jordan. It's been a wild three years since we last spoke.

So the book opens with the Suez blockage — why start there?

Because it was the moment everyone non-logistics suddenly cared about containers.

95% on stereo post-mixSRT · DOCX · TXT · Show notes MD

This is what loads when the job finishes.

Same layout as the real dashboard — Summary, full Transcript, Speakers tab, Exports. Key points and action items extracted automatically. Auto-tags on every job.

app.transcription.solutions / interview-202.mp3Export

Summary 5Transcript 1,420Speakers 2Exports

interview-202.mp347:08128 kbps CBR2 speakersen-US auto-detected

Founders need post-call content, not just transcripts. Tools force them to stitch 5 apps together.

Sample preview from a founder interview about post-call workflow. Real transcripts look exactly like this — same tabs, same summary block, same key-points / action-items split, same auto-tag chips.

Key points

Gap exists between raw recordings and shippable content — tools stop at transcript.

Show notes, social clips, blog drafts all expected by call's end, not next-day.

Current tooling fragmented across 5 apps — no single pipeline.

Conversion-rate signal flipped a buyer-segment assumption at week 3.

40% of original hypothesis survived — the shape held, mechanics rebuilt.

Action items

Speaker 1Investigate single-pipeline approach to replace 5-app stitch.

Speaker 2Mock how show-notes draft could flow from the transcript.

Speaker 2Pull conversion-rate by segment, Monday EOD.

Speaker 1Map the 5-app stitch & list which steps actually need a human.

Auto-taggedfounder interviewpost-call contenttooling fragmentationsingle pipeline

Try it on your own file — it's free

Option 01

Descript

Audio editor with built-in transcription. Great for editing-by-text workflows, heavier than you need if you just want a transcript.

Primary useDAW + word-edit

Speaker diarizationAcoustic, EN-strong

Show notesUnderlord AI add-on

ExportSRT · TXT · project file

Free tier1 hr/mo transcription

Cost$24/user/mo (Creator)

Best forSolo podcasters who edit episodes by deleting words from a transcript and want one app for everything.

Option 02

Transcription.Solutions

Drop the episode master. Transcript, show notes, tags, SRT — all four in one pass. No editor, no lock-in.

Primary useTranscript + show notes

Speaker diarizationAcoustic + per-track upload

Show notesFree on every plan

ExportSRT · VTT · DOCX · MD · JSON

Free tier30 min/mo, no card

Cost · per min$0.03

Best forShows that already have an editor (Logic, Hindenburg, Reaper) and just want clean text + notes after the episode is mixed.

Option 03

Castmagic

Show-notes-as-a-service. Drag in the file, get a slick content pack. Transcript is more of a byproduct.

Primary useContent repurposing

Speaker diarizationYes, EN-tuned

Show notesMany templates, paid only

ExportSRT · TXT · template MD

Free tierTrial only

Cost~$23+/mo (Starter)

Best forMarketing-heavy shows that need 12 social posts, 4 newsletter drafts, and a LinkedIn carousel per episode.

Pricing approximate as of 2026 and changes per vendor. Free tiers and add-on AI features rotate frequently.

8 things people ask about podcast transcription.

01Can I just paste my YouTube or SoundCloud link?+

Yes. Paste a public YouTube URL or a hosted episode link (SoundCloud, Buzzsprout, Transistor, Libsyn direct MP3) and we pull the audio on our side. For private feeds, download the file and upload it.

02Will the music intro be transcribed as 'la la la' nonsense?+

Not if Skip music intro/outro is on (it is by default). We detect non-speech audio and start the transcript at the first spoken word. Timestamps in the SRT shift to match so YouTube captions still sync.

03What's in the show notes file exactly?+

A 2-4 sentence episode summary, 3-7 key points as a bulleted list, action items if any were mentioned, and 3-8 topic tags. Rendered as markdown so you can paste straight into WordPress, Ghost, Substack, or your podcast host's episode page.

04Can you generate chapter markers for Apple Podcasts and Spotify?+

Yes — chapters are generated from the key points with timestamps. Export as a separate chapters.txt or embed in the WAV/M4A. Note that Spotify only honors chapters on Anchor-hosted shows, so the txt file is your fallback.

05I have per-track files from Riverside / SquadCast — should I upload those?+

Yes, please do. Upload each speaker's WAV separately and tag them with names. We transcribe each track independently and merge by timestamp. Accuracy lands around 97% on this setup — the cleanest case we see.

06Can it flag sponsor reads or ad breaks?+

Not automatically yet — that's on the roadmap. For now, drop a marker in your edit (a brief silence or chime) and we'll surface it as a timestamp in the transcript. You can also tag ad segments by paste-finding the sponsor brand name afterward.

07How long can the episode be?+

Up to 6 hours per file in one upload. Most shows run 30-90 minutes, which finishes in 4-8 minutes wall-clock. For a 3-hour interview episode, expect roughly 12-15 minutes from upload to all four artifacts ready.

08Will the SRT replace YouTube's auto-captions cleanly?+

Yes. The SRT is line-broken at ~42 chars with proper punctuation and speaker prefixes optional. Upload it in YouTube Studio → Subtitles → Add language → SRT. It overrides the auto-generated caption track entirely.

Transcription for podcasters.Show notes and SRT in one pass.

Drop your audio or video

Paste a link, we’ll fetch the audio

Record straight from your browser

Episode master in. Transcript, show notes, SRT, tags out.

This is what loads when the job finishes.

Founders need post-call content, not just transcripts. Tools force them to stitch 5 apps together.

Descript. Castmagic. Or us.

Descript

Transcription.Solutions

Castmagic

Three things that bite podcasters on generic transcription tools.

What goes wrong

What to flip here

Recommended job settings for podcasts

97% on studio-mic episodes. Holds up on remote-guest calls too.

8 things people ask about podcast transcription.

Drop your episode. Get the transcript, notes, and SRT.