Instagram auto-captions
Built-in caption sticker on the Reel. No file you can download or edit outside the app.
Paste a public Instagram Reel link. Get a verbatim transcript, SRT and VTT subtitle files, and 100+ language detection — no download, no Instagram login, no creator account needed.
MP3 · WAV · M4A · MP4 · MOV · MKV · OGG · OPUS · FLAC · WEBM — up to 100 MB anonymously
YouTube · TikTok · Vimeo · Twitter · SoundCloud · Spotify · 50+ more
↓ Watch what comes out
We resolve the public Reel URL server-side, pull the audio track, and duck the music bed before recognition. Vocal-isolated ASR means trending audio doesn't eat your hook line.
Okay if you're using retinol and your skin is peeling — stop.
You're using too much. Pea-sized amount, two nights a week, that's it.
Buffer with moisturizer first. Link to the one I use is in my bio.
Save this so you stop wasting product. Follow for part two.
↓ This is the dashboard
Same layout as the real dashboard — Summary, full Transcript, Speakers tab, Exports. Key points and action items extracted automatically. Auto-tags on every job.
Sample preview from a founder interview about post-call workflow. Real transcripts look exactly like this — same tabs, same summary block, same key-points / action-items split, same auto-tag chips.
Three real options · honest comparison
Instagram bakes a caption sticker into the Reel itself. Submagic and CapCut style captions inside their editor and re-export your video. We give you the raw SRT/VTT/transcript text — burn it in your editor of choice, or repurpose it as a blog post.
Built-in caption sticker on the Reel. No file you can download or edit outside the app.
Paste a public Reel URL. Get the transcript and subtitle files. Bring your own editor.
Captions styled inside a video editor. Re-exports the Reel with burned-in text.
Pricing and feature flags accurate as of 2026. Submagic and CapCut tier names change frequently — check their site for current plans.
Specific to Instagram Reels
Reels aren't podcasts. Short, loud, music-heavy, and full of platform-specific tokens. Flip the right settings and the caption file comes back ready to drop into a timeline.
Paste a Reel URL and these flip on by default. Override per-job from the form.
Accuracy · real-world numbers
Reels are mixed loud, and trending music sits inches under the voice. The ceiling depends on how much the music bed competes with the talker. Numbers below are from real creator Reels processed in production — talking-head beauty, finance, food, and skit content.
Recorded into a mic, music added in post or absent. Cleanest case — error mostly on brand names and hashtag spellings.
Voice mixed 8-12 dB over a background loop. Vocal isolation handles it. The bulk of creator Reels land here.
Music within 3 dB of the voice. Lyrics occasionally leak into the transcript as words. Plan a quick clean-up pass.
Phone mic, ambient noise, no lavalier. Words usable, proper nouns suffer. Worst case in our Reel data.
Common questions
30 free minutes every month. No card. SRT, VTT, 100+ languages, every export included.
Start free