Instagram Reel transcription.Paste a URL, get captions.

Paste a public Instagram Reel link. Get a verbatim transcript, SRT and VTT subtitle files, and 100+ language detection — no download, no Instagram login, no creator account needed.

Drop a file, or pick one

MP3 · WAV · M4A · MP4 · MOV · MKV · OGG · OPUS · FLAC · WEBM — up to 100 MB anonymously

Paste a link, we’ll fetch the audio

YouTube · TikTok · Vimeo · Twitter · SoundCloud · Spotify · 50+ more

Record straight from your browser

Sign up takes 30 seconds — recording opens right after, in the dashboard.

No card required~90s per 60-min fileSRT · VTT · DOCX · TXTFiles auto-deleted in 24h

↓ Watch what comes out

Reel URL in. Captions out.

We resolve the public Reel URL server-side, pull the audio track, and duck the music bed before recognition. Vocal-isolated ASR means trending audio doesn't eat your hook line.

Instagram Reel URLREC 1 voice · 0:47
auto-detected en-US44.1 kHz · music bed -14 LUFS
~90s
Captions · streaming94% accuracy
S1

Okay if you're using retinol and your skin is peeling — stop.

S1

You're using too much. Pea-sized amount, two nights a week, that's it.

S1

Buffer with moisturizer first. Link to the one I use is in my bio.

S1

Save this so you stop wasting product. Follow for part two.

94% with music bed onSRT · VTT · DOCX · TXT · JSON

↓ This is the dashboard

This is what loads when the job finishes.

Same layout as the real dashboard — Summary, full Transcript, Speakers tab, Exports. Key points and action items extracted automatically. Auto-tags on every job.

Try it on your own file — it's free

Three real options · honest comparison

Instagram auto-captions. Submagic or CapCut. Or us.

Instagram bakes a caption sticker into the Reel itself. Submagic and CapCut style captions inside their editor and re-export your video. We give you the raw SRT/VTT/transcript text — burn it in your editor of choice, or repurpose it as a blog post.

Option 01

Instagram auto-captions

Built-in caption sticker on the Reel. No file you can download or edit outside the app.

RequiresPosting from Instagram app
Languages~17 supported
ExportNone — sticker only
Edit transcriptTap-edit, single line
Repurpose to TikTok/ShortsRe-do per platform
CostFree
Best forCreators who only post inside Instagram and never need the text outside the app.
Option 02

Transcription.Solutions

Paste a public Reel URL. Get the transcript and subtitle files. Bring your own editor.

RequiresPublic Reel URL
Languages100+, auto-detected
ExportSRT · VTT · DOCX · TXT · JSON
Edit transcriptWord-level editor
Repurpose to TikTok/ShortsOne SRT, every platform
Cost · per min$0.03
Best forCreators who post to Reels, TikTok and Shorts; agencies captioning client content; anyone repurposing Reels into blog posts or newsletters.
Option 03

Submagic / CapCut

Captions styled inside a video editor. Re-exports the Reel with burned-in text.

RequiresUpload the MP4
LanguagesSubmagic ~50, CapCut ~30
ExportMP4 with burn-in; SRT on paid
Edit transcriptInside their editor only
Repurpose to TikTok/ShortsRe-render per aspect ratio
Cost$10–24/mo (Submagic)
Best forSolo creators who want animated word-by-word captions baked into the video and don't need the raw text elsewhere.

Pricing and feature flags accurate as of 2026. Submagic and CapCut tier names change frequently — check their site for current plans.

Specific to Instagram Reels

Three things that bite people on generic transcription tools.

Reels aren't podcasts. Short, loud, music-heavy, and full of platform-specific tokens. Flip the right settings and the caption file comes back ready to drop into a timeline.

What goes wrong

  1. 1Trending audio bleeds into the transcript. Generic ASR transcribes lyrics as if the singer were a speaker — your hook gets buried under a chorus line.
  2. 2@-handles and #hashtags become word salad. "At sephora" instead of @sephora, "hashtag get ready with me" instead of #grwm.
  3. 3Caption timing breaks the safe zone. Default 42-character lines wrap onto 3 lines on a 9:16 frame and collide with the Instagram UI chrome at the bottom.

What to flip here

  1. 1Turn on Music suppression on the job form. We separate vocals from the bed before recognition, so lyrics don't sneak into your transcript.
  2. 2Paste your handles and recurring hashtags into Custom vocabulary. We keep them as tokens — @sephora stays @sephora, #grwm stays #grwm.
  3. 3Set Caption style: Reel — 2 lines max, 32 characters per line, top-aligned. Drops into CapCut, Premiere or DaVinci without re-flowing.

Recommended job settings for Reels

Paste a Reel URL and these flip on by default. Override per-job from the form.

Music suppression
On (vocal isolation)
Speaker model
Single-voice creator (override for skits)
Language
Auto-detect · 100+ supported
Filler words
Kept (creator pacing)
Caption style
Reel · 2 lines · 32 char max
Export
SRT · VTT · TXT · JSON

Accuracy · real-world numbers

94% with music bed on. Drops on loud trending audio.

Reels are mixed loud, and trending music sits inches under the voice. The ceiling depends on how much the music bed competes with the talker. Numbers below are from real creator Reels processed in production — talking-head beauty, finance, food, and skit content.

96%
Studio voiceover, no music

Recorded into a mic, music added in post or absent. Cleanest case — error mostly on brand names and hashtag spellings.

94%
Talking head + light music bed

Voice mixed 8-12 dB over a background loop. Vocal isolation handles it. The bulk of creator Reels land here.

85%
Loud trending audio + voiceover

Music within 3 dB of the voice. Lyrics occasionally leak into the transcript as words. Plan a quick clean-up pass.

78%
Field audio (street, café, wind)

Phone mic, ambient noise, no lavalier. Words usable, proper nouns suffer. Worst case in our Reel data.

Common questions

8 things people ask about Reel transcription.

01Do I need to download the Reel first?+
No. Paste the public Reel URL (instagram.com/reel/...) and we resolve the audio server-side. No Instagram login, no MP4 export step, no browser extension.
02Does it work on private accounts or close-friends Reels?+
No. We only resolve publicly viewable Reels — same rule Instagram applies to anyone not logged in. For private content, download the Reel from your own account and upload the file directly.
03Can you burn the captions into the video?+
Not yet. We return SRT, VTT and a word-level transcript. Drop the SRT into CapCut, Premiere, DaVinci or Descript to burn it in. Burn-in is on the roadmap but isn't shipping today.
04How does it handle background music and trending audio?+
Music suppression is on by default for Reels — we run vocal isolation before ASR, which keeps lyrics from leaking into your transcript. On loud trending audio (music within 3 dB of voice) accuracy still drops to around 85%.
05What about emojis, hashtags and @-handles?+
Spoken hashtags and handles come through as #tag and @handle if you add them to Custom vocabulary on the job form. Emojis are not inserted — they're visual, not spoken — though we leave space for them in the SRT if you want to add them manually.
06Does it work on Reels longer than 90 seconds?+
Yes. Reels can run up to 3 minutes (15 minutes for some creator accounts). We process whatever the URL points to. Cost is per audio minute, so a 3-minute Reel runs about $0.09.
07Can I get the transcript translated into another language?+
Yes. After transcription, run the job through translation — we support 100+ language pairs. You get back a translated SRT with timing preserved, ready to caption international audiences.
08Will it pick up on-screen text that's already burned into the Reel?+
No. We transcribe audio only. If the creator burned in text overlays without voicing them, those won't appear in the transcript. OCR for on-screen text is a separate pipeline we don't ship yet.

Paste a Reel URL. See the captions.

30 free minutes every month. No card. SRT, VTT, 100+ languages, every export included.

Start free