TikTok auto-captions
Built into the TikTok editor. Toggle on, captions appear. No file you can take elsewhere.
Drop a TikTok video URL. We pull the audio server-side and return timestamped text plus SRT and VTT caption files — ready to re-upload or burn in.
MP3 · WAV · M4A · MP4 · MOV · MKV · OGG · OPUS · FLAC · WEBM — up to 100 MB anonymously
YouTube · TikTok · Vimeo · Twitter · SoundCloud · Spotify · 50+ more
↓ Watch what comes out
Paste any public TikTok video link. We fetch the audio track, run language detection, and stream back captions while background music keeps playing under the voice.
Okay so the secret to crispy tofu nobody tells you — press it for ten minutes, not two.
Then cornstarch, not flour. Toss it, don't dust it.
Air fryer at 400 for twelve minutes, flip halfway.
Comment 'tofu' and I'll send the full sauce recipe.
↓ This is the dashboard
Same layout as the real dashboard — Summary, full Transcript, Speakers tab, Exports. Key points and action items extracted automatically. Auto-tags on every job.
Sample preview from a founder interview about post-call workflow. Real transcripts look exactly like this — same tabs, same summary block, same key-points / action-items split, same auto-tag chips.
Three real options · honest comparison
TikTok ships auto-captions in the editor. CapCut and Submagic add styled, animated captions for re-upload. We give you the raw transcript plus clean SRT/VTT — bring your own editor.
Built into the TikTok editor. Toggle on, captions appear. No file you can take elsewhere.
Paste the public URL. Get a transcript file plus SRT/VTT you can drop into any editor or re-upload anywhere.
Styled, animated captions tuned for short-form. Locked to their editor, English-first.
Pricing approximate as of May 2026. Language counts based on each vendor's published support pages.
Specific to TikTok
TikTok audio isn't podcast audio. These are the differences worth flipping before you queue the job.
Paste a TikTok URL and these flip on by default. Override per-job from the form.
Accuracy · real-world numbers
The ceiling is set by how loud the music bed is and how fast the creator talks. Voice-over recorded separately and dropped over a quiet bed is the best case; lip-sync trends and duets are the worst. Numbers below come from real TikTok URLs run through our pipeline.
Creator recorded on mic, music sits 15-20 dB below voice. Talking-head educational and recipe content lands here.
Selfie-style talking head, no backing track. Phone mic and room reverb cost a few points versus voice-over.
Voice and music within 6 dB. Fast hooks and brand names take hits — expect a 1-minute clean-up pass.
Two audio tracks overlapping or song lyrics being mouthed. We transcribe what's spoken; song lyrics are flagged, not retyped.
Common questions
30 free minutes every month. No card. SRT, VTT, 100+ languages, all exports included.
Start free