TikTok transcript — transcribe TikTok video to text, free

TikTok transcription.Paste a link, get captions.

Drop a TikTok video URL. We pull the audio server-side and return timestamped text plus SRT and VTT caption files — ready to re-upload or burn in.

Drop your audio or video

MP3 · WAV · M4A · MP4 · MOV · MKV · OGG · OPUS · FLAC · WEBM — up to 100 MB anonymously

Paste a link, we’ll fetch the audio

YouTube · TikTok · Vimeo · Twitter · SoundCloud · Spotify · 50+ more

Record straight from your browser

No card required~90s per 60-min fileSRT · VTT · DOCX · TXTFiles auto-delete in 24h

Public URL in. Captions out.

Paste any public TikTok video link. We fetch the audio track, run language detection, and stream back captions while background music keeps playing under the voice.

TikTok video URLREC 1 voice · 0:47 · vertical 9:16

auto-detected en-US44.1 kHz · music bed -18 dB

~90s

Captions · streaming94% accuracy

Okay so the secret to crispy tofu nobody tells you — press it for ten minutes, not two.

Then cornstarch, not flour. Toss it, don't dust it.

Air fryer at 400 for twelve minutes, flip halfway.

Comment 'tofu' and I'll send the full sauce recipe.

94% on creator voice-overSRT · VTT · TXT · DOCX · JSON

This is what loads when the job finishes.

Same layout as the real dashboard — Summary, full Transcript, Speakers tab, Exports. Key points and action items extracted automatically. Auto-tags on every job.

app.transcription.solutions / interview-202.mp3Export

Summary 5Transcript 1,420Speakers 2Exports

interview-202.mp347:08128 kbps CBR2 speakersen-US auto-detected

Founders need post-call content, not just transcripts. Tools force them to stitch 5 apps together.

Sample preview from a founder interview about post-call workflow. Real transcripts look exactly like this — same tabs, same summary block, same key-points / action-items split, same auto-tag chips.

Key points

Gap exists between raw recordings and shippable content — tools stop at transcript.

Show notes, social clips, blog drafts all expected by call's end, not next-day.

Current tooling fragmented across 5 apps — no single pipeline.

Conversion-rate signal flipped a buyer-segment assumption at week 3.

40% of original hypothesis survived — the shape held, mechanics rebuilt.

Action items

Speaker 1Investigate single-pipeline approach to replace 5-app stitch.

Speaker 2Mock how show-notes draft could flow from the transcript.

Speaker 2Pull conversion-rate by segment, Monday EOD.

Speaker 1Map the 5-app stitch & list which steps actually need a human.

Auto-taggedfounder interviewpost-call contenttooling fragmentationsingle pipeline

Try it on your own file — it's free

Option 01

TikTok auto-captions

Built into the TikTok editor. Toggle on, captions appear. No file you can take elsewhere.

RequiresUpload through TikTok app

Language coverage~40 languages, EN strongest

ExportNone — burned in only

Edit before publishIn-app text editor

Music handlingMisses lyrics, garbles voice over loud beds

CostFree

Best forCreators who only need captions inside TikTok and never repost to Reels or Shorts.

Option 02

Transcription.Solutions

Paste the public URL. Get a transcript file plus SRT/VTT you can drop into any editor or re-upload anywhere.

RequiresPublic TikTok URL — no login

Language coverage100+ with auto-detect

ExportSRT · VTT · DOCX · TXT · JSON

Edit before publishWeb editor, then re-export

Music handlingVoice isolation on noisy beds

Cost · per min$0.03

Best forCreators cross-posting to Reels/Shorts/YouTube, agencies repurposing client TikToks, researchers archiving trends.

Option 03

CapCut / Submagic

Styled, animated captions tuned for short-form. Locked to their editor, English-first.

RequiresApp install + paid for export

Language coverage~20 strong, others spotty

ExportMP4 with burn-in, SRT on paid

Edit before publishInside their timeline only

Music handlingEN-tuned, drops on accented voice

Cost$10–24/mo (approximate, 2026)

Best forSolo creators who want animated word-pop captions and never leave the CapCut/Submagic editor.

Pricing approximate as of May 2026. Language counts based on each vendor's published support pages.

8 things people ask about TikTok transcription.

01Do I need to download the TikTok first?+

No. Paste the public video URL (the share link from the TikTok app) and we extract the audio server-side. If the video is private or region-blocked, you'll need to download the MP4 yourself and upload it — we can't bypass TikTok's access rules.

02Will you transcribe the song lyrics or just the creator's voice?+

Just the spoken voice. Voice isolation suppresses the music bed before transcription, and trending-audio lyrics get flagged in the JSON output rather than written into the caption track. You can flip isolation off if you specifically want lyrics.

03Can I get an SRT formatted for vertical short-form video?+

Yes. The short-form caption preset breaks cues at roughly 3 words per line and 1.2 seconds per cue — the rhythm that fits the 9:16 safe zone without overlapping UI. Standard SRT (one sentence per cue) is also available.

04What about duets and stitches with two voices?+

Acoustic diarization separates the two voices and labels them Speaker 1 and Speaker 2. Accuracy drops 5-10 points when the audio tracks overlap heavily — that's the worst case in our data.

05Does it handle non-English creators?+

Yes — 100+ languages with auto-detect. Spanish, Portuguese, Indonesian, Vietnamese, and Arabic creators come back at roughly the same accuracy band as English. Code-switching (mixing two languages mid-sentence) is detected and labeled per segment.

06How long until the transcript is ready?+

Under five minutes for a standard 30-90 second TikTok, usually under two. Longer-form TikToks (3-10 minutes) finish in roughly 1/10 of real-time.

07Can I bulk-process a creator's whole feed?+

Yes, via the API or by pasting a list of URLs into the dashboard. We rate-limit the URL fetcher politely so TikTok doesn't block us — expect ~30 videos in the first batch, then steady throughput from there.

08Is this allowed under TikTok's terms?+

We only fetch public videos via their public share endpoints — the same way a browser preview does. We don't bypass private accounts or login walls. If you're transcribing someone else's content for commercial use, fair-use and platform rules are on you to check.

TikTok transcription.Paste a link, get captions.

Drop your audio or video

Paste a link, we’ll fetch the audio

Record straight from your browser

Public URL in. Captions out.

This is what loads when the job finishes.

Founders need post-call content, not just transcripts. Tools force them to stitch 5 apps together.

TikTok auto-captions. CapCut or Submagic. Or us.

TikTok auto-captions

Transcription.Solutions

CapCut / Submagic

Three things that bite people on generic transcription tools.

What goes wrong

What to flip here

Recommended job settings for TikTok

94% on clean voice-over. Music-heavy clips drop predictably.

8 things people ask about TikTok transcription.

Paste a TikTok URL. See what comes out.