Faasalalauina YouTube. Sili atu i automatic captions.Mas itiiti i le togi o tagata.

Fe'au le YouTube video URL. Maua se 95%+ sa'o faamatalaga faʻatasi ai speaker labels, chapter timestamps, ma SRT/VTT captions e mafai ona toe uploadina — leai le Premium, leai le Chrome extension.

Drop a file, or pick one

MP3 · WAV · M4A · MP4 · MOV · MKV · OGG · OPUS · FLAC · WEBM — up to 100 MB anonymously

Paste a link, we’ll fetch the audio

YouTube · TikTok · Vimeo · Twitter · SoundCloud · Spotify · 50+ more

Record straight from your browser

Sign up takes 30 seconds — recording opens right after, in the dashboard.

No card required~90s per 60-min fileSRT · VTT · DOCX · TXTFiles auto-deleted in 24h

↓ Matamata i le mea e sau ai

URL i ai. Captions ma le maʻa faamatalaga e sau ai.

Fe'au le youtu.be poʻo youtube.com link. Tete'a e matou, toto'e le highest-bitrate audio track i le server-side, taumafai diarization, ma toe foʻi ai se timestamped faamatalaga faʻatasi SRT/VTT o le mafai ona uploadina e fai ma community captions.

youtu.be/dQw4w9WgXcQREC Interview · speakers 2 · 28:14
auto-detected en-USopus 160 kbps · 48 kHz
~90s
Faamatalaga · streaming96% sa'o
S1

Ua tula'i le channel i 100k subs i le valu masina — o le a le upu na toe folau ai?

S2

Ioe, le posting Shorts i aso uma mo le ono itula — na muamua ai le long-form watch time.

S1

Ma le thumbnail rework — na fuafuaina ea i le YouTube Studio?

S2

Ioe, le Test & Compare tool fou. E lua nai le tolu winners na leai se fofoga i luga.

96% i le talking-head audioSRT · VTT · DOCX · TXT · JSON

↓ This is the dashboard

This is what loads when the job finishes.

Same layout as the real dashboard — Summary, full Transcript, Speakers tab, Exports. Key points and action items extracted automatically. Auto-tags on every job.

Try it on your own file — it's free

Tolu ava moni · faatulagaga tonu

YouTube automatic captions. Rev tagata. Poʻo i matou.

Lua e YouTube automatic captions i le video uma leai se togi — na'o le eseʻese ale sa'o ma leai le speaker labels. Faʻatau Rev i le faamatalaga o tagata i le $1.50/min. Nofolau e matou i le aoao: AI i le 95%+, speaker labels, tolu minute turnaround.

Option 01

YouTube automatic captions

Leai se togi, faʻataʻia i le video fa'apublic uma. E leai le punctuation pass, e leai le speaker labels.

TogiLeai se togi
Sa'o~80% i le leo maʻa
Speaker labelsLeai
PunctuationItiiti, e leai le paragraphs
TusitusiCopy-paste mai le transcript panel
Galue i lePublic videos na o le mea lea
Best forVave matamata i le video e te le pele ai pe leai le talitonuga i le sa'o.
Option 02

Transcription.Solutions

Fe'au le URL. Tolu minute mulimuli: maʻa faamatalaga, SRT/VTT, AI summary faʻatasi ai le chapter links.

Togi · i le min$0.03 i le Pro
Sa'o95%+ i le talking-head
Speaker labelsIoe (Pro ma Business)
PunctuationFa'atasi, faʻatasi ai le paragraphs
TusitusiSRT · VTT · DOCX · TXT · JSON
Galue i lePublic + unlisted URLs
Best forCreators toe uploadina captions, podcasters fai maloʻa video i blog, researchers toto'e fesili mai interviews.
Option 03

Rev tagata transcription

O le tagata e tusi ai. Sa'o tele tele, faʻaleʻi turnaround, togi i le minuta.

Togi · i le min$1.50
Sa'o99%+ mautinoa
Speaker labelsIoe
PunctuationFa'atasi, editorial-grade
Turnaround12-24 itula te'a
Galue i lePea fail u'u
Best forCourt-admissible content, broadcast subtitles, poʻo interviews na o le tasi upu na leʻo e petʻai ai le quote.

Pricing sa'o pei o le 2026. Rev rates e faiʻai le latou standard service tier; AI-only tiers mai competitors e le o faatulagaina i iinei.

Fa'apitoa i YouTube

Tolu mea na te popoia i matou i le generic transcription tools.

O le YouTube audio e iai pea mea e eseʻese na generic transcribers e le gaoiina. E soʻa le tonu settings ma toe sau le faamatalaga o le mafai ona re-upload e fai ma captions.

O le mea na tupito ai

  1. 1O le music beds e faʻapolopo ai le recognizer. Intro stings ma le background music ua faasalalauina e fai ma garbled upu. O le generic AI e le iloa e lamuina.
  2. 2O le SRT line lengths e le o le measina i YouTube caption rules. O le subtitles e tele i le safe area i luga mobile, poʻo le tutasi mid-word ona e le o le chunker na fuafuaina i le video.
  3. 3O le channel-specific names (sponsor brands, game titles, guest handles e pei o @MKBHD) ua tusi mai phonetically. O le tasi tipitapi ma ua le mafai ona sailia le quote.

O le mea e soʻa i iinei

  1. 1Soʻu le Music-aware segmentation i le job form. Fa'atagua e matou music regions faʻatasi le `[music]` ae o le tatau lea o le falo lyrics, ma toe amata le transcription maʻa pe a toe sau le leo.
  2. 2Pilia le YouTube-safe SRT e fai mo le tusitusi. O le lines e muamua i le 42 characters, tele 2 lines i le cue, ma o le breaks na saʻo i le phrase boundaries — tutusi le file tonu i YouTube Studio.
  3. 3Tusi le channel vocabulary (sponsor names, recurring guests, game titles) i le Custom vocabulary. Avatu e matou i le recognizer e fai ma hint koia na o le brand spellings e le soʻo.

Fautuaga job settings mo YouTube

Fe'au le YouTube URL ma o nei e soʻa i lalo i le faʻamanuia. Suia per-job mai le form.

Source
URL paste · auto-resolve youtu.be
Diarization
Acoustic · 1-4 speakers
Music handling
Tag [music], skip lyrics
Filler words
Removed by default
Summary
Chapter timestamps + key moments
Export
YouTube-safe SRT · VTT · DOCX

Accuracy · real-world numbers

95%+ i le talking-head videos. Music ma game audio e faʻaiti ai.

YouTube content e eseʻese tele — o le podcast studio ma le Fortnite stream e le o le fa'afitauli e tasi. Lapel-mic talking-head o le toe lelei, ma le background music ma le overlapping game audio e sofai ai le sa'o i le vave. O numera i lalo mai le YouTube URLs tupu moni a clients i production.

97%
Studio podcast · per-guest mic

Joe Rogan-style setup: o le tasi tagata i le separate boom mic, laumei lite, leai le music bed. Diarization e faʻalilolilo pe a le ta'u leo i le tasi.

95%
Single talking-head · lapel/USB mic

Standard tutorial poʻo le video essay. O le tasi speaker, leo i lalo, intro music ducked i lalo leo. O le tele o YouTube uploads e tula'i iinei.

89%
Vlog faʻatasi B-roll · leo i fafo

Matagi, tamalie, ambient music i lalo voiceover. O upu e galue pea; ta'u le occasional misses i proper nouns ma brand names.

84%
Gaming stream · leo i luga game audio

Game SFX, music, ma chat-reading i le tele tele volume. O leo o le streamer te maʻa; teammates i luga Discord ua sopoia le vave. O le toe faʻai iti i matou data.

Fesili masani

8 mea e fesili ai tagata i le YouTube transcription.

01E na o le tusi URL, poʻo download e te le video muamua?+
Na o le tusi URL. E tatanua e matou youtube.com/watch, youtu.be short links, ma unlisted video URLs. Tete'a e matou i le server-side, toto'e le audio track na o le mea lea (e le le video), ma toe saua le transcription — te'a i le 10 seconds o le tusi.
02E galue ea i le private poʻo unlisted videos?+
Unlisted ioe, private leai. O le unlisted URLs e fa'atulagaina i le publicate pe o iai le link, koia na mafai ai e matou maua. O le private videos e mana'o tele a sini i le Google account — e le mafai e matou taofia oe. Download le MP4 mai YouTube Studio muamua, ona upload le file.
03Aisea o le transcript a matou e sili atu nai lo YouTube auto-captions?+
O le YouTube auto-captions e faʻatau i le streaming model fuafuaina mo le togi-at-scale i luga le billion o videos. Tatu e matou i le model tele faʻatasi ai le full-context decoding, custom vocabulary, ma le soʻo diarization pass. Iʻuga: ~95% vs ~80%, faʻatasi ai speaker labels ma le regular punctuation.
04E mafai ona upload le SRT toe i YouTube e fai ma community captions?+
Ioe. Tusitusi e fai YouTube-safe SRT, bukluka YouTube Studio → Subtitles → Add → Upload file. O le line lengths ma le timing a matou e fa'asalalau i YouTube display rules, koia na le overflow cues i le mobile poʻo le tutasi mid-word.
05O le copyright — e soifua ea le faasalalauina le video a se tasi?+
O le transcribing mo le personal use, research, journalism, poʻo le commentary e fa'asalalau i le fair use i le US. Re-publishing le full transcript fa'amea e muamua. E le o le audio poʻo le video e fa'atagua e matou, o le text e avatu ai — o le mea e te faia ai o oe le tanu. E le legal advice.
06E mafai ea koe gaoiina le long videos e pei o 4-hour podcast episodes?+
Ioe. O le hard cap a matou e 8 hours mo le file. O le 4-hour Lex Fridman episode e faasalalauina i le 8-12 minutes wall-clock ma o le tala mai $7.20 i le Pro pricing. Speaker diarization e fa'asalalau i le aofa atoa.
07E gaoiina ea koe non-English YouTube videos?+
Ioe — 99 languages auto-detected. O le Spanish, Hindi, Portuguese, ma Japanese e tula'i uma i le 2-3 points o le English sa'o i le leo maʻa. Code-switching (English + Spanish i le sentense e tasi) e galue ae faʻaiti i le ~5 points.
08E mafai ea maua le chapter timestamps e pei o YouTube auto-chapters?+
Ioe. O le AI summary e aofia ai le chapter-style timestamps i le topic transitions faʻatasi ai le key-moment links. Paste faʻo i le video description e fai `00:00 Intro / 03:42 Setup / …` — YouTube e fai atu ai e fai ma clickable chapters faʻapenei.

Fe'au le YouTube URL. Matamata i le mea e sau ai.

30 free minutes i le masina uma. E leai le card. Speaker labels, YouTube-safe SRT, AI summary faʻatasi ai le chapter timestamps — ua aofia uma.

Amata se leai se togi