ffmpeg + Whisper
Mahala, ntleng, o na le math ata. O na le pipeline le bug e ka mong ho eona.
Lahla MP4 file mmala o bona — re ntsha setsi sa audio server-side, re buisa transcript e nang le nako, le re phetha SRT e buang ka botlalo YouTube, Vimeo, kapa NLE ya hao.
MP3 · WAV · M4A · MP4 · MOV · MKV · OGG · OPUS · FLAC · WEBM — up to 100 MB anonymously
YouTube · TikTok · Vimeo · Twitter · SoundCloud · Spotify · 50+ more
↓ Bona se e busang
MP4 ke sethako — re bala setsi sa audio ka botlalo, re sa ntšhe video ka tsela e ncha. Dinako di dumelanong le diframo ho timeline ya hao ya pele, ka tsela SRT e dumelanong ha e kena pele.
Lumela, module ye re ka lebona workflow ya refund ho tloha qalolong le go ya ho pele.
Potso e foufaneng pele re simologa — na se se amanang le refund ya karolo-karolo?
E le hantle. Refund ya karolo-karolo e sebelisa screen tse ts'oanang empa ka khoele e fapaneng.
Ke e utloisitse. Le kgetlo ya tiiso e ntse e le madi a palo e 200 dollars?
↓ This is the dashboard
Same layout as the real dashboard — Summary, full Transcript, Speakers tab, Exports. Key points and action items extracted automatically. Auto-tags on every job.
Sample preview from a founder interview about post-call workflow. Real transcripts look exactly like this — same tabs, same summary block, same key-points / action-items split, same auto-tag chips.
Dikgetsi tse tharo tse meta · papiso e ne nete
O ka ntsha audio leihlo le o hlale Whisper. O ka lahla MP4 ho Descript kapa VEED le o dule kahare ho editor ea bona. Kapa o ka lahla file apha le o fumane transcript + SRT, nang le ho ikuta ho editor.
Mahala, ntleng, o na le math ata. O na le pipeline le bug e ka mong ho eona.
Lahla MP4. Audio extraction, diarization, SRT, summary — ketsahalo e le nngwe.
Lokela MP4 ho editor. Transcript e hlaha e le karolo ya timeline UI.
Pricing and feature caps approximate as of 2026. Descript and VEED tier names change frequently — check their site for current limits.
Specific to MP4
MP4 ke sethako, e seng codec — le transcription tools tse ntsi di e rata e le blob ya audio e le nngwe. Eo e matla ho tswa.
Lahla MP4 le tsena di bontshwa ka default. Override per-job ho tswa ho form.
Accuracy · real-world numbers
MP4 accuracy e hlalosiwa ke mic, e seng codec. Mic ya lav ho set e hlotshwane e fapana le camera ya 4K e nang le audio e etsoang ka ho ka sa leloko. Dinomoro tsa tlase li tsoang ho MP4 tsa customer tse nete, tse hlophisitsoe ke se eleng se fumanwate ho audio.
Lapel kapa boom ho recorder, 48 kHz AAC at 192+ kbps, room e lokisitswe. Nyakiso ea holimo. Speaker labels di tlatsang ho shoot ya batho ba babeli.
Mic ya mokgosi 2-4 feet ho tswa basalapali. Room tone e le nngwe empa puo e a utloahala. YouTube creator footage e ntso e fihla apha.
OBS, Loom, Camtasia exports. Mic e atamela empa room ha e lokisitswe, hangata e na le system audio bleed. E le hantle haholo ho transcripts tsa tutorial.
Mic ya phone e etsoang, leqale kapa handling noise, hole e fapana shot le shot. Mantsoe a kgonehala, ipaakante 1-2 fixes per minute ho ditshwantshiso tse nepahetseng.
Dipotso tse tloaelehileng
30 mahala metsotsoana khoeli e ka mong. Ha ho card. Audio e ntshoa server-side, speaker labels, frame-accurate SRT — tse ka mong tse sekamoseng.
Start free