ffmpeg + Whisper
Yamahhala, ekhaya, nokuqangamba. Uyawami umsebenzi kanye nogqaji ngamunye.
Lahla i-MP4 file njengoba linjalo — sithwala inzwai ye-audio i-server-side, sabulele incwadi enezitlaka zosikhathi, bese sethula i-SRT efanele ngubuntu ku-YouTube, Vimeo, noma kuNLE yakho.
MP3 · WAV · M4A · MP4 · MOV · MKV · OGG · OPUS · FLAC · WEBM — up to 100 MB anonymously
YouTube · TikTok · Vimeo · Twitter · SoundCloud · Spotify · 50+ more
↓ Bheka okuvela
I-MP4 iyisikhumbuzi — sifunda i-audio stream ngqo, asikho nomkulwa-nhlelo ovidiyo. Izitlaka zosikhathi zihlala zifanele nendlela yakho engena, kanjalo i-SRT ihambe kahle usemi lwangokuqala.
Kulungile, kule yunithi siyahamba ngomsebenzi wokubuyisa mali kumbuzi.
Umbuzo omkhulu ngaphambi kokuthi siqale — ingabe lokhu kusebenza kumabuyiso anqoba?
Oku kukhethekile. Amabuyiso anqoba asebenzisa isikrini esifanayo kodwa ikhosi yankinga ehlukile.
Ngiyakuqonda. Kanye umkelo wokuvuma uyasalela izigidi ezimbili za-dollars?
↓ This is the dashboard
Same layout as the real dashboard — Summary, full Transcript, Speakers tab, Exports. Key points and action items extracted automatically. Auto-tags on every job.
Sample preview from a founder interview about post-call workflow. Real transcripts look exactly like this — same tabs, same summary block, same key-points / action-items split, same auto-tag chips.
Izinketho ezintathu eziyiqiniso · ukuqhathanisa ngobuqotho
Ungakhipha i-audio ngokwako bese ukulala u-Whisper. Ungawaleka i-MP4 ku-Descript noma VEED bese uhlala ngaphakathi kumhleli wawo. Noma lahla ifayela lapha bese uthola incwadi + i-SRT kabusha, ayikho inkompilo ka-editor.
Yamahhala, ekhaya, nokuqangamba. Uyawami umsebenzi kanye nogqaji ngamunye.
Lahla i-MP4. Okukhiphwa kwe-audio, ukuhlukahluka, i-SRT, isifiqelo — isiyonke.
Lalela i-MP4 kumhleli. Incwadi ivela njengenxalenye ye-timeline UI.
Intengo nezilolo zingathe amaphuzu okuthi zonke isayithi yabo.
Okuthile qukungenisa kwe-MP4
I-MP4 iyisikhumbuzi, hhayi i-codec — futhi amathuluzi we-transcription angajwayelekile amthuthukise njengebhlob elilodwa le-audio. Lapho amaphuzu evela khona.
Lahla i-MP4 futhi lokhu kulolane ngoku. Boshiyela ngamunye ngezinqubela zosebenzako.
Accuracy · real-world numbers
Ukunemba kwe-MP4 kusethelwa ngumakinasi, hhayi i-codec. Imakinasi ye-lav ku-set enomeqe idlule ikhamera engu-4K enomzwa okhethiwe ngokuqala. Amanomboro angezansi avela ku-MP4 yabaphakeli abangcwele, ahlukaniswe ngalolisile inzwai.
Ilapele noma iphalamende ye-recorder, 48 kHz AAC ngo-192+ kbps, indawo eluluniselwe. Umkhakha okhulu. Izilabel zabakhulumi zibambe kahle kuabakhulumi ababili.
Imakinasi ephezulu kwebhalakhon ekunwe kuze kune-4 izinyawo kumkhulumi. Umthwali wemithwalo kodwa okukhulumwa kucacile. Isiqunywa segumbi le-YouTube sihlala lapha.
I-OBS, I-Loom, i-Camtasia exports. Imakinasi isonse kodwa indawo ayinakugcinwa kahle, evame ukuba nokukunepuka kwi-audio yesistimu. Kakuhle kancane para i-transcript yesifundo.
Imakinasi ye-fone engaphakathi, umoya noma ukunenyakela, ibanga lishintshe isithombe. Amagama ayasebenza, ashanele i-1-2 ukusungulwa isigamu lemizuzu engumkhulumi.
Imibuzo ejwayelekile
30 amaminithini yamahhala ngenyanga. Akukho ikhadi. I-audio ikhiphwa i-server-side, izilabel zabakhulumi, i-frame-accurate i-SRT — konke akusethwa.
Qala yamahhala