Whisper endaweni / evulekile
Yamahhala uma una-GPU nohlobo lwehora. Ayikho i-speaker diarization ngokwemisinsile.
Labula ifayela le-MP3 nganoma iyiphi inzinga kusuka ku-64 kuya ku-320 kbps. Thola umbhalo onenumbe zentuthuko, onezibonisi zabakhulumi ezilwimini ezingu-99 — akudingeki ukuguqula ifayela, akudingeki ukuhlela kabusha, akudingeki ukulinda enlizweni.
MP3 · WAV · M4A · MP4 · MOV · MKV · OGG · OPUS · FLAC · WEBM — up to 100 MB anonymously
YouTube · TikTok · Vimeo · Twitter · SoundCloud · Spotify · 50+ more
↓ Buka okhelela phandle
Sifunda inzila ekhasi le-MP3 ngqo — VBR, CBR, isizinto-kanxele, noma iyiphi enkokheli (LAME, Fraunhofer, FFmpeg). Uma ifayela liqinisile isizinto ngkosi ngakho ehlukela, sisebenzisa lokho ukwahluka amazwi. Ukucishe kube muntu oyedwa kuya kuhlela kabusha kusukela ekusikeni kwi-acoustic.
Ngakusho ngalinye elasikali aliyafakwanga kuqede?
Ngcono ngoMkhehlwane 2019, ekuthatheni izilwane kuyadumazayo.
Nezilwane ezalahlekile — zabe zisiwo yonke indawo?
Kukhona inhlangano yephepha kusuka ngo-'78, kodwa ingxenye yayo imonakele amanzi.
↓ This is the dashboard
Same layout as the real dashboard — Summary, full Transcript, Speakers tab, Exports. Key points and action items extracted automatically. Auto-tags on every job.
Sample preview from a founder interview about post-call workflow. Real transcripts look exactly like this — same tabs, same summary block, same key-points / action-items split, same auto-tag chips.
Izinketho ezintathu eziqotho · ukuqhathanisa okuqotho
Ungakwazi ukuqulazethuza Whisper kulapthobu lakho likhona uma unengalo yokwenza injini yezincwadi. U-Otter nabantu be-Sonix bavumela uthwalisa MP3 ngaphakathi kwamadeshibhodi okuthengiswayo. Sithatha ifayela, sibuye umbhalo, futhi asinakuhlala ngaphakathi ne-UI.
Yamahhala uma una-GPU nohlobo lwehora. Ayikho i-speaker diarization ngokwemisinsile.
Labula i-MP3. Thola umbhalo onenziwa ezibonisi zabakhulumi emva isikhathi esifanele × 0.025.
Ideshibhodi elitululekile, amacala ethambeka ayenyanga, Isingisi-elenzile. Uthwalisa ifayela libonakala njengesici senziwa.
Ushintsho lwentengo kanye nokukhona kwesici okufanele ngokuka-Meyi 2026. Isebenzwa se-Whisper sithalakele inzuzo yomdlela nesizinda sehardwea.
Akukhetheke-MP3
MP3 yisiselo, hhayi indlela yokurekhodela — okusho ukuthi indlela yokwehluleka iyaphuma kumvezi, hhayi ezwini.
Amaphakathi akufanele ngamafomu angu-~80% we-MP3. Langiza umsebenzi ngamunye evumelekile.
Accuracy · real-world numbers
Ukunemba kwe-MP3 kuqedwa okufikaziyo umvezi wakahlukile, hhayi yona. Ukuncibilikisa okuthembekile ngaphezu kuka-~96 kbps kulondoloza inzwa yomuntu kahle kakhulu; ngaphansi kuka-64 kbps, izingilo nobukhali bezwi ziqala ukuxosha. Izinomboro ngenzansi zivela kuMP3 yamakhasimende aqotho emukelweni.
Sezaliwe-ekhatsiwe emuva ezwini. Izinsiza ze-podcast, ukukhokha kwezinomu, isetileshini yekhasi yokubuyamba. Ukwahluka kahle uma abakhulumi besizindeni ezahlukene.
Izinga elidlinzana kakhulu ngokubalubule izikali zezwi. Okukhiphiwe kweZoom, okulandelwa kweRiverside, umveshane osijubela ngokwemisinsile. Izimpawu zokuncibilikiza azibonakale kumuthuli wokuqonda izwi.
Amaphresenti eyumlayelo wezwi kulomu engeningi amafowuni. Ukwahluka kwe-acoustic kuhlangabezana nabakhulumi aba-2-4. Izinomboro namabizo okuqonda ayidinga kakade ikhulumela.
Izilwane zikashumalanga, izilwane zesikhulume, izumvelo esinile. Izingilo ezikhulu (f/s/sh) ziphumula. Sikhona nokubhalwa — ceba uhlobo lomphumela.
Iziphi izibuzo esivamile
Iminit engu-30 yamahhala kwinyanga. Ayidingi ikhadi. Izibonisi zabakhulumi, izilwimi engu-99, konke umfophi okungenela-okungenela okuhambisanayo.
Qala yamahhala