Whisper ya lapeng / open source
E mahala haeba o na le GPU le motsheare wa hora. Ha ho diarization ea boledi ka mofini.
Laola faele ya MP3 ka bitrate efe kapa efe ho tloha 64 go 320 kbps. Fumana sengwalo sa boemo ba nako, se diarisiletsoeng ka lipuo tse 99 — ha ho phetolo ya formate, ha ho re-encoding, ha ho lebala ka pono.
MP3 · WAV · M4A · MP4 · MOV · MKV · OGG · OPUS · FLAC · WEBM — up to 100 MB anonymously
YouTube · TikTok · Vimeo · Twitter · SoundCloud · Spotify · 50+ more
↓ Sheba se ka tlago
Re bala di-header tsa MP3 frame ka kotsi — VBR, CBR, joint-stereo, sekopu se sele (LAME, Fraunhofer, FFmpeg). Haeba faele e le le le le le le le mpho wa nnete le le le boledi ba ka mo di-channel tse fapaneng, re e sebelisa go arola molumo. Mono mix-down e aba ka mabuto a acoustic diarization.
Kahoo ke neng ha u qala ho ela hloko hore archive e ne e se e le botlano?
Mohlomong ka 2019, ha re qala ho reka di-reel-to-reel.
Le di-tape tse sa leng teng — na di ne di sa kataloge kae e le kae?
Ho na le index ea pampiri ho tloha '78, empa halofo ea eona e suwe ka metsi.
↓ This is the dashboard
Same layout as the real dashboard — Summary, full Transcript, Speakers tab, Exports. Key points and action items extracted automatically. Auto-tags on every job.
Sample preview from a founder interview about post-call workflow. Real transcripts look exactly like this — same tabs, same summary block, same key-points / action-items split, same auto-tag chips.
Dikgetsi tse tharo tse nnete · letotokelo leo le itseng
O ka sebetsa Whisper sehlopheng sa hao ka mahala haeba o na le boteknoloji. Otter le Sonix di amoha di-MP3 upload kahare ga di-dashboard tsa subscription. Re nka faele, ra aba sengwalo, mme ha re ka mo diphaphaswing tsa UI.
E mahala haeba o na le GPU le motsheare wa hora. Ha ho diarization ea boledi ka mofini.
Laola MP3. Fumana letseno le sefako sa boledi le nako e fokolane × 0.025.
Dashboard e nepahetseng, kgwedi ea minete e matelelo, English-tuned. File upload e utlwala joaloka feature e e ka direng.
Pricing le feature availability a nepahetse ha May 2026. Whisper performance e fetoha ka size ya model le hardware.
E ikhethileng ho MP3
MP3 ke sebopeho, ha se mohuta wa kgetsi — eo e bolelang hore dimodo tsa ho hlola e tuka ho tloha ho encoder, ha se puo.
Diphakwane tseo di lekana ~80% ya difile tsa MP3. Override per-job ho tloha fomu.
Accuracy · real-world numbers
MP3 nepahalo e fapuoa ke se encoder e se boloketseng, ha se ka rona. Perceptual compression e ka saesae ~96 kbps e boloka mohlala wa puo haholo; tlase ho 64 kbps, sibilants le consonants di qala ho senyeha. Dinomoro mo tlase ke ho tloha di-MP3 tse dipono tsa bareki ba lebopo.
E batla e le sa hlolehwa bakeng sa puo. Podcast masters, dictation app exports, professional interview rigs. Diarization e hloekileng haeba boledi ba hlapara ka li-channel tse fapaneng.
Bitrate e sebelisetsoang haholo bakeng sa spoken-word MP3s. Zoom exports, Riverside downloads, voice recorders default. Compression artifacts a sa utlete ka recognizer.
Voice memo defaults ka diphone tse ngata. Acoustic diarization e berakanya boledi ba 2-4. Dinomoro le proper nouns ka mehla di hloka leele.
Lesaka-leso ole answering-machine rips, lecture archives, narrow-band sources. High-frequency consonants (f/s/sh) di fupana. E ntse e le e lokelang — ikarabella leqephelo.
Dipotso tse tloaelehileng
Minete e 30 e mahala ka kgwedi. Ha ho kadi e hlokahalang. Sefako sa boledi, lipuo tse 99, formats ya export kakaretso e akaretsa.
Qala mahala