Whisper local / open source
Boikgang fa o na le GPU le letsatsi le lengwe. Ga go na diarization ya mosaatsanyi mo go go akoya.
Akoma faele ya MP3 mo go neng ka neng mo 64 go 320 kbps. Amogetsa phetoleling e e nang le nako le mawala a mosaatsanyi mo go dipuo e 99 — ga le tlhoke go badila mokgatlho, ga o re-encode, ga le hemedi mo listeng.
MP3 · WAV · M4A · MP4 · MOV · MKV · OGG · OPUS · FLAC · WEBM — up to 100 MB anonymously
YouTube · TikTok · Vimeo · Twitter · SoundCloud · Spotify · 50+ more
↓ Leba seleo se se etsang
Re balela diretswa tša diframe tša MP3 ka go ithe — VBR, CBR, joint-stereo, encoder neng (LAME, Fraunhofer, FFmpeg). Fa faele e le stereo e e boiteanong go lebakabaka le ba mosaatsanyi go le different channels, re dirisa seo go go rarola diphathi. Mono mix-down e wa go fallback go diarization e e nang le kumo.
So when did you first realise the archive was incomplete?
Probably around 2019, when we started digitising the reel-to-reels.
And the missing tapes — were they catalogued anywhere at all?
There's a paper index from '78, but half of it's water-damaged.
↓ This is the dashboard
Same layout as the real dashboard — Summary, full Transcript, Speakers tab, Exports. Key points and action items extracted automatically. Auto-tags on every job.
Sample preview from a founder interview about post-call workflow. Real transcripts look exactly like this — same tabs, same summary block, same key-points / action-items split, same auto-tag chips.
Dikgetsi tše tharo tše di mo gare · kutlo ya boitshoko
O ka diranela Whisper mo lapeng la gago fela fa o bo technical. Otter le Sonix di amogetsa MP3 uploads mo go dashboards tša subscription. Ree amogetsa faele, re go bega phetoleling, le ga o ka gona mo UI.
Boikgang fa o na le GPU le letsatsi le lengwe. Ga go na diarization ya mosaatsanyi mo go go akoya.
Akoma MP3. Amogetsa sengwalwa se se nang le mawala a mosaatsanyi ka nako e e swanetseng.
Dashboard e e lokileng, monthly minutes cap, English-tuned. File upload e utwa jaaka setheo se se sekondari.
Pricing le feature availability di nepele go ya go May 2026. Whisper performance e fapanika go ya go model size le hardware.
E le seethemogo sa MP3
MP3 ke mokgatlho, e se recording style — go duma seleo sa go botlhokwa go tswa mo encoder, e se puo.
Defaults tšee di swanela ~80% ya mafaele a MP3. Override per-job go forma.
Accuracy · real-world numbers
MP3 accuracy e kopantšwa ke seelo se compressed keeper e sa se gatile, e se ke a. Perceptual compression mo go feta ~96 kbps e sireletša intelligibility ya puo gabotlane; tlase ga 64 kbps, sibilants le consonants ba simolola go bolawa. Dinomoro tlase e tšwa mo MP3s e e bonolo ya customer mo go go dirago tikologo.
Go kakaretso-lossless go puo. Podcast masters, dictation app exports, professional interview rigs. Diarization e nepelo fa ba mosaatsanyi mo separate channels.
Bitrate e e lešoro ka kakaretso go MP3s tšee di batlangaang puo. Zoom exports, Riverside downloads, voice recorders e simolola mo default. Compression artifacts ga di utolole go recognizer.
Voice memo defaults mo difoneng tšee di le ntši. Acoustic diarization e lametša 2-4 speakers. Dinomoro le proper nouns ka nako e nngwe di tlhoka leba.
Old answering-machine rips, lecture archives, narrow-band sources. High-frequency consonants (f/s/sh) ba bolawa. Le boela ba legible — beakanya proofread.
Dipotšo tšee di e kgonegile
Metsotso e 30 e le boikgang mo kgwedi e nngwe. Ga le tlhoke karata. Mawala a mosaatsanyi, dipuo tše 99, export format nngwe le nngwe e akantswe.
Simolola boikgang