Shamuamisa lifaele tsa WAV ka litatello tsa seane-seane.Boleng bo sa lahlang.

Tlosa pego ea WAV e le yona ho tsoa ho likala tsa lebala, DAW bounce, kapa mokha oa lipotso. Re boloka hdepth ea 24-bit, re etsa diarization ho PCM e hloekileng, mme re u rusa lepuetsong le nang le linako le SRT maemong a metsotso.

Drop a file, or pick one

MP3 · WAV · M4A · MP4 · MOV · MKV · OGG · OPUS · FLAC · WEBM — up to 100 MB anonymously

Paste a link, we’ll fetch the audio

YouTube · TikTok · Vimeo · Twitter · SoundCloud · Spotify · 50+ more

Record straight from your browser

Sign up takes 30 seconds — recording opens right after, in the dashboard.

No card required~90s per 60-min fileSRT · VTT · DOCX · TXTFiles auto-deleted in 24h

↓ Bona se tsa hlahloang

PCM ea hlokahala. Lepuetsong le hloekileng la hlahloang.

Boleng bo sa lahlang ba WAV bo bolela hore liketsahalo le likatsane le mantsoe a bohale ha a hlahloag hantle — ha ho MP3 e arohanang likatsane. Reoetsi e utlwa se utlwang ke microphone. Haeba faele e na le modumo o mararo (seane-seane e le nngwe ka channel), re tloha ho acoustic diarization mme re arohana ho channel layout.

WAV · 48 kHz / 24-bitREC Modumo o 2 · 1h 12m · 743 MB
auto-detected en-GBstereo PCM · e se nyomaneng
~90s
Lepuetsong · streamingBoitlhokego ba 97%
S1

Ntjela mahlolong a hoseng o hoo — nako efe fono e tlile?

S2

Ho le haesana ho di-limane kapa haeso. Ketlele e ne e tshwane, ke hopola hoo.

S1

Mme ho tloha tšoo u ne u la diretseng ka lebakeng?

S2

E le ho ya ntle. Mela e ne e sa robale ha ke ile.

Boitlhokego ba 97% ho WAV ea mofuta ka mofutaSRT · DOCX · TXT · JSON

↓ This is the dashboard

This is what loads when the job finishes.

Same layout as the real dashboard — Summary, full Transcript, Speakers tab, Exports. Key points and action items extracted automatically. Auto-tags on every job.

Try it on your own file — it's free

Dikgetsi tse tharo tsa nako — papadi e boitshwaro

Adobe Audition. Descript. Kapa rona.

Audition's Speech to Text e kopantswe le Creative Cloud mme e dula kahare ho timeline. Descript e tlosa WAV ho editor ea yona. Rona re tšea faele e le yona, re u rusa nka ya kakaretso, mme ha re kope u tsamaisa tšebetso ea hao kae kape.

Option 01

Adobe Audition / Premiere

Lepuetsong la mahare ho timeline ya Adobe. E hokahantswe le Creative Cloud le faele ya tšebetso.

E hlokaCreative Cloud subscription
Diarization ea seane-seaneEe, e kopantsweng feela
WAV ea modumo o mararoE tlohetswe ka pele ho STT
NkaSRT · CSV · XML
Lipuo18, ho khetha ka bokang
Theko~$23/khoeli (app e le nngwe)
Best forBapoli ba se se etsang ka Premiere kapa Audition ba eletsa captions e hokahantswe ho timeline.
Option 02

Transcription.Solutions

Tlosa WAV. Diarization ka per-channel haeba e na le modumo o mararo. Motheo o hlahloang ka 24h.

E hlokaLetho — feela faele
Diarization ea seane-seaneKa mofuta-ka-mofuta kapa sengano
WAV ea modumo o mararoAlang ho di-channel tse 16
NkaSRT · VTT · DOCX · TXT · JSON
Lipuo99, auto-detected
Theko · ka metsotso$0.03
Best forMongwe ya amanang le WAV e hloekileng — batsoali ba sengano sa lebala, ba podcast ba le le DAW, ba nalane a molemo, bafuputsi.
Option 03

Descript

E tlosa WAV ka editor ea Descript. E matla, empa o hloka ho etsa mosebetsi kahare ho yona.

E hlokaDescript account + import
Diarization ea seane-seaneSengano, EN-tuned
WAV ea modumo o mararoImport e le clip e le yona
NkaTXT · SRT · DOCX
Lipuo23, boitlhokego bo fapaneng
Theko$16–24/sebelisi/khoeli
Best forBapoli ba podcast ba eletsa ho lokisa sengano ka ho lokisa lepuetsong — matla a nako a Descript.

Theko e le hantle ka 2026. Adobe le Descript feature flags li fetohela hangata; lokisa litipiso tsa hona pele u kopanya.

Specific ho WAV

Dilo tse tharo tse nakileng batho ho dipale tsa transcription tsa kakaretso.

Batla ba tloso ba bapala WAV ba hlakola pele ba e romela ho reoetsi. Rona ha re etsa.

Se se tlang morao

  1. 1WAV ea modumo o mararo e kopantswe. Pego ya 4-channel ho tsoa ho Sound Devices MixPre e kopantswe ho mono pele ho STT. Separeishi sa per-mic o buile bakeng sa sona se lahla.
  2. 2WAV tse 32-bit float ho tsoa ho Zoom F-series kapa MixPre ha di amohe, kapa di feletswe ho 16-bit mme di lahla headroom recovery.
  3. 396 kHz / 24-bit potsiso di ntsha nako e telele ho paka hobane sesebelisoa se re-encodes ho MP3 ho browser pele se e romele.

Se ho fokotsa mona

  1. 1Tlosa WAV ea modumo o mararo e le yona (alang ho di-channel tse 16). Re bala channel layout ho WAV header mme re abela seane-seane e le nngwe ka track — ha ho acoustic guessing.
  2. 232-bit float e amohwa ka tlwaele. Rona re boloka float headroom ha re etsa nomaleising bakeng sa reoetsi, ka hona peaks ka holim'a 0 dBFS ha di feloe.
  3. 3Direct binary upload, ha ho transcode ho browser. WAV e 2 GB e mo peto hodima bandwidth ea hao e le yona mme e qala ho etsa mosebetsi hape byte ea ho qetela e tsena.

Ditlhophiso tse kang se lokisitsweng bakeng sa WAV

Tlosa WAV mme tse le khone ho default. Fetola ka nako ka nako ho foromo.

Sample rate
Native (ha ho downsample)
Bit depth
24-bit / 32-float e bolokwang
Diarization
Ka mofuta haeba e na le modumo o mararo
Speaker model
Potsiso · batsoali ba 2-8
Filler words
E bolokwang (lokisa off haeba e hlokahala)
Nka
DOCX · SRT · mantsoe a nang le linako

Accuracy · real-world numbers

97%+ ho WAV ea mofuta-ka-mofuta. WAV e fana ka sengano kahohle se hlokehang haholo ho reoetsi.

Hobane WAV e boloka PCM e hloekileng e se nyomanneng, likatsane le liketsahalo ha a afahane e le ele MP3 e a arohana. Reoetsi e utlwa se utlwang ke microphone. Dipalo ka tlase di tsoa ho tšebeliso ea WAV eo e le ho nneo ho ba mosebetsing.

98%
Studio WAV · seane-seane e le nngwe

48 kHz / 24-bit, condenser e nang le diaphragm e kgolo, kamore e lokisitsweng. Polelo, puku ea moya, bookings ea voice-over di fela mo.

96%
WAV ea potsiso ea modumo o mararo

Channel e le nngwe ka seane-seane (lavs kapa mics a mohatla). Diarization ke feela channel routing — phoso ea mantsoe feela.

92%
Likala tsa likala tse lokisitsweng ka ho mphuthi

Zoom H5, Tascam DR-40, tse tšoanang. Stereo XY pickup, batsoali ba 2-3, le letsoai le letelele. Hangata WAV ya podcast e fela mo.

85%
Letsoai le lelelele la lebala la WAV

Ntle, khofi, se se halofesang. Letelo le hloekileng le thusa — letsoai ke leo, ha se codec artefact — empa boitlhokego bo ntse ba beakanya ha seane-seane se arohana.

Lipotso tse tloaethang

Dilo tse 8 tse potsiwe ke batho ka shamuamiso sa WAV.

01Boholo ba maximum ba faele ya WAV ke bofe?+
5 GB ka faele ho plano ya kakaretso, e leng ka baka la 8 hora tsa stereo 48 kHz / 24-bit, kapa 2.5 hora tsa 96 kHz / 24-bit. Lifaele tse kgolo li kile tsa kea ho team plan — feela ikopanye le rona pele ho tloso.
02Na le support 32-bit float WAV ho tsoa Zoom F-series kapa MixPre?+
Ee, ka tlwaele. Rona re bala samples tse float letsoet 0 dBFS, ka hona transients e kgolo o ne o buile ho kgonwa ho tsoba le photheke e sa robala ke cleansly. Ho ya ka bohao bao ba nka sekeliting silently down-cast ho 16-bit pele.
03Ke na le 4-channel WAV ho tsoa ho field recorder — mic e le nngwe ka motho. Na diarization e tla sebelisa seo?+
E tla ka dula. Tlosa WAV ya polyphonic e le yona (ha ho bounce ho stereo pele). Rona rena channel layout ho WAV header mme re abela seane-seane e le nngwe ka track — e boema haholo ho feta acoustic diarization ho diphuthi tse tšoanang.
04Na le tla downsample WAV ya 96 kHz ya ka?+
Reoetsi e etsa mosebetsi ho 16 kHz kahare — yena ke tekanyo ya tlhaloganyo ya mantsoe. Empa rona rena faele ya hao ya motheo e sa lokiswa le ho sebelisa ho post-processing e le gating. Nka ya hao e thapetse timeline ya motheo.
05Na WAV e le nene e na le boitlhokego bo boholo ho MP3 bakeng sa transcription?+
E le nene, ee — hangata ditshitiso tse 1-2 tsa WER ho mantsoe a hloekileng. Liphapang tse kgolo di bonahala ho likatsane le dinao tse bohale, moo MP3's psychoacoustic compression e lahla tlhaloganyo reoetsi e ne e tla naya. Ho nalane kapa sebaka sa forensic, WAV ke tshwantsha.
06Na BWF metadata le timecode di bolokwang?+
Rona rena bext chunks (bext, iXML) mme rena sekolopele start timecode ho hopola lepuetsong ho session timeline ya hao. WAV ya motheo e se na hlokoa — rona rena pereko kope e hlahloang ka 24h.
07Na ke kona ho tlosa puketi ya lifaele tsa WAV ho tsoa DAW session export?+
Ee. Batch upload e amohwa alang ho difaele tse 50 le go felela. WAV ka pele e fumanya mosebetsi wa yona le lepuetsong. Haeba ke stems ho tsoa ho session e le nngwe, le ho kopanya ka WAV e le nngwe pele ho tloso mme rona rena diarize ka channel.
08Nako ea hantle e nka hora e le nngwe ya stereo WAV?+
Tloso ke ho morao haholo — 1-hour 48 kHz / 24-bit stereo WAV ke ka baka la 600 MB mme e nka metsotso e 2-5 ho broadband e tloaethang. Hang-hang ho tloseha, transcription yena e sebetsa ka baka la metsotso e 4-6 ho cue ya kakaretso.

Tlosa WAV ya hao. Boloka boleng ba hloekileng. Bona se tsa hlahloang.

Metsotso e 30 e lekaneng ka khoeli. Ha ho card. Diarization ea mofuta-ka-mofuta, 32-bit float e suportwa, sengano sa motheo se hlahloang ka 24h.

Qala kgoneng