Fakasipi WAV files mo ngaahi mahina.Lossless quality.

Toʻa 'a e WAV pī mei he field rig, DAW bounce, pe he interview kit. Ke taki ʻiate koe 'a e 24-bit headroom, ke fai diarization 'i he raw PCM, mo fa'u ki he kupu 'etau taimi mo SRT 'i he miniti.

Drop a file, or pick one

MP3 · WAV · M4A · MP4 · MOV · MKV · OGG · OPUS · FLAC · WEBM — up to 100 MB anonymously

Paste a link, we’ll fetch the audio

YouTube · TikTok · Vimeo · Twitter · SoundCloud · Spotify · 50+ more

Record straight from your browser

Sign up takes 30 seconds — recording opens right after, in the dashboard.

No card required~90s per 60-min fileSRT · VTT · DOCX · TXTFiles auto-deleted in 24h

↓ Sipi hange ha ia ka fakatupu

Raw PCM ki lalo. Kupu mahina'o ki tuʻu.

Lossless WAV e fai ia 'o e sibilant, plosive, mo e kupu 'ē ke toe mai — 'ikai fakalolotoga MP3 'i he consonants. Kapau 'e hoko 'a e file multi-track (speaker taha 'i ha channel), te toʻo ʻiate koe 'a e acoustic diarization p mo ke toʻa 'i he channel layout.

WAV · 48 kHz / 24-bitREC 2 tracks · 1h 12m · 743 MB
auto-detected en-GBstereo PCM · uncompressed
~90s
Kupu · streaming97% lelei
S1

Toʻo au ki 'ia taimi ma'a 'i seventy-eight — ʻe hā taimi na hoko ai e telefoni?

S2

Quarter to five, give or take. Kettle was on, I remember that much.

S1

Mo meʻa ia na koe fai ʻal ki he boatyard?

S2

Straight to the boatyard. Lights were still on when I pulled in.

97% 'i he per-track WAVSRT · DOCX · TXT · JSON

↓ This is the dashboard

This is what loads when the job finishes.

Same layout as the real dashboard — Summary, full Transcript, Speakers tab, Exports. Key points and action items extracted automatically. Auto-tags on every job.

Try it on your own file — it's free

Ngaahi hako tokotolu 'o lelei lelei · fakahihifo 'o lelei lelei

Adobe Audition. Descript. Pe ʻiate koe.

Adobe Audition Speech to Text e poupou mo e Creative Cloud p e tui 'i he timeline. Descript e toʻo 'a e WAV ki hona editor. Ke toʻa 'iate koe 'a e file 'o hange 'ia, fakatupu ʻiate ʻoku ngaahi fa'u, p 'ē faʻi koe ke malolo 'a e project mo'o.

Option 01

Adobe Audition / Premiere

Kupu panel 'i he Adobe timeline. Tuʻu ʻi he Creative Cloud mo e project file.

Ne'i mahaloCreative Cloud subscription
Speaker diarizationYes, mixed-down only
Multi-track WAVFlattened before STT
FakatupuSRT · CSV · XML
Ngaahi lea18, manual select
Koloa~$23/mo (single app)
Best forNgaahi fakatotolo 'ē kuta 'i he Premiere pe Audition 'ē mahalo ngaahi kaptions tuʻi 'i he timeline.
Option 02

Transcription.Solutions

Taʻo 'a e WAV. Per-channel diarization kapau multi-track. Koloa 'o toʻa 'i he houa 24.

Ne'i mahaloNothing — 'o 'e file mo'o
Speaker diarizationPer-track pe acoustic
Multi-track WAVUp to 16 channels
FakatupuSRT · VTT · DOCX · TXT · JSON
Ngaahi lea99, auto-detected
Koloa · per min$0.03
Best forKau ʻe mahu 'a e raw WAV — field recordists, podcasters bouncing mei he DAW, oral history archivists, researchers.
Option 03

Descript

Toʻo mai ho WAV ki he Descript editor. Kau 'o lelei, kae tauʻi ke fai 'i he meʻa 'o ia.

Ne'i mahaloDescript account + import
Speaker diarizationAcoustic, EN-tuned
Multi-track WAVImport as separate clips
FakatupuTXT · SRT · DOCX
Ngaahi lea23, accuracy varies
Koloa$16–24/user/mo
Best forPodcast editors 'ē mahalo ke fakatotolo 'a e leo fakasipi 'a e kupu — 'o ia 'a e Descript superpower 'o ia.

Pricing accurate as of 2026. Adobe mo Descript feature flags e suia ʻa e taimi; sipi 'a e ngaahi kupu koloa kuo 'aonga 'i he toki.

Specific ki he WAV

Ngaahi meʻa tokotolu 'ē ha'u 'i ha ngaahi fakasipi tools 'o sipi lelei.

Most uploaders e sipi liʻo 'a e WAV mo'o kuo ka toʻo 'i ha recognizer. 'Ē fai ʻiate koe.

Hange ha 'e fakapōpō

  1. 1Multi-track WAV e tui. 'A e 4-channel field recording mei he Sound Devices MixPre e tui ki he mono before STT. 'O e per-mic separation 'ē nofo mo'o 'e toli'i.
  2. 232-bit float WAVs mei he Zoom F-series pe MixPre 'e toʻo toʻo, pe clip ki he 16-bit mo toli'i 'a e headroom recovery.
  3. 396 kHz / 24-bit interviews e toe 'aonga ʻe lolotoga 'a e toʻo koeʻuhi 'a e meʻa e fakahoko ki he MP3 'i he browser kuo ka toʻo.

Hange ha 'e toʻo 'i he meʻa 'o ia

  1. 1Toʻo 'a e multi-track WAV 'o hange 'ia (up to 16 channels). Te sipi ʻiate koe 'a e channel layout mei he WAV header mo toʻa speaker taha per track — 'ē mahalo acoustic guessing.
  2. 232-bit float e 'aonga ʻa e nativu. Te taki ʻiate koe 'a e float headroom kuo ka fakanofo ke recognizer, koe peaks above 0 dBFS 'ē toʻo.
  3. 3Direct binary upload, 'ē transcode 'i he browser. 'A e 2 GB WAV e fai 'i ho full bandwidth mo fakasipi 'a e momeʻa mei he byte tuʻu taha.

Recommended job settings ki he WAV

Taʻo 'a e WAV mo 'ia toʻa 'i he default. Override per-job mei he form.

Sample rate
Native (no downsample)
Bit depth
24-bit / 32-float preserved
Diarization
Per-channel kapau multi-track
Speaker model
Interview · 2-8 speakers
Filler words
Kept (toggle off kapau mahalo)
Fakatupu
DOCX · SRT · timestamped TXT

Accuracy · real-world numbers

97%+ 'i he per-track WAV. WAV e ha'u mai he recognizer 'o e kupu mahina'o lelei.

Koeʻuhi WAV e tukituki 'a e raw PCM 'e 'ikai ma'u fakasipi koloa, 'ē 'ikai tu'u 'a e consonants mo sibilants 'o ia hange 'ia MP3 tu'u — 'ē 'ikai ma'u e recognizer 'a e fakasipi 'o ia. Ngaahi kino 'i lalo ne'i mei he ngaahi customer WAV jobs 'i he fakatotolo koloa.

98%
Studio WAV · speaker taha

48 kHz / 24-bit, large-diaphragm condenser, treated room. Narration, audiobook, voice-over bookings land here.

96%
Multi-track interview WAV

Channel taha per speaker (lavs pe boundary mics). Diarization 'o 'e channel routing mo'o — text-only error.

92%
Handheld field recorder

Zoom H5, Tascam DR-40, similar. Stereo XY pickup, 2-3 speakers, some room reflection. Most podcast WAVs land here.

85%
Noisy environment field WAV

Outdoor, café, vehicle. Lossless capture e fai ia — 'o e leo 'o ia 'o ia, 'ikai codec artefact — kae lelei 'e toʻo 'i he overlapping speech.

Ngaahi 'autafa founga

Ngaahi meʻa 8 'ē 'autafa 'i he WAV fakasipi.

01Hange ha 'e maximum WAV file size?+
5 GB per file 'i he standard plan, 'o ia 'e hake 'e 8 hours 'o e stereo 48 kHz / 24-bit, pe 2.5 hours 'o e 96 kHz / 24-bit. Large files 'e lelei 'i he team plan — e faʻi 'iate koe kuo ka toʻo 'a e upload.
02Te 'aonga koe 32-bit float WAV mei he Zoom F-series pe MixPre?+
Yes, nativu. Te sipi ʻiate koe 'a e float samples 'e 'ikai toʻo 'i 0 dBFS, koe loud transients 'ē nelekehe ke toʻo 'i lalo 'i he post 'e toʻo fakasipi. Most generic uploaders e sipi liʻo down-cast ki he 16-bit first.
03ʻoku ʻi a'u 'a e 4-channel WAV mei ha field recorder — speaker taha 'i ha channel. 'E mahalo ha ia e diarization 'i he meʻa 'o ia?+
'E mahalo ia. Toʻo 'a e polyphonic WAV 'o liʻo ('ē bounce ki stereo first). Te sipi ʻiate koe 'a e channel layout mei he WAV header mo toʻa speaker taha per track — lelei lahi 'e hake acoustic diarization 'i he similar voices.
04'E toʻo koe 'a e 96 kHz WAV mo'o?+
The recognizer e fai 'i 16 kHz internal — 'o ia 'a e ceiling 'o e human speech intelligibility. Kae taki ʻiate koe 'a e original file 'e hōʻo mo e post-processing like noise gating. 'O e exports 'e fakahihifo 'i he original timeline.
05Is WAV actually more accurate than MP3 ki he fakasipi?+
Marginal, yes — usually 1-2 points 'o WER 'i he clean speech. 'O e gap lelei 'e hōʻo 'i he sibilants mo quiet passages, 'o ia MP3 psychoacoustic compression e toli'i 'a e fakasipi 'ē mahalo 'a e recognizer. Ki he archival pe forensic meʻa, WAV 'o ia 'a e right call.
06Are BWF metadata mo timecode preserved?+
Te sipi ʻiate koe 'a e BWF chunks (bext, iXML) mo fai 'a e start timecode ke 'au 'a e kupu ki ho session timeline. 'O e original WAV 'ē toʻo 'a me'a mo'o — te fai ʻiate koe kuo ka copy 'ē kaveoa 'i he houa 24.
07Can I toʻa 'a e folder 'o WAV files mei DAW session fakatupu?+
Yes. Batch upload e mahalo up to 50 files 'i he taha. Each WAV e fa'u hona job mo kupu. Kapau 'e stems mei he session taha, e mahalo 'a koe e merge 'i he faʻu multi-track WAV kuo ka toʻo mo diarize per channel.
08Hange ha e 1-hour stereo WAV 'o ia 'e 'aonga?+
Upload 'o ia 'a e slowest meʻa — 'a e 1-hour 48 kHz / 24-bit stereo WAV 'o ia 'e hake 'e 600 MB mo 2-5 miniti 'i he halo broadband. Kuo ka toʻo, fakasipi mo'o e toe 'i ha 4-6 miniti 'i he standard queue.

Taʻo 'a e WAV mo'o. Tauʻi 'a e lossless quality. Sipi hange ha ia ka fakatupu.

30 free miniti every month. 'Ē card. Per-track diarization, 32-bit float supported, source audio 'e kaveoa 'i he houa 24.

Fakasipi free