Salavala MP4 video i le tusitusiga.O le leo ua tauina atu faʻaotometi.

Tuʻu le MP4 file e pei ona i ai — tatou te taulaʻi i le leo track i le server, toe foʻi mai le tusitusiga faatasi ai le taimi, ma tu'u mai le SRT e faʻafo toe i YouTube, Vimeo, poʻo lau NLE.

Drop a file, or pick one

MP3 · WAV · M4A · MP4 · MOV · MKV · OGG · OPUS · FLAC · WEBM — up to 100 MB anonymously

Paste a link, we’ll fetch the audio

YouTube · TikTok · Vimeo · Twitter · SoundCloud · Spotify · 50+ more

Record straight from your browser

Sign up takes 30 seconds — recording opens right after, in the dashboard.

No card required~90s per 60-min fileSRT · VTT · DOCX · TXTFiles auto-deleted in 24h

↓ Vaʻai i le mea e tupu mai

MP4 i totonu. Tusitusiga + SRT i faʻfo.

O le MP4 o se container — tatou te faitau le leo stream e matua'i, e leai se fa'asoʻo o le video. O le taimi e nofo ai frame-accurate i lau timeline tumatau, e pei o le SRT e tutusa ai i le faʻaaogaina muamua.

training-module-04.mp4REC 1080p · 22:14 · 412 MB
auto-detected en-USAAC 48 kHz stereo · 192 kbps
~90s
Tusitusiga · streaming95% accuracy
S1

Faʻapea, i lenei module tatou te paʻu i le refund workflow tala tutotonu.

S2

Fesili vave i le mea tatou te amataʻia — e aoga lenei mo partial refunds?

S1

E lelei le fesili. O le partials e faʻaaogaina le laʻau e tasi ae o le reason code e ese.

S2

Ua malamalama. Ma o le approval threshold e nofo ai le tolugalau talatala?

95% i le dialog maʻaSRT · VTT · DOCX · TXT · JSON

↓ This is the dashboard

This is what loads when the job finishes.

Same layout as the real dashboard — Summary, full Transcript, Speakers tab, Exports. Key points and action items extracted automatically. Auto-tags on every job.

Try it on your own file — it's free

Avanoa tolu e moni · faʻatusatusaga poto

DIY faatasi ffmpeg. O se video editor. Poʻo tatou.

E mafai ona e tauina atu le leo oe lava ma tu'u Whisper. E mafai ona e tuʻu le MP4 i Descript poʻo VEED ma nofo i totonu o latou editor. Poʻo e tuʻu le file i iinei ma toe foʻi mai le tusitusiga + SRT, e leai se editor lock-in.

Option 01

ffmpeg + Whisper

Faʻalumanai, o le lava, faʻalavelave. E te amanua le pipeline ma soʻo se pitopito i ai.

RequiresCLI + 10 GB model + GPU
Speaker diarizationSeparate tool (pyannote)
SRT outputIoe, manual flag
Taimi i le 1-hour MP420–90 itula i le CPU
Multi-track audioE te filifili le stream
Tau$0 + lau hardware
Best forEngineers e amatasia Whisper i le lava ma e le malogalofa i le fa'aopoopoga diarization.
Option 02

Transcription.Solutions

Tuʻu le MP4. Audio extraction, diarization, SRT, summary — le taimi e tasi.

RequiresBrowser, e lena lava
Speaker diarizationI totonu, soʻo se job
SRT outputFrame-aligned i le tumatau
Taimi i le 1-hour MP4~4 itula, streamed
Multi-track audioTatou te faʻatali soʻo le streams
Tau · per itula$0.03
Best forSoʻo se tasi e amanua se MP4 ma manao i le tusitusiga ma SRT e leai se aʻoaʻoga i se video editor poʻo se CLI.
Option 03

Descript / VEED

Tu'u MP4 i le editor. O le tusitusiga e aliali mai e pei o le bahagi o le timeline UI.

RequiresAccount + editor learning curve
Speaker diarizationIoe, EN-tuned
SRT outputExport-gated e le plan
Upload cap5 GB (Descript faʻalumanai)
Multi-track audioLe track muamua e tasi
Tau$12–24/user/masina
Best forEditors e manao i le fa'asalalauga o le video ma le tusitusiga i le alaʻalafagaga e tasi.

Pricing ma feature caps ua fa'atatau atu i le 2026. Descript ma VEED tier names e suiga masani — no'o i latou site mo le taimi nei o le malifa.

Specific i le MP4

Mea tolu e fasi tagata i luga ole generic transcription tools.

O le MP4 o se container, e leai o le codec — ma o le tele o le transcription tools e faʻapea o se audio blob e faoʻo i ai. O le alaga e mai iinei.

O le mea e faapitoa ai

  1. 1Multi-track MP4 faatasi boom + lav. O le generic tools e tago le track 1 ma filifili isi, e malaʻo lava le mic e maʻa. O le masani i le FCP ma Premiere exports.
  2. 2O le leo faʻasalafai i vlogs ma ads e faatupu phantom words. O le recognizer e tatala i faasalalauga o le vocals i le music bed.
  3. 3O le SRT timestamps e neʻi ai pe a faaliliu le tool i le video i le ala i totonu. I le itula 40 o le captions e fa'alavelave se sekoniti.

O le mea e fa'asoʻo iinei

  1. 1Upload — tatou te imesaʻi soʻo le audio streams ma tuʻu oe e filifili se tasi faasalalauga. O le default o le highest-bitrate track.
  2. 2Faʻamaloʻo le Music suppression i le job form. Tatou te fa'amatotoka le recognizer i le speech VAD ae o le instrumental sections e nofo faʻatū.
  3. 3Tatou e leai se fa'asoʻo video lava. O le leo ua tauina atu i le native sample rate, timestamps e fa'asino i le container's edit list — SRT e tutusa frame-accurate.

O le fa'atonu o le job settings mo le MP4

Tuʻu se MP4 ma o iai e faʻamaloʻo i le default. Override per-job mai le form.

Audio extraction
Native sample rate, leai se fa'asoʻo
Track selection
Highest-bitrate stream
Diarization
Acoustic · 1-6 speakers
Music suppression
I luga mo vlog/ad presets
SRT format
≤42 chars/line, 2 lines max
Export
SRT · VTT · DOCX · timestamped TXT

Accuracy · real-world numbers

95% i se shoot maʻa. Numera poto pe a fetaui le leo.

O le MP4 accuracy e seti e le mic, e leai o le codec. O le lav mic i le quiet set e sili atu i le 4K camera faatasi le audio o le board oe lava. O numera i lalo mai le MP4 o le customer moni, fa'asalalauga i le mea e tauina le leo.

96%+
Studio shoot, lav poʻo shotgun mic

Lapel poʻo boom i se recorder, 48 kHz AAC i 192+ kbps, le aʻo o le fale. O le pae i le luga. O le taiga o le tagata tautala e tutonu ai i le shoot e lua-se tagata.

93%
DSLR faatasi le camera shotgun

Camera-top mic 2-4 fuʻa mai le tautala. O le leo o le aʻo ae o le tautala e malamalama. O le YouTube creator footage e faʻasanoia iinei.

89%
Screen recording faatasi le USB mic

OBS, Loom, Camtasia exports. O le mic e lata ae o le aʻo e leai se fa'asamino, masani ona e amanailau ai le system audio bleed. E lelei lava mo le tutorial transcripts.

84%
Phone-shot vlog, internal mic

O le phone mic o le suo, le matagi poʻo le fa'aulupega, o le mamao e e fesuiaʻi. O le upu e aoga, ia moemoe 1-2 fa'asoʻoga oe itula i le gafa talaʻaga.

Fesili masani

Mea 8 e fesili ai tagata i le MP4 transcription.

01E te fa'asoʻo ia le video?+
Leai. Tatou e faitau le leo stream atu mai le MP4 container. O le video stream e leai se fa'atuʻi, leai se fa'asoʻo, ma leai se polotulotunya i le mahope o le job tausiga — e te amanua le file tumatau.
02Se faʻasasa codec i totonu o le MP4 e aoga?+
O le standard H.264 + AAC o le alaga e faʻamaloʻo. Tatou e aoga faʻapea HEVC/H.265, ProRes-in-MP4, ma leo i MP3, Opus, ALAC, poʻo PCM. Afai e mafai ona ffmpeg probing ia, tatou e aoga faasalalauga ia.
03O le file size cap?+
10 GB per upload i le web uploader, 50 GB via le API faatasi resumable chunks. O le 1-hour 1080p MP4 e 1-3 GB ma o le tele o files e tutusa le web path e leai se mafaufau.
04E tutusa ai le SRT faatasi le video tumatau?+
Ioe — o le taimi e fa'asino i le MP4's edit list ma native sample rate. Tatou e leai se fa'asoʻo, e pei o le SRT e tutusa ai. Tuʻu le SRT i le MP4 i soʻo se player poʻo NLE ma ua tutusa ai i le faʻaaogaina muamua.
05E mafai ona ou fa'alavelave le subtitles i le video?+
E leai i le latou alaga — tatou e toe foʻi mai le SRT ma tuʻu le burn-in i le editor. Ffmpeg one-liner, HandBrake, Premiere, DaVinci, Kapwing soʻo e aoga le SRT tatou te faia. E leai tatou manao e faʻo encode tool faʻapea.
06Se faʻapea MOV, MKV, M4V, WebM?+
Soʻo aoga i le pipeline e tasi. O le MOV faʻapea — o le MPEG-4 family e tasi, extraction path e tutusa. O le MKV faatasi le leo streams e toʻatele na te aoga le stream-picker UI e pei o le multi-track MP4.
07E mafai ona ou tu'u le YouTube poʻo Vimeo URL?+
Ioe mo YouTube — paste le public URL i le upload screen ma tatou tu'u le leo e matua'i, e leai se MP4 download. O le Vimeo e manao i le file tuʻu poʻo le signed download link i le mea o lo latou player gates le stream.
08Afai e leai se dialog tautala poo le leo ma B-roll e tasi?+
O le VAD e imesaʻi silent ma music-only sections ma faʻatalaʻi, e pei o e te leai se taluapaga i le ambient footage. O le tusitusiga e fa'asino atu o iai e `[music]` poʻo `[no speech]` ae leai se faʻafo mataʻu atu o upu.

Tuʻu lau MP4. Toe foʻi mai le tusitusiga ma SRT.

30 free itula oe masina. E leai le card. O le leo ua tauina atu i le server, taiga o le tagata tautala, frame-accurate SRT — soʻo i ai.

Amataia faʻalumanai